Manifest.json Format Changes and Discrepancies¶

Date: December 2024
Purpose: Document discrepancies between original specification and current implementation

Executive Summary¶

The Verdikta manifest.json format has evolved significantly from its original specification. While maintaining backward compatibility, several new features have been added and the implementation has been enhanced to support complex multi-party arbitration scenarios.

Key Discrepancies Identified¶

1. Missing Documentation for Implemented Features¶

Multi-CID Support ✅ IMPLEMENTED but NOT DOCUMENTED¶

Current Implementation: Full multi-CID support with bCIDs object and hierarchical archive processing
Original Documentation: No mention of multi-CID capabilities
Impact: Major feature gap in documentation

// IMPLEMENTED but not in original docs
{
  "bCIDs": {
    "plaintiffComplaint": "Description of plaintiff archive",
    "defendantRebuttal": "Description of defendant archive"
  },
  "addendum": "Real-time data description"
}

Addendum Feature ✅ IMPLEMENTED but NOT DOCUMENTED¶

Current Implementation: Real-time data injection with content sanitization
Original Documentation: No mention of addendum functionality
Usage: Allows injection of current market data, timestamps, etc.

Enhanced IPFS Support ✅ IMPLEMENTED but PARTIALLY DOCUMENTED¶

Current Implementation: Full IPFS CID support for primary, additional, and support files
Original Documentation: Limited IPFS documentation
Features: Automatic caching, multiple hash formats, error handling

2. Implementation Improvements Not Documented¶

Default Outcome Generation ✅ ENHANCED BEHAVIOR¶

Current Implementation: Automatically generates outcomes based on NUMBER_OF_OUTCOMES if not provided in primary file
Original Documentation: Unclear about default behavior
Format: ["outcome1", "outcome2", ...]

Enhanced Error Handling ✅ IMPROVED¶

Current Implementation: Comprehensive error messages with specific guidance
Original Documentation: Limited error handling documentation
Improvement: Better debugging and troubleshooting support

File Type Support ✅ EXPANDED¶

Current Implementation: Extensive MIME type support including images, documents, data files
Original Documentation: Limited file format specification
Added Support: WebP, RTF, DOCX, Markdown, CSV, and more

3. Testing Tool Integration ✅ NEW FEATURE¶

Simplified Manifest Format ✅ NOT DOCUMENTED¶

Current Implementation: Supports both legacy and simplified formats
Original Documentation: Only full manifest format
Purpose: Streamlined testing and attachment-only scenarios

// NEW: Simplified format for testing
{
  "format": "simplified",
  "name": "Test Case",
  "attachments": [
    {
      "filename": "file.txt",
      "name": "Test File",
      "type": "text/plain"
    }
  ]
}

Feature Implementation Status¶

✅ Fully Implemented and Working¶

Multi-CID Processing - Complete implementation with validation
Addendum Support - Content injection with sanitization
IPFS Integration - Full CID support with caching
Default Outcomes - Automatic generation based on parameters
Enhanced Validation - Joi schema with comprehensive error handling
Multiple File Formats - Extensive MIME type support
Testing Tool Integration - Both legacy and simplified formats

⚠️ Partially Implemented¶

Schema Validation Enforcement - Schema exists but not fully enforced
Complex File Type Detection - Basic support, some edge cases

❌ Limitations Found¶

Hash-only Primary Files - Implementation exists but limited testing
Validation Completeness - Some inconsistencies between schema and parser

API Changes and Enhancements¶

Parser Output Enhancements¶

The manifest parser now returns additional fields not mentioned in original documentation:

// NEW: Enhanced parser output
{
  prompt: "Combined query text",
  models: [...],
  iterations: 1,
  outcomes: [...],
  name: "Case Name",           // NEW
  addendum: "Addendum desc",   // NEW
  bCIDs: {...},               // NEW
  references: [...],          // NEW
  additional: [...],
  support: [...]
}

Multi-CID Query Construction ✅ NEW FEATURE¶

Combines content from multiple archives
Hierarchical content organization
Reference section aggregation
Addendum injection support

Migration Recommendations¶

For Existing Users¶

Continue using current manifests - Full backward compatibility maintained
Consider upgrading to new features - Multi-CID for complex cases
Update documentation references - Use new specification document

For New Users¶

Start with current specification - Use MANIFEST_SPECIFICATION.md
Leverage new features - Multi-CID, addendum, enhanced IPFS support
Use simplified format for testing - When appropriate

Documentation Updates Required¶

High Priority¶

Multi-CID Documentation - Complete feature documentation needed
Addendum Usage Guide - Real-time data injection examples
IPFS Integration Guide - Comprehensive CID usage documentation
Testing Tool Integration - Simplified format documentation

Medium Priority¶

Error Handling Guide - Comprehensive troubleshooting documentation
File Format Support - Complete MIME type reference
Performance Guidelines - Best practices for large files and multi-CID

Low Priority¶

API Reference Updates - Parser output documentation
Migration Examples - Upgrade path examples
Advanced Use Cases - Complex scenario documentation

Validation Schema Discrepancies¶

Schema vs Implementation¶

The Joi validation schema in validator.js is more permissive than the actual parser implementation:

// Schema allows optional primary.hash
primary: Joi.object({
  filename: Joi.string().required(),
  hash: Joi.string().optional()
}).required()

// But parser enforces: filename XOR hash (not both)

Recommendations¶

Align schema with implementation - Update Joi schema to match actual validation
Enforce schema validation - Use schema more consistently in parser
Add missing validations - Some edge cases not covered

Security Considerations¶

Addendum Sanitization ✅ IMPLEMENTED¶

Content sanitization to prevent code injection
Character filtering for security
Safe string interpolation

IPFS Security ✅ CONSIDERATIONS NEEDED¶

CID validation before fetching
File size limits for IPFS content
Network timeout handling

Performance Implications¶

Multi-CID Processing¶

Sequential processing - May impact performance with many CIDs
IPFS network dependency - Latency considerations
File caching - Local storage usage

Recommendations¶

Archive size limits - Keep individual archives under 10MB
IPFS optimization - Cache frequently used content
Error handling - Graceful degradation for network issues

Future Development Recommendations¶

Short Term¶

Complete documentation updates - Address all identified gaps
Schema validation alignment - Fix discrepancies
Testing coverage - Improve multi-CID and IPFS tests

Medium Term¶

Performance optimization - Parallel CID processing
Enhanced validation - More comprehensive error checking
File type expansion - Additional format support

Long Term¶

Versioning strategy - Formal manifest version management
Breaking change process - Controlled evolution path
Advanced features - Plugin architecture, custom validators

Conclusion¶

The Verdikta manifest.json format has evolved significantly beyond its original specification, with several powerful new features implemented but not fully documented. The current implementation is robust and backward-compatible, but requires updated documentation to reflect its full capabilities.

The new comprehensive specification (MANIFEST_SPECIFICATION.md) addresses these gaps and provides a current, accurate reference for developers and users of the Verdikta system.

Immediate Action Required: 1. Update all references to point to new specification 2. Consider the original manifestFile-r3.docx as superseded 3. Begin using new features (multi-CID, addendum) in appropriate scenarios

Next Steps: 1. Review and validate the new specification with the development team 2. Update any external documentation or integrations 3. Consider implementing suggested improvements and fixes