Intelligent File Classification
Advanced Content Analysis & Automated File Categorization
Task Overview
Perform comprehensive analysis of files to determine their optimal placement within project directory structures using advanced NLP, code parsing, and metadata extraction techniques. This task transforms manual file organization into an intelligent, automated classification system.
Core Objectives
Analyze file content using multi-modal understanding techniques
Extract meaningful metadata from various file types and formats
Determine optimal placement based on content, context, and project structure
Maintain high accuracy through continuous learning and feedback integration
Provide transparent reasoning for all classification decisions
Input Requirements
File Analysis Context
classification_input:
file_information:
- file_path: "absolute_path_to_file"
- file_size: "size_in_bytes"
- file_extension: "file_extension"
- mime_type: "detected_mime_type"
- creation_date: "iso_8601_timestamp"
- modification_date: "iso_8601_timestamp"
project_context:
- project_type: "web_app | ml_project | research | documentation"
- directory_structure: "current_project_structure_mapping"
- naming_conventions: "established_naming_patterns"
- classification_rules: "project_specific_placement_rules"
content_analysis:
- raw_content: "file_content_as_string"
- extracted_text: "text_extracted_from_binary_files"
- metadata: "embedded_file_metadata"
- relationships: "connections_to_other_files"Execution Workflow
Phase 1: Content Extraction & Preprocessing (30-60 seconds)
Phase 2: Multi-Modal Analysis (1-2 minutes)
Phase 3: Classification Decision Making (30-60 seconds)
Classification Rules Engine
Content-Based Classification Rules
Advanced Pattern Recognition
Machine Learning Integration
Continuous Learning System
Quality Assurance
Classification Validation
Success Metrics
Performance Indicators
โ Classification Accuracy: 95%+ correct placement for standard file types
โ Processing Speed: Average 30 seconds per file for complete analysis
โ Confidence Reliability: 90%+ accuracy for high-confidence classifications
โ User Satisfaction: Less than 5% manual corrections required
Quality Standards
โ Consistency: Same file types consistently classified to same locations
โ Transparency: Clear reasoning provided for all classification decisions
โ Adaptability: Continuous improvement through feedback integration
โ Scalability: Handle increasing file volumes without performance degradation
Integration Points
Handoff to Locomoto
This task ensures intelligent, accurate file classification that forms the core of the automated file organization system, providing the intelligence needed to maintain organized, logical project structures.
Last updated