Workflow

The workflow feature is a core capability of MatrixOne Intelligence, enabling users to define and execute complex data processing tasks through a visual interface.

Workflow Creation

Enter the workspace, click Data Processing > Workflow > Create Workflow, and fill in the information according to your actual needs to complete the workflow creation. The system supports two modes: intelligent creation and manual creation. Intelligent creation allows you to quickly generate workflows through natural language.

Basic Configuration

Configuration Item	Description
Source Data	Input data storage location
Target Location	Output path for processed results. Cannot select an already occupied target location.
File Type	Supported formats: • Documents: doc/docx/ppt/pptx/txt/md/pdf/xlsx/xls • Images: jpg/jpeg/bmp/png • Videos: mp4/mov/mkv • Audio: wav/mp3/aac/flac
Priority	Options: "Low", "Medium" (default), and "High". Once set, new workflow jobs will immediately follow this priority. When multiple workflows run concurrently, the platform schedules them in order of priority (High to Low).
Processing Mode	Supported modes: • Single Run: Executes once when triggered • Scheduled: Intervals: 1/5/10/30 minutes, 1/2/4/8 hours, 1 day (default: 5 minutes). Short intervals (<1 day) trigger on the hour (e.g., 30-minute intervals at :00/:30). Long intervals (≥1 day) require manual next execution time. • Load Trigger: Executes immediately after a batch of files is loaded into the same source volume.
Processing Scope	"Single Run" mode supports processing by file type or individual files. File-level selection is only available when a single source is chosen.
Branch Name	Name of the current workflow branch (default: "main").

Processing Pipeline Configuration

Supports multiple processing operators, including: Document Parser Node, Image Parser Node, Audio Parser Node, Video Parser Node, Chunking Node, Text Embedding Node, Information Extraction Node, Data Cleaning Node, and Data Augmentation Node.

Document Parser Node

Extracts structured content from documents (text, images, tables, headings).

Module	Description
Node Name	Within 100 characters, unique within the workflow.
Image Description	Generates image content descriptions based on the Qwen/Qwen2-VL-72B-Instruct model, supports selecting description language: Chinese/English.
PPT Multimodal Parsing	When enabled, converts each page of the PPT into an image for processing.
CSV Settings	Supports configuring delimiter, quote character, backslash escape, and whether to use the first row as the table header.
OCR Recognition	Uses the ucaslcl/GOT-OCR2_0 model to extract text from images.
Notes	Node remarks.

Image Parser Node

Recognizes text and visual content in images for structured understanding.

Module	Description
Node Name	Unique within workflow (≤100 chars).
Image Captioning	Generates image descriptions using Qwen/Qwen2-VL-72B-Instruct model (supports Chinese/English).
OCR	Extracts text from images using ucaslcl/GOT-OCR2_0 model.
Notes	Optional node remarks.

Audio Parser Node

Transcribes speech content into text with high accuracy.

Module	Description
Node Name	Unique within workflow (≤100 chars).
Noise Reduction	Improves SNR and accuracy (increases compute overhead, may lose some details).
Voice Segmentation	• Uses VAD to split audio into speech/non-speech segments. • Min Silence Gap: Shortest silence to split segments (default: 0.5s, range: 0.1-2s). • Max Speech Duration: Longest segment before forced split (default: 30s, range: 5-60s).
ASR Model	Uses SenseVoice model for transcription.
Notes	Optional node remarks.

Video Parser Node

Extracts audio from video and transcribes it into text.

Module	Description
Node Name	Unique within workflow (≤100 chars).
Noise Reduction	Improves SNR and accuracy (increases compute overhead, may lose some details).
Voice Segmentation	• Uses VAD to split audio. • Min Silence Gap: (default: 0.5s, range: 0.3-3s). • Max Speech Duration: (default: 30s, range: 10-180s).
ASR Model	Uses SenseVoice model for transcription.
Notes	Optional node remarks.

Split Node

Segments text based on rules.

Module	Description
Node Name	Within 100 characters, unique within the workflow.
Document	General Documents: Text: • Segmentation Method: By single delimiter, by multiple delimiters • Segment Max Length: 100-2000 (default 1024) • Segment Overlap: Not exceeding the set field segment length. Within 100 characters, unique within the workflow. Images: • Context Overlap: Default 50. Tables: • Context Overlap: Default 50. Tables: Excel, CSV segmented by row.
Image	Image Element Merging: OCR recognized content and image description content merged as a single segment.
Excel, CSV segmented by row.
Audio/Video	Segmented by transcribed speech chunks, supports setting segment max length, default 1024.
Node Name	Within 100 characters, unique within the workflow.

Text Embedding Node

Converts text to semantic vectors for intelligent understanding/retrieval.

Module	Description
Node Name	Unique within workflow (≤100 chars).
Embedding	Uses BAAl/bge-m3 model for vector generation.
Notes	Optional node remarks.

Information Extraction Node

Information Extraction Node leverages AI model capabilities combined with preset field rules to automatically extract key information from text.

Module	Function Description
Node Name	Within 100 characters, unique within the workflow. Defaults to the node type name.
Extraction Result	• Generate results per file: Independently extracts information from each file input to the workflow and generates corresponding JSON result files. • Generate a single combined result for all files: Processes all input files as a whole, searches for and integrates required information across all files, ultimately generating only one merged JSON result file.
Extraction Model	Supports integration of Large Language Models (Qwen-Turbo) and Multimodal Models (Qwen/Qwen2.5-VL-32B-Instruct) for field extraction. When selecting a multimodal model, it can be directly connected to the Start Node within the workflow to process files containing documents and images (direct processing of htm/html/xlsx/xls/csv format files is not currently supported), eliminating the need for additional parsing nodes. Token limit is 128k.
Extraction Information**	Includes three default field extraction templates: Financial Statements (including tables), Invoices, and Resumes. Also supports custom field configuration via a form interface. The system automatically generates the corresponding JSON Schema based on field name, type, meaning, and whether it's required. Supports up to 4 levels of nested structures, with a maximum of 40 fields. During configuration, you can switch to JSON Preview mode at any time to intuitively view the actual Schema content generated by the system.
Description	Node notes/remarks.

Note

When performing information extraction, it is recommended to select the appropriate model based on the file type and content structure:For well-structured text files (e.g., parsed PDF, Word, TXT), Large Language Models are recommended. They excel at semantic understanding and field inference, making them suitable for scenarios with complex logic and strong inter-field relationships.For original PDFs, images, or scanned documents, Multimodal Models are recommended. They can directly recognize mixed text and image layouts without requiring additional parsing steps, making them more suitable for extracting information from documents with complex formats or containing charts, forms, and invoices. Choosing the right model helps improve extraction accuracy and processing efficiency.

Data Type Selection

The Information Extraction Node supports the following data types: object, string, boolean, number, array/string, array/boolean, array/number, array/object. Choosing the appropriate data type helps improve extraction accuracy and subsequent process efficiency.

Basic Types

string: Suitable for text-based fields where the content typically doesn't require numerical operations. Examples include single names, titles, IDs, descriptive fields, addresses, remarks, explanatory information, single dates, etc.Example:
- Customer Company: "xxx Co., Ltd."
- Invoice Code: "1234567890"
- Date of Birth: "1990-05-13"
number: Suitable for numerical fields that can be used for calculations or comparisons. Examples include amounts, quantities, percentages, tax rates, years of experience, counts, etc. Example:
- Amount: 12345.67
- Tax Rate: 0.13,
- Stock Quantity: 150
boolean: Used for binary judgments, with answers like "yes/no" or "true/false". Examples include whether approval was passed, whether tax is included, whether currently employed, etc.Example:
- is_active: true,
- is_tax_included: false
object: Used for structured data, where a single field contains multiple sub-fields. Suitable for complex entities or field groups. Examples include an address object (province, city, district, detailed address), an invoice information object (invoice number + issue date + amount), or a single entry within "Education Experience" or "Work Experience" in a resume. Example:

     {
       "company": "Matrix Origin",
       "position": "Engineer",
       "start_date": "2020-01",
       "end_date": "2023-01"
     }

Array Types

array/string: Used when a field may contain multiple text values. Examples include skill lists (e.g., in a resume), multiple tags, multiple recipients, multiple subject names, etc.Example:

[“Java”, “Python”, “SQL”]

array/number: Used for multiple numerical items. Examples include lists of amounts across multiple periods, multiple scores, collections of numerical attributes, etc.Example:

[95, 88, 76]

array/boolean: Suitable for combinations of multiple Boolean fields. Examples include multi-dimensional permission flags, daily attendance records (true/false lists), etc.Example:

[true, false, true]

array/object: One of the most commonly used composite structures, used for lists containing multiple similar objects. Choose this when the text contains repeating structures. Examples include multiple work experiences or education entries in a resume, multiple line items on an invoice (product name, quantity, unit price, amount), multiple account lines in financial statements, multiple transaction records in a report, etc.Example:

[
   {
      "item": "商品 A",
      "quantity": 2,
      "price": 100
   },
   {
      "item": "商品 B",
      "quantity": 1,
      "price": 200
   }
]

Summary:

Type	Usage Scenario
string	Single text field (name, ID, date, description, etc.)
number	Amount, quantity, ratio, fields for numerical calculation
boolean	Yes/No type fields (yes/no, true/false)
object	A field containing multiple sub-fields, a structured entity
array/string	Multiple text items (skill list, tags, subject name list)
array/number	Collection of numbers (multi-period amounts, score list)
array/boolean	Multiple Boolean items (e.g., daily attendance record)
array/object	Multiple structured records (work experiences, invoice line items, report rows)

Data Cleaning Node

Identifies and handles redundant/erroneous data.

Module	Description
Node Name	Unique within workflow (≤100 chars, defaults to node type).
Sensitive Info Masking	Off by default. Removes PII (phones/emails).
Text Normalization	Unicode normalization & Traditional-to-Simplified Chinese conversion.
Special Char Removal	Removes URLs, invisible chars, HTML tags.
Special Char Filtering	Auto-deletes text blocks exceeding threshold.
Sensitive Word Filtering	Removes flagged terms.
Deduplication	Uses N-Gram repetition ratio threshold.
Notes	Optional node remarks.

Data Enhancement Node

Module	Description
Node Name	Unique within workflow (≤100 chars, defaults to node type).
Samples per Chunk	Max generated samples per chunk (1-20, default: 10).
Data Format	Supports Alpaca, ShareGPT, OpenAI formats + custom (max 40 fields, 4-level nesting).
Model	Uses qwen-turbo for generation.
Notes	Optional node remarks.

Node Dependencies

Node Type	Upstream Nodes	Downstream Nodes
Start Node	None	• Doc Parser • Image Parser • Audio Parser • Video Parser • Info Extraction
Doc Parser	Start Node	• Info Extraction • Chunking • End Node
Image Parser	Start Node	• Info Extraction • Chunking • End Node
Audio Parser	Start Node	• Info Extraction • Chunking • End Node
Video Parser	Start Node	• Info Extraction • Chunking • End Node
Chunking	• Start Node • Parsers	• Text Embedding • Data Cleaning • Data Augmentation • End Node
Text Embedding	Chunking	End Node
Info Extraction	• Start Node • Parsers	• Data Augmentation • End Node
Data Cleaning	• Parsers • Chunking	• Text Embedding • Data Augmentation • End Node
Data Augmentation	• Data Cleaning • Chunking • Info Extraction	End Node
End Node	All except Start Node	None

Branch Management

Workflow branches help manage different versions of similar data processes efficiently:

Reduces management overhead (avoid duplicate workflows)
Optimizes resources (shared steps execute once)
Simplifies comparison (visual diff between branches)

Branches function like Git: each workflow has a "main" branch (default) with shared base configs (source/target volumes, file types). Branches can modify their processing pipelines independently, with results stored in subdirectories by branch name.

Create Branch

From workflow list, click "Create Branch". Select base branch (default: "main"). New branch inherits the base's pipeline. Branch names must be unique.

Note

Main branch cannot be deleted alone. Deleting all branches deletes the workflow.

New branches may execute immediately based on workflow state. All branches share execution resources (identical steps run once). Workflow state is collective; start/stop affects all branches.

Modify Branch

Only stopped workflows can be edited
Only "main" branch can edit base configs; all branches can adjust pipelines independently

Compare Branches

Includes "main" branch by default; supports multi-select comparison

Delete Branch

"Main" branch cannot be deleted alone
Deleting all branches deletes the workflow
Option to delete branch data from target volume

Workflow Management

From the workflow list, you can:

Rerun workflows
Modify/delete workflows (deleting all branches deletes the workflow)

Click a workflow name to view details. Use the edit button to modify/rerun, or click "Execution Details" to view job status.