From Text to Pixel: Advancing Long-Context Understanding in MLLMs.
Yujie LuXiujun LiTsu-Jui FuMiguel P. EcksteinWilliam Yang WangPublished in: CoRR (2024)
Keyphrases
- information retrieval
- neural network
- contextual information
- text documents
- machine learning
- theoretical and practical implications
- keywords
- text mining
- context sensitive
- free text
- pixel level
- textual data
- connected component analysis
- text information
- intensity values
- automatically extracted
- segmentation algorithm
- information extraction
- image segmentation
- multimedia
- knowledge base