2 7

Dana Aubakirova

danaaubakirova

AI & ML interests

DocumentAI, Deep Learning, Multimodal Learning, Computer Vision, Image Processing, NLP

Articles

Introducing TextImage Augmentation for Document Images

Aug 6

• 30

LAVE: Zero-shot VQA Evaluation on Docmatix with LLMs - Do We Still Need Fine-Tuning?

Jul 25

• 18

Multimodal Augmentation for Documents: Recovering “Comprehension” in “Reading and Comprehension” task

May 16

• 17

Organizations

Posts 2

Post

829

🚀 We are thrilled to introduce TextImage Data Augmentation, developed in collaboration with Albumentations AI! ✨ This multimodal technique modifies document images and text simultaneously, enhancing Vision Language Models (VLMs) for high-text datasets.

👩‍💻 Learn how this innovative approach can improve your document AI projects by checking out our full blog post here: https://huggingface.co/blog/doc_aug_hf_alb

Post

1273

The Document AI team ( @Molbap , @rwightman , @danaaubakirova ) at Hugging Face is developing a new multimodal data augmentation pipeline utilising both visual and textual aspects of document images.

Check out my latest blog post for more details:
https://huggingface.co/blog/danaaubakirova/doc-augmentation

Please, share your thoughts and suggestions with us.
And stay tuned for the updates!

models 4

datasets 2

danaaubakirova/docmatix-subset

Viewer • Updated Jul 23 • 2.13k • 51

danaaubakirova/patfig

Preview • Updated Jul 10 • 579 • 4