Ankush Singal

Andyrasika

AI & ML interests

None yet

Recent Activity

updated a collection 24 days ago
Fine-Tuning
reacted to merve's post with ❤️ 27 days ago
What a beginning to this year in open ML 🤠 Let's unwrap! https://huggingface.co/collections/merve/jan-10-releases-677fe34177759de0edfc9714 Multimodal 🖼️ > ByteDance released SA2VA: a family of vision LMs that can take image, video, text and visual prompts > moondream2 is out with new capabilities like outputting structured data and gaze detection! > Dataset: Alibaba DAMO lab released multimodal textbook — 22k hours worth of samples from instruction videos 🤯 > Dataset: SciCap captioning on scientific documents benchmark dataset is released along with the challenge! LLMs 💬 > Microsoft released Phi-4, sota open-source 14B language model 🔥 > Dolphin is back with Dolphin 3.0 Llama 3.1 8B 🐬🐬 > Prime-RL released Eurus-2-7B-PRIME a new language model trained using PRIME alignment > SmallThinker-3B is a new small reasoning LM based on Owen2.5-3B-Instruct 💭 > Dataset: QWQ-LONGCOT-500K is the dataset used to train SmallThinker, generated using QwQ-32B-preview 📕 > Dataset: @cfahlgren1 released React Code Instructions: a dataset of code instruction-code pairs 📕 > Dataset: Qwen team is on the roll, they just released CodeElo, a dataset of code preferences 👩🏻‍💻 Embeddings 🔖 > @MoritzLaurer released zero-shot version of ModernBERT large 👏 > KaLM is a new family of performant multilingual embedding models with MIT license built using Qwen2-0.5B Image/Video Generation ⏯️ > NVIDIA released Cosmos, a new family of diffusion/autoregressive World Foundation Models generating worlds from images, videos and texts 🔥 > Adobe released TransPixar: a new text-to-video model that can generate assets with transparent backgrounds (a first!) > Dataset: fal released cosmos-openvid-1m Cosmos-tokenized OpenVid-1M with samples from OpenVid-1M Others > Prior Labs released TabPFNv2, the best tabular transformer is out for classification and regression > Metagene-1 is a new RNA language model that can be used for pathogen detection, zero-shot embedding and genome understanding
updated a collection 28 days ago
multimodal
View all activity

Organizations

Keras Dreambooth Event's profile picture Stable Diffusion concepts library's profile picture Musika's profile picture MLX Community's profile picture ONNX Community's profile picture Hugging Face Discord Community's profile picture

Andyrasika's activity

published an article 8 months ago
published an article 10 months ago
view article
Article

Estimating Memory Consumption of LLMs for Inference and Fine-Tuning for Cohere Command-R+

11
published an article 10 months ago
view article
Article

RAG Empowerment: Cohere C4AI Command-R and Transformers Unveiled

10
published an article 11 months ago
published an article 11 months ago
view article
Article

Revolutionizing Video Transcription: Unveiling Gemma-2b-it and Langchain in the Era of Transformers

3
published an article 12 months ago
view article
Article

Transformers and Quadrant: Revolutionizing Data Integration for NLP Tasks

1
published an article about 1 year ago
published an article about 1 year ago
view article
Article

Unleashing the Power of Unsloth and QLora:Redefining Language Model Fine-Tuning

12
published an article about 1 year ago
view article
Article

Unleashing the Power of Logprobs in Language Models: A Practical Guide

2
published an article about 1 year ago
view article
Article

Unveiling TinyLlama: An Inspiring Dive into a Revolutionary Small-Scale Language Model

2
published an article about 1 year ago
view article
Article

Multimodal IDEFICS: Unveiling the Transparency & Power of Open Visual Language Models

published an article about 1 year ago
view article
Article

Streamlining Data Management with Hugging Face and DVC: A Seamless Integration

published an article about 1 year ago
view article
Article

Leveraging Transformers and PyTorch for Multiple Choice Question Tasks

1
published an article about 1 year ago
view article
Article

Uniting Forces: Integrating Hugging Face with Langchain for Enhanced Natural Language Processing

4
published an article about 1 year ago
view article
Article

Intel Neural-Chat 7b: Fine-Tuning on Gaudi2 for Top LLM Performance

published an article over 1 year ago
published an article over 1 year ago
view article
Article

Hearing is Believing: Revolutionizing AI with Audio Classification via Computer Vision

1
published an article over 1 year ago
view article
Article

InfiniText: Empowering Conversations & Content with Mistral-7B-Instruct-v0.1

published an article over 1 year ago
view article
Article

Samantha and Mistral 7B: A Powerful and Versatile Language Model Duo

1