April 5, 2023
Segmentation β identifying which image pixels belong to an object β is a core task in computer vision and is used in a broad array of applications, from analyzing scientific imagery to e...
Similar Articles (10 found)
π 81.1% similar
Please check out our new release on Segment Anything Model 2 (SAM 2).
- SAM 2 code: https://github.com/facebookresearch/segment-anything-2
- SAM 2 dem...
π 54.6% similar
Introduction
Artificial Intelligence (AI) dominates todayβs headlinesβhailed as a breakthrough one day, warned against as a threat the next. Yet much ...
π 53.4% similar
Table of Contents
Synthetic Data Generation Using the BLIP and PaliGemma Models
In this tutorial, we embark on the first part of a two-part series whe...
π 52.3% similar
Every year, we have a new iPhone that claims to be faster and better in every way. And yes, these new computer vision models and new image sensors can...
π 52.2% similar
2 Years of ML vs. 1 Month of Prompting
November 7, 2025
Recalls at major automakers cost hundreds of millions of dollars a year. Itβs a huge issue. To...
π 51.5% similar
I think image-encoder from CLIP (even smallest variant ViT B/32) is good enough to capture a lot of semantic information to allow natural language que...
π 50.5% similar
A few months after launching Qwen3-VL, Alibaba has released a detailed technical report on the open multimodal model. The data shows the system excels...
π 50.1% similar
Contents
Micro-Models: Origin Story
What are micro-models exactly?
Added Benefits to using Micro-models in computer vision projects
Data-oriented Prog...
π 49.9% similar
How We Cut Inference Costs from $46K to $7.5K Fine-Tuning Qwen-Image-Edit
Running quality inference at scale is something we think about a lot at Oxen...
π 49.7% similar
Table of Contents
- SmolVLM to SmolVLM2: Compact Models for Multi-Image VQA
- SmolVLM 1: A Compact Yet Capable Vision-Language Model
- What Is SmolVLM...