Mixpeek Engineering Blog (Page 5)

NVIDIA Cosmos: The Makings of a World Foundation Model

World foundation models are neural networks that simulate real-world environments and predict accurate outcomes based on text, image, or video input.

Research

Hybrid search on distributed signals for multimodal understanding

Our brains process multiple inputs simultaneously. Mixpeek brings this power to AI, enabling multimodal video understanding. Search across transcripts, visuals, and more for truly intelligent content analysis. #AI #VideoAnalytics

Research

Transforming Multimodal Search with Mixpeek 0.9.0

At Mixpeek, we're on a mission to make multimodal search (images, videos, audio and text) accessible and powerful.

Product Updates

Semantic Video Search: Unlocking Visual Content

Find, analyze, and leverage visual information within your video library using advanced AI and natural language processing, revolutionizing how you interact with and extract value from your multimedia assets.

Search

Mixpeek & FLUX for Multimodal RAG

Building a Comprehensive Image Indexing, Retrieval, and Generation Pipeline Using Mixpeek and Replicate's FLUX

Integrations

Multimodal Classification: A Practical Tool for Organizing Your Diverse Content

Streamline your content management with Mixpeek’s Multimodal Classification. Automatically categorize videos, images, audio files, and text into predefined categories, making data retrieval faster and more efficient. Ideal for businesses handling diverse content types.

Research

AI-Powered Video Captioning Models for Hybrid Video Retrieval

Automatic, AI-generated video captioning for video

Research

Scaling Video Processing with Celery and Render

Build a scalable, distributed video processing pipeline using celery and render with fastapi

Data Processing

How We Indexed the 1000 Top Movie Trailers for AI Apps

In the ever-evolving landscape of digital content, the ability to process vast amounts of unstructured data has become a game-changer.

Data Processing

Building a Multimodal Data Processing Pipeline with Kafka, Airflow, and SageMaker

Build a multimodal data processing pipeline using Apache Kafka, Apache Airflow, and Amazon SageMaker. This pipeline will handle various file types (image, video, audio, text, and documents) in parallel, process them through custom ML tasks, and store the results in a database.

Data Processing

Set Up and Run OpenAI's CLIP on SageMaker for Inference

How to deploy and run OpenAI's CLIP model on Amazon SageMaker for efficient real-time and offline inference.

Tutorials

Reverse Video Search: How It Works + Python API Tutorial

Reverse video search allows us to use a video clip as an input for a query against videos that have been indexed in a vector store.

Search

Video Scene Detection Embedding Models

Using semantic video understanding models to intelligently locate key scenes across petabytes of videos.

ResearchVideo

Introducing VUSE: Video Understanding and Semantic Embedding

State-of-the art video understanding model that converts videos into embeddings.

ResearchVideo

Build an S3 RAG App with Langchain and MongoDB KNN

Unlock the power of your unstructured data with Mixpeek, automating ETL from S3 to MongoDB and enabling advanced question answering, content analysis, and semantic search capabilities through LangChain's cutting-edge AI models.

Integrations

Unlock your S3 Bucket with Mixpeek and MongoDB KNN

The standard design pattern when you want to serve non JSON data to your client is to first store it

Integrations

Semantic Video Understanding

Semantic video understanding bridges the gap of labeling, enabling a complete analysis of video content.

Search

Visual Product Discovery to Increase Online Purchase Rates

Visual shopping allows shoppers to search by image, text, or a combination of both. This discovery experience uses A.I. to increase a store's purchase rate and size.

Industry

Semantic Video Search

Semantic video search is a technology that utilizes machine learning and natural language processing to accurately analyze, retrieve, and understand the context of video content.

Search

Searching PDFs in S3 Using OpenSearch and Tika

In this tutorial, we walked through the process of building a Python script that is able to search the contents of PDF files in an Amazon S3 bucket using Apache Tika and OpenSearch.

Tutorials