Astra platform

Vector Search for Real-world GenAI 

An easy-to-use API with all of the vector and structured data for production retrieval-augmented generation (RAG) applications.

Build Accurate AI with Real-time Data and Streaming

Astra DB vector is for mixed workloads with vector, non-vector, and streaming data.

Only Astra DB vector gives you simultaneous query/update with ultra-low latency.

Build Accurate AI with Real-time Data and Streaming

Deploy Production-level Security and Compliance

Protect users and data with the depth enterprise security features and compliance certifications for production AI.

Standalone vector databases were designed for simple applications and workloads—not for enterprise-scale requirements.

Deploy Production-level Security and Compliance

Developer-friendly APIs, Pricing and Community Support

Code today and launch tomorrow with vector-native APIs and ecosystem integrations.

Astra DB gives developers the industry’s most generous developer plans, APIs, community support, and choice of deployment options.

Developer-friendly APIs, Pricing and Community Support

Market Leaders Shaping Their Industries with Vector Search from DataStax

Developers

RAG Made Easier

An intuitive API and powerful integrations for production-level RAG workloads.

Install

Install the API clients

pythontypescriptjava
pip install astrapy

Connect

Create or connect to your database & collections

pythontypescriptjava
# The DataAPIClient lets you connect to your database
client = DataAPIClient("AstraCS:...")
db = client.get_database(
    "https://<id>-<region>.apps.astra.datastax.com"
)

# create_collection() will return the newly created collection
collection = db.create_collection(
    name="collection_test",
    dimension=5,
)

# Or you can connect to an existing connection directly
collection = astra_db.get_collection("collection_test")

Insert

Insert into your vector store (collection)

pythontypescriptjava
collection.insert_one(
    {
        "name": "Coded Cleats",
        "description": "Chat bot integrated sneakers that talk to you",
        "$vector": [0.1, 0.15, 0.3, 0.12, 0.05],
    }
)

Find

Find documents using vector search

pythontypescriptjava
documents = list(collection.find(
    sort={"$vector": [0.15, 0.1, 0.1, 0.35, 0.55]},
    limit=100,
))
Try for Free
Try for Free

$300/year in free credit and no credit card required.

Explore examples
Explore examples

Tutorials and sample generative AI apps with best practices.

DOCS
DOCS

Get started in minutes with generative AI and RAG.

Vector Crash Course with Ania Kubow

Build a chatbot with LangChain, Open AI and Astra DB chatbot.

Watch Now

Key Vector Search Use Cases

Generate real-time AI applications with vector search, empowered by advanced language models (LLMs) and Chat AI Agents.

Integrations

Enhance your AI/ML applications and ecosystem with contextual data insights and automations.

Join the Community

Plug into the real-time conversation on the community’s Planet Cassandra Discord channel.

One-stop Data API for Production GenAI

Astra DB gives JavaScript developers a complete data API and out-of-the-box integrations that make it easier to build production RAG apps with high relevancy and low latency.

FAQ

What is a vector database?

Vector databases like DataStax Astra DB (built on Apache Cassandra) are designed to provide optimized storage and data access capabilities specifically for vector embeddings, which is the mathematical representation of data. Vector databases provide multi-dimensional representation of structured and unstructured data and enable functions like vector search on large corpora of data.

What is vector search and how does it relate to a vector database?

Vector search associates similar mathematical representations of data, and vector representations, converting queries into the same vector representation. With both query and data represented as vectors, finding related data becomes a function of searching for any data representations that are the closest to your query representation, known as nearest neighbors. Vector databases provide the storage and retrieval of data representations for vector search called vector embeddings. Since data is represented across multiple dimensions, vector databases need to be highly scalable and highly performant.

How does vector search work?

The concept of nearest neighbor is at the core of how vector search works and there are a number of different algorithms that can be used for finding nearest neighbors depending on how much compute resources you want to allocate and/or how accurate you are looking for your result to be.

Is vector search compatible with cloud-based and on-premises data environments?

Vector search doesn’t have a concept of where the data is stored so can be used for cloud-based or on-premise data environments. Solutions like Astra DB are built to provide a cloud-native data platform ideally suited for building generative AI applications powered by vector search, however, on-premise solutions like DataStax Enterprise (DSE) are also being used for vector search capabilities.

Cloud-based solutions tend to be more commonly deployed as they provide the scalability for additional storage and compute resources on demand depending on the application's requirements.



Which industries can benefit from vector search?

Vector search is not limited by a specific industry and can be leveraged by use cases across all industries. Building recommendation engines using vector search offers improved customer engagement and visibility. Vector search can also be used to build natural language processing chatbots that interact with product documentation in real time to provide the right answer at the right time.


Vector search is the latest approach to data organization and access and allows applications the ability to leverage generative AI across all industries.

Is vector search suitable for large-scale data sets?

Vector search can be used on small, medium, or large data sets interchangeably. However, the important thing to remember is that with small datasets a lot of the compute and storage overhead can be maintained in the application space. For medium to large datasets applications, you should leverage a high-performance vector database like Astra DB, allowing for the decoupling of data storage from the application. This allows for applications to reuse and leverage vector data across multiple application instances and frees up resources in the application.

What sets DataStax's vector search apart from other similar solutions on the market?

One of the primary differences between DataStax vector search and other offerings in the market is that DataStax Astra DB is built on Apache Cassandra, which for over 15 years has been used to provide a highly scalable, highly performant approach to unstructured data storage and retrieval via NoSQL functionality. Most of the solutions in the market today are single-solution approaches to providing vector databases for vector storage. DataStax provides a proven/hardened solution to handling the massive scalable and performance demands generative AI applications need.


In addition, while many solutions are available for vector search, DataStax Astra provides a completely integrated platform for building generative AI applications. More than just a vector database, more than just vector search, DataStax Astra provides the ability to leverage orchestration frameworks like LlamaIndex and LangChain to simplify the generative AI application development and enable end-to-end vector lifecycle management.