Today, we’re pleased to announce that semantic_text now defaults to the jina-embeddings-v5-text family on Elastic Inference Service (EIS), bringing built-in multilingual inference with no additional configuration required.
EIS provides managed, GPU-accelerated inference tightly integrated with Elasticsearch. With EIS, you don’t need to host, scale, or maintain infrastructure for embedding models.
Semantic search retrieves results based on meaning. Text is converted into vector embeddings so queries can match related concepts, even when the exact words differ.
The semantic_text field type simplifies this entire workflow, with automatic chunking, embedding generation at index time, and seamless querying via the semantic query, without building custom pipelines or managing separate model inference.
The jina-embeddings-v5-text model family just launched on EIS, giving developers powerful multilingual embeddings accessible as part of the core semantic_text workflow. So now your semantic search works across languages out of the box, and global datasets, such as support articles, product descriptions, user reviews, and multilingual websites, work without extra configuration.
This default opens up broad, globe-spanning semantic retrieval with no operational overhead.
jina-embeddings-v5-text
The jina-embeddings-v5-text models represent the latest generation of compact, high-performance multilingual embedding models on EIS.
- State-of-the-art multilingual performance: Top scores on MMTEB benchmarks across hundreds of languages.
jina-embeddings-v5-text-nanoleads models under 500M parameters, andjina-embeddings-v5-text-smalloutperforms significantly larger alternatives. - Multiple task capabilities: Spanning across retrieval, semantic matching, clustering, and classification.
- Flexible choices to fit your use case: Two model sizes (
small,nano) let you balance speed, cost, and quality. - Long-context support: Embed long texts efficiently, ideal for document collections with extended context.
Get started
1. Create index
Define a semantic_text field with no additional configuration. Embeddings will be generated automatically at index time using the default model. For production workloads, explicitly specify the model to ensure consistent behavior and results.
PUT /multilingual-reviews
{
"mappings": {
"properties": {
"product": { "type": "keyword" },
"review": { "type": "semantic_text" },
"language": { "type": "keyword" }
}
}
}2. Index multilingual documents
Add product reviews in six different languages. Each document’s review field is automatically embedded at ingest time, with no separate pipeline or preprocessing needed.
POST /multilingual-reviews/_bulk?refresh=wait_for
{ "index": { "_id": "1" } }
{ "product": "wireless-headphones", "review": "Amazing noise cancellation and the battery lasts all day. Perfect for long flights.", "language": "en" }
{ "index": { "_id": "2" } }
{ "product": "wireless-headphones", "review": "La cancelación de ruido es impresionante. Muy cómodos incluso después de horas de uso.", "language": "es" }
{ "index": { "_id": "3" } }
{ "product": "wireless-headphones", "review": "ノイズキャンセリングが素晴らしく、長時間つけていても耳が痛くなりません。", "language": "ja" }
{ "index": { "_id": "4" } }
{ "product": "wireless-headphones", "review": "Réduction de bruit excellente et très confortable pour les longs trajets en avion.", "language": "fr" }
{ "index": { "_id": "5" } }
{ "product": "wireless-headphones", "review": "Hervorragende Geräuschunterdrückung. Ideal für Pendler und Vielflieger.", "language": "de" }
{ "index": { "_id": "6" } }
{ "product": "wireless-headphones", "review": "O cancelamento de ruído é excelente e a bateria dura o dia todo.", "language": "pt" }3. Search across languages with a query in English
GET /multilingual-reviews/_search
{
"query": {
"match": {
"review": "comfortable for long flights"
}
}
}The results show all six reviews ranked by semantic relevance to the English query:
{
"took": 83,
"timed_out": false,
"_shards": {
"total": 6,
"successful": 6,
"skipped": 0,
"failed": 0
},
"hits": {
"total": {
"value": 6,
"relation": "eq"
},
"max_score": 0.8275735,
"hits": [
{
"_index": "multilingual-reviews",
"_id": "4",
"_score": 0.8275735,
"_source": {
"product": "wireless-headphones",
"review": "Réduction de bruit excellente et très confortable pour les longs trajets en avion.",
"language": "fr"
}
},
{
"_index": "multilingual-reviews",
"_id": "1",
"_score": 0.7616198,
"_source": {
"product": "wireless-headphones",
"review": "Amazing noise cancellation and the battery lasts all day. Perfect for long flights.",
"language": "en"
}
},
{
"_index": "multilingual-reviews",
"_id": "5",
"_score": 0.72122526,
"_source": {
"product": "wireless-headphones",
"review": "Hervorragende Geräuschunterdrückung. Ideal für Pendler und Vielflieger.",
"language": "de"
}
},
{
"_index": "multilingual-reviews",
"_id": "2",
"_score": 0.6867013,
"_source": {
"product": "wireless-headphones",
"review": "La cancelación de ruido es impresionante. Muy cómodos incluso después de horas de uso.",
"language": "es"
}
},
{
"_index": "multilingual-reviews",
"_id": "3",
"_score": 0.66513836,
"_source": {
"product": "wireless-headphones",
"review": "ノイズキャンセリングが素晴らしく、長時間つけていても耳が痛くなりません。",
"language": "ja"
}
},
{
"_index": "multilingual-reviews",
"_id": "6",
"_score": 0.61658823,
"_source": {
"product": "wireless-headphones",
"review": "O cancelamento de ruído é excelente e a bateria dura o dia todo.",
"language": "pt"
}
}
]
}
}Notice that the French review ranks first, even above the English one. That's because "très confortable pour les longs trajets en avion" ("very comfortable for long trips by plane") is a closer semantic match to the query than the English review, which splits its focus across noise cancellation, battery life, and flights. This demonstrates the jina-embeddings-v5-text-small ability to rank by meaning, not language.
4. Search across languages with a Japanese query
GET /multilingual-reviews/_search
{
"query": {
"match": {
"review": "長時間のフライトに最適"
}
}
}The results show all six reviews ranked by semantic relevance to the Japanese query (“Ideal for long-haul flights”):
{
"took": 89,
"timed_out": false,
"_shards": {
"total": 6,
"successful": 6,
"skipped": 0,
"failed": 0
},
"hits": {
"total": {
"value": 6,
"relation": "eq"
},
"max_score": 0.7556782,
"hits": [
{
"_index": "multilingual-reviews",
"_id": "4",
"_score": 0.7556782,
"_source": {
"product": "wireless-headphones",
"review": "Réduction de bruit excellente et très confortable pour les longs trajets en avion.",
"language": "fr"
}
},
{
"_index": "multilingual-reviews",
"_id": "1",
"_score": 0.7395687,
"_source": {
"product": "wireless-headphones",
"review": "Amazing noise cancellation and the battery lasts all day. Perfect for long flights.",
"language": "en"
}
},
{
"_index": "multilingual-reviews",
"_id": "5",
"_score": 0.68835545,
"_source": {
"product": "wireless-headphones",
"review": "Hervorragende Geräuschunterdrückung. Ideal für Pendler und Vielflieger.",
"language": "de"
}
},
{
"_index": "multilingual-reviews",
"_id": "3",
"_score": 0.6487931,
"_source": {
"product": "wireless-headphones",
"review": "ノイズキャンセリングが素晴らしく、長時間つけていても耳が痛くなりません。",
"language": "ja"
}
},
{
"_index": "multilingual-reviews",
"_id": "6",
"_score": 0.6241487,
"_source": {
"product": "wireless-headphones",
"review": "O cancelamento de ruído é excelente e a bateria dura o dia todo.",
"language": "pt"
}
},
{
"_index": "multilingual-reviews",
"_id": "2",
"_score": 0.6183049,
"_source": {
"product": "wireless-headphones",
"review": "La cancelación de ruido es impresionante. Muy cómodos incluso después de horas de uso.",
"language": "es"
}
}
]
}
}The ranking is nearly identical to the English query: French and English still lead because they're the most semantically relevant to "perfect for long flights," regardless of query language. The Japanese review didn't get artificially boosted just because the query was in Japanese. It ranks fourth because it discusses wearing comfort, not flights. Semantic relevance takes priority over language matching.
Note: For English-only use cases
If you prefer a sparse representation or would like to continue to use Elastic Learned Sparse EncodeR (ELSER) for English workloads, ELSER remains available and fully supported as an option for semantic_text.
You can explicitly choose ELSER by specifying inference_id: ".elser-2-elastic in your mappings when creating an index.
Conclusion: Semantic search without borders
With semantic_text now defaulting to the jina-embeddings-v5-text family on Elastic Inference Service, multilingual semantic search becomes the standard developer experience in Elasticsearch. This means developers can build search, retrieval augmented generation (RAG), and AI applications that work across global datasets without stitching pipelines together.
Create a semantic_text field, index your data, and start searching. All Elastic Cloud trials have access to Elastic Inference Service. Try it now on Elastic Cloud Serverless or Elastic Cloud Hosted, or use EIS via Cloud Connect with your self-managed cluster.
Ready to try this out on your own? Start a free trial.
Want to get Elastic certified? Find out when the next Elasticsearch Engineer training is running!
Related content

May 11, 2026
One index, all media: Introducing jina-embeddings-v5-omni
jina-embeddings-v5-omni lets you embed text, images, video, and audio into a single Elasticsearch index and query across all of them at once.

April 22, 2026
Jina embeddings v3 now available on Gemini Enterprise Agent Platform Model Garden
Jina search foundation model, jina-embeddings-v3, is now self-deployable on Gemini Enterprise Agent Platform Model Garden, with more to follow. Run jina-embeddings-v3 on a single L4 GPU inside your own VPC.

April 10, 2026
Unsupervised document clustering with Elasticsearch + Jina embeddings
A practical, reproducible approach to unsupervised document clustering with Elasticsearch and Jina embeddings.

February 24, 2026
Build a "chat with your website data" agent with Jina Embeddings v5 and Elasticsearch
Learn how to build a working, grounded "chat with your website" experience in under an hour by using Elasticsearch Serverless, Jina Embeddings v5, Elastic Open Web Crawler, and Elastic Agent Builder.

February 23, 2026
jina-embeddings-v5-text: Compact state-of-the-art text embeddings for search and intelligent applications
Introducing jina-embeddings-v5-text models, including jina-embeddings-v5-text-small and jina-embeddings-v5-text-nano, and explaining how to use these multilingual embedding models via Elastic Inference Service (EIS).