Concrete ML v1.9: TFHE-rs Compatibility and Faster LLM Fine-tuning

April 10, 2025

—

Andrei Stoian

Concrete ML v1.9 introduces support for the TFHE-rs ciphertext format, enabling seamless integration of Concrete ML models into Rust-based FHE pipelines using TFHE-rs. This release also brings performance improvements to the LoRA LLM fine-tuning protocol, along with new example notebooks demonstrating its use.

In parallel, Zama is also launching the Concrete ML Extensions, a new client SDK designed for building FHE-enabled browser and mobile applications. This SDK highlights the potential of fully homomorphic encryption to empower mobile users to securely process their sensitive data—without ever exposing it in the clear.

In this blog post, we’ll dive into the key features of this release and explore what’s possible with the latest advancements in Concrete ML.

TFHE-rs ciphertext format support

Concrete ML now supports the TFHE-rs radix ciphertext format, making encrypted ML workflows compatible with the Rust ecosystem. TFHE-rs uses a universal parameter set with backward compatibility, meaning that ciphertexts encrypted with these parameters today remain compatible in the future.

With this new compatibility in Concrete ML v1.9, you can now use these ciphertexts as inputs and outputs of ML models. The following snippet shows how to compile models to use TFHE-rs ciphertexts.

model.compile(x, ciphertext_format=CiphertextFormat.TFHE_RS)
y_pred_tfhers = model.predict(fhe_test_data, fhe="execute")

You can also use TFHE-rs ciphertexts with the client/server API. A new use-case demonstrates how to use TFHE-rs post-processing on the logits output by a decision tree classifier. Note that using TFHE-rs ciphertexts format requires a conversion layer in the ML model, which may introduce a 4–5x latency overhead.

New use-case: Fine-tuned LLAMA for math problem solving

Concrete v1.9 brings a new example of a fully functional encrypted fine-tuning pipeline on GPU. In the example notebook, the LLAMA 1B model is fine-tuned with LoRA on a math problem dataset—entirely under FHE and accelerated on GPU.

We compare model quality using perplexity scores across encrypted vs. cleartext training runs. With performance optimizations, the FHE fine-tuning pipeline now achieves up to 64 tokens/second on a desktop GPU.

Here’s how fine-tuning improves the model's reasoning.

Before fine-tuning, the original LLAMA model produces the following output for a simple math problem:

Prompt: When you multiply a number by 7, it becomes 98. What is that number?
Response: If you multiply a number by 7, it becomes 98. So, the number you're asking about is 98.

After fine-tuning, the model solves the problem correctly:

Prompt: When you multiply a number by 7, it becomes 98. What is that number?
Response: To find the number, you need to divide 98 by 7. 98 ÷ 7 = 14

Training on the full dataset takes 28 hours across 50 desktop GPUs.

Mobile client SDK: Concrete ML Extensions

Today, mobile phones store sensitive information for billions of people, making privacy and security more important than ever. At the same time, the rise of AI has shown how personal data can unlock powerful services—from healthcare insights and genetic analysis to personalized recommendations and targeted ads. It’s a double-edged sword: the same data that can benefit us also puts our privacy at risk. By integrating FHE into mobile apps, we can enable personalized features while keeping user data completely private and secure.

Concrete ML v1.9 introduces Concrete ML Extensions, a new SDK designed for building FHE-enabled client-side apps. Developers can compile this SDK to Swift, enabling iOS applications to perform encryption, decryption, and key generation natively.

A step-by-step tutorial is available to guide you through compiling the Swift library and integrating it into your iOS apps. In the coming weeks, we’ll also be releasing a series of demo iOS applications—stay tuned!

Additional links

Star Zama's Concrete ML GitHub repository to endorse our work.
Review the Concrete ML documentation.
Get support on our community channels.
Participate in the Zama Bounty Program to get rewards in cash!

Related Blog Posts

[Video Tutorial] Improving Multiple-GPU Throughput Using TFHE-rs

Tutorials

In this tutorial, Zama team member Agnes Leroy, shows you how to improve multiple-GPU throughput using TFHE-rs.

Zama Bounty Program Season 8

Announcements

Announcing the winning submissions from Season 7 and the new bounties for Season 8.

Call For Builders: Onboard The Next Trillions In DeFi With Confidential Lending

Confidential Blockchain

DeFi is fast, open, and efficient—but too transparent for institutions. What if it offered Swiss-bank-level privacy?

Read more →

Back to blog

Privacy is necessary for an open society in the electronic age. Privacy is not secrecy. A private matter is something one doesn't want the whole world to know, but a secret matter is something one doesn't want anybody to know. Privacy is the power to selectively reveal oneself to the world.If two parties have some sort of dealings, then each has a memory of their interaction. Each party can speak about their own memory of this; how could anyone prevent it? One could pass laws against it, but the freedom of speech, even more than privacy, is fundamental to an open society; we seek not to restrict any speech at all. If many parties speak together in the same forum, each can speak to all the others and aggregate together knowledge about individuals and other parties. The power of electronic communications has enabled such group speech, and it will not go away merely because we might want it to.Since we desire privacy, we must ensure that each party to a transaction have knowledge only of that which is directly necessary for that transaction. Since any information can be spoken of, we must ensure that we reveal as little as possible. In most cases personal identity is not salient. When I purchase a magazine at a store and hand cash to the clerk, there is no need to know who I am. When I ask my electronic mail provider to send and receive messages, my provider need not know to whom I am speaking or what I am saying or what others are saying to me; my provider only need know how to get the message there and how much I owe them in fees. When my identity is revealed by the underlying mechanism of the transaction, I have no privacy. I cannot here selectively reveal myself; I must always reveal myself.Therefore, privacy in an open society requires anonymous transaction systems. Until now, cash has been the primary such system. An anonymous transaction system is not a secret transaction system. An anonymous system empowers individuals to reveal their identity when desired and only when desired; this is the essence of privacy.Privacy in an open society also requires cryptography. If I say something, I want it heard only by those for whom I intend it. If the content of my speech is available to the world, I have no privacy. To encrypt is to indicate the desire for privacy, and to encrypt with weak cryptography is to indicate not too much desire for privacy. Furthermore, to reveal one's identity with assurance when the default is anonymity requires the cryptographic signature.We cannot expect governments, corporations, or other large, faceless organizations to grant us privacy out of their beneficence. It is to their advantage to speak of us, and we should expect that they will speak. To try to prevent their speech is to fight against the realities of information. Information does not just want to be free, it longs to be free. Information expands to fill the available storage space. Information is Rumor's younger, stronger cousin; Information is fleeter of foot, has more eyes, knows more, and understands less than Rumor.We must defend our own privacy if we expect to have any. We must come together and create systems which allow anonymous transactions to take place. People have been defending their own privacy for centuries with whispers, darkness, envelopes, closed doors, secret handshakes, and couriers. The technologies of the past did not allow for strong privacy, but electronic technologies do.We the Cypherpunks are dedicated to building anonymous systems. We are defending our privacy with cryptography, with anonymous mail forwarding systems, with digital signatures, and with electronic money.Cypherpunks write code. We know that someone has to write software to defend privacy, and since we can't get privacy unless we all do, we're going to write it. We publish our code so that our fellow Cypherpunks may practice and play with it. Our code is free for all to use, worldwide. We don't much care if you don't approve of the software we write. We know that software can't be destroyed and that a widely dispersed system can't be shut down.Cypherpunks deplore regulations on cryptography, for encryption is fundamentally a private act. The act of encryption, in fact, removes information from the public realm. Even laws against cryptography reach only so far as a nation's border and the arm of its violence. Cryptography will ineluctably spread over the whole globe, and with it the anonymous transactions systems that it makes possible.For privacy to be widespread it must be part of a social contract. People must come and together deploy these systems for the common good. Privacy only extends so far as the cooperation of one's fellows in society. We the Cypherpunks seek your questions and your concerns and hope we may engage you so that we do not deceive ourselves. We will not, however, be moved out of our course because some may disagree with our goals.The Cypherpunks are actively engaged in making the networks safer for privacy. Let us proceed together apace.Onward.Eric Hughes9 March 1993