Learning with errors - AbsoluteAstronomy.com

Learning with errors is a problem in machine learning

Machine learning

Machine learning, a branch of artificial intelligence, is a scientific discipline concerned with the design and development of algorithms that allow computers to evolve behaviors based on empirical data, such as from sensor data or databases...

. A generalization of the parity learning

Parity learning

Parity learning is a problem in machine Learning. An algorithm claiming to solve this problem must try to guess the function ƒ, given some samples and the assurance that ƒ computes the parity of bits at some fixed locations. The samples are generated using some distribution over the input...

problem, it has recently been used to create public-key cryptosystems

Public-key cryptography

Public-key cryptography refers to a cryptographic system requiring two separate keys, one to lock or encrypt the plaintext, and one to unlock or decrypt the cyphertext. Neither key will do both functions. One of these keys is published or public and the other is kept private...

based on worst-case hardness of some lattice problems

Lattice problems

In computer science, lattice problems are a class of optimization problems on lattices. The conjectured intractability of such problems is central to construction of secure lattice-based cryptosystems...

. The problem was introduced by Oded Regev in 2005.

Given access to samples

where

and

, with assurance that, for some fixed linear function

Linear function

In mathematics, the term linear function can refer to either of two different but related concepts:* a first-degree polynomial function of one variable;* a map between two vector spaces that preserves vector addition and scalar multiplication....

with high probability and deviates from it according to some known noise model, the algorithm must be able to recreate

or some close approximation of it.

Definition

Denote by

the additive group on reals modulo one. Denote by

the distribution on

obtained by choosing a vector

uniformly at random, choosing

according to a probability distribution

and outputting

for some fixed vector

where the division is done in the field of reals, and the addition in

.

The learning with errors problem is to find

, given access to polynomially many samples of choice from

.

For every

, denote by

the one-dimensional Gaussian with density function

where

, and let

be the distribution on

obtained by considering

modulo one. The version of LWE considered in most of the results would be

Decision version

The LWE problem described above is the search version of the problem. In the decision version, the goal is to distinguish between noisy inner products and uniformly random samples from

(practically, some discretized version of it). Regev showed that the decision and search versions are equivalent when

is a prime bounded by some polynomial in

Solving decision assuming search

Intuitively, it is easy to see that if we have a procedure for the search problem, the decision version can be solved easily: just feed the input samples for the decision version to the procedure for the search problem, and check if the

returned by the search procedure can generate the input pairs modulo some noise.

Solving search assuming decision

For the other direction, suppose that we have a procedure for the decision problem, the search version can be solved as follows: Recover the

one co-ordinate at a time. Suppose we are guessing the first co-ordinate, and we are trying to test if

for a fixed

. Choose a number

uniformly at random. Denote the given samples by

. Feed the transformed pairs

to the decision problem. It is easy to see that if the guess

was correct, the transformation takes the distribution

to itself, and otherwise takes it to the uniform distribution. Since we have a procedure for the decision version which distinguishes between these two types of distributions, and errs with very small probability, we can test if the guess

equals the first co-ordinate. Since

is a prime bounded by some polynomial in

can only take polynomially many values, and each co-ordinate can be efficiently guessed with high probability.

Regev's result

For a n-dimensional lattice

, let smoothing parameter

denote the smallest

such that

where

is the dual of

and

is extended to sets by summing over function values at each element in the set. Let

denote the discrete Gaussian distribution on

of width

for a lattice

and real

. The probability of each

is proportional to

.

The discrete Gaussian sampling problem(DGS) is defined as follows: An instance of

is given by an

-dimensional lattice

and a number

. The goal is to output a sample from

. Regev shows that there is a reduction from

for any function

.

Regev then shows that there exists an efficient quantum algorithm for

given access to an oracle for

for integer

and

such that

. This implies the hardness for

. Although the proof of this assertion works for any

, for creating a cryptosystem, the

has to be polynomial in

Peikert's result

Peikert proves that there is a probabilistic polynomial time reduction from the

problem in the worst case to solving

using

samples for parameters

and

Use in Cryptography

The LWE problem serves as a versatile problem used in construction of several cryptosystems. In 2005, Regev showed that the decision version of LWE is hard assuming quantum hardness of the lattice problems

Lattice problems

(for

as above) and

with t=Õ(n/

). In 2009, Peikert proved a similar result assuming only the classical hardness of the related problem

. The disadvantage of Peikert's result is that it bases itself on a non-standard version of an easier (when compared to SIVP) problem GapSVP.

Public-key cryptosystem

Regev proposed a public-key cryptosystem based on the hardness of the LWE problem. The cryptosystem as well as the proof of security and correctness are completely classical. The system is characterized by

and a probability distribution

. The setting of the parameters used in proofs of correctness and security is

, a prime number between and .
for an arbitrary constant
for

The cryptosystem is then defined by:

Private Key: Private key is an chosen uniformly at random.
Public Key: Choose vectors uniformly and independently. Choose error offsets independently according to . The public key consists of
Encryption: The encryption of a bit is done by choosing a random subset of and then defining as
Decryption: The decryption of is if is closer to than to , and otherwise.

The proof of correctness follows from choice of parameters and some probability analysis. The proof of security is by reduction to the decision version of LWE: an algorithm for distinguishing between encryptions (with above parameters) of

and

can be used to distinguish between

and the uniform distribution over

CCA-secure cryptosystem

Peikert proposed a system that is secure even against any chosen-ciphertext attack

Chosen-ciphertext attack

A chosen-ciphertext attack is an attack model for cryptanalysis in which the cryptanalyst gathers information, at least in part, by choosing a ciphertext and obtaining its decryption under an unknown key. In the attack, an adversary has a chance to enter one or more known ciphertexts into the...

The source of this article is wikipedia, the free encyclopedia. The text of this article is licensed under the GFDL.