Rustで高次抽象構文を表現することは可能ですか？

Question

Haskellでは、関数を使用して代数データ型（ADT）を記述するのは非常に簡単です。これにより、置換のためにネイティブ関数に依存するインタープリター、つまり、非常に効率的であることが知られている高次抽象構文（HOAS）を作成できます。たとえば、これはその手法を使用した単純なλ計算インタープリターです。

_data Term = Hol Term | Var Int | Lam (Term -> Term) | App Term Term pretty :: Term -> String pretty = go 0 where go lvl term = case term of Hol hol -> go lvl hol Var idx -> "x" ++ show idx Lam bod -> "λx" ++ show lvl ++ ". " ++ go (lvl+1) (bod (Hol (Var lvl))) App fun arg -> "(" ++ go lvl fun ++ " " ++ go lvl arg ++ ")" reduce :: Term -> Term reduce (Hol hol) = hol reduce (Var idx) = Var idx reduce (Lam bod) = Lam (\v -> reduce (bod v)) reduce (App fun arg) = case reduce fun of Hol fhol -> App (Hol fhol) (reduce arg) Var fidx -> App (Var fidx) (reduce arg) Lam fbod -> fbod (reduce arg) App ffun farg -> App (App ffun farg) (reduce arg) main :: IO () main = putStrLn . pretty . reduce $ App (Lam$ \x -> App x x) (Lam$ \s -> Lam$ \z -> App s (App s (App s z))) _

De Bruijnインデックスではなく、ネイティブ関数がどのように使用されたかに注目してください。これにより、アプリケーションを手動で置き換えた場合よりもインタープリターがかなり高速になります。

Rustにはクロージャーと多くのFn()タイプがありますが、この状況でHaskellクロージャーとまったく同じように動作するかどうかはわかりません。 Rustの低レベルの性質を前提にRustでHOASを表すことはできますか？Termデータ型はどのように表現されますか？

ljedrz · Accepted Answer

ラムダ計算のファンとして、私はこれを試みることにしました、そして、それはHaskell（ playground link ）よりも少し見た目が悪いけれども、それは確かに可能です

use std::rc::Rc; use Term::*; #[derive(Clone)] enum Term { Hol(Box<Term>), Var(usize), Lam(Rc<dyn Fn(Term) -> Term>), App(Box<Term>, Box<Term>), } impl Term { fn app(t1: Term, t2: Term) -> Self { App(Box::new(t1), Box::new(t2)) } fn lam<F: Fn(Term) -> Term + 'static>(f: F) -> Self { Lam(Rc::new(f)) } fn hol(t: Term) -> Self { Hol(Box::new(t)) } } fn pretty(term: Term) -> String { fn go(lvl: usize, term: Term) -> String { match term { Hol(hol) => go(lvl, *hol), Var(idx) => format!("x{}", idx), Lam(bod) => format!("λx{}. {}", lvl, go(lvl + 1, bod(Term::hol(Var(lvl))))), App(fun, arg) => format!("({} {})", go(lvl, *fun), go(lvl, *arg)), } } go(0, term) } fn reduce(term: Term) -> Term { match term { Hol(hol) => *hol, Var(idx) => Var(idx), Lam(bod) => Term::lam(move |v| reduce(bod(v))), App(fun, arg) => match reduce(*fun) { Hol(fhol) => Term::app(Hol(fhol), reduce(*arg)), Var(fidx) => Term::app(Var(fidx), reduce(*arg)), Lam(fbod) => fbod(reduce(*arg)), App(ffun, farg) => Term::app(Term::app(*ffun, *farg), reduce(*arg)), }, } } fn main() { // (λx. x x) (λs. λz. s (s (s z))) let term1 = Term::app( Term::lam(|x| Term::app(x.clone(), x.clone())), Term::lam(|s| Term::lam(move |z| Term::app( s.clone(), Term::app( s.clone(), Term::app( s.clone(), z.clone() )))))); // λb. λt. λf. b t f let term2 = Term::lam(|b| Term::lam(move |t| Term::lam({ let b = b.clone(); // necessary to satisfy the borrow checker move |f| Term::app(Term::app(b.clone(), t.clone()), f) }) )); println!("{}", pretty(reduce(term1))); // λx0. λx1. (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 x1))))))))))))))))))))))))))) println!("{}", pretty(reduce(term2))); // λx0. λx1. λx2. ((x0 x1) x2) }

Rcを使用するよう提案してくれたBurntSushi5と、BoxのRcの下にある不要なLamを削除することを提案してくれたShepmasterに感謝します。より長いLamチェーンのチェッカー。

Earth Engine · Answer

承認されたソリューションは、Rcを使用して、クローン可能なヒープ割り当てクロージャーを作成します。

技術的に言えば、ランタイム参照カウントが必要ないため、これは必要ありません。必要なのは、特性オブジェクトとしてのクロージャーであり、それもクローン化可能です。

ただし、Rust 1.29.2ではdyn Clone + FnOnce(Term) -> Termなどを使用できません。この制限は将来緩和される可能性があります。制限には2つの要素があります：Cloneはオブジェクトセーフではありません（リラックスすることはまずありません）。2つの特性を組み合わせる場合、そのうちの1つを自動特性にする必要があります（これはリラックスできます）。

言語の改善を待っている間に、これを回避するための新しい特性を導入できます。

// Combination of FnOnce(Term) -> Term and Clone trait TermLam { // The FnOnce part, declared like an Fn, because we need object safety fn app(&self, t: Term) -> Term; // The Clone part, but we have to return sized objects // (not Self either because of object safety), so it is in a box fn clone_box(&self) -> Box<dyn TermLam>; } // Blanket implementation for appropriate types impl<F> TermLam for F where F: 'static/*' highlighting fix */ + Clone + FnOnce(Term) -> Term { // Note: when you have a Clone + FnOnce, you effectively have an Fn fn app(&self, t: Term) -> Term { (self.clone())(t) } fn clone_box(&self) -> Box<dyn TermLam> { Box::new(self.clone()) } } // We can now clone the box impl Clone for Box<dyn TermLam> { fn clone(&self) -> Self { self.clone_box() } }

その後、Rcを使用する必要性を削除できます。

#[derive(Clone)] enum Term { Hol(Box<Term>), Var(usize), Lam(Box<dyn TermLam>), App(Box<Term>, Box<Term>), } impl Term { fn app(t1: Term, t2: Term) -> Self { App(Box::new(t1), Box::new(t2)) } fn lam<F>(f: F) -> Self where F: 'static/*' highlighting fix */ + Clone + FnOnce(Term) -> Term { Lam(Box::new(f)) } fn hol(t: Term) -> Self { Hol(Box::new(t)) } } fn pretty(term: Term) -> String { fn go(lvl: usize, term: Term) -> String { match term { Hol(hol) => go(lvl, *hol), Var(idx) => format!("x{}", idx), Lam(bod) => format!("λx{}. {}", lvl, go(lvl + 1, bod.app(Term::hol(Var(lvl))))), App(fun, arg) => format!("({} {})", go(lvl, *fun), go(lvl, *arg)), } } go(0, term) } fn reduce(term: Term) -> Term { match term { Hol(hol) => *hol, Var(idx) => Var(idx), Lam(bod) => Term::lam(move |v| reduce(bod.app(v))), App(fun, arg) => match reduce(*fun) { Hol(fhol) => Term::app(Hol(fhol), reduce(*arg)), Var(fidx) => Term::app(Var(fidx), reduce(*arg)), Lam(fbod) => fbod.app(reduce(*arg)), App(ffun, farg) => Term::app(Term::app(*ffun, *farg), reduce(*arg)), }, } } fn main() { // (λx. x x) (λs. λz. s (s (s z))) let term1 = Term::app( Term::lam(|x| Term::app(x.clone(), x.clone())), Term::lam(|s| { Term::lam(move |z| { Term::app( s.clone(), Term::app(s.clone(), Term::app(s.clone(), z.clone())), ) }) }), ); // λb. λt. λf. b t f let term2 = Term::lam(|b| { Term::lam(move |t| { Term::lam({ //let b = b.clone(); No longer necessary for Rust 1.29.2 move |f| Term::app(Term::app(b.clone(), t.clone()), f) }) }) }); println!("{}", pretty(reduce(term1))); // λx0. λx1. (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 x1))))))))))))))))))))))))))) println!("{}", pretty(reduce(term2))); // λx0. λx1. λx2. ((x0 x1) x2) }

これは、他の回答が試みた最初の方法であり、著者は解決できませんでした。

最適化！

さびは、安全性を犠牲にすることなく性能を達成することが知られています。ただし、上記の実装は常にTermsを値で渡し、多くの不必要なclone呼び出しを行うため、いくつかの最適化を行うことができます。

また、Rustデータの一部を文字列化する標準的な方法は、Display特性を使用することです。

use std::fmt::{Display, Error, Formatter}; use Term::*; // Combination of FnOnce(Term) -> Term and Clone trait TermLam { // The FnOnce part, declared like an Fn, because we need object safety fn app(&self, t: Term) -> Term; // The Clone part, but we have to return sized objects // (not Self either because of object safety), so it is in a box fn clone_box(&self) -> Box<dyn TermLam>; } // Blanket implementation for appropriate types impl<F> TermLam for F where F: 'static/*' highlighting fix */ + Clone + FnOnce(Term) -> Term, { // Note: when you have a Clone + FnOnce, you effectively have an Fn fn app(&self, t: Term) -> Term { (self.clone())(t) } fn clone_box(&self) -> Box<dyn TermLam> { Box::new(self.clone()) } } // We can now clone the box impl Clone for Box<dyn TermLam> { fn clone(&self) -> Self { self.clone_box() } } #[derive(Clone)] enum Term { Hol(Box<Term>), Var(usize), Lam(Box<dyn TermLam>), App(Box<Term>, Box<Term>), } impl Term { fn app(t1: Term, t2: Term) -> Self { App(Box::new(t1), Box::new(t2)) } fn lam<F>(f: F) -> Self where F: 'static/*' highlighting fix */ + Clone + FnOnce(Term) -> Term, { Lam(Box::new(f)) } fn hol(t: Term) -> Self { Hol(Box::new(t)) } // `reduce` is now a by-reference method fn reduce(&self) -> Term { match self { Hol(_) => self.clone(), Var(_) => self.clone(), Lam(bod) => { let bod = bod.clone(); Term::lam(move |v| bod.app(v).reduce()) }, // We reuse the reduced object when possible, // to avoid unnecessary clone. App(fun, arg) => match fun.reduce() { other @ Hol(_) => Term::app(other, arg.reduce()), other @ Var(_) => Term::app(other, arg.reduce()), Lam(fbod) => fbod.app(arg.reduce()), other @ App(_, _) => Term::app(other, arg.reduce()), }, } } } //The standard way of `pretty` is `Display` impl Display for Term { fn fmt(&self, fmt: &mut Formatter) -> Result<(), Error> { // As the API is different from `pretty`, the way we do recursion is // a bit different as well struct LvlTerm<'a>(usize, &'a Term); impl<'a> Display for LvlTerm<'a> { fn fmt(&self, fmt: &mut Formatter) -> Result<(), Error> { match self { LvlTerm(lvl, Hol(hol)) => write!(fmt, "{}", LvlTerm(*lvl, hol)), LvlTerm(_, Var(idx)) => write!(fmt, "x{}", idx), LvlTerm(lvl, Lam(bod)) => write!( fmt, "λx{}. {}", *lvl, LvlTerm(*lvl + 1, &bod.app(Term::hol(Var(*lvl)))) ), LvlTerm(lvl, App(fun, arg)) => { write!(fmt, "({} {})", LvlTerm(*lvl, fun), LvlTerm(*lvl, arg)) } } } } write!(fmt, "{}", LvlTerm(0, self)) } } fn main() { // In general, if you need to use a value n+1 times, you need to // call clone it n times. You don't have to clone it in the last use. // (λx. x x) (λs. λz. s (s (s z))) let term1 = Term::app( Term::lam(|x| Term::app(x.clone(), x)), Term::lam(|s| { Term::lam(move |z| Term::app(s.clone(), Term::app(s.clone(), Term::app(s, z)))) }), ); // No clone is required if all values are used exactly once. // λb. λt. λf. b t f let term2 = Term::lam(|b| Term::lam(move |t| Term::lam(move |f| Term::app(Term::app(b, t), f)))); println!("{}", term1.reduce()); // λx0. λx1. (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 (x0 x1))))))))))))))))))))))))))) println!("{}", term2.reduce()); // λx0. λx1. λx2. ((x0 x1) x2) }

プレイグラウンド

上記のコードはさらに簡略化できることがわかります。reduceのマッチアームにはコードの重複があるため、それらをまとめて折りたたむことができます。ただし、この回答の目的は、問題のHaskellコードと同じ方法で物事を行うことを示すことなので、これはそのままにしておきました。

また、クロージャーがFnOnceではなくFnであることのみを要求することにより、使用サイトで、すべての変数を複数回使用する場合にのみクローンを作成する要件を緩和しました。トレードオフは、クロージャが呼び出されるたびに、それがキャプチャするすべての変数が複製されることです。プロファイリングするまで、どちらが優れているかを判断するのは困難です。そのため、コードの見栄えが良くなるものを選択します。

繰り返しますが、Rustはそれらの決定を明示的にし、選択の違いを念頭に置いて、これは良いことです！