Feat : Query complexity model for algorithms #275

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Draft

Shreyas4991 wants to merge 54 commits into leanprover:main from Shreyas4991:query-complexity-freeM-shreyas

+956 −85

Contributor

Shreyas4991 commented Jan 21, 2026 •

edited

Loading

Generalizing @tannerduve 's query complexity model. This model of algorithmic complexity provides a lightweight approach to complexity verification of algorithms, similar to #165

However it offers several improvements over #165 :

No explicit ticks needed, so the chances of making mistakes like forgetting to count specific calls is removed.
The operations which are counted are made explicit as an inductive type and a model of this type.
The model of a query type accepts custom cost structures so long as you can define addition and a 0 operation on them (basically additive monoids suffice).
With a notion of reductions between models, this approach is more modular for the algorithm specifier, allowing high level specs of operations which can be translated into simpler query models later.

Update to the below drawback : The model requires users to choose a cost for each pure operation upfront by a typeclass instance for a given type of Cost. This is a departure from TimeMs model. But with custom Cost types with a specific field for pure, these costs can be separated from that of calls to queries.

One drawback : It is still possible to sneak in free operations inside pure. However this is unavoidable in any lightweight monadic approach. A deeply embedded DSL is the only foolproof way to avoid this. Nevertheless this approach removes annotation and review burden and ensures that any actual call to a query will be counted. Thus it is easy to notice when a monadic operation is not being called.

Zulip Discussion Thread: https://leanprover.zulipchat.com/#narrow/channel/513188-CSLib/topic/Query.20model.20for.20Algorithms.20complexity.20.3A.20updates/near/569606456

tannerduve and others added 22 commits

December 5, 2025 15:11


          initial commit

7c47d30


          fix up docs

c0c3426


          docs

a5389ab


          fix up

a32b00d


          Merge branch 'main' into query-complexity-freeM

927e4e8


          effects fix

c8d3206


          Add TimeM time/cost monad

9c3c26c

Co-authored-by: Sorrachai Yingchareonthawornhcai <sorrachai@users.noreply.github.com>


          Add TimeM time/cost monad

2136afe

Co-authored-by: Sorrachai Yingchareonthawornhcai <sorrachai@users.noreply.github.com>


          rename

f5397b0


          update cslib.lean

9e6fcd4


          Test PR to PR ability with doc comment

9381b9a


          Yet to fix monad instance issue in timeProg function

aeed9a7


          Remove unnecessary instance

b8c9a1e


          monadLift instance needed

9f1ba8a


          Merge branch 'main' of github.com:leanprover/cslib into query-complex…

80c704d

…ity-freeM-shreyas


          Merge branch 'main' of github.com:leanprover/cslib into query-complex…

c1fb032

…ity-freeM-shreyas


          Need to restart

5e1eb09


          Merge branch 'main' of github.com:leanprover/cslib into query-complex…

ba95a81

…ity-freeM-shreyas


          Revert to FreeM, but use extra type param

aaf1d87


          Some progress on the example

3267a70


          Dangers of using only pure ops

48755f0


          experiments

da183e1

Shreyas4991 changed the title ~~Query complexity free m shreyas~~ Query complexity formalisation for describing algorithmic models

Shreyas4991 added 7 commits

January 22, 2026 17:33


          Simple example

29a5908


          Simple example

65d820d


          Developing array sort example

6c2835f


          Developing array sort example

0684bdd


          Vectors

b8c18b0


          Vectors

f859a41


          Made coercions work. Added custom cost structures

912890d

eric-wieser reviewed

View reviewed changes

Cslib/Foundations/Control/Monad/Free/Effects.lean

Comment on lines -349 to -351

    
              /-- Type constructor for reader operations. -/

              inductive ReaderF (σ : Type u) : Type u → Type u where

                | read : ReaderF σ σ

eric-wieser Jan 23, 2026

Deleting this seems counter-productive

Contributor Author

Shreyas4991 Jan 23, 2026

That came from main. I don't use it in this PR. I am just not sure how to get rid of it.

eric-wieser Jan 23, 2026

I think "came from main" means "I did a bad merge of main"

eric-wieser reviewed

View reviewed changes

Cslib/Algorithms/QueryModel.lean Outdated

Comment on lines 68 to 72

    
                evalQuery q :=

                  match q with

                  | .write l i x => l.set i x

                  | .find l elem =>  l.findIdx (· = elem)

                  | .get l i => l[i]

eric-wieser Jan 23, 2026

Suggested change

      
              evalQuery q :=
          
                match q with
          
                | .write l i x => l.set i x
          
                | .find l elem =>  l.findIdx (· = elem)
          
                | .get l i => l[i]
          
              evalQuery
          
                | .write l i x => l.set i x
          
                | .find l elem =>  l.findIdx (· = elem)
          
                | .get l i => l[i]

etc

eric-wieser reviewed

View reviewed changes

Cslib/Algorithms/QueryModel.lean

Comment on lines +125 to +126

    
              instance {Q α} : Coe (Q α) (FreeM Q α) where

                coe := FreeM.lift

eric-wieser Jan 23, 2026

This belongs in the file defining FreeM

Contributor Author

Shreyas4991 Jan 23, 2026

Agreed. Do note that the current file has a lot of things that should move to separate files.

Shreyas4991 added 2 commits

January 23, 2026 01:13


          Model is a structure now

b7a95d5


          Remove redundant TimeM interpretation in the beginning. The TimeM sec…

…tion suffices

eric-wieser reviewed

View reviewed changes

Cslib/Algorithms/QueryModel.lean Outdated

Comment on lines 130 to 136

    
              def eval [Add Cost] [Zero Cost]

                (P : Prog Q α) (M : Model Q Cost) : α :=

                match P with

                | .pure x => x

                | .liftBind op cont  =>

                    let qval := M.evalQuery op

                    eval (cont qval) M

eric-wieser Jan 23, 2026

Suggested change

      
            def eval [Add Cost] [Zero Cost]
          
              (P : Prog Q α) (M : Model Q Cost) : α :=
          
              match P with
          
              | .pure x => x
          
              | .liftBind op cont  =>
          
                  let qval := M.evalQuery op
          
                  eval (cont qval) M
          
            def eval [Add Cost] [Zero Cost]
          
              (P : Prog Q α) (M : Model Q Cost) : α :=
          
              Id.run <| P.liftM fun x => pure (M.evalQuery i)

Shreyas4991 added 3 commits

January 23, 2026 01:27


          Cslib.Init imports

e4407cc


          Set up linear search example. Only theorem statements. Proofs come later

c59aa5d


          extra space removed

903f5e4

eric-wieser reviewed

View reviewed changes

Cslib/Algorithms/QueryModel.lean Outdated

    
              section VectorLinearSearch

              inductive VecSearch (α : Type) : Type → Type  where

                | compare :  (a : Vector α n) → (i : ℕ) → (val : α) →  VecSearch α Bool

eric-wieser Jan 23, 2026 •

edited

Loading

Why not write these to look more like functions,

Suggested change

      
              | compare :  (a : Vector α n) → (i : ℕ) → (val : α) →  VecSearch α Bool
          
              | compare (a : Vector α n) (i : ℕ) (val : α) : VecSearch α Bool

eric-wieser reviewed

View reviewed changes

Cslib/Algorithms/QueryModel.lean Outdated

Comment on lines 216 to 222

    
              inductive VecSortOps (α : Type) : Type → Type  where

                | swap : (a : Vector α n) → (i j : Fin n) → VecSortOps α (Vector α n)

                | cmp :  (a : Vector α n) → (i j : Fin n) → VecSortOps α Bool

                | write : (a : Vector α n) → (i : Fin n) → (x : α) → VecSortOps α (Vector α n)

                | read : (a : Vector α n) → (i : Fin n) → VecSortOps α α

                | push : (a : Vector α n) → (elem : α) → VecSortOps α (Vector α (n + 1))

eric-wieser Jan 23, 2026

This is unsafe, as it lets the algorithm access the underlying data structure. At minimum you need

Suggested change

      
            inductive VecSortOps (α : Type) : Type → Type  where
          
              | swap : (a : Vector α n) → (i j : Fin n) → VecSortOps α (Vector α n)
          
              | cmp :  (a : Vector α n) → (i j : Fin n) → VecSortOps α Bool
          
              | write : (a : Vector α n) → (i : Fin n) → (x : α) → VecSortOps α (Vector α n)
          
              | read : (a : Vector α n) → (i : Fin n) → VecSortOps α α
          
              | push : (a : Vector α n) → (elem : α) → VecSortOps α (Vector α (n + 1))
          
            inductive VecSortOps (VecType : Nat → Type) (α : Type) : Type → Type  where
          
              | swap : (a : VecType n) → (i j : Fin n) → VecSortOps VecType α (VecType n)
          
              | cmp : (a : VecType n) → (i j : Fin n) → VecSortOps VecType α Bool
          
              | write : (a : VecType n) → (i : Fin n) → (x : α) → VecSortOps VecType α (VecType n)
          
              | read : (a : VecType n) → (i : Fin n) → VecSortOps VecType α α
          
              | push : (a : VecType n) → (elem : α) → VecSortOps VecType α (VecType (n + 1))

Contributor Author

Shreyas4991 Jan 23, 2026

Good point. I'll get back to this tomorrow noon. A Finvec would be the better option.

eric-wieser Jan 23, 2026 •

edited

Loading

On second thoughts, this still doesn't help; you can't let the algorithm have access to a at all, otherwise it can do classical if VecType = Vector then cheat else alg

Contributor Author

Shreyas4991 Jan 23, 2026

Any monadic dsl will have this issue.

eric-wieser Jan 23, 2026

I think the issue goes away if the dsl can only access the vector through opaque variable indices?

Contributor Author

Shreyas4991 Jan 23, 2026

That was my data structure hiding idea. Make the vector opaque. That is, remove the vector from the query and add it to the model. The problem is removing it means we can't write monadic Progs that are recursive on specific subvectors.

Contributor Author

Shreyas4991 Jan 24, 2026 •

edited

Loading

Another issue with this idea is that currently the second parameter of the query type is the return type of the operation. If we can't have that, then we can't have the monad structure. Without flexibility on the return type of the query, we lose a lot of flexibility in our queries

Shreyas4991 added 3 commits

January 23, 2026 13:55


          Let's tail-recursive linear search. It is still monadic

b8e89df


          Author and license stuff

d7fa83e


          Another attempt at proofs

c1061d3

chenson2018 reviewed

View reviewed changes

Cslib.lean Outdated

    
              public import Cslib.Foundations.Control.Monad.Free.Fold

              public import Cslib.Foundations.Data.FinFun

              public import Cslib.Foundations.Control.Monad.Time

              import Cslib.Foundations.Data.FinFun

Collaborator

chenson2018 Jan 24, 2026

Make sure with the move to the module system you now run lake exe mk_all --module

chenson2018 reviewed

View reviewed changes

Cslib/Foundations/Control/Monad/Time.lean

Comment on lines +15 to +16

    
              `TimeM` is a monad that tracks execution time alongside computations, using natural numbers

              as a simple cost model. As plain types it is isomorphic to `WriterT Nat Id`.

Collaborator

chenson2018 Jan 24, 2026

Can you clarify what you envision happening to this module? If what you are proposing subsumes this, I vote that we replace it outright with the new framework. I think we can make room for a lightweight monadic DSL and Boole's deep embedding, but I don't want to encourage any more fragmentation than that.

Contributor Author

Shreyas4991 Jan 24, 2026 •

edited

Loading

This PR largely subsumes TimeM. In principle, because TimeM allows arbitrary (occurrences of) operations to be checkmarked, it is more flexible, but as I argued in December, this flexibility is a footgun and a limitation in so many ways. As long as we are systematically counting operations, this model clearly subsumes TimeM.

There is one footgun any monadic DSL will have, namely, sneaking in of computations with pure. This is fundamental to a monad. However even here, I have just managed to improve the situation a bit, by adding a notion of PureCosts as a typeclass and adding custom pure fields in some cost models. As you can see in the #evals as lines 222 and 401, this model can capture how many pure operations are used as well. This breaks the equivalence with TimeM's model of assigning 0 cost to pure, but gives us room for a sanity check as reviewers of code.

Overall I agree with you that this model can entirely replace TimeM.

Shreyas4991 added 4 commits

January 24, 2026 12:02


          Let's count pure oerations as well


          Let's count pure oerations as well

fb185b9


          Add missing public annotation to import in CSLib

3bf4db0


          Address some review comments

5ca8828

Shreyas4991 mentioned this pull request

feat: Adding proofs of correctness and time complexity of insertion sort #280

Open

Shreyas4991 added 7 commits

January 25, 2026 03:28


          List linear search proofs were simpler

22a5af4


          Try Vector proofs by list induction

7ef2750


          Split the files

72a74c7


          Add copyright header

c2529f7


          Improve docs

db100ea


          Circuit complexitygit add *!


          add copyright comment to CircuitProgs.lean

5acdf88

eric-wieser reviewed

View reviewed changes

Cslib/Algorithms/CircuitProgs.lean

Comment on lines +59 to +63

eric-wieser Jan 26, 2026

Suggested change

      
                  if id ∉ s₂
          
                  then
          
                    id :: s₂
          
                  else
          
                    s₂
          
                  insert id s₂

eric-wieser reviewed

View reviewed changes

Cslib/Algorithms/CircuitProgs.lean

Comment on lines +54 to +55

    
                | .const id _ =>

                    if id ∉ countedIDs then id :: countedIDs else countedIDs

eric-wieser Jan 26, 2026 •

edited

Loading

Suggested change

      
              | .const id _ =>
          
                  if id ∉ countedIDs then id :: countedIDs else countedIDs
          
              | .const id _ => insert id countedIDs

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

chenson2018 chenson2018 left review comments

fmontesi Awaiting requested review from fmontesi fmontesi will be requested when the pull request is marked ready for review fmontesi is a code owner

+1 more reviewer

eric-wieser eric-wieser left review comments

At least 1 approving review is required to merge this pull request.

Labels

None yet