Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make module system more robust #139

Open
jeaye opened this issue Nov 27, 2024 · 6 comments
Open

Make module system more robust #139

jeaye opened this issue Nov 27, 2024 · 6 comments
Assignees

Comments

@jeaye
Copy link
Member

jeaye commented Nov 27, 2024

More details to be filled in soon.

JIT

Add dynamic var for loaded libs

Clojure uses a clojure.core/*loaded-libs* dynamic var, which contains a set of loaded libs. It's initialized in clojure.core, like so:

(defonce ^:dynamic
  ^{:private true
     :doc "A ref to a sorted set of symbols representing loaded libs"}
  *loaded-libs* (ref (sorted-set)))

We can do the same with jank but make it an atom and leave a TODO for ref usage. Then follow all of the usages of *loaded-libs* in clojure.core and make sure jank is doing the same work in the various functions.

Add cyclical dep check

Clojure also uses another dynamic var, *pending-paths*, to track cyclical deps. It's defined like so:

(defonce ^:dynamic
  ^{:private true
     :doc "A stack of paths currently being loaded by this thread"}
  *pending-paths* ())

We'll want to add the same thing, along with the check-cyclic-dependency function and its usages. Manually test this to ensure it's working well.

Add transactionality

Based on my testing, Clojure will still mark a namespace as loaded if you throw an exception while requiring it. This means we don't need anything clever other than thread safety when updating the loaded libs set. Please do your own testing (check loaded libs, try to load a ns which isn't loaded, have it fail, and then check again) and ensure jank matches Clojure's behavior. The loaded libs var is private, but you can get to it like so:

(deref (intern 'clojure.core '*loaded-libs*))

Again, an atom will do fine here, as far as I can tell. I think it'd be good to ask about why a ref is used here, compared to an atom, in the Clojurian Slack; we might learn something neat.

Add ability to look up module by source

I have recently added an origin enum to load_module. It allows us to explicitly load from source or load from the latest, where latest is either binary or source depending on timestamps (timestamp checking not yet implemented).

For future functionality, it would be helpful to extract some of the behavior we have in load_module into a find_module function which also takes an origin. It can then return some data containing both the entry and which part of the entry should be used (based on the origin).

Add reloading support

When a pass :reload along to require, we want to load the module from the latest origin again, even if it's already loaded. This will require passing in a flag to ignore the early exit for skipping modules which are already loaded. This also needs to work with :reload-all, which reloads the specified module and all of its dependencies from their latest origins. Look into how Clojure does this in the load-lib function.

AOT

Ensure module dependencies are compiled and loaded properly

We can AOT compile a module to an object file right now. If that module requires other modules, we need to fork off and compile each of them into their own object files. This has not been tested and may not be working. Start simply, with a.jank and b.jank, where a requires b. Then tell jank to compile a with jank compile a (ensure that a.jank is on the module path).

Firstly, we want to be sure that both a and b get generated separately, with the correct LLVM IR modules. You can comment out the print in runtime::context::write_module to see the IR that's getting written for each module.

Secondly, we want to be sure that when we load a, b gets loaded as well. You can verify this by putting a println at the top of both of them, starting a jank repl, and requiring a. You should see both prints.

Load binaries only if the source isn't newer

When loading a module and the origin is latest, we can consider binaries for loading. If a binary is present, we need to check its timestamp against the source file. If the source file is missing, we need to not load the binary, since we always require source distributions. If the binary is newer or at least as new as the source, we can load the binary. If the source is newer, we need to load the source. For timestamps, we want to check the last modified time.

Skip module compilation based on timestamp

When compiling, if a module has a binary which has a sufficient timestamp, we can skip compilation of the source.

Update binary cache path based on compilation flags

Right now, our binary cache path is based on a few different values. This is in binary_version, in dir.cpp. We'll need to parameterize binary_version to take in more inputs. Here's a list of those I can think of right now:

  1. Optimization setting (from CLI flags)
  2. Include dirs and preprocessor defines (from CLI flags)
  3. Target architecture (from CLI flags -- not yet implemented)
  4. Direct linking (from CLI flags -- not yet implemented)

This will mean that changing any of the above will result in a new binary cache dir which will then result in a recompile of every source (unless that cache dir has up-to-date binaries in it already).

Prevent duplicate symbols from being generated

For this, we need a few things:

  1. Fully qualify generated symbols, such that foo_456_0 becomes clojure_core_foo_456_0
    • Start with runtime::munge
    • Then take a look at the existing fns like module_to_load_function for how to replace . with _
  2. Change runtime::context::unique_{string,symbol} to use the current ns
    • Move the static atomic counter into ns
  3. Add a jank_set_module_symbol_counter(ns_name, count) to the C API
    • Intern the ns and then set its atomic count, return void
  4. Add call to the new C API fn in codegen
    • Find the usage of module_to_load_function in create_function and add a call in there
    • Specify the ns name and the current value of the counter as params

Add .cpp module AOT compilation

If a module is backed by a .cpp file, we still want to be able to AOT compile it to a .o file. For example, if we have src/foo_native.cpp, we should be able to jank compile foo-native. In order to do this, we'll probably need to invoke Clang, make sure it has the right flags (includes, defines, etc), and tell it to compile that .cpp file to the target .o file, while handling any failures.

HOWEVER, before digging into this, I would ask in LLVM's #jit to see if there's a way to just get the IR from C++ that we're JIT compiling. That would allow us to not compile it a different way, since we can just use our normal JIT stuff and then extract the IR module and save it to an object file.

@Samy-33
Copy link
Contributor

Samy-33 commented Nov 27, 2024

@jeaye, I am on this.

@frenchy64
Copy link
Contributor

frenchy64 commented Jan 16, 2025

I think there might be a bit more to *loaded-libs*. IIRC it might be mutated in a transaction that is rolled back if loading fails.

OTOH I seem to remember being surprised that side effects happened in a transaction.

@Samy-33
Copy link
Contributor

Samy-33 commented Jan 17, 2025

There's not much else happening in the transactions that mutate *loaded-libs* apart from just that, updating itself.

So it can very well be just an atom. If swap! fails on an atom, the value it holds is retained.

@frenchy64
Copy link
Contributor

There's not much else happening in the transactions that mutate loaded-libs apart from just that, updating itself.

The dosync transaction inside clojure.core/load-all would call load each time it is retried, wouldn't it?

While (and if) this is true, clearly this is an unsupported scenario since loading is not thread-safe and hence would never be retried. But in the spirit of Clojure parity, it's worth pondering.

I don't think the claim in this issue is correct that "Clojure will still mark a namespace as loaded if you throw an exception while requiring it". But I also think Clojure parity can be achieved by changing dosync to swap!.

Maybe the scenario that was observed was one where an ns form was evaluated outside of a require.

If swap! fails on an atom, the value it holds is retained.

It sounds like you're suggesting calling load within a swap!. I think we can do loading outside of any transaction like this:

(defonce ^:dynamic *loaded-libs* (atom (sorted-set)))

(defn- load-one
  [lib need-ns require]
  (load (root-resource lib))
  (throw-if (and need-ns (not (find-ns lib)))
            "namespace '%s' not found after loading '%s'"
            lib (root-resource lib))
  (when require
    (swap! *loaded-libs* conj lib)))

(defn- load-all
  "Loads a lib given its name and forces a load of any libs it directly or
  indirectly loads. If need-ns, ensures that the associated namespace
  exists after loading. If require, records the load so any duplicate loads
  can be skipped."
  [lib need-ns require]
  (swap! *loaded-libs* #(reduce1 conj %1 %2)
         (binding [*loaded-libs* (atom (sorted-set))]
           (load-one lib need-ns require)
           @*loaded-libs*)))

(defmacro ns
  (let [...]
    `(do
       ...
        (if (.equals '~name 'clojure.core) 
          nil
          (do (swap! @#'*loaded-libs* conj '~name) nil)))))

@Samy-33
Copy link
Contributor

Samy-33 commented Jan 17, 2025

The dosync transaction inside clojure.core/load-all would call load each time it is retried, wouldn't it?

That's right.

since loading is not thread-safe and hence would never be retried

I don't quite follow this. Can you elaborate?


The following experiment leads to the claim Clojure will still mark a namespace as loaded if you throw an exception while requiring it

;; a.clj
;; (ns a (:require b))

;; b.clj
;; (ns b)
;; (throw (Exception.))

➜  test clj -Sdeps '{:paths ["src"] :deps {com.clojure-goes-fast/clj-java-decompiler {:mvn/version "0.3.6"}}}'
Clojure 1.12.0
user=> (require 'a)
Execution error at b/eval152 (b.clj:3).
null
user=> (in-ns 'clojure.core)
#object[clojure.lang.Namespace 0x982bb90 "clojure.core"]
clojure.core=> @*loaded-libs*
#{b clojure.core.protocols clojure.core.server clojure.core.specs.alpha clojure.edn clojure.instant clojure.java.basis clojure.java.basis.impl clojure.java.browse clojure.java.io clojure.java.javadoc clojure.java.process clojure.java.shell clojure.main clojure.pprint clojure.repl clojure.repl.deps clojure.spec.alpha clojure.spec.gen.alpha clojure.string clojure.tools.deps.interop clojure.uuid clojure.walk}
clojure.core=>

The idea is that while a is not marked as loaded, we still see b as loaded, even when we experienced an exception while loading it.

The example implementation you mentioned, is exactly how it's currently implemented in Jank.
https://github.com/jank-lang/jank/blob/main/compiler%2Bruntime/src/jank/clojure/core.jank#L3818-L3827

@frenchy64
Copy link
Contributor

frenchy64 commented Jan 17, 2025

I don't quite follow this. Can you elaborate?

My impression is that it's undefined behavior to load files in parallel (per serialized-require docstring) and since any write contention on *loaded-libs* only happens in that (undefined) scenario, we can ignore it.

The following experiment leads to the claim ...

Thanks. I realized it was :reload-all I was thinking of:

$ clj -Sdeps '{:paths ["src"]}'
Clojure 1.12.0
user=> (require 'a :reload-all)
Execution error at b/eval152 (b.clj:2).
null
user=> @@#'clojure.core/*loaded-libs*
#{clojure.core.protocols clojure.core.server clojure.edn clojure.instant clojure.java.basis clojure.java.basis.impl clojure.java.browse clojure.java.io clojure.java.javadoc clojure.java.process clojure.java.shell clojure.main clojure.pprint clojure.repl clojure.repl.deps clojure.spec.alpha clojure.spec.gen.alpha clojure.string clojure.tools.deps.interop clojure.uuid clojure.walk}

I think that's achieved by dynamic binding so jank would probably also inherit these semantics.

The example implementation you mentioned, is exactly how it's currently implemented in Jank.

Whoops, I didn't see any commits referencing this issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants