Permit FFI files in subdirectories #3601

PgBiel · 2024-09-08T23:11:03Z

Closes #1562

~~Still WIP (finishing the details regarding the duplicate module check)~~

Changes:

FFI files in subdirectories are now copied. For JavaScript specifically, they can be used as @external(javascript, "./file.mjs", "function"). For Erlang, you'd use the module's name as usual.
An additional check was included to ensure no .erl files with the same name exist within the project, as Erlang cannot have duplicate modules.
Added support for directories to the in-memory filesystem. (Otherwise, the is_directory function would return true for files and read_dir would also list the directory itself, causing tests to enter infinite recursion and crash.)

Potential improvements:

I'd like to add explicit integration tests for using @external with relative paths in JS (I tested locally with .mjs with node.js, Deno and Bun and it worked, but could be worth it to test other potential edge cases). Let me know where would be the best spot in the codebase to do so.
Consider having an error when a native file would overlap with a .gleam file's generated code (.mjs or .erl).

lpil

Thank you!

You'll see there's integration tests in the test directory which can be updated with nested FFI files

compiler-core/src/build/native_file_copier.rs

compiler-core/src/io/memory.rs

PgBiel · 2024-09-15T18:02:25Z

Problem: My handmade recursive directory walker is too naive and can enter infinite symlink loops. I'll probably add a new method to FileSystemReader where we can address this at the IO boundary, similarly to how it's done for gleam_source_files.

PgBiel · 2024-09-16T02:24:50Z

Alright, I've made the implementation more robust (it also now supports copying nested Erlang modules), added detection of conflicting .gleam and .mjs files (but not of conflicting .gleam and .erl files yet due to the issue outlined here: #1562 (comment) - we could leave this to a future PR), added detection of conflicting .erl files in separate subpaths, added more unit tests and added integration tests. 😄

inoas · 2024-09-16T13:45:12Z

Hello,

thanks for taking this on, this is lovely <3 for FFI interop!

In case this is desired and-or not yet considered:

Can we enforce some namespacing on Erlang, Elixir and JavaScript modules that maps to the directory structure and else throw a compiler error?

Edit:

... if possible, this might also make this check obsolete?

An additional check was included to ensure no .erl files with the same name exist within the project, as Erlang cannot have duplicate modules.

PgBiel · 2024-09-16T16:12:28Z

I'm not sure that would be desirable since the main point of FFI is to have full control over what the final transpiled code does. Besides, we would have to parse all Erlang files, though I guess we could do with some regex in this case. But I think it's more a matter of flexibility, and the error just tells you if you ever mess it up somehow.

The error is also not perfect since I guess it's theoretically possible for you to shadow some module from some other package. But it's an attempt.

inoas · 2024-09-16T18:54:41Z

I'm not sure that would be desirable since the main point of FFI is to have full control over what the final transpiled code does. Besides, we would have to parse all Erlang files, though I guess we could do with some regex in this case. But I think it's more a matter of flexibility, and the error just tells you if you ever mess it up somehow.

The error is also not perfect since I guess it's theoretically possible for you to shadow some module from some other package. But it's an attempt.

maybe it could warn.
I don't know... I think these FFI files will all be new and setting up strictness is nice there. You can still build any erlang/elixir library as a deprendency.

PgBiel · 2024-09-16T19:38:00Z

I think this is fine to be honest... Forcing people to name their modules like folder@folder2@something in every folder doesn't sound good or useful realistically. And it can clash either way.

lpil

Very nice, thank you.

It seems now that there's a concept of directories in the in memory file system and also the OS file system, and each is responsible for implementing traversal. I think it would make sense to remove this duplication and instead to have the trait know nothing about directory walking and then the walking logic depends on the trait and is shared between all implementations.

PgBiel · 2024-09-20T20:38:50Z

I think it would make sense to remove this duplication and instead to have the trait know nothing about directory walking and then the walking logic depends on the trait and is shared between all implementations.

If I understood correctly, you are proposing making a dir walking function which uses only the trait's read_dir function (and other trait functions if needed), right? Though I'm not sure if we would want to reinvent the wheel here, given we already have a package which implements graph traversal logic to avoid loops etc. What if we compromise and simply add a walk_dir function to the trait and use that where applicable?

Although I guess a re-implementation will be needed if we add symlinks to the in-memory FS down the road, so we could alternatively consider searching for existing platform agnostic implementations and using that, or even just re-implement while attempting to avoid reinventing the wheel as much as possible, e.g. by using special data structure crates if needed. Still, I'm not fully aware of all edge cases to know whether we would be able to match existing OS-specific implementations in terms of not only performance but also correctness.

lpil · 2024-09-21T20:16:02Z

Yes, that's right. We should not have two implementations of the same thing, and removing a dependency is beneficial for maintenance. Having one less dependency to verify for the EU Cyber Resilience Act is nice too!

I'm not worried about correctness seeing as this is a trivial algorithm, will be fine.

PgBiel · 2024-10-07T00:55:25Z

By the way, I believe this is very close to completion (the dir walker algorithm was already pushed, but I haven't tested it well yet) - I'll be picking this up again very soon, once I have a bit more time :)

PgBiel · 2024-11-06T01:35:08Z

Ready for another look 😄

The dir walker has been implemented with symlink loop detection. I've switched other places using recursive search to use the dir walker. Try it out :)

I've also made a few fixes and explicit design decisions in the implementation of directories in IMFS. Of note, I now ensure the root path always exists, for consistency. (We don't really check for this anywhere at the moment though, so I can remove that, but I felt like not leaving a time bomb for the future here 😄.)

Ideally, `src/abc.gleam` and `src/abc.erl` would cause an error. However, this can actually occur legitimately when downloading a Hex package with precompiled `.erl` source files, which would trigger false positives. Wonder what would be the best way to detect this situation, but doesn't seem trivial to solve.

- Test header file in separate subdir - Test parent folder ffi in Erlang

- Prevent problems with infinite symlink loops

add mjs bug fix to changelog update changelog to include erl ffi files update changelog with js ffi change

We were previously including all paths in the filesystem, which included, in particular, the root. Therefore, the "initial files" included the root directory, causing everything to be deleted, not only the initial files.

lpil reviewed Sep 10, 2024

View reviewed changes

compiler-core/src/build/native_file_copier.rs Outdated Show resolved Hide resolved

compiler-core/src/io/memory.rs Outdated Show resolved Hide resolved

compiler-core/src/io/memory.rs Outdated Show resolved Hide resolved

compiler-core/src/io/memory.rs Outdated Show resolved Hide resolved

lpil marked this pull request as draft September 10, 2024 15:22

PgBiel force-pushed the nested-js-ffi branch from 29b67b1 to 0880a75 Compare September 12, 2024 23:31

PgBiel changed the title ~~Permit JS FFI files in subdirectories~~ Permit FFI files in subdirectories Sep 13, 2024

PgBiel force-pushed the nested-js-ffi branch from a7903ba to cb3980e Compare September 15, 2024 17:36

PgBiel force-pushed the nested-js-ffi branch from 5da6877 to df8cbd8 Compare September 16, 2024 01:42

PgBiel marked this pull request as ready for review September 16, 2024 02:22

lpil reviewed Sep 20, 2024

View reviewed changes

lpil marked this pull request as draft September 20, 2024 13:53

PgBiel force-pushed the nested-js-ffi branch 5 times, most recently from ea4fbd3 to 7410d22 Compare October 24, 2024 04:39

PgBiel force-pushed the nested-js-ffi branch 2 times, most recently from 3808565 to 977041b Compare November 5, 2024 15:02

PgBiel marked this pull request as ready for review November 6, 2024 01:34

PgBiel force-pushed the nested-js-ffi branch from 951f374 to 8506b46 Compare November 8, 2024 02:09

PgBiel added 2 commits November 7, 2024 23:32

add directories to in-memory FS

d37d7b8

fix test-package-compiler fs test

d0756c4

PgBiel added 29 commits November 7, 2024 23:32

create output subdir if it doesn't exist

effa596

adjustments to io/memory

1398858

add erlang duplicate module check

eacd83e

detect clash between gleam and erl modules

34430f1

detect clashing gleam and javascript modules

bec2bbe

separate .gleam codepath

bdd7411

add subdir_ffi integration tests

9d5000b

add subdir_ffi test to ci

81da601

use EcoString, add subdir Erlang file tests

6a20050

more robust subdir_ffi tests

cbf75a1

- Test header file in separate subdir - Test parent folder ffi in Erlang

use walkdir for native file search

86057e0

- Prevent problems with infinite symlink loops

add module and native file clash error

9ebd89f

add error for duplicate native erlang module

0aa81b7

update changelog with ffi subdir info

70e765b

add mjs bug fix to changelog update changelog to include erl ffi files update changelog with js ffi change

final nested ffi improvements

5095adc

custom dirwalker

9cd46f6

fix test project compiler

d3cabcb

dir walker: don't canonicalize returned paths

a3425b5

replace reader methods by dirwalker

c08de93

add dir walking test

32da5d1

fix phony in subdir_ffi test

d9f2345

add trace back to gleam_source_files

7b6749f

improve dir walker comments

65a0694

use imfs subpaths for initial files in tests

17e3364

We were previously including all paths in the filesystem, which included, in particular, the root. Therefore, the "initial files" included the root directory, causing everything to be deleted, not only the initial files.

add note about root path

5998b62

ensure imfs has root

23cead6

cannot delete root from imfs

8efc284

fix tests by using 'files'

9769b61

PgBiel force-pushed the nested-js-ffi branch from c4a6cf1 to 9769b61 Compare November 8, 2024 02:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Permit FFI files in subdirectories #3601

Permit FFI files in subdirectories #3601

PgBiel commented Sep 8, 2024 •

edited

Loading

lpil left a comment

PgBiel commented Sep 15, 2024 •

edited

Loading

PgBiel commented Sep 16, 2024 •

edited

Loading

inoas commented Sep 16, 2024 •

edited

Loading

PgBiel commented Sep 16, 2024 •

edited

Loading

inoas commented Sep 16, 2024

PgBiel commented Sep 16, 2024

lpil left a comment

PgBiel commented Sep 20, 2024 •

edited

Loading

lpil commented Sep 21, 2024

PgBiel commented Oct 7, 2024

PgBiel commented Nov 6, 2024 •

edited

Loading

Permit FFI files in subdirectories #3601

Are you sure you want to change the base?

Permit FFI files in subdirectories #3601

Conversation

PgBiel commented Sep 8, 2024 • edited Loading

lpil left a comment

Choose a reason for hiding this comment

PgBiel commented Sep 15, 2024 • edited Loading

PgBiel commented Sep 16, 2024 • edited Loading

inoas commented Sep 16, 2024 • edited Loading

PgBiel commented Sep 16, 2024 • edited Loading

inoas commented Sep 16, 2024

PgBiel commented Sep 16, 2024

lpil left a comment

Choose a reason for hiding this comment

PgBiel commented Sep 20, 2024 • edited Loading

lpil commented Sep 21, 2024

PgBiel commented Oct 7, 2024

PgBiel commented Nov 6, 2024 • edited Loading

PgBiel commented Sep 8, 2024 •

edited

Loading

PgBiel commented Sep 15, 2024 •

edited

Loading

PgBiel commented Sep 16, 2024 •

edited

Loading

inoas commented Sep 16, 2024 •

edited

Loading

PgBiel commented Sep 16, 2024 •

edited

Loading

PgBiel commented Sep 20, 2024 •

edited

Loading

PgBiel commented Nov 6, 2024 •

edited

Loading