Skip to content

v2.0.0 (Superposition)

Compare
Choose a tag to compare
@danieldk danieldk released this 16 Apr 13:17
· 1 commit to main since this release
8debb21

✨ New features and improvements

  • Register models using catalogue to support external models in Auto{Decoder,Encoder,CausalLM} (#351, #352).
  • Add support for loading parameters in-place (#370).
  • Support for ELECTRA models (#358).
  • Add support for write/upload operations with HFHubRepository (#354).
  • Add support for converting Curated Transformer configs to HF-compatible configs (#333).

🔴 Bug fixes

  • Support PyTorch 2.2 (#360).

⚠️ Backwards incompatibilities

  • Support for TorchScript tracing is removed (#361).
  • The qkv_split argument is now mandatory for AttentionHeads, AttentionHeads.uniform, AttentionHeads.multi_query, and AttentionHeads.key_value_broadcast (#374).
  • All FromHFHub mixins are renamed to FromHF (#374).
  • FromHF.convert_hf_state_dict is removed in favor of FromHF.state_dict_from_hf (#374).

👥 Contributors

@danieldk, @honnibal, @ines, @KennethEnevoldsen, @shadeMe