Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ENH: Using msgpack instead of json #1819

Draft
wants to merge 45 commits into
base: develop
Choose a base branch
from

Commits on Jul 15, 2022

  1. REL: 2.3.2 Maintenance version

    Sebastian Wagner authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    c30a5c9 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    63b848a View commit details
    Browse the repository at this point in the history
  3. ENH: New expert bot for uWhoisd (#1918)

    * ENH: New expert bot for uWhoisd
    * ENH: Code cleanup, improve tests
    * DOC: Add documentation for uWhoisd
    Rafiot authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    e03c14d View commit details
    Browse the repository at this point in the history
  4. New Bot for Fireeye Appliances

    Referring to my mail I created a Parser& Bot which is able to Collect and parse data from fireeye mail and file analysis appliances.
    We are collection sha1 and MD5 hashed and if there is network communication we are also collecting urls and domains.
    CysihZ authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    2dab6a1 View commit details
    Browse the repository at this point in the history
  5. MAINT+BUG: Various fixes in fireeye bots, tests and documentation

    rewrite bot bots' parameter handling to current methodology
    use HTTP mixin for fireeye collector
    Sebastian Wagner authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    e5f55e1 View commit details
    Browse the repository at this point in the history
  6. rename fireeye collector to fireeye mas

    Fireeye has more products than the MAS system
    Sebastian Wagner authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    f25c996 View commit details
    Browse the repository at this point in the history
  7. DOC: bots: add anchors for all sections

    adds the module name of bots as section anchor
    adds the module name to the information block if it was missing or
    incomplete
    fixes part of certtools/intelmq-api#4
    Sebastian Wagner authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    a325159 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    2ddb625 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    7476e31 View commit details
    Browse the repository at this point in the history
  10. FIX: skip sieve expert tests

    monoidic authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    184d8ff View commit details
    Browse the repository at this point in the history
  11. FIX: RDAP checks if entity is valid entity-type

    As some RDAP implementation may vary, we check if the entity
    is a valid entity-type. As of the RFC for RDAP only JSON dicts
    are allowed.
    
    Fixes #1942
    
    Signed-off-by: Sebastian Waldbauer <[email protected]>
    waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    ad7f6ab View commit details
    Browse the repository at this point in the history
  12. BUG/DOC: updated feodotracker browse parser confiuration

    - adapted docs
    - added news entry
    - added upgrade config
    - added upgrade config test
    - added html table parser test
    
    fixes #1938
    Sebastian Wagner authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    1f24793 View commit details
    Browse the repository at this point in the history
  13. ENH: add honeypot_brute_force and corresponding tests

    ... and add legacy tag to drone_brute_force which it replaces
    Birger Schacht authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    9639874 View commit details
    Browse the repository at this point in the history
  14. ENH: add honeypot_ddos_amp and corresponding tests (#1950)

    add legacy tag to application_ddos_victim
    
    Co-authored-by: Wagner <[email protected]>
    2 people authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    7c79954 View commit details
    Browse the repository at this point in the history
  15. DOC+ENH: shadowserver support for new feeds

    add documentation for #1950 and #1952
    fix filename detection for the new feeds
    Sebastian Wagner authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    89cec4c View commit details
    Browse the repository at this point in the history
  16. ENH: Added cache mixin

    The cache mixin replaces the cache lib and adds the possibility
    to inherit the mixin in classes.
    
    Signed-off-by: Sebastian Waldbauer <[email protected]>
    waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    c48716e View commit details
    Browse the repository at this point in the history
  17. FIX: Removed some old cache code

    Signed-off-by: Sebastian Waldbauer <[email protected]>
    waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    0810f7d View commit details
    Browse the repository at this point in the history
  18. DOC: add license information to all the files

    This commit adds license information to a lot of files and adds a
    .reuse/dep5 file that lists the license information for some folders
    
    The commit also changes the main license in setup.cfg from AGPL-3.0-only
    to AGPL-3.0-or-later because only one file has the AGPL-3.0-only file as
    license and multiple files have the AGPL-3.0-or-later in the license
    header.
    
    It also removes the cef_logo.png file, as there is no information about
    the licese anywhere to be found. It is now included directly from the
    website of the european union.
    
    Closes #1633
    Birger Schacht authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    3816ef0 View commit details
    Browse the repository at this point in the history
  19. Configuration menu
    Copy the full SHA
    a869320 View commit details
    Browse the repository at this point in the history
  20. Configuration menu
    Copy the full SHA
    4b4ac75 View commit details
    Browse the repository at this point in the history
  21. ENH: add event_ip_spoofer shadowserver config and corresponding tests

    and add legacy tag to shadowserver caida config
    Birger Schacht authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    bdec455 View commit details
    Browse the repository at this point in the history
  22. ENH: add event4_honeypot_darknet shadowserver config & tests

    and add legacy tag to darknet config
    Birger Schacht authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    0926b38 View commit details
    Browse the repository at this point in the history
  23. ENH: add event46_sinkhole shadowserver config & tests

    and add legacy tag to the configs it replaces
    
    and update changelog and documentation accordingly
    Birger Schacht authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    df77cda View commit details
    Browse the repository at this point in the history
  24. FIX: change copyright header syntax in md files

    Closes #1972
    Birger Schacht authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    cffc303 View commit details
    Browse the repository at this point in the history
  25. Configuration menu
    Copy the full SHA
    0adf048 View commit details
    Browse the repository at this point in the history
  26. ENH+DOC: shadowserver exchange feed

    fix mapping
    use compromised type if the data indicates an active webshell
    plus add testcases
    add changelog
    update bots documentation
    Sebastian Wagner authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    9aec5f5 View commit details
    Browse the repository at this point in the history
  27. Configuration menu
    Copy the full SHA
    8f115c8 View commit details
    Browse the repository at this point in the history
  28. DOC for PR#1971

    Sebastian Wagner authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    b6dcaf9 View commit details
    Browse the repository at this point in the history
  29. DOC/ENH: shadowserver feed

    enhance mappings
    add 4/6 agnostic mapping for `Sinkhole-Events` as well
    document feeds with IPv4 and IPv6 better and shorter
    Sebastian Wagner authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    a5941ce View commit details
    Browse the repository at this point in the history
  30. DOC: add changelog entry for PR#1975

    Sebastian Wagner authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    8e0463b View commit details
    Browse the repository at this point in the history
  31. DOC: azure collector: document minimum azure version

    Sebastian Wagner authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    9754252 View commit details
    Browse the repository at this point in the history
  32. DOC: Document the licenses of all the files

    This commit adds a license header or a license file to most of the
    files, or documents the license in the .reuse/dep5 license file.
    
    Some of the process was automated, first by listing all the files that
    are not reuse lint compliant:
    > reuse lint > ../reuse.lst
    This list was then modified to remove metainformation and only list
    filenames. Also a couple of filenames that need manual modification were
    removed.
    
    Then using git and reuse:
    > for file in `cat ../reuse.lst`; do year=`git log --reverse --pretty="format:%ai" $file | head -1 | cut -d "-" -f 1`;  author=`git log --reverse --pretty="format:%an" $file|head -1`; reuse addheader --copyright="$author" --year="$year" --license="AGPL-3.0-or-later" --skip-unrecognised $file; done
    
    Then the same process was repeated for files reuse does not recognize,
    like csv and json files or REQUIREMENTS.txt files.
    Birger Schacht authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    c7f9cd2 View commit details
    Browse the repository at this point in the history
  33. Configuration menu
    Copy the full SHA
    1443ebf View commit details
    Browse the repository at this point in the history
  34. Configuration menu
    Copy the full SHA
    30eeb56 View commit details
    Browse the repository at this point in the history
  35. DOC: eventdb: describe the events table itself

    Sebastian Wagner authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    9b05242 View commit details
    Browse the repository at this point in the history
  36. harm: rename compromised to system-compromise

    match with RSIT in the taxonomy intrusions:
    compromised -> system-compromise
    unauthorized-command -> system-compromise
    unauthorized-login -> system-compromise
     adapt bots depending on the name
    add changelog and news entries, including SQL update statements
    Sebastian Wagner authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    8c21411 View commit details
    Browse the repository at this point in the history
  37. harm: remove type defacement

    merged into information-content-security > unauthorised-information-modification
    
    adapt bots depending on the name
    add changelog and news entries, including SQL update statements
    Sebastian Wagner authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    f64c422 View commit details
    Browse the repository at this point in the history
  38. Configuration menu
    Copy the full SHA
    75acec4 View commit details
    Browse the repository at this point in the history
  39. DEP: rmeove deprecated ripencc expert

    was renamed and marked as deprecated in 2.0.0.beta1
    #1404
    Sebastian Wagner authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    37acdea View commit details
    Browse the repository at this point in the history
  40. DEP: modify expert: remove compat with old format

    Compatibility with the deprecated configuration format (before 1.0.0.dev7) was removed.
    #1404
    Sebastian Wagner authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    2ca649e View commit details
    Browse the repository at this point in the history
  41. DEP: remove deprecated database update scripts

    The deprecated shell scripts
    - `update-asn-data`
    - `update-geoip-data`
    - `update-tor-nodes`
    - `update-rfiprisk-data`
    have been removed in favor of the built-in update-mechanisms (see the bots' documentation). A crontab file for calling all new update command can be found in `contrib/cron-jobs/intelmq-update-database`.
    
    #1404
    Sebastian Wagner authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    ac2ae40 View commit details
    Browse the repository at this point in the history
  42. DOC: n6: add more illustrations

    add two n6 images directly to the repository, as they are not displayed
    on readthedocs otherwise: The other websites hosting the images block
    loading images if the referer does not match a whitelist. we can't add a
    noreferer HTML attribute in rst as well. the option left is to add the
    files, that only implies adding the licensing information and the
    AGPL-3.0 license text as well.
    
    add two illustrations on the the flow n6 to intelmq and vice versa, own
    work.
    
    some textual improvements in the document itself.
    Sebastian Wagner authored and waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    b55afef View commit details
    Browse the repository at this point in the history
  43. ENH: Aggregate expert

    The Aggregate Expert might be used to aggregate events within a given
    timespan and threshold.
    
    Signed-off-by: Sebastian Waldbauer <[email protected]>
    waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    94fa7f8 View commit details
    Browse the repository at this point in the history
  44. [ENH] Using msgpack instead of json

    Using msgpack instead of json results in faster (de)serialize and
    less memory usage. Redis is also capable of msgpack within its lua
    api i.e. https://github.com/kengonakajima/lua-msgpack-native.
    
    ====== Benchmark =======
    JSON median size: 387
    MSGPACK median size: 329
    ------------------------
    Diff: 16.20%
    
    JSON
    * Serialize: 39286
    * Deserialize: 30713
    
    MSGPACK
    * Serialize: 23483
    * Deserialize: 12602
    ---------------------
    DIFF
    * Serialize: 50.35%
    * Deserialize: 83.62%
    
    Data extracted from spamhaus-collector
    Measurements based on deduplicator-expert
    460 events in total process by deducplicator-expert
    
    Signed-off-by: Sebastian Waldbauer <[email protected]>
    waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    e97db41 View commit details
    Browse the repository at this point in the history
  45. FIX: More generic way of using different (de)-serializers

    Signed-off-by: Sebastian Waldbauer <[email protected]>
    waldbauer-certat committed Jul 15, 2022
    Configuration menu
    Copy the full SHA
    1253c3e View commit details
    Browse the repository at this point in the history