(Python) Validate code components extraction #20

betogaona7 · 2023-08-10T20:54:59Z

For python documents, we can validate (or replace) GPT's code components extraction by using the ast library: Ex.

def extract_classes_and_functions(source_code):

    parsed_tree = ast.parse(source_code)

    classes = []
    functions = []

    for node in ast.walk(parsed_tree):
        if isinstance(node, ast.ClassDef):
            classes.append(node)
        elif isinstance(node, ast.FunctionDef):
            functions.append(node)

    return classes, functions

The text was updated successfully, but these errors were encountered:

betogaona7 · 2023-08-10T21:04:21Z

This is language-dependent, so maybe could work as part of a set of functions for validation depending in the user's input, and not as an optimization in the pipeline. Right now, the extraction is language-agnostic limited only by Langchain splitter's supported programming languages: https://python.langchain.com/docs/modules/data_connection/document_transformers/text_splitters/code_splitter

betogaona7 added enhancement New feature or request research labels Aug 10, 2023

betogaona7 self-assigned this Aug 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(Python) Validate code components extraction #20

(Python) Validate code components extraction #20

betogaona7 commented Aug 10, 2023 •

edited

Loading

betogaona7 commented Aug 10, 2023 •

edited

Loading

(Python) Validate code components extraction #20

(Python) Validate code components extraction #20

Comments

betogaona7 commented Aug 10, 2023 • edited Loading

betogaona7 commented Aug 10, 2023 • edited Loading

betogaona7 commented Aug 10, 2023 •

edited

Loading

betogaona7 commented Aug 10, 2023 •

edited

Loading