Recursive retriever
EmbeddedTablesUnstructuredRetrieverPack #
Bases: BaseLlamaPack
Embedded Tables + Unstructured.io Retriever pack.
Use unstructured.io to parse out embedded tables from an HTML document, build a node graph, and then run our recursive retriever against that.
NOTE: must take in a single HTML file.
Source code in llama_index/packs/recursive_retriever/embedded_tables_unstructured/base.py
16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 |
|
get_modules #
get_modules() -> Dict[str, Any]
Get modules.
Source code in llama_index/packs/recursive_retriever/embedded_tables_unstructured/base.py
59 60 61 62 63 64 65 |
|
run #
run(*args: Any, **kwargs: Any) -> Any
Run the pipeline.
Source code in llama_index/packs/recursive_retriever/embedded_tables_unstructured/base.py
67 68 69 |
|
RecursiveRetrieverSmallToBigPack #
Bases: BaseLlamaPack
Small-to-big retrieval (with recursive retriever).
Given input documents, and an initial set of "parent" chunks, subdivide each chunk further into "child" chunks. Link each child chunk to its parent chunk, and index the child chunks.
Source code in llama_index/packs/recursive_retriever/small_to_big/base.py
15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 |
|
get_modules #
get_modules() -> Dict[str, Any]
Get modules.
Source code in llama_index/packs/recursive_retriever/small_to_big/base.py
76 77 78 79 80 81 82 83 |
|
run #
run(*args: Any, **kwargs: Any) -> Any
Run the pipeline.
Source code in llama_index/packs/recursive_retriever/small_to_big/base.py
85 86 87 |
|