@cdklabs/generative-ai-cdk-constructs • Docs

@cdklabs/generative-ai-cdk-constructs / bedrock / ChunkingStrategy

Class: `abstract` ChunkingStrategy

Properties

configuration

abstract configuration: ChunkingConfigurationProperty

The CloudFormation property representation of this configuration

DEFAULT

readonly static DEFAULT: ChunkingStrategy

Fixed Sized Chunking with the default chunk size of 300 tokens and 20% overlap.

FIXED_SIZE

readonly static FIXED_SIZE: ChunkingStrategy

Fixed Sized Chunking with the default chunk size of 300 tokens and 20% overlap. You can adjust these values based on your specific requirements using the ChunkingStrategy.fixedSize(params) method.

HIERARCHICAL_COHERE

readonly static HIERARCHICAL_COHERE: ChunkingStrategy

Hierarchical Chunking with the default for Cohere Models.

Overlap tokens: 30
Max parent token size: 500
Max child token size: 100

HIERARCHICAL_TITAN

readonly static HIERARCHICAL_TITAN: ChunkingStrategy

Hierarchical Chunking with the default for Titan Models.

Overlap tokens: 60
Max parent token size: 1500
Max child token size: 300

NONE

readonly static NONE: ChunkingStrategy

Amazon Bedrock treats each file as one chunk. Suitable for documents that are already pre-processed or text split.

SEMANTIC

readonly static SEMANTIC: ChunkingStrategy

Semantic Chunking with the default of bufferSize: 0, breakpointPercentileThreshold: 95, and maxTokens: 300. You can adjust these values based on your specific requirements using the ChunkingStrategy.semantic(params) method.

Methods

fixedSize()

static fixedSize(props): ChunkingStrategy

Method for customizing a fixed sized chunking strategy.

Parameters

• props: FixedSizeChunkingConfigurationProperty

Returns

ChunkingStrategy

hierarchical()

static hierarchical(props): ChunkingStrategy

Method for customizing a hierarchical chunking strategy. For custom chunking, the maximum token chunk size depends on the model.

Amazon Titan Text Embeddings: 8192
Cohere Embed models: 512

Parameters

• props: HierarchicalChunkingProps

Returns

ChunkingStrategy

semantic()

static semantic(props): ChunkingStrategy

Method for customizing a semantic chunking strategy. For custom chunking, the maximum token chunk size depends on the model.

Amazon Titan Text Embeddings: 8192
Cohere Embed models: 512

Parameters

• props: SemanticChunkingConfigurationProperty

Returns

ChunkingStrategy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ChunkingStrategy.md

ChunkingStrategy.md

Class: `abstract` ChunkingStrategy

Properties

configuration

DEFAULT

FIXED_SIZE

HIERARCHICAL_COHERE

HIERARCHICAL_TITAN

NONE

SEMANTIC

Methods

fixedSize()

Parameters

Returns

hierarchical()

Parameters

Returns

semantic()

Parameters

Returns

Files

ChunkingStrategy.md

Latest commit

History

ChunkingStrategy.md

File metadata and controls

Class: abstract ChunkingStrategy

Properties

configuration

DEFAULT

FIXED_SIZE

HIERARCHICAL_COHERE

HIERARCHICAL_TITAN

NONE

SEMANTIC

Methods

fixedSize()

Parameters

Returns

hierarchical()

Parameters

Returns

semantic()

Parameters

Returns

Class: `abstract` ChunkingStrategy