vllm.distributed.kv_transfer.kv_connector.v1.hf3fs.utils.common ¶
AtomicCounter ¶
Thread-safe atomic counter for round-robin operations.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/hf3fs/utils/common.py
HF3FSConnectorMetadata ¶
Bases: KVConnectorMetadata
Container for HF3FS connector metadata.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/hf3fs/utils/common.py
add_request ¶
add_request(request_metadata: HF3FSRequestMetadata) -> None
HF3FSRequestMetadata dataclass ¶
Metadata for a single request in HF3FS connector.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/hf3fs/utils/common.py
from_scheduling_state staticmethod ¶
from_scheduling_state(
state: RequestSchedulingState,
block_size: int,
load_op: LoadBlockInfo | None = None,
skip_leading_blocks: int | None = None,
) -> Optional[HF3FSRequestMetadata]
Create request metadata from scheduling state.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/hf3fs/utils/common.py
LoadBlockInfo dataclass ¶
Operation for loading blocks from external storage.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/hf3fs/utils/common.py
RequestSchedulingState dataclass ¶
Unified request scheduling state management.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/hf3fs/utils/common.py
_normalize_block_ids ¶
Normalize block_ids to list format.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/hf3fs/utils/common.py
update_tokens_and_blocks ¶
Update with new tokens and blocks.