AlphaSampler

is an advanced text generation sampling system that dynamically optimizes sampling parameters based on entropy and attention patterns. It provides more controlled and context-aware text generation by adaptively adjusting sampling parameters during the generation process.

Features

Dynamic Parameter Optimization: Automatically adjusts temperature, top-k, top-p, repetition penalty, and candidate size based on context
Entropy-Based Analysis: Utilizes both token distribution and attention pattern entropy
Moving Average Window: Maintains stability through alpha value smoothing
Comprehensive Error Handling: Robust error checking and logging throughout the pipeline
Compatible with Hugging Face Models: Works with AutoModelForCausalLM models

Requirements

torch
transformers
logging
typing

Installation

Clone the repository
Install dependencies:

pip install torch transformers

Quick Start

from transformers import AutoModelForCausalLM, AutoTokenizer
from alpha import generate_response

# Initialize model and tokenizer
model_name = "Qwen/Qwen2.5-0.5B-Instruct"
model = AutoModelForCausalLM.from_pretrained(model_name, output_attentions=True)
tokenizer = AutoTokenizer.from_pretrained(model_name)

# Generate text
prompt = "Your prompt here"
response = generate_response(model, tokenizer, prompt)
print(response)

Core Components

SamplerConfig

Configuration class that defines:

Base parameters for sampling
Parameter bounds
Optimization settings
Smoothing factors

cfg = SamplerConfig()
cfg.base_temp = 0.2
cfg.base_top_p = 0.80
# ... other configurations

AdaptiveEntropixSampler

Main sampling class with the following key methods:

calculate_alpha: Computes the adaptive scaling factor using:
- Token distribution entropy
- Token distribution varentropy
- Attention pattern entropy
- Attention pattern varentropy
find_optimal_parameters: Optimizes sampling parameters based on:
- Current alpha value
- Parameter bounds
- Gradient-based optimization
sample: Main sampling method that:
- Applies all optimized parameters
- Handles repetition penalty
- Generates next token probabilities

Parameter Ranges

Parameter	Min	Max	Description
Temperature	0.1	1.0	Controls randomness
Top-K	5	100	Number of top tokens to consider
Top-P	0.1	1.0	Cumulative probability threshold
Repetition Penalty	1.1	1.5	Penalty for repeated tokens
Candidate Size	1	100	Final pool of candidates

Generation Flow

flowchart TD
    Start[User Prompt] --> Tokenize[Tokenize Input]
    Tokenize --> Loop[Token Generation Loop]
    
    subgraph Loop[Token Generation Loop]
        direction TB
        subgraph EntropyCalc[Alpha Calculation]
            E1[Calculate Token Distribution Entropy & Varentropy]
            E2[Calculate Attention Pattern Entropy & Varentropy]
            E1 & E2 --> Alpha[Compute Combined Alpha]
        end

        subgraph Optimization[Parameter Optimization]
            direction TB
            Alpha --> Params[Optimize Parameters]
            Params --> P1[Temperature: 0.1-1.0]
            Params --> P2[Top K: 5-100]
            Params --> P3[Top P: 0.1-1.0]
            Params --> P4[Rep Penalty: 1.1-1.5]
            Params --> P5[Candidate Size: 1-100]
        end

        EntropyCalc --> Optimization
        Optimization --> Sample[Sample Token from Distribution]
        Sample --> RepPenalty[Apply Repetition Penalty]
        RepPenalty --> Generate[Generate Next Token]
        Generate --> |Next Token| EntropyCalc
    end

    Generate --> |EOS Token| End[Complete Response]

Error Handling

The system includes comprehensive error handling:

Input validation
Tensor shape verification
Parameter bound checking
Logging at multiple levels
Graceful fallbacks

Logging

The system uses Python's built-in logging module with configurable levels:

logging.basicConfig(level=logging.INFO,
                   format='%(asctime)s - %(levelname)s - %(message)s')

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
docs		docs
alpha.py		alpha.py
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AlphaSampler

Features

Requirements

Installation

Quick Start

Core Components

SamplerConfig

AdaptiveEntropixSampler

Parameter Ranges

Generation Flow

Error Handling

Logging

About

Releases

Packages

Languages

EdwardDali/alpha

Folders and files

Latest commit

History

Repository files navigation

AlphaSampler

Features

Requirements

Installation

Quick Start

Core Components

SamplerConfig

AdaptiveEntropixSampler

Parameter Ranges

Generation Flow

Error Handling

Logging

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages