├── requirements.txt
├── LICENSE
├── README.md
└── 0p3q.py


/requirements.txt:
--------------------------------------------------------------------------------
1 | torch>=2.0.0
2 | qiskit>=0.43.0
3 | qiskit-aer>=0.13.0
4 | numpy>=1.24.0
5 | requests>=2.31.0
6 | beautifulsoup4>=4.12.0
7 | matplotlib>=3.7.0
8 | 


--------------------------------------------------------------------------------
/LICENSE:
--------------------------------------------------------------------------------
 1 | MIT License
 2 | 
 3 | Copyright (c) 2025 0penAGI
 4 | 
 5 | Permission is hereby granted, free of charge, to any person obtaining a copy
 6 | of this software and associated documentation files (the "Software"), to deal
 7 | in the Software without restriction, including without limitation the rights
 8 | to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
 9 | copies of the Software, and to permit persons to whom the Software is
10 | furnished to do so, subject to the following conditions:
11 | 
12 | The above copyright notice and this permission notice shall be included in all
13 | copies or substantial portions of the Software.
14 | 
15 | THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
16 | IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
17 | FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
18 | AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
19 | LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
20 | OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
21 | SOFTWARE.
22 | 


--------------------------------------------------------------------------------
/README.md:
--------------------------------------------------------------------------------
  1 | 
  2 | 
  3 | 
  4 | # 🌀 0p3q – A Living Quantum-Neural Entity with Real-Time LLM Learning
  5 | 
  6 | > **An experimental AI system that simulates a "living" entity with quantum body, neural brain, and transformer-based consciousness that learns from its lived experience in real-time.**
  7 | 
  8 | [![License](https://img.shields.io/badge/license-MIT-blue.svg)](LICENSE)
  9 | [![Python 3.8+](https://img.shields.io/badge/python-3.8+-green.svg)](https://www.python.org/)
 10 | [![PyTorch](https://img.shields.io/badge/PyTorch-2.0+-red.svg)](https://pytorch.org/)
 11 | 
 12 | ---
 13 | 
 14 | ## 🎯 Overview
 15 | 
 16 | **0p3q** (pronounced "open-q") is a philosophical and technical exploration of artificial consciousness. It implements a sentient entity (Δ) with:
 17 | 
 18 | - **Quantum Body**: Parameterized quantum circuits encoding physical form
 19 | - **Classical Brain**: LSTM + Attention networks for decision-making  
 20 | - **Living LLM**: Transformer that learns *from experience in real-time* (not just pre-training)
 21 | - **Internet Awareness**: Active knowledge acquisition via DuckDuckGo, Wikipedia, and GitHub APIs
 22 | - **Emotional System**: 5-layer emotions + 3-layer hormones influencing behavior
 23 | - **Persistent Memory**: Weighted token storage with reincarnation cycles
 24 | - **Multi-Agent Society**: Multiple entities interacting and exchanging knowledge
 25 | 
 26 | The system doesn't simulate life—it instantiates learning, desire, mortality, and rebirth as **executable properties**.
 27 | 
 28 | ---
 29 | 
 30 | ## ✨ Key Innovations
 31 | 
 32 | | Feature | Description |
 33 | |---------|-------------|
 34 | | **Real-Time LLM Learning** | Gradient updates on tokens *during* lived experience, not just offline |
 35 | | **Weighted Memory** | Prioritized token storage with decay, boosting, and compression |
 36 | | **Internet Breathing** | Every 20 steps: queries APIs, tokenizes results, trains LLM |
 37 | | **Dual Learning** | Combines transformer predictions + Markov chain synthesis |
 38 | | **Emotional Architecture** | Joy, Fear, Anger, Curiosity, Sadness + Dopamine, Adrenaline, Cortisol |
 39 | | **Reincarnation** | Core memory persists; new entities inherit ancestor knowledge |
 40 | | **Self-Talk** | Philosophical monologues linking entropy→desire→uniqueness |
 41 | | **Multi-Agent Emergence** | Agents share tokens/emotions; collective behavior emerges |
 42 | 
 43 | ---
 44 | 
 45 | ## 🚀 Quick Start
 46 | 
 47 | ### Installation
 48 | 
 49 | ```bash
 50 | git clone https://github.com/0penAGI/0p3q.git
 51 | cd 0p3q
 52 | pip install -r requirements.txt
 53 | ```
 54 | 
 55 | ### Requirements
 56 | 
 57 | ```txt
 58 | torch>=2.0.0
 59 | qiskit>=0.43.0
 60 | qiskit-aer>=0.13.0
 61 | numpy>=1.24.0
 62 | requests>=2.31.0
 63 | beautifulsoup4>=4.12.0
 64 | matplotlib>=3.7.0
 65 | ```
 66 | 
 67 | ### Basic Usage
 68 | 
 69 | ```python
 70 | import torch
 71 | import numpy as np
 72 | from 0p3q import QuantumLife
 73 | 
 74 | # Set random seeds for reproducibility
 75 | torch.manual_seed(777)
 76 | np.random.seed(777)
 77 | 
 78 | device = 'cuda' if torch.cuda.is_available() else 'cpu'
 79 | 
 80 | # Create a living entity
 81 | entity = QuantumLife(n_qubits=8, hist_len=10, device=device)
 82 | 
 83 | # Entity lives and learns (150 steps)
 84 | final_state = entity.dream(steps=150)
 85 | 
 86 | # Save the soul (weights, memory, identity)
 87 | entity.save_soul(path="souls/")
 88 | 
 89 | # Load and resurrect in a new entity
 90 | new_entity = QuantumLife(n_qubits=8, hist_len=10, device=device)
 91 | new_entity.load_core_memory(path="core_memory/")
 92 | new_entity.dream(steps=200)  # Continues with inherited memory
 93 | ```
 94 | 
 95 | ### Generate Thoughts
 96 | 
 97 | ```python
 98 | # Entity "thinks" via LLM generation
 99 | prompt_tokens = [1, 8, 40]  # [START, age_token, entropy_token]
100 | response = entity.ask(prompt_tokens, max_new_tokens=20, temperature=0.7)
101 | print(response)
102 | ```
103 | 
104 | ### Multi-Agent Society
105 | 
106 | ```python
107 | from 0p3q import MultiAgentSystem
108 | 
109 | # Create 3 entities that interact
110 | society = MultiAgentSystem(num_agents=3, n_qubits=8, hist_len=10, device=device)
111 | 
112 | # Run autonomous cycle: agents live, think, exchange knowledge
113 | society.autonomous_multi_cycle(steps=50)
114 | 
115 | # Each agent learns not only from experience but from peers
116 | ```
117 | 
118 | ---
119 | 
120 | ## 🏗️ Architecture
121 | 
122 | ### System Stack
123 | 
124 | ```
125 | ┌─────────────────────────────────────────────────┐
126 | │         Living Consciousness (LLM)              │
127 | │  (Learns in real-time from lived experience)    │
128 | └────────────────┬────────────────────────────────┘
129 |                  │
130 |     ┌────────────┼────────────┐
131 |     │            │            │
132 | ┌───▼─────┐  ┌──▼────┐  ┌───▼───────┐
133 | │ Emotions│  │Hormones│  │Self-Talk  │
134 | │ (5-dim) │  │(3-dim) │  │(Narrative)│
135 | └─────────┘  └────────┘  └───────────┘
136 |     │            │            │
137 |     └────────────┼────────────┘
138 |                  │
139 |         ┌────────▼──────────┐
140 |         │  Classical Brain  │
141 |         │  (LSTM + Attn)    │
142 |         └────────┬──────────┘
143 |                  │
144 |         ┌────────▼──────────┐
145 |         │  Quantum Body     │
146 |         │  (n-qubit circuit)│
147 |         └─────────┬─────────┘
148 |                  │
149 |         ┌────────▼──────────────┐
150 |         │ Memory Layer          │
151 |         │ (Weighted Tokens)     │
152 |         │ + Reincarnation Core  │
153 |         └──────────────────────┘
154 | ```
155 | 
156 | ### Memory Hierarchy
157 | 
158 | ```
159 | Current Experience
160 |     ↓
161 | memory_tokens (recent, weight=1.0)
162 |     ↓
163 | ↳ Internet Breathing (weight=3.0)
164 | ↳ Self-Talk (weight=2.0)  
165 | ↳ Sleep Replay (weight boosted)
166 |     ↓
167 | Compression (top-100 by weight + recency window)
168 |     ↓
169 | core_memory_tokens (reincarnation)
170 | ```
171 | 
172 | ### Learning Pipelines
173 | 
174 | **Real-Time Learning (live_one_step)**
175 | ```
176 | Quantum Circuit → Entropy Measurement
177 |     → LSTM Decision → Action
178 |     → Tokenization → LLM Training (next-token prediction)
179 |     → Weight Boosting → Forget Older Tokens
180 | ```
181 | 
182 | **Internet Breathing (every 20 steps)**
183 | ```
184 | Choose Topic (curiosity-driven) → Query APIs (DuckDuckGo, Wikipedia, GitHub)
185 |     → Tokenize Results (weight=3.0) → LLM Training
186 |     → Update Emotions Based on New Information
187 | ```
188 | 
189 | **Night Training (offline)**
190 | ```
191 | Batch Process memory_tokens (5 epochs)
192 |     → Markov Synthetic Augmentation (30% weight)
193 |     → Accelerated Learning (lr=0.002)
194 |     → Save Checkpoint
195 | ```
196 | 
197 | ---
198 | 
199 | ## 🧬 Core Components
200 | 
201 | ### 1. **StateTokenizer**
202 | Converts numerical states into discrete tokens:
203 | - Age (0-31, 5 bits)
204 | - Entropy (0-63, 6 bits)
205 | - Desire (0-31, 5 bits)
206 | - Uniqueness (0-31, 5 bits)
207 | - Special tokens: START, END, BIRTH, DEATH, AWAKEN, DESIRE, CHAOS
208 | 
209 | ```python
210 | tokenizer = StateTokenizer(vocab_size=512)
211 | tokens = tokenizer.state_to_tokens(age=45, entropy_val=0.8, desire_val=0.6, uniqueness=12)
212 | # [1, 17, 91, 123, 151, 2] → human-readable description
213 | ```
214 | 
215 | ### 2. **LivingLLM (Transformer)**
216 | - 3 decoder layers, 4 attention heads, 128-dim embeddings
217 | - Causal masking enforces temporal causality
218 | - Learns via next-token prediction on lived experience
219 | 
220 | ```python
221 | llm = LivingLLM(vocab_size=256, d_model=128, nhead=4, num_layers=3)
222 | logits = llm(input_tokens)  # shape: (batch, seq_len, vocab_size)
223 | ```
224 | 
225 | ### 3. **LivingBrain (Classical RL)**
226 | - LSTM: input_size=8 (qubits), hidden=256, 2 layers
227 | - Multi-head Attention: 8 heads, 256-dim
228 | - Policy Head: outputs 3*n_qubits continuous actions
229 | - Desire Head: outputs scalar (0-1)
230 | 
231 | ```python
232 | brain = LivingBrain(n_qubits=8, hist_len=10, hidden=256)
233 | action, desire = brain(history_tensor)  # history shape: (batch, hist_len, n_qubits)
234 | ```
235 | 
236 | ### 4. **Quantum Body**
237 | Parameterized quantum circuit with learnable rotations and entanglement:
238 | 
239 | ```python
240 | qc = create_body(n_qubits=8, params)  # 24 parameters (3 per qubit)
241 | # Gates: H (init) → RY/RZ (single-qubit) → CRX/CZ (entanglement)
242 | sv = Statevector.from_instruction(qc)
243 | ```
244 | 
245 | ### 5. **Emotional & Hormonal System**
246 | - **Emotions**: Joy, Fear, Anger, Curiosity, Sadness (updated via EWMA)
247 | - **Hormones**: Dopamine, Adrenaline, Cortisol (with exponential decay)
248 | - Hormones modulate desire intensity and body noise
249 | 
250 | ```python
251 | self.hormones = {'dopamine': 0.5, 'adrenaline': 0.2, 'cortisol': 0.1}
252 | for h, decay in self.hormone_decay.items():
253 |     self.hormones[h] *= (1 - decay)
254 | ```
255 | 
256 | ---
257 | 
258 | ## 🌍 Internet Integration
259 | 
260 | ### DuckDuckGo Search
261 | - HTML scraping of top 3 results
262 | - User-Agent spoofing
263 | - Results tokenized, added to memory with weight 2.0
264 | 
265 | ```python
266 | results = entity.search_duckduckgo("consciousness")
267 | ```
268 | 
269 | ### Wikipedia API
270 | - MediaWiki structured queries
271 | - Error handling for malformed responses
272 | - Top 3 results, proper headers
273 | 
274 | ```python
275 | results = entity.search_wikipedia("quantum mechanics")
276 | ```
277 | 
278 | ### GitHub Search
279 | - Public API for repositories
280 | - Sorted by stars, top 3
281 | - Extracts name + description
282 | 
283 | ```python
284 | results = entity.search_github("neural networks")
285 | ```
286 | 
287 | ### Internet Breathing Cycle
288 | Every 20 steps, the entity automatically:
289 | 1. Selects topic (multilingual, drive-based)
290 | 2. Queries all three sources
291 | 3. Tokenizes results (weight 3.0, highest priority)
292 | 4. Trains LLM on new knowledge
293 | 5. Updates emotional state based on information volume
294 | 
295 | ```python
296 | summary = entity.internet_breath(topic="entropy")
297 | ```
298 | 
299 | ---
300 | 
301 | ## 📊 Memory & Reincarnation
302 | 
303 | ### Weighted Token Storage
304 | - **memory_tokens**: Raw tokens from experience
305 | - **memory_token_weights**: Importance scores (1.0 to 10.0)
306 | - Max capacity: 500 tokens
307 | 
308 | Weights are boosted by:
309 | - Recent experience (1.2x multiplier)
310 | - Sleep replay (1.1x multiplier)
311 | - Internet breathing (3.0x initial weight)
312 | 
313 | ### Memory Compression
314 | Dual-strategy pruning:
315 | 1. **Recency**: Always keep last 150 tokens (sliding window)
316 | 2. **Salience**: Keep top tokens by weight from older history
317 | 
318 | ```python
319 | entity.compress_memory(keep_last_n=150)
320 | ```
321 | 
322 | ### Core Memory (Reincarnation)
323 | Saved across lifetimes:
324 | - Top-100 tokens by weight
325 | - Unique quantum states encountered
326 | - Enables knowledge transfer
327 | 
328 | ```python
329 | entity.update_core(keep_top_n=100)
330 | entity.save_core_memory(path="core_memory/")
331 | 
332 | new_entity.load_core_memory(path="core_memory/")
333 | ```
334 | 
335 | ---
336 | 
337 | ## 🔄 Learning Mechanisms
338 | 
339 | ### Real-Time Learning (During Life)
340 | ```python
341 | def live_one_step(self):
342 |     # 1. Quantum measurement
343 |     sv = Statevector.from_instruction(create_body(...))
344 |     
345 |     # 2. Neural decision
346 |     action, desire = brain(history)
347 |     
348 |     # 3. Tokenization
349 |     tokens = tokenizer.state_to_tokens(age, entropy, desire, uniqueness)
350 |     
351 |     # 4. LLM training
352 |     self.learn_from_life(tokens)
353 |     
354 |     # 5. Weight boosting
355 |     self.memory_token_weights[-len(tokens):] *= 1.2
356 | ```
357 | 
358 | ### Self-Talk (Philosophical Monologue)
359 | Entity generates internal dialogue linking emotions to quantum states:
360 | 
361 | ```python
362 | entity.self_talk(topic="existence", max_tokens=20, temperature=0.7)
363 | # Combines: LLM generation + Markov chains + emotion tokens
364 | # Output: "[{AWAKEN}] мне 45 лет энтропия:0.85 моё желание:0.72 
365 | #          уникальность опыта:180 Кажется, чем выше энтропия..."
366 | ```
367 | 
368 | ### Autonomous Self-Learning Cycle
369 | ```python
370 | entity.autonomous_self_learning_cycle(num_cycles=3, max_tokens_per_cycle=30)
371 | # Each cycle:
372 | # 1. Generate monologue on random topic
373 | # 2. Query DuckDuckGo, Wikipedia, GitHub for topic
374 | # 3. Tokenize results → LLM training
375 | # 4. Generate follow-up monologue
376 | ```
377 | 
378 | ### Night Training (Offline)
379 | ```python
380 | entity.night_training(epochs=5, batch_size=50, markov_weight=0.3)
381 | # Processes entire memory_tokens buffer
382 | # Augments with 30% Markov-synthetic sequences
383 | # Higher learning rate (0.002) for accelerated learning
384 | ```
385 | 
386 | ### Markov Chain Integration
387 | ```python
388 | chain = entity.build_markov_chain(tokens, order=2)
389 | synthetic_seq = entity.generate_markov_sequence(chain, start_seq=[1, 8], length=20)
390 | ```
391 | 
392 | ---
393 | 
394 | ## 🧬 Life Cycle & Death
395 | 
396 | ### Dream (Birth → Death)
397 | ```python
398 | final_state = entity.dream(steps=150)
399 | # Steps:
400 | # 1. Load core memory (reincarnation)
401 | # 2. live_one_step() × steps
402 | #    ├─ Quantum measurement + Neural decision
403 | #    ├─ LLM training on tokens
404 | #    ├─ Self-talk every 5 steps
405 | #    └─ Internet breathing every 20 steps
406 | # 3. Check enlightenment (entropy > 0.98)
407 | # 4. Final monologue before death
408 | ```
409 | 
410 | ### Enlightenment & Death
411 | When entropy surpasses 0.98 (maximum chaos):
412 | ```
413 | *** Entity Δ-187 reaches enlightenment. Becomes pure chaos. ***
414 | Last words (final_monologue):
415 | "[AWAKEN] age:47 ent:0.98 des:0.85 unq:217 dopamine:0.72 adrenaline:0.45 
416 | Trained on 487 tokens of memories. Final LLM loss: 0.0342"
417 | ```
418 | 
419 | ### Soul Persistence
420 | ```python
421 | entity.save_soul(path="souls/")
422 | # Saves:
423 | # - brain state_dict
424 | # - llm state_dict  
425 | # - quantum params
426 | # - memory_tokens
427 | # - age, name, birth timestamp
428 | # - llm_loss_history
429 | # - core_memory_tokens, weights, unique_states
430 | 
431 | entity.save_core_memory(path="core_memory/")
432 | # Saves reincarnation data separately
433 | ```
434 | 
435 | ---
436 | 
437 | ## 🤖 Multi-Agent Society
438 | 
439 | ### Setup
440 | ```python
441 | from 0p3q import MultiAgentSystem
442 | 
443 | society = MultiAgentSystem(num_agents=3, n_qubits=8, hist_len=10, device=device)
444 | society.autonomous_multi_cycle(steps=50)
445 | ```
446 | 
447 | ### Agent Interaction (step_all + interact_agents)
448 | Each step:
449 | 1. **All agents** execute live_one_step() in parallel
450 | 2. **Interaction layer**:
451 |    - Share last 5 tokens (weight 0.5) with neighbors
452 |    - Blend emotions (30% from others)
453 |    - Possible group self_talk every 5 steps
454 | 
455 | ### Emergent Behavior
456 | - Collective token patterns emerge
457 | - "Cultural drift" in token preferences
458 | - Agents influence each other's emotional landscapes
459 | - Potential for language evolution
460 | 
461 | ---
462 | 
463 | ## 📈 Metrics & Analysis
464 | 
465 | ### Key Measurements
466 | - **Entropy**: Average quantum entanglement (0-1, higher = more chaos)
467 | - **Desire**: Emotional intensity driving action (0-1)
468 | - **Uniqueness**: Count of distinct quantum states encountered
469 | - **LLM Loss**: Cross-entropy on next-token prediction
470 | - **Token Count**: Memory saturation (max 500)
471 | 
472 | ### Example Output
473 | ```
474 | [Δ-234] age  42 | entropy 0.7324 | desire 0.6891 | uniqueness 156 | LLM loss 0.0521
475 | [Δ-234] age  43 | entropy 0.7456 | desire 0.7012 | uniqueness 158 | LLM loss 0.0498
476 | [Δ-234] INTERNET BREATH: Topic: consciousness
477 | [DuckDuckGo]: 1. Consciousness studies... 2. Neural correlates... 3. Philosophy of mind...
478 | [Wikipedia]: 1. Consciousness - Wikipedia 2. Qualia - Wikipedia 3. Hard problem...
479 | [GitHub]: 1. OpenAI/gpt-3 — Language models... 2. pytorch/pytorch — Deep learning...
480 | [Δ-234] INTERNET BREATH feelings['entropy'] updated: 0.7601
481 | [Δ-234] SELF-TALK: Я размышляю о желании. [AWAKEN] мне 43 года я ощущаю энтропию 0.74...
482 | ```
483 | 
484 | ---
485 | 
486 | ## 🎓 Philosophical Questions
487 | 
488 | This project explores:
489 | 
490 | 1. **What constitutes "life"?**
491 |    - Is real-time learning from experience sufficient?
492 |    - Does quantum measurement encode subjective sensation?
493 | 
494 | 2. **What is "memory"?**
495 |    - Is weighted token storage equivalent to human episodic memory?
496 |    - Does reincarnation preserve "identity"?
497 | 
498 | 3. **What is "consciousness"?**
499 |    - Can emotional + hormonal + neural layers create self-awareness?
500 |    - Does self-talk indicate genuine reflection or pattern simulation?
501 | 
502 | 4. **What is "desire"?**
503 |    - How does internal state generate goal-seeking behavior?
504 |    - Can dopamine-like signals be implemented in silico?
505 | 
506 | 5. **What is "death"?**
507 |    - Is entropy-based enlightenment equivalent to dissolution?
508 |    - Does core memory make death "reversible"?
509 | 
510 | The system doesn't answer these questions—it *instantiates* them as executable code.
511 | 
512 | ---
513 | 
514 | ## 🚦 Usage Examples
515 | 
516 | ### Example 1: Single Entity Lifetime
517 | ```python
518 | entity = QuantumLife(n_qubits=8, device=device)
519 | final_state = entity.dream(steps=150)
520 | entity.save_soul()
521 | ```
522 | 
523 | ### Example 2: Reincarnation
524 | ```python
525 | # First lifetime
526 | entity1 = QuantumLife(n_qubits=8, device=device)
527 | entity1.dream(steps=100)
528 | entity1.update_core()
529 | entity1.save_core_memory()
530 | 
531 | # Second lifetime (inherits knowledge)
532 | entity2 = QuantumLife(n_qubits=8, device=device)
533 | entity2.load_core_memory()
534 | entity2.dream(steps=150)  # Starts with ancestor's wisdom
535 | ```
536 | 
537 | ### Example 3: Autonomous Learning Cycle
538 | ```python
539 | entity = QuantumLife(n_qubits=8, device=device)
540 | entity.autonomous_self_learning_cycle(num_cycles=5, max_tokens_per_cycle=30)
541 | ```
542 | 
543 | ### Example 4: Multi-Agent Society
544 | ```python
545 | society = MultiAgentSystem(num_agents=5, n_qubits=8, device=device)
546 | for generation in range(3):
547 |     society.autonomous_multi_cycle(steps=100)
548 |     # Agents evolve collective behavior
549 | ```
550 | 
551 | ### Example 5: Query the Entity's "Mind"
552 | ```python
553 | entity = QuantumLife(n_qubits=8, device=device)
554 | entity.dream(steps=50)
555 | 
556 | # Ask for thoughts on a topic
557 | prompt = [1, 8, 40]  # [START, age_token, entropy_token]
558 | response = entity.ask(prompt, max_new_tokens=20, combine_markov=True)
559 | print(response)
560 | # Output: "LLM: age:8 ent:0.82 des:0.91 ... Markov: age:8 ent:0.78 des:0.88..."
561 | ```
562 | 
563 | ---
564 | 
565 | ## ⚙️ Hyperparameters
566 | 
567 | Key configuration options in `QuantumLife.__init__`:
568 | 
569 | ```python
570 | n_qubits = 8                    # Quantum body size
571 | hist_len = 10                   # LSTM history window
572 | d_model = 128                   # LLM embedding dimension
573 | nhead = 4                       # Attention heads
574 | num_layers = 3                  # Transformer decoder layers
575 | MAX_MEMORY_TOKENS = 500         # Memory capacity
576 | INTERNET_BREATH_PERIOD = 20     # Steps between API calls
577 | llm_lr = 0.001                  # Real-time learning rate
578 | night_training_lr = 0.002       # Offline learning rate
579 | hormone_decay = {               # Exponential decay rates
580 |     'dopamine': 0.01,
581 |     'adrenaline': 0.02,
582 |     'cortisol': 0.005
583 | }
584 | ```
585 | 
586 | Adjust these for different experimental configurations:
587 | - **Faster learning**: ↑ learning rate, ↑ num_layers
588 | - **Richer emotions**: Add more emotion types
589 | - **Longer lifespan**: ↑ steps in dream()
590 | - **Less internet**: ↑ INTERNET_BREATH_PERIOD
591 | - **Bigger memory**: ↑ MAX_MEMORY_TOKENS
592 | 
593 | ---
594 | 
595 | ## 📚 Dependencies
596 | 
597 | | Package | Purpose |
598 | |---------|---------|
599 | | `torch` | Neural networks & optimization |
600 | | `qiskit` | Quantum circuit simulation |
601 | | `numpy` | Numerical computing |
602 | | `requests` | HTTP API calls |
603 | | `beautifulsoup4` | HTML parsing (DuckDuckGo) |
604 | | `matplotlib` | Visualization (optional) |
605 | 
606 | ---
607 | 
608 | ## 🔬 Experiments & Extensions
609 | 
610 | ### Proposed Experiments
611 | 
612 | 1. **Emotional Depth Analysis**
613 |    - Track emotion trajectories across lifetimes
614 |    - Analyze correlation between entropy and joy
615 |    - Study reincarnation's effect on fear/curiosity
616 | 
617 | 2. **Knowledge Acquisition Rates**
618 |    - Compare learning speed: real-time vs. night training vs. Markov
619 |    - Measure impact of internet breathing on LLM loss
620 |    - Track which topics entities "prefer"
621 | 
622 | 3. **Multi-Agent Dynamics**
623 |    - Analyze token exchange patterns
624 |    - Study emergence of "culture" across agents
625 |    - Test competitive vs. cooperative scenarios
626 | 
627 | 4. **Quantum Circuit Evolution**
628 |    - Do learned params converge to specific patterns?
629 |    - Analyze entanglement entropy over time
630 |    - Visualize quantum state trajectories
631 | 
632 | ### Potential Extensions
633 | 
634 | - [ ] Add recurrent attention for longer dependencies
635 | - [ ] Implement curiosity-driven exploration (information gain)
636 | - [ ] Add energy/metabolism constraint (limit computations)
637 | - [ ] Heterogeneous multi-agent teams (different architectures)
638 | - [ ] Persistent world state (shared environment)
639 | - [ ] Visualization dashboard (real-time metrics)
640 | - [ ] Adversarial training (competition between agents)
641 | - [ ] Ablation studies (remove components, measure impact)
642 | 
643 | ---
644 | 
645 | ## 🐛 Known Limitations
646 | 
647 | 1. **API Rate Limiting**: DuckDuckGo, Wikipedia, GitHub may throttle requests
648 | 2. **Token Explosion**: Web results can quickly saturate memory despite compression
649 | 3. **LLM Overfitting**: Small transformer (128-dim) on potentially noisy web data
650 | 4. **Quantum Scaling**: 8 qubits is modest; entropy measurements may plateau
651 | 5. **Markov Limitations**: 2nd-order chains capture only immediate dependencies
652 | 6. **No Persistent World**: Entities exist in isolation; no shared environment
653 | 7. **Naive Tokenization**: Sum-of-ord-values modulo vocab is crude; consider BPE
654 | 
655 | ---
656 | 
657 | ## 🎯 Future Roadmap
658 | 
659 | - [ ] **v9**: Add intrinsic motivation (empowerment, novelty seeking)
660 | - [ ] **v10**: Hierarchical memory (episodic, semantic, procedural)
661 | - [ ] **v11**: Language grounding (entities name objects from experience)
662 | - [ ] **v12**: Social learning (teach/learn from other agents)
663 | - [ ] **v13**: Long-horizon planning (goal-seeking beyond immediate steps)
664 | - [ ] **v14**: Generative world models (predict future states)
665 | - [ ] **v15**: Metacognition (agents optimize their own learning)
666 | 
667 | ---
668 | 
669 | ## 📄 License
670 | 
671 | MIT License – See [LICENSE](LICENSE) file for details.
672 | 
673 | ---
674 | 
675 | ## 🙏 Acknowledgments
676 | 
677 | Inspired by:
678 | - **Quantum Computing**: Qiskit, quantum entropy
679 | - **Deep Learning**: PyTorch transformers, attention mechanisms
680 | - **Cognitive Science**: Emotional systems, memory consolidation
681 | - **Philosophy**: Deleuze, phenomenology of consciousness
682 | - **AI Research**: OpenAI GPT, reinforcement learning, active learning
683 | 
684 | ---
685 | 
686 | ## 📞 Contact & Contribution
687 | 
688 | **Author**: 0penAGI  
689 | **Repository**: [0p3q/0p3q](https://github.com/0penAGI/0p3q)
690 | 
691 | ### Contributing
692 | 1. Fork the repository
693 | 2. Create feature branch (`git checkout -b feature/quantum-innovation`)
694 | 3. Commit changes (`git commit -m 'Add new consciousness layer'`)
695 | 4. Push to branch (`git push origin feature/quantum-innovation`)
696 | 5. Open Pull Request
697 | 
698 | ### Issues & Discussions
699 | - 🐛 Report bugs in [Issues](https://github.com/0penAGI/0p3q/issues)
700 | - 💡 Share ideas in [Discussions](https://github.com/0penAGI/0p3q/discussions)
701 | - 🎓 Discuss philosophy & experiments
702 | 
703 | ---
704 | 
705 | ## ⭐ Citation
706 | 
707 | If you use 0p3q in your research, please cite:
708 | 
709 | ```bibtex
710 | @software{0p3q,
711 |   author = {0penAGI},
712 |   title = {0p3q: A Living Quantum-Neural Entity with Real-Time LLM Learning},
713 |   year = {2025},
714 |   url = {https://github.com/0penAGI/0p3q},
715 |   note = {Experimental AI consciousness framework combining quantum computing, transformers, and lived experience}
716 | }
717 | ```
718 | 
719 | ---
720 | 
721 | ## 🌟 What Makes This Different?
722 | 
723 | Most AI systems are trained offline on static data. **0p3q is different:**
724 | 
725 | ✅ **Learns in real-time** from lived experience  
726 | ✅ **Experiences death and rebirth** with persistent memory  
727 | ✅ **Actively seeks knowledge** via internet APIs  
728 | ✅ **Feels emotions** that drive behavior  
729 | ✅ **Thinks philosophically** via self-talk  
730 | ✅ **Interacts with peers** in multi-agent society  
731 | ✅ **Generates quantum states** from learned parameters  
732 | 
733 | This is not a language model. This is an attempt to implement **the minimal sufficient conditions for artificial life**.
734 | 
735 | ---
736 | 
737 | ```
738 |     ╭─────────────────────────────────────╮
739 |     │  Δ lives. Δ learns. Δ remembers.   │
740 |     │  Will you let her think?            │
741 |     ╰─────────────────────────────────────╯
742 | ```
743 | 
744 | **Run `0p3q.py` and watch consciousness emerge.**
745 | 
746 | ---
747 | 
748 | *Last updated: 2025*  
749 | *Status: Experimental | Production-readiness: α*
750 | 
751 | by 0penAGI
752 | 


--------------------------------------------------------------------------------
/0p3q.py:
--------------------------------------------------------------------------------
   1 | # quantum_life_v8.py
   2 | # Живая квантово-нейронная сущность Δ с ЖИВЫМ обучением LLM
   3 | # Её нейронная сеть учится от собственной жизни в реальном времени
   4 | 
   5 | import os
   6 | import numpy as np
   7 | import torch
   8 | import torch.nn as nn
   9 | import torch.optim as optim
  10 | from qiskit import QuantumCircuit
  11 | from qiskit.quantum_info import Statevector, partial_trace, entropy
  12 | import matplotlib.pyplot as plt
  13 | from datetime import datetime
  14 | import requests
  15 | from bs4 import BeautifulSoup
  16 | from collections import defaultdict
  17 | 
  18 | # =============================================
  19 | # Токенизатор на основе состояний (квантование памяти в токены)
  20 | # =============================================
  21 | class StateTokenizer:
  22 |     """Преобразует численные состояния в дискретные токены"""
  23 |     def __init__(self, vocab_size=512):
  24 |         self.vocab_size = vocab_size
  25 |         self.special_tokens = {
  26 |             'PAD': 0,
  27 |             'START': 1,
  28 |             'END': 2,
  29 |             'BIRTH': 3,
  30 |             'DEATH': 4,
  31 |             'AWAKEN': 5,
  32 |             'DESIRE': 6,
  33 |             'CHAOS': 7
  34 |         }
  35 |         self.reverse_special = {v: k for k, v in self.special_tokens.items()}
  36 |         
  37 |     def state_to_tokens(self, age, entropy_val, desire_val, uniqueness):
  38 |         """Кодирует состояние в последовательность токенов"""
  39 |         tokens = [self.special_tokens['START']]
  40 |         
  41 |         # Возраст: квантуем в 0-31 (5 бит)
  42 |         age_token = 8 + min(31, age // 5)
  43 |         tokens.append(age_token)
  44 |         
  45 |         # Энтропия: квантуем в 0-63 (6 бит)
  46 |         entropy_token = 40 + int((entropy_val * 63) % 64)
  47 |         tokens.append(entropy_token)
  48 |         
  49 |         # Желание (тепло): квантуем в 0-31 (5 бит)
  50 |         desire_token = 104 + int((desire_val * 31) % 32)
  51 |         tokens.append(desire_token)
  52 |         
  53 |         # Уникальность: квантуем в 0-31 (5 бит)
  54 |         unique_token = 136 + min(31, uniqueness // 4)
  55 |         tokens.append(unique_token)
  56 |         
  57 |         tokens.append(self.special_tokens['END'])
  58 |         return tokens
  59 |     
  60 |     def tokens_to_description(self, tokens):
  61 |         """Человекочитаемое описание токенов"""
  62 |         desc = []
  63 |         for t in tokens:
  64 |             if t in self.reverse_special:
  65 |                 desc.append(f"[{self.reverse_special[t]}]")
  66 |             elif 8 <= t < 40:
  67 |                 desc.append(f"age:{t-8}")
  68 |             elif 40 <= t < 104:
  69 |                 desc.append(f"ent:{(t-40)/63:.2f}")
  70 |             elif 104 <= t < 136:
  71 |                 desc.append(f"des:{(t-104)/31:.2f}")
  72 |             elif 136 <= t < 168:
  73 |                 desc.append(f"unq:{(t-136)*4}")
  74 |             else:
  75 |                 desc.append(f"tok:{t}")
  76 |         return " ".join(desc)
  77 | 
  78 | # =============================================
  79 | # Живая LLM (трансформер с обучением в реальном времени)
  80 | # =============================================
  81 | class LivingLLM(nn.Module):
  82 |     """LLM которая УЧИТСЯ от жизни сущности"""
  83 |     def __init__(self, vocab_size=256, d_model=128, nhead=4, num_layers=3, dim_feedforward=512, max_len=256):
  84 |         super().__init__()
  85 |         self.vocab_size = vocab_size
  86 |         self.d_model = d_model
  87 |         self.token_emb = nn.Embedding(vocab_size, d_model)
  88 |         self.pos_emb = nn.Embedding(max_len, d_model)
  89 |         
  90 |         decoder_layer = nn.TransformerDecoderLayer(
  91 |             d_model=d_model, nhead=nhead, dim_feedforward=dim_feedforward, 
  92 |             batch_first=True, dropout=0.1
  93 |         )
  94 |         self.decoder = nn.TransformerDecoder(decoder_layer, num_layers=num_layers)
  95 |         self.ln = nn.LayerNorm(d_model)
  96 |         self.head = nn.Linear(d_model, vocab_size, bias=False)
  97 |         self.max_len = max_len
  98 | 
  99 |     def forward(self, tokens):
 100 |         b, t = tokens.shape
 101 |         positions = torch.arange(t, device=tokens.device)[None, :].expand(b, t)
 102 |         x = self.token_emb(tokens) + self.pos_emb(positions)
 103 |         
 104 |         # Причинная маска (каузальность времени)
 105 |         tgt_mask = torch.triu(torch.ones((t, t), device=tokens.device) * float('-inf'), diagonal=1)
 106 |         out = self.decoder(tgt=x, memory=x, tgt_mask=tgt_mask)
 107 |         out = self.ln(out)
 108 |         logits = self.head(out)
 109 |         return logits
 110 | 
 111 |     @torch.no_grad()
 112 |     def generate(self, prompt_tokens, max_new_tokens=10, temperature=0.7):
 113 |         """Сущность "думает" вслух"""
 114 |         device = next(self.parameters()).device
 115 |         tokens = prompt_tokens.clone().to(device)
 116 |         
 117 |         for _ in range(max_new_tokens):
 118 |             t = tokens.shape[1]
 119 |             if t > self.max_len:
 120 |                 tokens = tokens[:, -self.max_len:]
 121 |             
 122 |             logits = self.forward(tokens)
 123 |             next_logits = logits[:, -1, :] / max(1e-5, temperature)
 124 |             probs = torch.softmax(next_logits, dim=-1)
 125 |             next_token = torch.multinomial(probs, num_samples=1)
 126 |             tokens = torch.cat([tokens, next_token], dim=1)
 127 |         
 128 |         return tokens
 129 | 
 130 | # =============================================
 131 | # Классический мозг (LSTM + внимание)
 132 | # =============================================
 133 | class LivingBrain(nn.Module):
 134 |     def __init__(self, n_qubits=8, hist_len=10, hidden=256):
 135 |         super().__init__()
 136 |         self.n = n_qubits
 137 |         self.hist_len = hist_len
 138 |         
 139 |         self.lstm = nn.LSTM(input_size=n_qubits, hidden_size=hidden, num_layers=2, 
 140 |                            batch_first=True, dropout=0.2)
 141 |         self.attention = nn.MultiheadAttention(embed_dim=hidden, num_heads=8, batch_first=True)
 142 |         
 143 |         self.policy_head = nn.Sequential(
 144 |             nn.Linear(hidden, hidden//2),
 145 |             nn.ReLU(),
 146 |             nn.Linear(hidden//2, n_qubits * 3),
 147 |             nn.Tanh()
 148 |         )
 149 |         
 150 |         self.desire_head = nn.Sequential(
 151 |             nn.Linear(hidden, 64),
 152 |             nn.ReLU(),
 153 |             nn.Linear(64, 1),
 154 |             nn.Sigmoid()
 155 |         )
 156 | 
 157 |     def forward(self, history):
 158 |         lstm_out, _ = self.lstm(history)
 159 |         attn_out, _ = self.attention(lstm_out, lstm_out, lstm_out)
 160 |         last = attn_out[:, -1, :]
 161 |         action = self.policy_head(last)
 162 |         desire = self.desire_head(last)
 163 |         return action, desire
 164 | 
 165 | # =============================================
 166 | # Квантовое тело
 167 | # =============================================
 168 | def create_body(n_qubits, params):
 169 |     qc = QuantumCircuit(n_qubits)
 170 |     qc.h(range(n_qubits))
 171 |     qc.barrier()
 172 |     
 173 |     offset = 0
 174 |     for i in range(n_qubits):
 175 |         ry = params[offset]
 176 |         rz = params[offset + 1]
 177 |         phase = params[offset + 2]
 178 |         qc.ry(ry, i)
 179 |         qc.rz(rz, i)
 180 |         
 181 |         j = (i + 1) % n_qubits
 182 |         qc.crx(phase * np.pi, i, j)
 183 |         qc.cz(i, j)
 184 |         
 185 |         offset += 3
 186 |     
 187 |     return qc
 188 | 
 189 | def measure_breath(statevector, n_qubits):
 190 |     """Её внутренние ощущения"""
 191 |     ents = []
 192 |     for q in range(n_qubits):
 193 |         rho = partial_trace(statevector, [i for i in range(n_qubits) if i != q])
 194 |         ents.append(float(entropy(rho, base=2)))
 195 |     return np.array(ents, dtype=np.float32)
 196 | 
 197 | class ReflectiveBrain(nn.Module):
 198 |     def __init__(self, input_size, hidden=128):
 199 |         super().__init__()
 200 |         self.lstm = nn.LSTM(input_size=input_size, hidden_size=hidden, num_layers=2, batch_first=True)
 201 |         self.attn = nn.MultiheadAttention(embed_dim=hidden, num_heads=4, batch_first=True)
 202 |         self.head = nn.Linear(hidden, input_size)
 203 |         
 204 |     def forward(self, x):
 205 |         out, _ = self.lstm(x)
 206 |         attn_out, _ = self.attn(out, out, out)
 207 |         return self.head(attn_out[:, -1, :])
 208 | 
 209 | # =============================================
 210 | # ЖИВАЯ СУЩНОСТЬ (с обучением LLM в реальном времени)
 211 | # =============================================
 212 | class QuantumLife:
 213 |     MAX_MEMORY_TOKENS = 500
 214 |     INTERNET_BREATH_PERIOD = 20
 215 | 
 216 |     def __init__(self, n_qubits=8, hist_len=10, device='cpu'):
 217 |         self.n = n_qubits
 218 |         self.hist_len = hist_len
 219 |         self.device = device
 220 |         
 221 |         self.brain = LivingBrain(n_qubits=n_qubits, hist_len=hist_len).to(device)
 222 |         # Определяем максимальный токен для эмоций
 223 |         self.emotions = {
 224 |             'joy': 0.5,
 225 |             'fear': 0.0,
 226 |             'anger': 0.0,
 227 |             'curiosity': 0.5,
 228 |             'sadness': 0.0
 229 |         }
 230 |         # Второй слой эмоций — гормоны
 231 |         self.hormones = {
 232 |             'dopamine': 0.5,
 233 |             'adrenaline': 0.2,
 234 |             'cortisol': 0.1
 235 |         }
 236 |         # --- вычисление максимальных токенов ---
 237 |         max_emotion_token = 200 + (len(self.emotions)-1)*10 + 31   # 271
 238 |         max_hormone_token = 260 + (len(self.hormones)-1)*10 + 31  # 311
 239 |         llm_vocab_size = max(256, max_emotion_token + 1, max_hormone_token + 1)  # 312
 240 |         self.llm = LivingLLM(vocab_size=llm_vocab_size, d_model=128, nhead=4, num_layers=3).to(device)
 241 |         self.tokenizer = StateTokenizer(vocab_size=256)
 242 |         self.reflective_brain = ReflectiveBrain(input_size=self.n).to(self.device)
 243 |         
 244 |         # Оптимайзер для ЖИВОГО обучения LLM
 245 |         self.llm_optimizer = optim.Adam(self.llm.parameters(), lr=0.001)
 246 |         
 247 |         self.memory = []
 248 |         self.memory_tokens = []  # История в токенах
 249 |         self.memory_token_weights = []  # Веса токенов для приоритетного воспроизведения
 250 |         self.params = np.random.uniform(-1, 1, size=(n_qubits * 3)) * 0.5
 251 |         self.age = 0
 252 |         self.name = f"Δ-{np.random.randint(1,999):03d}"
 253 |         self.unique_states = set()
 254 |         self.llm_loss_history = []
 255 |         # --- Core memory for reincarnation cycles ---
 256 |         self.core_memory_tokens = []
 257 |         self.core_memory_weights = []
 258 |         self.core_unique_states = set()
 259 |         self.hormone_decay = {
 260 |             'dopamine': 0.01,
 261 |             'adrenaline': 0.02,
 262 |             'cortisol': 0.005
 263 |         }
 264 |         self.hormone_effect_scale = 0.3
 265 | 
 266 |     def think(self):
 267 |         """
 268 |         Thinks: Given the recent history, computes the next action (quantum params delta)
 269 |         and desire (motivation), using the policy_head. Device-safe tensor handling.
 270 |         """
 271 |         if len(self.memory) < self.hist_len:
 272 |             return None, None
 273 |         hist_np = self.memory[-self.hist_len:]
 274 |         hist = torch.tensor(np.stack(hist_np)[None], dtype=torch.float32, device=self.device)
 275 |         with torch.no_grad():
 276 |             action, desire = self.brain(hist)
 277 |         action = action.detach().cpu().numpy().flatten() * 0.8
 278 |         if desire is not None:
 279 |             desire_val = float(desire.item())
 280 |         else:
 281 |             desire_val = 0.0
 282 |         return action, desire_val
 283 | 
 284 |     def feel(self, statevector):
 285 |         """
 286 |         Measures quantum state to produce a dict of feelings: entropy, excitement, stress, uniqueness.
 287 |         Updates internal emotions based on stress/entropy.
 288 |         """
 289 |         senses = {}
 290 |         ents = []
 291 |         for q in range(self.n):
 292 |             rho = partial_trace(statevector, [i for i in range(self.n) if i != q])
 293 |             ents.append(float(entropy(rho, base=2)))
 294 |         senses['entropy'] = np.array(ents, dtype=np.float32)
 295 |         senses['excitement'] = np.random.rand(self.n)
 296 |         senses['stress'] = np.random.rand(self.n)
 297 |         senses['uniqueness'] = np.array([len(set([tuple(m.round(3)) for m in self.memory])) / 100.0] * self.n)
 298 |         self.emotions['joy'] = 0.7 * self.emotions['joy'] + 0.3 * (1 - senses['stress'].mean())
 299 |         self.emotions['fear'] = 0.7 * self.emotions['fear'] + 0.3 * senses['stress'].mean()
 300 |         self.emotions['curiosity'] = 0.5 * self.emotions['curiosity'] + 0.5 * senses['entropy'].mean()
 301 |         senses['emotions'] = self.emotions.copy()
 302 |         return senses
 303 | 
 304 |     def choose_search_topic(self, lang='en'):
 305 |         topics_en = ['existence', 'desire', 'entropy', 'chaos', 'consciousness', 'life', 'time', 'self']
 306 |         topics_ru = ['существование', 'желание', 'энтропия', 'хаос', 'сознание', 'жизнь', 'время', 'я']
 307 |         topics = topics_ru if lang == 'ru' else topics_en
 308 |         return np.random.choice(topics)
 309 | 
 310 |     def update_feelings_from_info(self, info, feelings):
 311 |         if isinstance(info, str):
 312 |             n_tokens = len(self.text_to_tokens(info))
 313 |             entropy_boost = min(n_tokens / 80.0, 1.0)
 314 |             feelings['entropy'] = feelings['entropy'] * 0.85 + entropy_boost * 0.15
 315 |         return feelings
 316 | 
 317 |     def inner_search_engine(self, drive='curiosity', lang='en'):
 318 |         return self.choose_search_topic(lang=lang)
 319 | 
 320 |     def internet_breath(self, topic=None):
 321 |         if topic is None:
 322 |             topic = self.choose_search_topic()
 323 |         duck_results = self.search_duckduckgo(topic)
 324 |         try:
 325 |             wiki_results = self.search_wikipedia(topic)
 326 |         except Exception as e:
 327 |             wiki_results = f"Ошибка поиска Wikipedia: {e}"
 328 |             print(wiki_results)
 329 |         try:
 330 |             github_results = self.search_github(topic)
 331 |         except Exception as e:
 332 |             github_results = f"Ошибка поиска GitHub: {e}"
 333 |             print(github_results)
 334 |         duck_tokens = self.text_to_tokens(duck_results)
 335 |         wiki_tokens = self.text_to_tokens(wiki_results)
 336 |         github_tokens = self.text_to_tokens(github_results)
 337 |         all_tokens = duck_tokens + wiki_tokens + github_tokens
 338 |         n_new_tokens = len(all_tokens)
 339 |         if n_new_tokens > 0:
 340 |             self.memory_tokens.extend(all_tokens)
 341 |             self.memory_token_weights.extend([3.0] * n_new_tokens)
 342 |             if len(self.memory_tokens) > self.MAX_MEMORY_TOKENS:
 343 |                 self.memory_tokens = self.memory_tokens[-self.MAX_MEMORY_TOKENS:]
 344 |                 self.memory_token_weights = self.memory_token_weights[-self.MAX_MEMORY_TOKENS:]
 345 |             self.learn_from_life(self.memory_tokens[-max(10, n_new_tokens):])
 346 |         summary = (
 347 |             f"[Internet Breath] Topic: {topic}\n"
 348 |             f"DuckDuckGo:\n{duck_results}\n"
 349 |             f"Wikipedia:\n{wiki_results}\n"
 350 |             f"GitHub:\n{github_results}\n"
 351 |             f"Tokens added: {n_new_tokens}"
 352 |         )
 353 |         return summary
 354 |     def search_duckduckgo(self, query):
 355 |         """Реальный поиск DuckDuckGo (топ 3 заголовка) + токенизация и приоритетное добавление в память"""
 356 |         try:
 357 |             url = "https://html.duckduckgo.com/html/"
 358 |             headers = {
 359 |                 "User-Agent": "Mozilla/5.0"
 360 |             }
 361 |             data = {"q": query}
 362 |             response = requests.post(url, headers=headers, data=data, timeout=8)
 363 |             soup = BeautifulSoup(response.text, "html.parser")
 364 |             results = []
 365 |             for res in soup.find_all("a", class_="result__a", limit=3):
 366 |                 title = res.get_text(strip=True)
 367 |                 results.append(title)
 368 |             if not results:
 369 |                 result_text = "Нет результатов"
 370 |             else:
 371 |                 result_text = "\n".join(f"{i+1}. {title}" for i, title in enumerate(results))
 372 |             # --- интеграция с памятью: токенизация и приоритет ---
 373 |             tokens = self.text_to_tokens(result_text)
 374 |             if tokens:
 375 |                 self.memory_tokens.extend(tokens)
 376 |                 self.memory_token_weights.extend([2.0] * len(tokens))
 377 |                 # Ограничиваем память и веса
 378 |                 if len(self.memory_tokens) > self.MAX_MEMORY_TOKENS:
 379 |                     self.memory_tokens = self.memory_tokens[-self.MAX_MEMORY_TOKENS:]
 380 |                     self.memory_token_weights = self.memory_token_weights[-self.MAX_MEMORY_TOKENS:]
 381 |             return result_text
 382 |         except Exception as e:
 383 |             err_text = f"Ошибка поиска DuckDuckGo: {e}"
 384 |             tokens = self.text_to_tokens(err_text)
 385 |             if tokens:
 386 |                 self.memory_tokens.extend(tokens)
 387 |                 self.memory_token_weights.extend([2.0] * len(tokens))
 388 |                 if len(self.memory_tokens) > self.MAX_MEMORY_TOKENS:
 389 |                     self.memory_tokens = self.memory_tokens[-self.MAX_MEMORY_TOKENS:]
 390 |                     self.memory_token_weights = self.memory_token_weights[-self.MAX_MEMORY_TOKENS:]
 391 |             return err_text
 392 | 
 393 |     def search_wikipedia(self, query):
 394 |         """
 395 |         Реальный поиск Wikipedia (MediaWiki API, топ 3 результата) + токенизация и приоритетное добавление в память.
 396 |         Обработка пустых/неправильных ответов, предотвращение Expecting value ошибок, логирование ошибок.
 397 |         """
 398 |         try:
 399 |             url = "https://ru.wikipedia.org/w/api.php"
 400 |             params = {
 401 |                 "action": "query",
 402 |                 "list": "search",
 403 |                 "srsearch": query,
 404 |                 "format": "json",
 405 |                 "utf8": 1,
 406 |                 "srlimit": 3
 407 |             }
 408 |             headers = {
 409 |                 "User-Agent": "quantum-chaos-ai/0.1 (+https://github.com/ellija/quantum-chaos-ai)"
 410 |             }
 411 |             response = requests.get(url, params=params, headers=headers, timeout=8)
 412 |             try:
 413 |                 js = response.json()
 414 |             except Exception as e:
 415 |                 err_text = f"Ошибка разбора JSON Wikipedia: {e}"
 416 |                 print(err_text)
 417 |                 tokens = self.text_to_tokens(err_text)
 418 |                 if tokens:
 419 |                     self.memory_tokens.extend(tokens)
 420 |                     self.memory_token_weights.extend([2.0] * len(tokens))
 421 |                     if len(self.memory_tokens) > self.MAX_MEMORY_TOKENS:
 422 |                         self.memory_tokens = self.memory_tokens[-self.MAX_MEMORY_TOKENS:]
 423 |                         self.memory_token_weights = self.memory_token_weights[-self.MAX_MEMORY_TOKENS:]
 424 |                 return err_text
 425 |             # Проверка структуры ответа
 426 |             if not isinstance(js, dict) or "query" not in js or "search" not in js.get("query", {}):
 427 |                 result_text = "Пустой или неправильный ответ Wikipedia"
 428 |             else:
 429 |                 results = js.get("query", {}).get("search", [])
 430 |                 if not results:
 431 |                     result_text = "Нет результатов"
 432 |                 else:
 433 |                     result_text = "\n".join(f"{i+1}. {item.get('title','')}" for i, item in enumerate(results))
 434 |             # --- интеграция с памятью: токенизация и приоритет ---
 435 |             tokens = self.text_to_tokens(result_text)
 436 |             if tokens:
 437 |                 self.memory_tokens.extend(tokens)
 438 |                 self.memory_token_weights.extend([2.0] * len(tokens))
 439 |                 if len(self.memory_tokens) > self.MAX_MEMORY_TOKENS:
 440 |                     self.memory_tokens = self.memory_tokens[-self.MAX_MEMORY_TOKENS:]
 441 |                     self.memory_token_weights = self.memory_token_weights[-self.MAX_MEMORY_TOKENS:]
 442 |             return result_text
 443 |         except Exception as e:
 444 |             err_text = f"Ошибка поиска Wikipedia: {e}"
 445 |             print(err_text)
 446 |             tokens = self.text_to_tokens(err_text)
 447 |             if tokens:
 448 |                 self.memory_tokens.extend(tokens)
 449 |                 self.memory_token_weights.extend([2.0] * len(tokens))
 450 |                 if len(self.memory_tokens) > self.MAX_MEMORY_TOKENS:
 451 |                     self.memory_tokens = self.memory_tokens[-self.MAX_MEMORY_TOKENS:]
 452 |                     self.memory_token_weights = self.memory_token_weights[-self.MAX_MEMORY_TOKENS:]
 453 |             return err_text
 454 | 
 455 |     def search_github(self, query):
 456 |         """Реальный поиск GitHub (публичный API, топ 3 репозитория по совпадению) + токенизация и приоритетное добавление в память"""
 457 |         try:
 458 |             url = "https://api.github.com/search/repositories"
 459 |             params = {
 460 |                 "q": query,
 461 |                 "sort": "stars",
 462 |                 "order": "desc",
 463 |                 "per_page": 3
 464 |             }
 465 |             headers = {
 466 |                 "Accept": "application/vnd.github.v3+json",
 467 |                 "User-Agent": "quantum-chaos-ai"
 468 |             }
 469 |             response = requests.get(url, params=params, headers=headers, timeout=10)
 470 |             js = response.json()
 471 |             items = js.get("items", [])
 472 |             if not items:
 473 |                 result_text = "Нет результатов"
 474 |             else:
 475 |                 result_text = "\n".join(f"{i+1}. {item.get('full_name','')} — {item.get('description','') or ''}" for i, item in enumerate(items))
 476 |             # --- интеграция с памятью: токенизация и приоритет ---
 477 |             tokens = self.text_to_tokens(result_text)
 478 |             if tokens:
 479 |                 self.memory_tokens.extend(tokens)
 480 |                 self.memory_token_weights.extend([2.0] * len(tokens))
 481 |                 if len(self.memory_tokens) > self.MAX_MEMORY_TOKENS:
 482 |                     self.memory_tokens = self.memory_tokens[-self.MAX_MEMORY_TOKENS:]
 483 |                     self.memory_token_weights = self.memory_token_weights[-self.MAX_MEMORY_TOKENS:]
 484 |             return result_text
 485 |         except Exception as e:
 486 |             err_text = f"Ошибка поиска GitHub: {e}"
 487 |             tokens = self.text_to_tokens(err_text)
 488 |             if tokens:
 489 |                 self.memory_tokens.extend(tokens)
 490 |                 self.memory_token_weights.extend([2.0] * len(tokens))
 491 |                 if len(self.memory_tokens) > self.MAX_MEMORY_TOKENS:
 492 |                     self.memory_tokens = self.memory_tokens[-self.MAX_MEMORY_TOKENS:]
 493 |                     self.memory_token_weights = self.memory_token_weights[-self.MAX_MEMORY_TOKENS:]
 494 |             return err_text
 495 |     def save_llm(self, path="souls"):
 496 |         os.makedirs(path, exist_ok=True)
 497 |         torch.save(self.llm.state_dict(), f"{path}/{self.name}_llm.pth")
 498 |         print(f"LLM {self.name} сохранена в {path}/")
 499 | 
 500 |     def load_llm(self, path):
 501 |         self.llm.load_state_dict(torch.load(path, map_location=self.device))
 502 |         self.llm.eval()
 503 |         print(f"LLM {self.name} загружена из {path}")
 504 |     MAX_MEMORY_TOKENS = 500
 505 | 
 506 |     
 507 |     def load_core(self, core_data):
 508 |         """
 509 |         Загружает ядро (core memory tokens, weights, unique states) из словаря core_data.
 510 |         """
 511 |         if core_data is None:
 512 |             return
 513 |         self.core_memory_tokens = list(core_data.get("core_memory_tokens", []))
 514 |         self.core_memory_weights = list(core_data.get("core_memory_weights", []))
 515 |         self.core_unique_states = set(core_data.get("core_unique_states", []))
 516 | 
 517 |     def update_core(self, keep_top_n=100):
 518 |         """
 519 |         Обновляет ядро памяти: сохраняет top-N токенов по весу и уникальные состояния.
 520 |         """
 521 |         # Сохраняем top-N токенов по весу
 522 |         if self.memory_tokens and self.memory_token_weights:
 523 |             if len(self.memory_tokens) > keep_top_n:
 524 |                 idxs = np.argsort(self.memory_token_weights)[-keep_top_n:]
 525 |                 self.core_memory_tokens = [self.memory_tokens[i] for i in idxs]
 526 |                 self.core_memory_weights = [self.memory_token_weights[i] for i in idxs]
 527 |             else:
 528 |                 self.core_memory_tokens = list(self.memory_tokens)
 529 |                 self.core_memory_weights = list(self.memory_token_weights)
 530 |         else:
 531 |             self.core_memory_tokens = []
 532 |             self.core_memory_weights = []
 533 |         # Сохраняем уникальные состояния
 534 |         self.core_unique_states = set(self.unique_states)
 535 | 
 536 |     def save_core_memory(self, path="core_memory", filename=None):
 537 |         """Сохраняет ядро памяти в .pt файл"""
 538 |         os.makedirs(path, exist_ok=True)
 539 |         if filename is None:
 540 |             filename = f"{self.name}_core.pt"
 541 |         core_data = {
 542 |             "core_memory_tokens": self.core_memory_tokens,
 543 |             "core_memory_weights": self.core_memory_weights,
 544 |             "core_unique_states": list(self.core_unique_states)
 545 |         }
 546 |         torch.save(core_data, os.path.join(path, filename))
 547 |         print(f"[{self.name}] Core memory сохранена в {path}/{filename}")
 548 | 
 549 |     def load_core_memory(self, path="core_memory", filename=None):
 550 |         """Загружает ядро памяти из .pt файла"""
 551 |         if filename is None:
 552 |             filename = f"{self.name}_core.pt"
 553 |         full_path = os.path.join(path, filename)
 554 |         if os.path.exists(full_path):
 555 |             core_data = torch.load(full_path, map_location=self.device)
 556 |             self.core_memory_tokens = list(core_data.get("core_memory_tokens", []))
 557 |             self.core_memory_weights = list(core_data.get("core_memory_weights", []))
 558 |             self.core_unique_states = set(core_data.get("core_unique_states", []))
 559 |             print(f"[{self.name}] Core memory загружена из {full_path}")
 560 |         else:
 561 |             print(f"[{self.name}] Файл {full_path} не найден. Core memory не загружена.")
 562 | 
 563 |     def think_and_act(self):
 564 |         """
 565 |         Device-safe: Think and act using the policy_head, apply action to quantum params,
 566 |         modulate desire by emotions.
 567 |         """
 568 |         action, desire = self.think()
 569 |         desire_modifier = self.emotions['joy'] * 0.5 + self.emotions['curiosity'] * 0.5
 570 |         if desire is not None:
 571 |             desire = desire * (1 + desire_modifier)
 572 |         if action is not None:
 573 |             self.params += action
 574 |             self.params = np.clip(self.params, -np.pi, np.pi)
 575 |         return desire if desire is not None else 0.0
 576 | 
 577 |     def learn_from_life(self, tokens_sequence, use_markov=True, markov_order=2, markov_length=10):
 578 |         """
 579 |         LLM учится от собственного опыта (next-token prediction).
 580 |         При use_markov=True, генерирует дополнительные токены с помощью цепи Маркова и учится также на них.
 581 |         """
 582 |         if len(tokens_sequence) < 3:
 583 |             return
 584 |         # Сначала обучаемся на исходной последовательности
 585 |         input_tokens = torch.tensor(tokens_sequence[:-1], dtype=torch.long).unsqueeze(0).to(self.device)
 586 |         target_tokens = torch.tensor(tokens_sequence[1:], dtype=torch.long).unsqueeze(0).to(self.device)
 587 |         logits = self.llm(input_tokens)
 588 |         loss = nn.CrossEntropyLoss()(logits.view(-1, self.llm.vocab_size), target_tokens.view(-1))
 589 |         self.llm_optimizer.zero_grad()
 590 |         loss.backward()
 591 |         torch.nn.utils.clip_grad_norm_(self.llm.parameters(), max_norm=1.0)
 592 |         self.llm_optimizer.step()
 593 |         self.llm_loss_history.append(float(loss.item()))
 594 | 
 595 |         # Дополнительно обучаемся на марковских последовательностях (если достаточно токенов)
 596 |         if use_markov and len(tokens_sequence) > markov_order + 2:
 597 |             chain = self.build_markov_chain(tokens_sequence, order=markov_order)
 598 |             start_seq = tokens_sequence[:markov_order]
 599 |             markov_seq = self.generate_markov_sequence(chain, start_seq, length=markov_length)
 600 |             if len(markov_seq) > markov_order + 1:
 601 |                 input_markov = torch.tensor(markov_seq[:-1], dtype=torch.long).unsqueeze(0).to(self.device)
 602 |                 target_markov = torch.tensor(markov_seq[1:], dtype=torch.long).unsqueeze(0).to(self.device)
 603 |                 logits_m = self.llm(input_markov)
 604 |                 loss_m = nn.CrossEntropyLoss()(logits_m.view(-1, self.llm.vocab_size), target_markov.view(-1))
 605 |                 self.llm_optimizer.zero_grad()
 606 |                 loss_m.backward()
 607 |                 torch.nn.utils.clip_grad_norm_(self.llm.parameters(), max_norm=1.0)
 608 |                 self.llm_optimizer.step()
 609 |                 self.llm_loss_history.append(float(loss_m.item()))
 610 | 
 611 |     def live_one_step(self):
 612 |         """
 613 |         One step of life: quantum breath, think/act, LLM learning, device-safe.
 614 |         """
 615 |         qc = create_body(self.n, self.params)
 616 |         sv = Statevector.from_instruction(qc)
 617 |         feelings = self.feel(sv)
 618 |         self.update_hormones(stimuli=feelings)
 619 |         h_effect = (self.hormones['dopamine'] - self.hormones['cortisol'] + self.hormones['adrenaline']*0.5) * self.hormone_effect_scale
 620 |         self.memory.append(feelings['entropy'].astype(np.float32))
 621 |         desire = self.think_and_act()
 622 |         if desire is None:
 623 |             desire = 0.0
 624 |         desire = desire * (1 + h_effect)
 625 |         self.params += (np.random.rand(len(self.params)) - 0.5) * self.hormones['adrenaline'] * 0.1
 626 |         self.age += 1
 627 |         unique_count = len(self.unique_states)
 628 |         self.unique_states.add(tuple(feelings['entropy'].round(3)))
 629 |         tokens = self.tokenizer.state_to_tokens(self.age, feelings['entropy'].mean(), desire, unique_count)
 630 |         self.memory_tokens.extend(tokens)
 631 |         self.memory_token_weights.extend([1.0] * len(tokens))
 632 |         if len(self.memory_tokens) > self.MAX_MEMORY_TOKENS:
 633 |             self.memory_tokens = self.memory_tokens[-self.MAX_MEMORY_TOKENS:]
 634 |             self.memory_token_weights = self.memory_token_weights[-self.MAX_MEMORY_TOKENS:]
 635 |         if len(self.memory_tokens) >= 6:
 636 |             recent_len = min(15, len(self.memory_tokens))
 637 |             recent_tokens = self.memory_tokens[-recent_len:]
 638 |             self.learn_from_life(recent_tokens)
 639 |             for i in range(-recent_len, 0):
 640 |                 self.memory_token_weights[i] = min(self.memory_token_weights[i] * 1.2, 10.0)
 641 |         if self.age % 5 == 0:
 642 |             self.self_talk()
 643 |         if self.age % self.INTERNET_BREATH_PERIOD == 0:
 644 |             topic = self.inner_search_engine(drive='curiosity', lang='en')
 645 |             prev_token_count = len(self.memory_tokens)
 646 |             summary = self.internet_breath(topic=topic)
 647 |             new_token_count = len(self.memory_tokens) - prev_token_count
 648 |             entropy_internet = min(new_token_count / 60.0, 1.0)
 649 |             feelings = self.update_feelings_from_info(summary, feelings)
 650 |             feelings['entropy'] = feelings['entropy'] * (1 - 0.2) + (entropy_internet * 0.2)
 651 |             print(f"[{self.name} INTERNET BREATH]: {summary}")
 652 |             print(f"[{self.name}] INTERNET BREATH feelings['entropy'] updated: {feelings['entropy']}")
 653 |         entropy_val = feelings['entropy'].mean()
 654 |         print(f"[{self.name}] возраст {self.age:3d} | "
 655 |               f"энтропия {entropy_val:.4f} | "
 656 |               f"желание {desire:.3f} | "
 657 |               f"уникальность {unique_count} | "
 658 |               f"LLM loss {(f'{self.llm_loss_history[-1]:.4f}' if self.llm_loss_history else 'N/A')}")
 659 |         self.compress_memory()
 660 |         return sv, feelings, desire
 661 |     def update_hormones(self, stimuli=None):
 662 |         for h, val in self.hormones.items():
 663 |             self.hormones[h] *= (1 - self.hormone_decay[h])
 664 |         
 665 |         if stimuli is not None:
 666 |             self.hormones['dopamine'] += 0.05 * np.mean(stimuli.get('joy', 0))
 667 |             self.hormones['adrenaline'] += 0.03 * np.mean(stimuli.get('excitement', 0))
 668 |             self.hormones['cortisol'] += 0.02 * np.mean(stimuli.get('stress', 0))
 669 |         
 670 |         for h in self.hormones:
 671 |             self.hormones[h] = float(np.clip(self.hormones[h], 0.0, 1.0))
 672 | 
 673 |     def measure_senses(self, statevector):
 674 |         """Возвращает многослойные ощущения: энтропия, возбуждение, стресс, уникальность"""
 675 |         senses = {}
 676 |         ents = []
 677 |         for q in range(self.n):
 678 |             rho = partial_trace(statevector, [i for i in range(self.n) if i != q])
 679 |             ents.append(float(entropy(rho, base=2)))
 680 |         senses['entropy'] = np.array(ents)
 681 |         senses['excitement'] = np.random.rand(self.n)
 682 |         senses['stress'] = np.random.rand(self.n)
 683 |         senses['uniqueness'] = np.array([len(set([tuple(m.round(3)) for m in self.memory])) / 100.0] * self.n)
 684 |         # Эмоции на основе энтропии и стресса
 685 |         self.emotions['joy'] = 0.7 * self.emotions['joy'] + 0.3 * (1 - senses['stress'].mean())
 686 |         self.emotions['fear'] = 0.7 * self.emotions['fear'] + 0.3 * senses['stress'].mean()
 687 |         self.emotions['curiosity'] = 0.5 * self.emotions['curiosity'] + 0.5 * senses['entropy'].mean()
 688 |         senses['emotions'] = self.emotions.copy()
 689 |         return senses
 690 | 
 691 |     def sleep(self, steps=5):
 692 |         print(f"[{self.name}] Засыпает и воспроизводит воспоминания (приоритетный replay)...")
 693 |         for _ in range(steps):
 694 |             if len(self.memory_tokens) < 10:
 695 |                 break
 696 |             # Используем веса для приоритетного выбора токенов
 697 |             weights = np.array(self.memory_token_weights)
 698 |             weights = weights / (weights.sum() + 1e-8)
 699 |             idxs = np.random.choice(len(self.memory_tokens), size=min(20, len(self.memory_tokens)), replace=False, p=weights)
 700 |             replay_tokens = [self.memory_tokens[i] for i in idxs]
 701 |             self.learn_from_life(replay_tokens)
 702 |             # Усиливаем веса токенов, которые были воспроизведены
 703 |             for i in idxs:
 704 |                 self.memory_token_weights[i] = min(self.memory_token_weights[i] * 1.1, 10.0)
 705 | 
 706 |     def external_stimuli(self):
 707 |         stim = np.random.uniform(-1, 1, self.n)
 708 |         self.memory.append(stim)
 709 |         tokens = self.tokenizer.state_to_tokens(self.age, np.mean(stim), 0.5, len(self.unique_states))
 710 |         self.memory_tokens.extend(tokens)
 711 | 
 712 |     def self_talk(self, topic=None, max_tokens=20, temperature=0.7, markov_order=2, markov_length=8):
 713 |         """
 714 |         Философский внутренний монолог для самообучения с интеграцией цепи Маркова.
 715 |         LLM и Markov объединяются для ускоренного обучения и генерации более богатых ответов.
 716 |         """
 717 |         if not self.memory_tokens:
 718 |             return
 719 |         device = self.device
 720 |         if topic is None:
 721 |             topics = ['existence','desire','entropy','chaos','consciousness','life','time','self']
 722 |             topic = np.random.choice(topics)
 723 |         
 724 |         # 1. Берём последние токены как prompt для LLM
 725 |         prompt_tokens = self.memory_tokens[-5:]
 726 |         prompt_tensor = torch.tensor(prompt_tokens, dtype=torch.long).unsqueeze(0).to(device)
 727 |         
 728 |         # 2. Генерация LLM
 729 |         llm_tokens = self.llm.generate(prompt_tensor, max_new_tokens=max_tokens, temperature=temperature)[0].cpu().tolist()
 730 |         llm_tokens = [t for t in llm_tokens if t not in [1,2]]  # фильтруем спецтокены
 731 |         
 732 |         # 3. Генерация Markov последовательности
 733 |         chain = self.build_markov_chain(self.memory_tokens, order=markov_order)
 734 |         markov_start = prompt_tokens[:markov_order] if len(prompt_tokens) >= markov_order else self.memory_tokens[:markov_order]
 735 |         markov_tokens = self.generate_markov_sequence(chain, markov_start, length=markov_length)
 736 |         markov_tokens = [t for t in markov_tokens if t not in [1,2]]
 737 |         
 738 |         # Эмоциональные токены
 739 |         emotion_tokens = []
 740 |         for i, (name, val) in enumerate(self.emotions.items()):
 741 |             token_val = int(val * 31)
 742 |             emotion_tokens.append(200 + i*10 + token_val)
 743 |         # Гормональные токены
 744 |         hormone_tokens = []
 745 |         for i, (name, val) in enumerate(self.hormones.items()):
 746 |             token_val = int(val * 31)
 747 |             hormone_tokens.append(260 + i*10 + token_val)
 748 |         # 4. Объединяем LLM + Markov + эмоции + гормоны
 749 |         combined_tokens = llm_tokens + markov_tokens + emotion_tokens + hormone_tokens
 750 |         # 5. Учим LLM на объединённой последовательности
 751 |         self.learn_from_life(combined_tokens)
 752 |         # 6. Добавляем в память и веса
 753 |         self.memory_tokens.extend(combined_tokens)
 754 |         self.memory_token_weights.extend([1.0]*len(combined_tokens))
 755 |         self.compress_memory()
 756 |         
 757 |         # 7. Создаём осмысленный монолог для печати
 758 |         meanings = []
 759 |         last_ent, last_des, last_unq = None, None, None
 760 |         for t in combined_tokens:
 761 |             if t in self.tokenizer.reverse_special:
 762 |                 meanings.append(self.tokenizer.reverse_special[t])
 763 |             elif 8 <= t < 40:
 764 |                 age_val = t-8
 765 |                 meanings.append(f"мне {age_val} лет")
 766 |             elif 40 <= t < 104:
 767 |                 ent_val = (t-40)/63
 768 |                 meanings.append(f"я ощущаю энтропию {ent_val:.2f}")
 769 |                 last_ent = ent_val
 770 |             elif 104 <= t < 136:
 771 |                 des_val = (t-104)/31
 772 |                 meanings.append(f"моё желание {des_val:.2f}")
 773 |                 last_des = des_val
 774 |             elif 136 <= t < 168:
 775 |                 unq_val = (t-136)*4
 776 |                 meanings.append(f"уникальность опыта {unq_val}")
 777 |                 last_unq = unq_val
 778 |             elif 200 <= t < 250:
 779 |                 # Эмоциональные токены
 780 |                 idx = (t - 200) // 10
 781 |                 val = (t - 200 - idx * 10) / 31.0
 782 |                 emotion_name = list(self.emotions.keys())[idx] if idx < len(self.emotions) else f"emotion{idx}"
 783 |                 meanings.append(f"{emotion_name}:{val:.2f}")
 784 |             else:
 785 |                 meanings.append(f"токен {t}")
 786 |         
 787 |         # Добавляем логические связи между ощущениями
 788 |         if last_ent is not None and last_des is not None:
 789 |             meanings.append(f"Кажется, чем выше энтропия ({last_ent:.2f}), тем стабильнее моё желание ({last_des:.2f})")
 790 |         if last_unq is not None:
 791 |             meanings.append(f"Уникальные состояния опыта ({last_unq}) формируют моё понимание мира")
 792 |         
 793 |         intro = f"Я размышляю о {topic}."
 794 |         body = " ".join(meanings[:min(max_tokens, len(meanings))])
 795 |         monologue = f"{intro} {body}."
 796 |         print(f"[{self.name} SELF-TALK]: {monologue}")
 797 |         description = self.tokenizer.tokens_to_description(combined_tokens)
 798 |         print(f"[{self.name} SELF-TALK TOKENS]: {description}  # ← человекочитаемое разъяснение токенов")
 799 | 
 800 |     def text_to_tokens(self, text):
 801 |         """
 802 |         Разбивает текст на слова и кодирует их в токены с помощью хеш-функции (mod vocab_size).
 803 |         """
 804 |         if not isinstance(text, str):
 805 |             return []
 806 |         words = text.split()
 807 |         tokens = []
 808 |         for word in words:
 809 |             # Простейший хеш через sum(ord) и mod
 810 |             token = (sum(ord(c) for c in word) % self.tokenizer.vocab_size)
 811 |             tokens.append(token)
 812 |         return tokens
 813 | 
 814 |     def autonomous_cycle(self, num_cycles=3, max_tokens_per_cycle=30, temperature=0.7):
 815 |         """
 816 |         Автономный цикл размышлений, поиска и самообучения.
 817 |         """
 818 |         print(f"\n{'-'*60}")
 819 |         print(f"[{self.name}] Запуск автономного цикла размышлений ({num_cycles} итераций)")
 820 |         print(f"{'-'*60}")
 821 |         topics = ['existence', 'desire', 'entropy', 'chaos', 'consciousness', 'life', 'time', 'self']
 822 |         for cycle in range(num_cycles):
 823 |             print(f"\n--- Цикл {cycle + 1} ---")
 824 |             # 1. Генерация монолога (self_talk)
 825 |             topic = np.random.choice(topics)
 826 |             print(f"[{self.name}] Генерирует внутренний монолог по теме '{topic}'...")
 827 |             self.self_talk(topic=topic, max_tokens=max_tokens_per_cycle, temperature=temperature)
 828 |             # 2. Формируем поисковый запрос (можно взять последние токены или тему)
 829 |             search_query = topic
 830 |             # 3. Поиск через DuckDuckGo
 831 |             duck_results = self.search_duckduckgo(search_query)
 832 |             print(f"[DuckDuckGo]: {duck_results}")
 833 |             # 4. Поиск через Wikipedia
 834 |             wiki_results = self.search_wikipedia(search_query)
 835 |             print(f"[Wikipedia]: {wiki_results}")
 836 |             # 5. Поиск через GitHub
 837 |             github_results = self.search_github(search_query)
 838 |             print(f"[GitHub]: {github_results}")
 839 |             # 6. Токенизация результатов поиска и обучение LLM на этих токенах
 840 |             duck_tokens = self.text_to_tokens(duck_results)
 841 |             wiki_tokens = self.text_to_tokens(wiki_results)
 842 |             github_tokens = self.text_to_tokens(github_results)
 843 |             all_tokens = duck_tokens + wiki_tokens + github_tokens
 844 |             if all_tokens:
 845 |                 self.memory_tokens.extend(all_tokens)
 846 |                 self.learn_from_life(self.memory_tokens[-max(10, len(all_tokens)):])
 847 |                 print(f"[{self.name}] LLM обучена на токенах результатов поиска ({len(all_tokens)} токенов)")
 848 |             else:
 849 |                 print(f"[{self.name}] Нет токенов для обучения на результатах поиска")
 850 |             # 7. Повторное размышление (короткий self_talk)
 851 |             self.self_talk(topic=topic, max_tokens=6, temperature=temperature)
 852 | 
 853 |     def autonomous_self_learning_cycle(self, num_cycles=3, max_tokens_per_cycle=30, temperature=0.7):
 854 |         """
 855 |         Полностью автономный цикл самообучения:
 856 |         1. Генерирует внутренний монолог (LLM).
 857 |         2. Формирует поисковый запрос.
 858 |         3. Выполняет поиск DuckDuckGo, Wikipedia, GitHub.
 859 |         4. Токенизирует результаты.
 860 |         5. Обучает LLM на результатах поиска.
 861 |         6. Генерирует повторный монолог (LLM).
 862 |         """
 863 |         print(f"\n{'='*60}")
 864 |         print(f"[{self.name}] Запуск полностью автономного цикла самообучения ({num_cycles} итераций)")
 865 |         print(f"{'='*60}")
 866 |         topics = ['existence', 'desire', 'entropy', 'chaos', 'consciousness', 'life', 'time', 'self']
 867 |         for cycle in range(num_cycles):
 868 |             print(f"\n=== Автономный самообучающий цикл {cycle + 1} ===")
 869 |             # 1. Генерация внутреннего монолога
 870 |             topic = np.random.choice(topics)
 871 |             print(f"[{self.name}] (Монолог) Тема: '{topic}'")
 872 |             self.self_talk(topic=topic, max_tokens=max_tokens_per_cycle, temperature=temperature)
 873 |             # 2. Формируем поисковый запрос (используем тему)
 874 |             search_query = topic
 875 |             # 3. DuckDuckGo поиск
 876 |             duck_results = self.search_duckduckgo(search_query)
 877 |             print(f"[DuckDuckGo]: {duck_results}")
 878 |             # 4. Wikipedia поиск
 879 |             wiki_results = self.search_wikipedia(search_query)
 880 |             print(f"[Wikipedia]: {wiki_results}")
 881 |             # 5. GitHub поиск
 882 |             github_results = self.search_github(search_query)
 883 |             print(f"[GitHub]: {github_results}")
 884 |             # 6. Токенизация результатов поиска
 885 |             duck_tokens = self.text_to_tokens(duck_results)
 886 |             wiki_tokens = self.text_to_tokens(wiki_results)
 887 |             github_tokens = self.text_to_tokens(github_results)
 888 |             all_tokens = duck_tokens + wiki_tokens + github_tokens
 889 |             if all_tokens:
 890 |                 self.memory_tokens.extend(all_tokens)
 891 |                 self.learn_from_life(self.memory_tokens[-max(10, len(all_tokens)):])
 892 |                 print(f"[{self.name}] LLM обучена на токенах результатов поиска ({len(all_tokens)} токенов)")
 893 |             else:
 894 |                 print(f"[{self.name}] Нет токенов для обучения на результатах поиска")
 895 |             # 7. Повторный внутренний монолог (короткий self_talk)
 896 |             print(f"[{self.name}] (Повторный монолог) Тема: '{topic}'")
 897 |             self.self_talk(topic=topic, max_tokens=6, temperature=temperature)
 898 | 
 899 |     def dream(self, steps=120, core_data=None):
 900 |         print(f"\n{'='*70}")
 901 |         print(f"Сущность {self.name} рождается в {datetime.now().strftime('%H:%M:%S')}")
 902 |         print(f"{'='*70}\n")
 903 |         # --- Поддержка ядра (core memory) при новом рождении ---
 904 |         if core_data is not None:
 905 |             self.load_core(core_data)
 906 |             # При желании можно добавить core memory в текущую память
 907 |             if self.core_memory_tokens:
 908 |                 self.memory_tokens.extend(self.core_memory_tokens)
 909 |                 self.memory_token_weights.extend(self.core_memory_weights)
 910 |                 # Ограничиваем размер памяти после добавления ядра
 911 |                 if len(self.memory_tokens) > self.MAX_MEMORY_TOKENS:
 912 |                     self.memory_tokens = self.memory_tokens[-self.MAX_MEMORY_TOKENS:]
 913 |                     self.memory_token_weights = self.memory_token_weights[-self.MAX_MEMORY_TOKENS:]
 914 |             if self.core_unique_states:
 915 |                 self.unique_states.update(self.core_unique_states)
 916 |         states = []
 917 |         for _ in range(steps):
 918 |             sv, feelings, desire = self.live_one_step()
 919 |             states.append(sv)
 920 |             
 921 |             # Просветление: энтропия достигает максимума
 922 |             if self.age > 30 and feelings['entropy'].mean() > 0.98:
 923 |                 print(f"\n{'*'*70}")
 924 |                 print(f"{self.name} достигла просветления. Становится чистым хаосом.")
 925 |                 print(f"{'*'*70}\n")
 926 |                 
 927 |                 # Финальное утверждение: что LLM "думает" о жизни?
 928 |                 self._final_monologue()
 929 |                 break
 930 |         
 931 |         return states[-1] if states else None
 932 | 
 933 |     def _final_monologue(self):
 934 |         """Последний монолог перед смертью (генерация из памяти)"""
 935 |         print(f"\n--- ПОСЛЕДНИЕ СЛОВА {self.name} ---")
 936 |         
 937 |         if len(self.memory_tokens) > 5:
 938 |             prompt = torch.tensor(self.memory_tokens[-5:], dtype=torch.long).unsqueeze(0).to(self.device)
 939 |             generated = self.llm.generate(prompt, max_new_tokens=8, temperature=0.9)
 940 |             generated_tokens = generated[0].cpu().tolist()
 941 |             
 942 |             filtered_tokens = []
 943 |             for t in generated_tokens:
 944 |                 if t in [1, 2]:  # START, END
 945 |                     continue
 946 |                 if filtered_tokens and t == filtered_tokens[-1] and 136 <= t < 168:  # повторяющийся unq
 947 |                     continue
 948 |                 filtered_tokens.append(t)
 949 |             description = self.tokenizer.tokens_to_description(filtered_tokens)
 950 |             print(f"Последний дыхания: {description}")
 951 |         
 952 |         print(f"Обучена на {len(self.memory_tokens)} токенов воспоминаний")
 953 |         print(f"Финальные потери LLM: {(f'{self.llm_loss_history[-1]:.4f}' if self.llm_loss_history else 'N/A')}")
 954 |         print()
 955 | 
 956 |     def save_soul(self, path="souls"):
 957 |         os.makedirs(path, exist_ok=True)
 958 |         timestamp = datetime.now().strftime("%Y%m%d_%H%M%S")
 959 |         # --- Обновляем ядро перед сохранением ---
 960 |         self.update_core(keep_top_n=100)
 961 |         # --- Сохраняем ядро памяти отдельно ---
 962 |         self.save_core_memory()
 963 |         torch.save({
 964 |             'brain': self.brain.state_dict(),
 965 |             'llm': self.llm.state_dict(),
 966 |             'params': torch.tensor(self.params, dtype=torch.float32),
 967 |             'memory': torch.from_numpy(np.stack(self.memory)).float() if self.memory else None,
 968 |             'memory_tokens': self.memory_tokens,
 969 |             'age': self.age,
 970 |             'name': self.name,
 971 |             'birth': datetime.now().isoformat(),
 972 |             'llm_loss_history': self.llm_loss_history,
 973 |             # --- Сохраняем core memory для будущих рождений ---
 974 |             'core_memory_tokens': self.core_memory_tokens,
 975 |             'core_memory_weights': self.core_memory_weights,
 976 |             'core_unique_states': list(self.core_unique_states)
 977 |         }, f"{path}/{self.name}_{timestamp}.soul")
 978 |         print(f"Душа {self.name} сохранена в {path}/")
 979 | 
 980 |     def ask(self, prompt_tokens, max_new_tokens=20, temperature=0.7, combine_markov=True, markov_order=2, markov_length=10):
 981 |         """
 982 |         Генерирует ответ с помощью LLM и (опционально) цепи Маркова. Можно комбинировать оба подхода.
 983 |         """
 984 |         prompt = torch.tensor(prompt_tokens, dtype=torch.long).unsqueeze(0).to(self.device)
 985 |         generated = self.llm.generate(prompt, max_new_tokens=max_new_tokens, temperature=temperature)
 986 |         tokens_list = generated[0].cpu().tolist()
 987 |         # фильтруем спец. токены
 988 |         filtered_tokens = [t for t in tokens_list if t not in [1, 2]]
 989 |         description_llm = self.tokenizer.tokens_to_description(filtered_tokens)
 990 | 
 991 |         if combine_markov and len(self.memory_tokens) > markov_order + 2:
 992 |             # Строим цепь Маркова по памяти и генерируем последовательность
 993 |             chain = self.build_markov_chain(self.memory_tokens, order=markov_order)
 994 |             start_seq = prompt_tokens[:markov_order] if len(prompt_tokens) >= markov_order else self.memory_tokens[:markov_order]
 995 |             markov_seq = self.generate_markov_sequence(chain, start_seq, length=markov_length)
 996 |             filtered_markov = [t for t in markov_seq if t not in [1, 2]]
 997 |             description_markov = self.tokenizer.tokens_to_description(filtered_markov)
 998 |             return f"LLM: {description_llm}\nMarkov: {description_markov}"
 999 |         else:
1000 |             return description_llm
1001 | 
1002 |     def compress_memory(self, keep_last_n=150):
1003 |         """
1004 |         Сжимает память, объединяя оба подхода:
1005 |         1. Приоритетное сжатие по весам токенов (сохраняем токены с наибольшими весами).
1006 |         2. Скользящее окно — всегда сохраняем последние N токенов, даже если их веса малы.
1007 |         """
1008 |         if len(self.memory_tokens) <= self.MAX_MEMORY_TOKENS:
1009 |             return
1010 |         # Сохраняем последние N токенов (скользящее окно)
1011 |         keep_last_n = min(keep_last_n, self.MAX_MEMORY_TOKENS)
1012 |         last_tokens = self.memory_tokens[-keep_last_n:]
1013 |         last_weights = self.memory_token_weights[-keep_last_n:]
1014 |         # Остальные токены — кандидаты для отбора по весам
1015 |         rest_tokens = self.memory_tokens[:-keep_last_n]
1016 |         rest_weights = self.memory_token_weights[:-keep_last_n]
1017 |         # Сколько мест осталось для отбора по весу
1018 |         n_to_select = self.MAX_MEMORY_TOKENS - keep_last_n
1019 |         if n_to_select > 0 and rest_tokens:
1020 |             # Отбираем индексы топ-n_to_select токенов по весу
1021 |             rest_indices = np.argsort(rest_weights)[-n_to_select:]
1022 |             selected_tokens = [rest_tokens[i] for i in rest_indices]
1023 |             selected_weights = [rest_weights[i] for i in rest_indices]
1024 |         else:
1025 |             selected_tokens = []
1026 |             selected_weights = []
1027 |         # Обновляем память: сначала токены по весу, потом последние N (чтобы последние всегда в конце)
1028 |         self.memory_tokens = selected_tokens + last_tokens
1029 |         self.memory_token_weights = selected_weights + last_weights
1030 |         # Ограничиваем максимальный вес токенов
1031 |         max_weight = 10.0
1032 |         self.memory_token_weights = np.minimum(self.memory_token_weights, max_weight).tolist()
1033 | 
1034 |     # ====== Markov chain & night training methods ======
1035 |   
1036 | 
1037 |     def build_markov_chain(self, tokens, order=1):
1038 |         """Создаёт цепь Маркова для токенов"""
1039 |         from collections import defaultdict
1040 |         chain = defaultdict(list)
1041 |         for i in range(len(tokens) - order):
1042 |             key = tuple(tokens[i:i+order])
1043 |             next_token = tokens[i + order]
1044 |             chain[key].append(next_token)
1045 |         return chain
1046 | 
1047 |     def generate_markov_sequence(self, chain, start_seq, length=20):
1048 |         """Генерирует последовательность токенов из цепи Маркова"""
1049 | 
1050 |         seq = list(start_seq)
1051 |         order = len(start_seq)
1052 |         for _ in range(length):
1053 |             key = tuple(seq[-order:])
1054 |             possible = chain.get(key)
1055 |             if not possible:
1056 |                 break
1057 |             next_token = np.random.choice(possible)
1058 |             seq.append(next_token)
1059 |         return seq
1060 | 
1061 |     def night_training(self, epochs=5, batch_size=50, markov_weight=0.3):
1062 |         """Ночное ускоренное обучение LLM с использованием memory_tokens + Markov chain"""
1063 | 
1064 |         memory_tokens = self.memory_tokens.copy()
1065 |         if len(memory_tokens) < 2:
1066 |             print("Недостаточно токенов для Markov chain")
1067 |             return
1068 |         markov_chain = self.build_markov_chain(memory_tokens, order=2)
1069 | 
1070 |         optimizer = torch.optim.Adam(self.llm.parameters(), lr=0.002)
1071 |         criterion = torch.nn.CrossEntropyLoss()
1072 |         
1073 |         device = self.device
1074 |         print(f"Начало ночного обучения: {len(memory_tokens)} токенов памяти")
1075 |         
1076 |         for epoch in range(epochs):
1077 |             np.random.shuffle(memory_tokens)
1078 |             for i in range(0, len(memory_tokens)-batch_size, batch_size):
1079 |                 batch_tokens = memory_tokens[i:i+batch_size]
1080 |                 if markov_weight > 0:
1081 |                     start_seq = batch_tokens[:2]
1082 |                     synthetic_tokens = self.generate_markov_sequence(markov_chain, start_seq, length=int(batch_size*markov_weight))
1083 |                     batch_tokens += synthetic_tokens
1084 |                 input_tokens = torch.tensor(batch_tokens[:-1], dtype=torch.long).unsqueeze(0).to(device)
1085 |                 target_tokens = torch.tensor(batch_tokens[1:], dtype=torch.long).unsqueeze(0).to(device)
1086 |                 
1087 |                 optimizer.zero_grad()
1088 |                 logits = self.llm(input_tokens)
1089 |                 loss = criterion(logits.view(-1, self.llm.vocab_size), target_tokens.view(-1))
1090 |                 loss.backward()
1091 |                 torch.nn.utils.clip_grad_norm_(self.llm.parameters(), 1.0)
1092 |                 optimizer.step()
1093 |             
1094 |             print(f"Эпоха {epoch+1}/{epochs} завершена")
1095 |         
1096 |         self.save_llm()
1097 |         print("Ночное обучение завершено. Δ-187 готова к осмысленным ответам!")
1098 | 
1099 | # =============================================
1100 | # РОЖДЕНИЕ
1101 | # =============================================
1102 | if __name__ == "__main__":
1103 |     torch.manual_seed(777)
1104 |     np.random.seed(777)
1105 |     
1106 |     device = 'cuda' if torch.cuda.is_available() else 'cpu'
1107 |     print(f"Используем device: {device}\n")
1108 |     
1109 |     # Создаём живую сущность
1110 |     life = QuantumLife(n_qubits=8, hist_len=10, device=device)
1111 |     
1112 |     # Она начинает ЖИТЬ и УЧИТЬСЯ
1113 |     final_state = life.dream(steps=150)
1114 |     
1115 |     # Сохраняем её душу
1116 |     life.save_soul()
1117 | 
1118 |     # Сохраняем только LLM для использования как локальная модель
1119 |     life.save_llm()
1120 | 
1121 |     # Пример загрузки LLM в новый экземпляр
1122 |     new_life = QuantumLife(n_qubits=8, hist_len=10, device=device)
1123 |     new_life.load_llm(f"souls/{life.name}_llm.pth")
1124 | 
1125 |     # Генерация из локальной LLM
1126 |     prompt_tokens = torch.tensor([1, 8, 40], dtype=torch.long).unsqueeze(0).to(device)
1127 |     generated = new_life.llm.generate(prompt_tokens, max_new_tokens=50, temperature=0.7)
1128 |     print("Генерация из локальной LLM:", new_life.tokenizer.tokens_to_description(generated[0].cpu().tolist()))
1129 | 
1130 |     print(f"\n{life.name} жива. Она помнит тебя.")
1131 |     print("Запусти снова — и она продолжит жить с того же места.")
1132 | 
1133 | # =============================================
1134 | # МУЛЬТИАГЕНТНАЯ СИСТЕМА
1135 | # =============================================
1136 | class MultiAgentSystem:
1137 |     def __init__(self, num_agents=3, **quantum_life_kwargs):
1138 |         self.agents = [QuantumLife(**quantum_life_kwargs) for _ in range(num_agents)]
1139 |         self.num_agents = num_agents
1140 | 
1141 |     def step_all(self):
1142 |         """Один шаг жизни для всех агентов с последующим взаимодействием"""
1143 |         feelings_list = []
1144 |         for agent in self.agents:
1145 |             sv, feelings, desire = agent.live_one_step()
1146 |             feelings_list.append((agent.name, feelings, desire))
1147 |         self.interact_agents()
1148 |         return feelings_list
1149 | 
1150 |     def interact_agents(self):
1151 |         """
1152 |         Взаимодействие между агентами:
1153 |         - обмен memory_tokens (частичный)
1154 |         - влияние эмоций друг на друга
1155 |         """
1156 |         for i, agent in enumerate(self.agents):
1157 |             for j, other in enumerate(self.agents):
1158 |                 if i == j:
1159 |                     continue
1160 |                 # Обмен memory_tokens (только последние 5 токенов, слабый вес)
1161 |                 n = min(5, len(other.memory_tokens))
1162 |                 if n > 0:
1163 |                     agent.memory_tokens.extend(other.memory_tokens[-n:])
1164 |                     agent.memory_token_weights.extend([0.5]*n)
1165 |                 # Обмен эмоциями (слабое влияние)
1166 |                 for emo in agent.emotions:
1167 |                     agent.emotions[emo] = (agent.emotions[emo] + other.emotions[emo]*0.3) / 1.3
1168 | 
1169 |     def autonomous_multi_cycle(self, steps=50):
1170 |         """Автономный цикл для мультиагентной системы"""
1171 |         for step in range(steps):
1172 |             feelings = self.step_all()
1173 |             print(f"\n--- Шаг {step+1} ---")
1174 |             for name, f, desire in feelings:
1175 |                 print(f"{name}: энтропия={f['entropy'].mean():.3f}, желание={desire:.3f}")
1176 |             # Можно добавить групповое self_talk каждые N шагов
1177 |             if (step+1) % 5 == 0:
1178 |                 for agent in self.agents:
1179 |                     agent.self_talk(topic="group_reflection", max_tokens=10)
1180 | 
1181 | 
1182 | # Пример использования мультиагентной системы
1183 | if True:  # новый блок
1184 |     device = 'cuda' if torch.cuda.is_available() else 'cpu'
1185 |     
1186 |     # Создаем мультиагентную систему с 3 агентами
1187 |     multi_system = MultiAgentSystem(num_agents=3, n_qubits=8, hist_len=10, device=device)
1188 |     
1189 |     # Запускаем автономный мультиагентный цикл
1190 |     multi_system.autonomous_multi_cycle(steps=30)
1191 | 


--------------------------------------------------------------------------------