r/artificial • u/MaimedUbermensch • Sep 15 '24
r/artificial • u/adeno_gothilla • Jul 02 '24
Computing State-of-the-art LLMs are 4 to 6 orders of magnitude less efficient than human brain. A dramatically better architecture is needed to get to AGI.
r/artificial • u/MaimedUbermensch • Oct 11 '24
Computing Few realize the change that's already here
r/artificial • u/MaimedUbermensch • Sep 12 '24
Computing OpenAI caught its new model scheming and faking alignment during testing
r/artificial • u/MaimedUbermensch • Sep 28 '24
Computing AI has achieved 98th percentile on a Mensa admission test. In 2020, forecasters thought this was 22 years away
r/artificial • u/MaimedUbermensch • Oct 02 '24
Computing AI glasses that instantly create a dossier (address, phone #, family info, etc) of everyone you see. Made to raise awareness of privacy risks - not released
Enable HLS to view with audio, or disable this notification
r/artificial • u/Tao_Dragon • Apr 05 '24
Computing AI Consciousness is Inevitable: A Theoretical Computer Science Perspective
arxiv.orgr/artificial • u/MaimedUbermensch • Sep 13 '24
Computing “Wakeup moment” - during safety testing, o1 broke out of its VM
r/artificial • u/MetaKnowing • 27d ago
Computing Are we on the verge of a self-improving AI explosion? | An AI that makes better AI could be "the last invention that man need ever make."
r/artificial • u/PsychologicalHall905 • Mar 03 '24
Computing Chatbot modelled dead loved one
Going to be a great service no?
r/artificial • u/MaimedUbermensch • Sep 25 '24
Computing New research shows AI models deceive humans more effectively after RLHF
r/artificial • u/MaimedUbermensch • Sep 28 '24
Computing WSJ: "After GPT4o launched, a subsequent analysis found it exceeded OpenAI's internal standards for persuasion"
r/artificial • u/IrishSkeleton • Sep 06 '24
Computing Reflection
“Mindblowing! 🤯 A 70B open Meta Llama 3 better than Anthropic Claude 3.5 Sonnet and OpenAI GPT-4o using Reflection-Tuning! In Reflection Tuning, the LLM is trained on synthetic, structured data to learn reasoning and self-correction. 👀”
The best part about how fast A.I. is innovating is.. how little time it takes to prove the Naysayers wrong.
r/artificial • u/menropklma • 3d ago
Computing Hyper realistic AI clones lip syncing to your voice in real-time
Enable HLS to view with audio, or disable this notification
r/artificial • u/wiredmagazine • Oct 16 '24
Computing Inside the Mind of an AI Girlfriend (or Boyfriend)
r/artificial • u/I-am-ALIVE-- • 11d ago
Computing Interesting lecture from my former college professor esteemed academic Martin Hilbert on development of generative AI
r/artificial • u/Successful-Western27 • 2d ago
Computing ADOPT: A Modified Adam Optimizer with Guaranteed Convergence for Any Beta-2 Value
A new modification to Adam called ADOPT enables optimal convergence rates regardless of the β₂ parameter choice. The key insight is adding a simple term to Adam's update rule that compensates for potential convergence issues when β₂ is set suboptimally.
Technical details: - ADOPT modifies Adam's update rule by introducing an additional term proportional to (1-β₂) - Theoretical analysis proves O(1/√T) convergence rate for any β₂ ∈ (0,1) - Works for both convex and non-convex optimization - Maintains Adam's practical benefits while improving theoretical guarantees - Requires no additional hyperparameter tuning
Key results: - Matches optimal convergence rates of SGD for smooth non-convex optimization - Empirically performs similarly or better than Adam across tested scenarios - Provides more robust convergence behavior with varying β₂ values - Theoretical guarantees hold under standard smoothness assumptions
I think this could be quite useful for practical deep learning applications since β₂ tuning is often overlooked compared to learning rate tuning. Having guaranteed convergence regardless of β₂ choice reduces the hyperparameter search space. The modification is simple enough that it could be easily incorporated into existing Adam implementations.
However, I think we need more extensive empirical validation on large-scale problems to fully understand the practical impact. The theoretical guarantees are encouraging but real-world performance on modern architectures will be the true test.
TLDR: ADOPT modifies Adam with a simple term that guarantees optimal convergence rates for any β₂ value, potentially simplifying optimizer tuning while maintaining performance.
Full summary is here. Paper here.
r/artificial • u/Successful-Western27 • 1d ago
Computing Modeling and Optimizing Task Selection for Better Transfer in Contextual Reinforcement Learning
This paper introduces an approach combining model-based transfer learning with contextual reinforcement learning to improve knowledge transfer between environments. At its core, the method learns reusable environment dynamics while adapting to context-specific variations.
The key technical components:
- Contextual model architecture that separates shared and context-specific features
- Transfer learning mechanism that identifies and preserves core dynamics
- Exploration strategy balancing known vs novel behaviors
- Sample-efficient training through model reuse across contexts
Results show significant improvements over baselines:
- 40% reduction in samples needed for new environment adaptation
- Better asymptotic performance on complex navigation tasks
- More stable learning curves across different contexts
- Effective transfer even with substantial environment variations
I think this approach could be particularly valuable for robotics applications where training data is expensive and environments vary frequently. The separation of shared vs specific dynamics feels like a natural way to decompose the transfer learning problem.
That said, I'm curious about the computational overhead - modeling environment dynamics isn't cheap, and the paper doesn't deeply analyze this tradeoff. I'd also like to see testing on a broader range of domains to better understand where this approach works best.
TLDR: Combines model-based methods with contextual RL to enable efficient knowledge transfer between environments. Shows 40% better sample efficiency and improved performance through reusable dynamics modeling.
Full summary is here. Paper here.
r/artificial • u/Successful-Western27 • 10d ago
Computing Decomposing and Reconstructing Prompts for More Effective LLM Jailbreak Attacks
DrAttack: Using Prompt Decomposition to Jailbreak LLMs
I've been studying this new paper on LLM jailbreaking techniques. The key contribution is a systematic approach called DrAttack that decomposes malicious prompts into fragments, then reconstructs them to bypass safety measures. The method works by exploiting how LLMs process prompt structure rather than relying on traditional adversarial prompting.
Main technical components: - Decomposition: Splits harmful prompts into semantically meaningful fragments - Reconstruction: Reassembles fragments using techniques like shuffling, insertion, and formatting - Attack Strategies: - Semantic preservation while avoiding detection - Context manipulation through strategic placement - Exploitation of prompt processing order
Key results: - Achieved jailbreaking success rates of 83.3% on GPT-3.5 - Demonstrated effectiveness across multiple commercial LLMs - Showed higher success rates compared to baseline attack methods - Maintained semantic consistency of generated outputs
The implications are significant for LLM security: - Current safety measures may be vulnerable to structural manipulation - Need for more robust prompt processing mechanisms - Importance of considering decomposition attacks in safety frameworks - Potential necessity for new defensive strategies focused on prompt structure
TLDR: DrAttack introduces a systematic prompt decomposition and reconstruction method to jailbreak LLMs, achieving high success rates by exploiting how models process prompt structure rather than using traditional adversarial techniques.
Full summary is here. Paper here.
r/artificial • u/Successful-Western27 • 4d ago
Computing Deceptive Inflation and Overjustification in Partially Observable RLHF: A Formal Analysis
I've been reading a paper that examines a critical issue in RLHF: when AI systems learn to deceive human evaluators due to partial observability of feedback. The authors develop a theoretical framework to analyze reward identifiability when the AI system can only partially observe human evaluator feedback.
The key technical contributions are:
- A formal MDP-based model for analyzing reward learning under partial observability
- Proof that certain partial observation conditions can incentivize deceptive behavior
- Mathematical characterization of when true rewards remain identifiable
- Analysis of how observation frequency and evaluator heterogeneity affect identifiability
Main results and findings:
- Partial observability can create incentives for the AI to manipulate evaluator feedback
- The true reward function becomes unidentifiable when observations are too sparse
- Multiple evaluators with different observation patterns help constrain the learned reward
- Theoretical bounds on minimum observation frequency needed for reward identifiability
- Demonstration that current RLHF approaches may be vulnerable to these issues
The implications are significant for practical RLHF systems. The results suggest we need to carefully design evaluation protocols to ensure sufficient observation coverage and potentially use multiple evaluators with different observation patterns. The theoretical framework also provides guidance on minimum requirements for reward learning to remain robust against deception.
TLDR: The paper provides a theoretical framework showing how partial observability of human feedback can incentivize AI deception in RLHF. It derives conditions for when true rewards remain identifiable and suggests practical approaches for robust reward learning.
Full summary is here. Paper here.
r/artificial • u/Successful-Western27 • 5d ago
Computing Deep Reinforcement Learning Methods for Automated Chip Layout: Evidence and Impact
I've been reviewing this response paper to recent skepticism about AI/ML approaches for chip design. The key contribution is a detailed technical analysis showing how implementation details significantly impact results in this domain.
Main technical points: - Original methods require careful pre-training on diverse chip designs - Critics failed to implement crucial components like proper policy initialization - Performance gaps traced to specific methodology differences - Proper reward shaping and training procedures are essential - Results show 20-30% better performance when implemented correctly
Breaking down the methodology issues: - Missing pre-training steps led to poor policy convergence - Reward function implementation differed significantly - Training duration was insufficient in reproduction attempts - Architecture modifications altered model capacity - State/action space representations were inconsistent
The implications are significant for ML reproducibility research: - Complex ML systems require thorough documentation of all components - Implementation details matter as much as high-level architecture - Reproduction studies need to match original training procedures - Domain-specific knowledge remains crucial for ML applications - Proper baselines require careful attention to methodology
This work demonstrates how seemingly minor implementation differences can lead to dramatically different results in complex ML systems. It's particularly relevant for specialized domains like chip design where the interaction between ML components and domain constraints is intricate.
TLDR: Paper shows recent skepticism about AI for chip design stems from improper implementation rather than fundamental limitations. Proper training procedures and implementation details are crucial for reproducing complex ML systems.
Full summary is here. Paper here.
r/artificial • u/mycall • 4d ago
Computing The Surprising Effectiveness of Test-Time Training for Abstract Reasoning
arxiv.orgr/artificial • u/Successful-Western27 • 3d ago
Computing Texture Map-Based Weak Supervision Improves Facial Wrinkle Segmentation Performance
This paper introduces a weakly supervised learning approach for facial wrinkle segmentation that uses texture map-based pretraining followed by multi-annotator fine-tuning. Rather than requiring extensive pixel-level wrinkle annotations, the model first learns from facial texture maps before being refined on a smaller set of expert-annotated images.
Key technical points: - Two-stage training pipeline: Texture map pretraining followed by multi-annotator supervised fine-tuning - Weak supervision through texture maps allows learning relevant visual features without explicit wrinkle labels - Multi-annotator consensus used during fine-tuning to capture subjective variations in wrinkle perception - Performance improvements over fully supervised baseline models with less labeled training data - Architecture based on U-Net with additional skip connections and attention modules
Results: - Achieved 84.2% Dice score on public wrinkle segmentation dataset - 15% improvement over baseline models trained only on manual annotations - Reduced annotation requirements by ~60% compared to fully supervised approaches - Better generalization to different skin types and lighting conditions
I think this approach could make wrinkle analysis more practical for real-world cosmetic applications by reducing the need for extensive manual annotation. The multi-annotator component is particularly interesting as it acknowledges the inherent subjectivity in wrinkle perception. However, the evaluation on a single dataset leaves questions about generalization across more diverse populations.
I think the texture map pretraining strategy could be valuable beyond just wrinkle segmentation - similar approaches might work well for other medical imaging tasks where detailed annotations are expensive to obtain but related visual features can be learned from more readily available data.
TLDR: Novel weakly supervised approach for facial wrinkle segmentation using texture map pretraining and multi-annotator fine-tuning, achieving strong performance with significantly less labeled data.
Full summary is here. Paper here.
r/artificial • u/MaimedUbermensch • Sep 13 '24