Picture for Ulf Schlichtmann

Ulf Schlichtmann

KV Packet: Recomputation-Free Context-Independent KV Caching for LLMs

Add code
Apr 14, 2026
Viaarxiv icon

OptINC: Optical In-Network-Computing for Scalable Distributed Learning

Add code
Mar 30, 2026
Viaarxiv icon

Large Language Models (LLMs) for Electronic Design Automation (EDA)

Add code
Aug 27, 2025
Figure 1 for Large Language Models (LLMs) for Electronic Design Automation (EDA)
Figure 2 for Large Language Models (LLMs) for Electronic Design Automation (EDA)
Figure 3 for Large Language Models (LLMs) for Electronic Design Automation (EDA)
Figure 4 for Large Language Models (LLMs) for Electronic Design Automation (EDA)
Viaarxiv icon

GENIE-ASI: Generative Instruction and Executable Code for Analog Subcircuit Identification

Add code
Aug 26, 2025
Viaarxiv icon

BasisN: Reprogramming-Free RRAM-Based In-Memory-Computing by Basis Combination for Deep Neural Networks

Add code
Jul 04, 2024
Viaarxiv icon

LiveMind: Low-latency Large Language Models with Simultaneous Inference

Add code
Jun 20, 2024
Figure 1 for LiveMind: Low-latency Large Language Models with Simultaneous Inference
Figure 2 for LiveMind: Low-latency Large Language Models with Simultaneous Inference
Figure 3 for LiveMind: Low-latency Large Language Models with Simultaneous Inference
Figure 4 for LiveMind: Low-latency Large Language Models with Simultaneous Inference
Viaarxiv icon

Memory Is All You Need: An Overview of Compute-in-Memory Architectures for Accelerating Large Language Model Inference

Add code
Jun 12, 2024
Figure 1 for Memory Is All You Need: An Overview of Compute-in-Memory Architectures for Accelerating Large Language Model Inference
Figure 2 for Memory Is All You Need: An Overview of Compute-in-Memory Architectures for Accelerating Large Language Model Inference
Figure 3 for Memory Is All You Need: An Overview of Compute-in-Memory Architectures for Accelerating Large Language Model Inference
Figure 4 for Memory Is All You Need: An Overview of Compute-in-Memory Architectures for Accelerating Large Language Model Inference
Viaarxiv icon

EncodingNet: A Novel Encoding-based MAC Design for Efficient Neural Network Acceleration

Add code
Feb 25, 2024
Viaarxiv icon

Logic Design of Neural Networks for High-Throughput and Low-Power Applications

Add code
Sep 19, 2023
Figure 1 for Logic Design of Neural Networks for High-Throughput and Low-Power Applications
Figure 2 for Logic Design of Neural Networks for High-Throughput and Low-Power Applications
Figure 3 for Logic Design of Neural Networks for High-Throughput and Low-Power Applications
Figure 4 for Logic Design of Neural Networks for High-Throughput and Low-Power Applications
Viaarxiv icon

MLonMCU: TinyML Benchmarking with Fast Retargeting

Add code
Jun 15, 2023
Figure 1 for MLonMCU: TinyML Benchmarking with Fast Retargeting
Figure 2 for MLonMCU: TinyML Benchmarking with Fast Retargeting
Figure 3 for MLonMCU: TinyML Benchmarking with Fast Retargeting
Figure 4 for MLonMCU: TinyML Benchmarking with Fast Retargeting
Viaarxiv icon