">
 

How Senior Engineers Use AI Without Burning Through Token Limits - Reduce AI Token Usage by 60–90%

Iniciado por joomlamz, Hoje at 14:25

Respostas: 1   |   Visualizações: 1

Tópico anterior - Tópico seguinte

0 Membros e 1 Visitante estão a ver este tópico.

Olá a todos os membros do fórum webmastersmz.com. Hoje, vamos discutir sobre uma atualização de segurança importante para o sistema operacional AlmaLinux, relacionada ao componente CIFSwitch (CVE-2026-46243). Vamos analisar os principais pontos desta atualização e discutir como ela afeta os servidores empresariais.

**Análise Técnica**

A atualização de segurança CIFSwitch (CVE-2026-46243) foi identificada como uma vulnerabilidade crítica no sistema operacional AlmaLinux. A vulnerabilidade permite ataques remotos de execução de código, o que pode levar a uma perda de confidencialidade, integridade e disponibilidade dos dados dos servidores.

Para corrigir esta vulnerabilidade, é necessário atualizar o componente CIFSwitch para a versão mais recente. A atualização é recomendada para todos os servidores que executam o sistema operacional AlmaLinux e têm o componente CIFSwitch instalado.

**Consequências da Inação**

Se não atualizar o componente CIFSwitch, os servidores podem ser vulneráveis a ataques maliciosos. Em casos extremos, a falta de atualização pode levar a uma perda total dos dados dos servidores, o que pode ser devastador para as empresas.

**Precauções ao Atualizar**

Quando atualizar o componente CIFSwitch, é importante seguir as instruções do fornecedor do sistema operacional e do componente. Além disso, é recomendável realizar um backup dos dados dos servidores antes da atualização.

**Conclusão**

A atualização de segurança CIFSwitch (CVE-2026-46243) é uma medida importante para garantir a segurança dos servidores empresariais que executam o sistema operacional AlmaLinux. É fundamental atualizar o componente CIFSwitch para a versão mais recente para evitar ataques maliciosos e proteger os dados dos servidores.

**Convidação ao Serviço de Alojamento da AplicHost**

Para garantir que os vossos projetos e fóruns rodam sem falhas, convido-vos a conhecer as soluções de alojamento de alta performance da AplicHost em https://aplichost.com. Nossa equipe experiente está à disposição para ajudar a encontrar a melhor opção de alojamento para os vossos necessidades.

How Senior Engineers Use AI Without Burning Through Token Limits - Reduce AI Token Usage by 60–90%



Tópico: How Senior Engineers Use AI Without Burning Through Token Limits - Reduce AI Token Usage by 60–90%
Categoria: Tutoriais | Programação & Tecnologia
Idioma Principal: Português (Conteúdo de Tecnologia)

Descrição do Conteúdo / Informações:
-------------------------------------------------------------------------
Last month I watched a developer exhaust their Claude usage limit in less than a week.

They weren't generating massive applications.

They weren't building complex AI systems.

They were simply asking AI to repeatedly scan the same repository, read the same files, and explain the same architecture over and over again.

Sound familiar?

As AI-assisted development becomes mainstream, many teams are discovering a new engineering challenge:

Token efficiency.

Just as experienced engineers learned to optimize cloud spend, senior engineers are now learning to optimize AI context.

The difference between a developer who runs out of tokens every few days and one who comfortably works all month often isn't the AI model.

It's how they manage context.

Here's the toolkit and workflow I've seen work consistently.



The Hidden Cost of Vibe Coding


Imagine you ask:

Fix a bug in PaymentService.ts

Your AI assistant proceeds to:

• Scan the entire repository

• Read infrastructure code

• Explore frontend folders

• Traverse documentation

• Load previous conversations

• Inspect unrelated dependencies

You asked about one file.

The model consumed context from hundreds.

That's where your tokens disappear.

The goal isn't to reduce intelligence.

The goal is to reduce unnecessary context.



1. RTK: Stop Paying For Useless Command Output


One of the biggest hidden token sinks is terminal output.

Many AI coding agents automatically consume:

• npm install logs

• build outputs

• test results

• deployment logs

• dependency resolution messages

Most of this information is irrelevant.

Tools like RTK solve this problem.



What RTK Does


RTK acts as a proxy layer between your development environment and the LLM.

Instead of forwarding everything:

npm install

RTK filters:

• redundant messages

• repeated warnings

• progress indicators

• noise

before they ever reach the model.



Benefits


Reported reductions:

• 60–90% reduction in token consumption for common development workflows

• Faster agent reasoning

• Cleaner context windows

The principle is simple:

If a human wouldn't read it, the model probably shouldn't either.



2. Lean-CTX: Compress Context Before It Reaches The Model


Most developers optimize prompts.

Few optimize files.

Large source files often contain:

• generated code

• comments

• repetitive structures

• boilerplate

Lean-CTX dynamically compresses and optimizes file content before it gets sent to the model.



Why It Matters


Instead of sending:

4,000 line file

you might send:

Relevant functions
Dependencies
Symbols
Interfaces

The AI receives the information it needs while consuming significantly fewer tokens.

Think of it as:

gzip for AI context.



3. AI Codex & Repository Indexers


One of the most expensive activities in AI coding is:

"Explore my codebase."

The model begins reading dozens of files trying to understand:

• routes

• APIs

• schemas

• services

• components

This exploratory phase can easily burn tens of thousands of tokens.

Repository indexing tools solve this.



What They Generate


Instead of scanning everything:

Generate:

ROUTES.md
DATABASE_SCHEMA.md
COMPONENTS.md
SERVICES.md
DEPENDENCIES.md

Now the AI can understand the system from five small files instead of 500 source files.



Typical Savings


Many teams report avoiding:

• 30k–50k tokens

during initial codebase exploration.

This is one of the highest ROI improvements you can make.



The Caveman Rule: My Favorite Token Hack


This sounds ridiculous.

But it works.

When you need code, you don't need essays.

You don't need:

Certainly! Here's a detailed explanation...

You need:

Bug here.
Fix this.
Run test.
Done.

The Caveman Rule instructs the AI to:

• skip conversational filler

• avoid lengthy summaries

• communicate with minimal words

Example:

Instead of:

I've identified several possible root causes...

You get:

Null value here.
Add guard clause.
Problem solved.

The technical accuracy remains.

The verbosity disappears.

Many developers report output token reductions approaching 75%.



Create A Project Brain


One of the biggest mistakes I see:

Developers repeatedly explaining their project.

Every new session starts with:

We're using:
- Node.js
- PostgreSQL
- Kubernetes
- OpenTelemetry
- GitHub Actions

Again.

And again.

And again.

Instead create:

CLAUDE.md
AGENTS.md
PROJECT_CONTEXT.md
ARCHITECTURE.md

Store:

• architecture

• conventions

• coding standards

• deployment patterns

• repository structure

Now every session starts with shared understanding.

The AI spends less time learning.

You spend fewer tokens teaching.



The Fragmented Code Approach


Another expensive habit:

Rewrite the entire file.

The AI responds with:

2,000 lines

You pay for all of it.

Instead ask:

Modify only lines 120–150.
Return patch only.
No summary.

Benefits:

• fewer output tokens

• smaller future context

• easier reviews

• lower costs

The best AI engineers increasingly think in patches, not rewrites.



Native IDE Features Most Developers Ignore


Many modern AI IDEs already provide token optimization features.

Most people never use them.



Cost Caps


Set:

• maximum tool calls

• session budgets

• usage limits

Treat tokens like cloud spend.

Because they are.



Compact Sessions


Claude and other tools support context compaction.

Example:

/compact

This removes:

• redundant conversation history

• obsolete decisions

• resolved issues

while preserving important context.

Think:

garbage collection for conversations.



New Session, New Problem


One of the easiest wins:

Start fresh.

When:

• a feature is complete

• a bug is resolved

• you're switching domains

create a new session.

Old conversations become baggage.

The model keeps re-reading:

• mistakes

• abandoned approaches

• irrelevant context

Fresh context often produces better results.



My Personal Context Engineering Checklist


Before asking AI anything:



Repository


Exclude:

node_modules/
dist/
coverage/
build/
.next/
target/



Context


Maintain:

CLAUDE.md
AGENTS.md
ARCHITECTURE.md
PROJECT_CONTEXT.md



Tooling


Use:

• RTK

• Lean-CTX

• AI Codex

• Repository Indexers

• Semantic Search

• Code Graphs



Prompting


Prefer:

Patch only.
No summary.

instead of:

Explain everything.



Sessions


• Compact regularly

• Start fresh often

• Keep contexts small



Final Thoughts


For years we optimized:

• cloud costs

• compute costs

• storage costs

• network costs

Now we need to optimize:

• context costs

The next generation of high-performing AI engineers won't be the people with the biggest context windows.

They'll be the people who know exactly what context to send.

Prompt engineering helped us talk to AI.

Context engineering helps us scale AI.

And in the age of vibe coding, context is the new compute.


Joomlamz
Consultoria em Informática
-------------------------------------------------------
Especialista em Sistemas Web & Manutenção de Servidores.
A desenvolver o novo AplPortal com suporte a PHP 8.
Precisa de ajuda profissional? Contacte-me.

Tags: