Skip to content

feat: improve coding skill score from 52% to 75%#81

Open
yogesh-tessl wants to merge 1 commit into
griddynamics:mainfrom
yogesh-tessl:improve/skill-review-optimization
Open

feat: improve coding skill score from 52% to 75%#81
yogesh-tessl wants to merge 1 commit into
griddynamics:mainfrom
yogesh-tessl:improve/skill-review-optimization

Conversation

@yogesh-tessl
Copy link
Copy Markdown

Hey @isolomatov-gd 👋

seriously thoughtful work. Rosetta is going after exactly the right problem - meta-prompting, context engineering, and centralised instruction management are where things get hard, and you’re tackling it head-on. The 31 well-structured skills, covering everything from workspace discovery to risk assessment, show real depth.

also ran your skills through tessl skill review at work and found some targeted improvements for the coding skill. Here's the before/after:

Skill Before After Change
coding 52% 75% +23%

The other skills I baselined (planning 66%, debugging 71%, self-organization 78%, orchestrator-contract 75%, etc.) were left untouched. This PR focuses on the single highest-impact change.

Changes made

Description improvements (biggest impact):

  • Replaced abstract buzzwords ("KISS/SOLID/DRY principles, zero-tolerance quality") with concrete actions: "implement features, fix bugs, refactor code with environment-aware configuration (local/dev/test/prod), dependency-ordered validation (databases → APIs → browser → mobile)"
  • Added explicit "Use when..." clause with specific triggers: code changes, DevOps scripts, CI/CD pipelines, code review, impact analysis
  • Description specificity went from 1/3 → 3/3 and completeness from 2/3 → 3/3

Content improvements:

  • Added explicit 6-step implementation workflow: Load context → Analyse impact → Implement → Validate → Review → Update docs
  • Added feedback loop for validation failures: diagnose with debugging skill → fix → re-validate from step 4
  • Compressed well-known principle acronyms (KISS, SOLID, DRY, YAGNI, MECE) into actionable constraints - Claude already knows these; the tokens are better spent on workflow structure
  • Expanded pitfalls from 1 → 4 items covering common failure modes (missed callers, redundant patterns, cross-environment validation, working directory issues)
  • Moved best_practices guidance into the workflow steps where it's actionable, eliminating a separate section
  • Workflow clarity went from 2/3 → 3/3

also stress-tested your coding skill against a task eval involving dependency-ordered validation across database schema changes, API endpoints, and browser flows. It navigated the layered validation sequence really well. Kudos for that.

Honest disclosure. I work at https://github.com/tesslio where we build tooling around skills like these. Not a pitch, just saw room for improvement and wanted to contribute.

If you want to self-improve your skills, or define your own scenarios to pressure test, just ask your agent (Claude Code, Codex, etc.) to evaluate and optimize your skill with Tessl. Ping me @yogesh-tessl, if you hit any snags.

Hey @isolomatov-gd 👋

I ran your skills through `tessl skill review` at work and found some targeted improvements for the `coding` skill. Here's the before/after:

| Skill | Before | After | Change |
|-------|--------|-------|--------|
| coding | 52% | 75% | **+23%** |

The other skills I baselined (planning 66%, debugging 71%, self-organization 78%, orchestrator-contract 75%, etc.) were left untouched — this PR focuses on the single highest-impact change.

<details>
<summary>Changes made</summary>

**Description improvements (biggest impact):**
- Replaced abstract buzzwords ("KISS/SOLID/DRY principles, zero-tolerance quality") with concrete actions: "implement features, fix bugs, refactor code with environment-aware configuration (local/dev/test/prod), dependency-ordered validation (databases → APIs → browser → mobile)"
- Added explicit "Use when..." clause with specific triggers: code changes, DevOps scripts, CI/CD pipelines, code review, impact analysis
- Description specificity went from 1/3 → 3/3 and completeness from 2/3 → 3/3

**Content improvements:**
- Added explicit 6-step implementation workflow: Load context → Analyse impact → Implement → Validate → Review → Update docs
- Added feedback loop for validation failures: diagnose with `debugging` skill → fix → re-validate from step 4
- Compressed well-known principle acronyms (KISS, SOLID, DRY, YAGNI, MECE) into actionable constraints — Claude already knows these; the tokens are better spent on workflow structure
- Expanded pitfalls from 1 → 4 items covering common failure modes (missed callers, redundant patterns, cross-environment validation, working directory issues)
- Moved `best_practices` guidance into the workflow steps where it's actionable, eliminating a separate section
- Workflow clarity went from 2/3 → 3/3

</details>

I also stress-tested your `coding` skill against a task eval involving dependency-ordered validation across database schema changes, API endpoints, and browser flows — it navigated the layered validation sequence really well. Kudos for that.

Honest disclosure — I work at @tesslio where we build tooling around skills like these. Not a pitch — just saw room for improvement and wanted to contribute.

Want to self-improve your skills? Just point your agent (Claude Code, Codex, etc.) at [this Tessl guide](https://docs.tessl.io/evaluate/optimize-a-skill-using-best-practices) and ask it to optimize your skill. Ping me — [@yogesh-tessl](https://github.com/yogesh-tessl) — if you hit any snags.

Thanks in advance 🙏
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant