성과측정 - AI 검증 및 평가 - AI 에이전트의 실제 활용 : 평가 및 거버넌스

07.AI/7. AI 벤치마크 2026. 3. 28. 07:56

728x90

https://nsp.nanet.go.kr/plan/subject/detail.do?nationalPlanControlNo=PLAN0000058313&newReportChk=list&highNationalPlanSubjectSn=4

AI Agents in Action: Foundations for Evaluation and Governance

(AI 에이전트의 실제 활용: 평가 및 거버넌스의 기초)

목차

Foreword 4

Executive summary 5

Introduction 6

1 Evolving technical foundations of AI agents 8

1.1 The software architecture of an AI agent 8

1.2 Communication protocols and interoperability 10

1.3 Cybersecurity considerations 12

2 Foundations for AI agent evaluation and governance 13

2.1 Classification 14

2.2 Evaluation 19

2.3 Risk assessment 22

2.4 Governance considerations for AI agents: a progressive approach 25

3 Looking ahead: multi-agent ecosystems 29

Conclusion 30

Contributors 31

Endnotes 34

728x90

저작자표시 (새창열림)

'07.AI > 7. AI 벤치마크' 카테고리의 다른 글

성과측정 - AI 검증 및 평가 - AI 에이전트 SWE-CI, EvoScore (0)	2026.03.28
성과측정 - AI 검증 및 평가 - AI 에이전트 스킬 평가 및 테스트 실무 가이드 (0)	2026.03.28
성과측정 - AI 검증 및 평가 - AI 에이전트 SWE-rebench (0)	2026.03.08
LLM - 성능 - 벤치마크 - 데이터셋 가이드 (0)	2026.02.20
LLM - 성능 - 벤치마크 - 데이터 누수(Data Leakage) (0)	2026.02.18

Posted by Mr. Slumber

,

블로그 이미지

#AI;DL, #프로밤샘러, #원문링크참조 Mr. Slumber

카테고리

분류 전체보기 (2434)

01.Digital Service (179)

02.SW (274)

03.Security (207)

04.Database (88)

05.Network (62)

06.CAOS (62)

07.AI (780)

08.Algorithm (43)

09.경영 (75)

10.BT (6)

11.법제도 (32)

12. 메일진 (518)

13.일상다반사 (97)

14. PM (6)

15. 이미지 (2)

16. 정보기술 기술지도사 (2)

태그목록

최근에 올라온 글

최근에 달린 댓글

글 보관함

달력

링크

Total :
Today :
Yesterday :

250x250

티스토리 초대신청

티스토리툴바