Eukaryotic NAT genes

 

 

Original List (2013)

 

For additional NAT genes annotated in fungi and protists, see here

 

 

PROTISTS: See table 1

FUNGI: See table 2

PLANTS: No NAT genes identified to date

ANIMALS: See table 3

 

 

Table 1: Annotation of NAT genes in sequenced genomes of protists(a),(b)

 

Taxon

Organism scientific name

Organism identification code

Taxonomy identifier

Gene

ORF (nt)

Protein (aa)

Exon span

Number of introns

Locus tag

TPA accession number

Heterolobosea (1)

Naegleria gruberi NEG-M

NAEGR

5762

NAT1

969

322

1 (1-969)

0

NAEGRDRAFT_80760

BN001457

[41]

NAT2

990

329

1 (1-990)

0

NAEGRDRAFT_56390

BN001458

[41]

Amoebozoa (5)

Dictyosteliida (2)

Dictyostelium discoideum AX4

DICDI

352472

NAT1

921

306

1 (1-310)

2 (410-1020)

1

DDB_G0288509

BN001450

[41]

NAT2

978

325

1 (1-301)

2 (415-1091)

1

DDB_G0288503

BN001451

[41]

NAT3

900

299

1 (1-310)

2 (402-991)

1

DDB_G0288507

BN001452

[41]

NAT4

837

278

1 (1-837)

0

DDB_G0287121

BN001453

[41]

NAT5

870

289

1 (1-870)

0

DDB_G0276993

BN001454

[41]

NAT6

1464

487

1 (1-256)

2 (324-784)

3 (1959-2220)

4 (2321-2776)

5 (2805-2833)

4

DDB_G0277549

BN001455

[41]

Polysphondylium pallidum PN500

POLPA

670386

NAT1

822

273

1 (1-822)

0

PPL_12417

BN001460

[41]

NAT2

777

258

1 (1-562)

2 (623-837)

1

PPL_12418

BN001461

[41]

Stramenopiles (7)

Oomycetes (5)

Saprolegnia parasitica CBS 223.65

SAPPC

695850

NAT1

951

316

1 (1-951)

0

 

BN001462

[41]

NAT2

933

310

1 (1-933)

0

 

BN001463

[41]

Bacillariophyta (diatoms) (2)

Phaeodactylum tricornutum CCAP 1055/1

PHATR

556484

NAT1

981

326

1 (1-981)

0

 

BN001459

[41]

Alveolata (18)

Ciliophora (2)

Paramecium tetraurelia d4-2

PARTE

412030

NAT1

861

286

1 (1-861)

0

GSPATT00024900001

BN001464

[41]

NAT2

861

286

1 (1-861)

0

GSPATT00029589001

BN001465

[41]

NAT3

867

288

1 (1-867)

0

GSPATT00029590001

BN001466

[41]

NAT4

441

146

1 (441)

0

GSPATT00029592001

BN001467

[41]

Tetrahymena thermophila SB210

TETTH

312017

NAT1

858

285

1 (1-858)

0

TTHERM_00616140

BN001468

[41]

NAT2

873

290

1 (1-873)

0

TTHERM_00408820

BN001469

[41]

NAT3

 

 

 

885

 

 

 

294

 

 

 

1 (1-491)

2 (605-758)

3 (1905-2001)

4 (2276-2418)

3

 

 

 

TTHERM_00693000

 

 

 

BN001470

[41]

NAT4

861

286

1 (1-861)

0

TTHERM_01222540

BN001471

[41]

NAT5

858

285

1 (1-858)

0

TTHERM_00888050

BN001472

[41]

NAT6

843

280

1 (1-843)

0

TTHERM_00335770

BN001473

[41]

Choanoflagellida (1)

Monosiga brevicollis MX1

MONBE

431895

NAT1

1029

342

1 (1-1029)

0

MONBRDRAFT_7289

BN001456

[41]

Chlorophyta (2)

Chlamydomonas reinhardtii CC-503 cw92 mt+

CHLRE

3055

NAT1

924 (partial 3' end)

308 (partial C-terminus)

1 (1-99)

2 (390-470)

3 (757-822)

4 (1639-1698)

5 (1862-1942)

6 (2429-2491)

7 (3163-3255)

8 (3760-3821)

9 (4573-4704)

10 (5094-5118)

11 (5911-6051)

12 (6339-6504) 13 (7099-7145)

12

CHLREDRAFT_187400

BN001479

[41]

Footnotes, Table 1:

(a)          The term "protists" is used to describe a diverse range of lower eukaryotic organisms and does not refer to any particular monophyletic taxonomic group. The appropriate reference sequence for each taxonomic group of protists should be requested from the manager of this website.

(b)          Column key – Column 1, 2: The numbers in brackets are sequenced genomes of protist species; Columns 2-4: The scientific name, official identification code and unique taxonomy identifier for each organism are available from the UniProt Taxonomy Browser (http://www.uniprot.org/taxonomy/); Columns 5-9: The retrieval and annotation of NAT-homologous sequences (ORF span and exon-intron structure) were carried out as described [41], and the gene symbols were assigned according to the guidelines of the NAT Gene Nomenclature Committee. The sequence length of open reading frames (ORF) is shown in nucleotides (nt) and of deduced proteins in amino acids (aa); Column 10: Where available, the locus tags are provided to anchor each annotated NAT gene to the corresponding genomic map; Column 11: Third-party annotation (TPA) accession numbers were assigned by the EMBL database and can be used to retrieve the complete details of NAT genes in the table.

 

 

 

Table 2: Annotation of NAT genes in sequenced genomes of fungi(a),(b)

 

Phylum

 

 

Organism scientific name

 

Organism

identification

code

Taxonomy identifier

Gene

ORF (nt)

Protein (aa)

Exon span

 

Number

of introns

 

Locus tag

 

 

TPA accession number

Chytridiomycota (2)

 

Batrachochytrium dendrobatidis

BATDE

 

109871

 

NAT1

 

897

 

298

 

1 (1-891)

2 (981-986)

1

 

BDEG_00384

 

BN001378

[41]

Basidiomycota (16)

Pucciniomycotina (2)

Puccinia graminis f. sp. tritici CRL 75-36-700-3

 

PUCGR

 

 

418459

 

 

ΝΑΤ1

 

 

1008

 

 

335

 

 

1 (1-345)

2 (430-837)

3 (986-1240)

2

 

 

PGTG_03435

 

 

BN001423

[41]

Ustilaginomycotina (4)

Malassezia globosa CBS 7966

MALGO

 

425265

 

ΝΑΤ1

 

834

 

277

 

1 (1-834)

 

0

 

MGL_4060

 

BN001438

[41]

Ustilago maydis 521

 

USTMA

 

237631

 

ΝΑΤ1

 

1119

 

372

 

1 (1-102)

2 (177-1193)

1

 

UM01576

 

BN001424

[41]

Agaricomycotina (10)

Coprinus cinereus (Coprinopsis cinerea) okayama 7-130

COPC7

240176

ΝΑΤ1

 

 

 

1089

 

 

 

362

 

 

 

1 (1-233)

2 (285-414)

3 (464-725)

4 (784-1247)

3

 

 

 

CC1G_07781

 

 

 

BN001421

[41]

 

 

ΝΑΤ2

 

 

 

1089

 

 

 

362

 

 

 

1 (1-230)

2 (287-415)

3 (493-749)

4 (831-1303)

3

 

 

 

CC1G_07803

 

 

 

BN001422

[41]

 

 

Moniliophthora perniciosa FA553

 

CRIPE

 

554373

 

NAT1

 

459 (partial 3' end)

153

(partial C-terminus)

1 (1-459)

 

0

 

 

 

BN001437

[41]

Ascomycota (122)

Pezizomycotina (78)

Ajellomyces capsulata G186AR

 

 

AJECG

 

 

447093

 

 

ΝΑΤ1

 

 

963

 

 

320

 

 

1 (1-402)

2 (534-580)

3 (644-1157)

2

 

 

 

 

 

BN001360

 

 

Ajellomyces capsulata H88

 

 

AJEC8

 

 

544711

 

 

ΝΑΤ1

 

 

963

 

 

320

 

 

1 (1-402)

2 (534-580)

3 (645-1158)

2

 

 

HCEG_02416

 

 

BN001361

[41]

 

Ajellomyces capsulata H143

 

 

AJECH

 

544712

 

ΝΑΤ1

 

963

 

320

 

1 (1-402)

2 (534-580)

3 (645-1158)

2

 

HCDG_05981

 

BN001362

[41]

Ajellomyces capsulata G217B

 

 

AJECA

 

 

447094

 

 

ΝΑΤ1

 

 

963

 

 

320

 

 

1 (1-402)

2 (534-580)

3 (645-1158)

2

 

 

 

 

BN001442

[41]

 

Ajellomyces capsulata NAm1

 

 

AJECN

 

 

339724

 

 

ΝΑΤ1

 

 

963

 

 

320

 

 

1 (1-402)

2 (534-580)

3 (645-1158)

2

 

 

HCAG_01594

 

 

BN001443

[41]

 

Ajellomyces dermatitidis SLH14081

 

AJEDS

 

 

559298

 

 

ΝΑΤ1

 

 

963

 

 

320

 

 

1 (1-402)

2 (538-584)

3 (659-1172)

2

 

 

BDBG_03471

 

 

BN001364

[41]

 

Ajellomyces dermatitidis ER-3

 

 

AJEDR

 

 

559297

 

 

ΝΑΤ1

 

 

963

 

 

320

 

 

1 (1-402)

2 (538-584)

3 (659-1172)

2

 

 

BDCG_08161

 

 

BN001363

[41]

 

Alternaria brassicicola ATCC 96836

ALTBR

 

622360

 

ΝΑΤ1

 

924

 

307

 

1 (1-924)

 

0

 

 

BN001365

[41]

Aspergillus clavatus NRRL 1

 

 

ASPCL

 

 

344612

 

 

ΝΑΤ1

 

 

963

 

 

320

 

 

1 (1-410)

2 (462-507)

3 (581-1087)

2

 

 

ACLA_050060

 

 

BN001369

[41]

 

Aspergillus flavus NRRL-3357  

ASPFN

332952

ΝΑΤ1

 

1017

 

338

 

1 (1-14)

2 (863-1865)

1

 

 

 

BN001444

[41]

ΝΑΤ2

 

 

981

 

 

326

 

 

1 (1-426)

2 (482-528)

3 (588-1095)

2

 

 

AFL2G_01915

 

 

BN001445

[41]

 

ΝΑΤ3

 

957

 

318

 

1 (1-396)

2 (450-1010)

1

 

AFL2G_11316

 

BN001446

[41]

ΝΑΤ4

 

641

 

possible pseudogene

1 (1-92)

2 (144-692)

1

 

 

 

BN001447

[41]

Aspergillus fumigatus CEA 10 / Af293  

 

ASPFC

 

 

451804

 

 

ΝΑΤ1

 

 

990

 

 

329

 

 

1 (1-410)

2 (440-514)

3 (574-1078)

2

 

 

Afu4g10930

 

 

BN001370

[41]

 

Aspergillus niger CBS 513.88 / ATCC 1015  

ASPNC

425011

ΝΑΤ1

1047

348

1 (1-1047)

0

fge1_pg_C_12000355

BN001371

[41]

ΝΑΤ2

 

990

 

329

 

1 (1-441)

2 (508-1056)

1

 

fge1_pm_C_7000250

 

BN001372

[41]

ΝΑΤ3

 

 

948

 

 

315

 

 

1 (1-397)

2 (491-536)

3 (592-1096)

2

 

 

fge1_pg_C_17000131

 

 

BN001373

[41]

 

ΝΑΤ4

 

1050

 

349

 

1 (1-539)

2 (600-1110)

1

 

fge1_pg_C_12000261

 

BN001374

[41]

ΝΑΤ5

975

324

1 (1-975)

0

 

BN001375

[41]

Aspergillus terreus NIH 2624

ASPTN

341663

NAT1

 

 

957

 

 

318

 

 

1 (1-411)

2 (493-539)

3 (593-1091)

2

 

 

ATEG_03982

 

 

BN001376

[41]

 

NAT2

 

969

 

322

 

1 (1-393)

2 (448-1023)

1

 

ATEG_05747

 

BN001377

[41]

Botryotinia fuckeliana B05.10

 

BOTFB

 

332648

 

NAT1

 

981

 

326

 

1 (1-380)

2 (470-1070)

1

 

 

BN001379

[41]

Chaetomium globosum CBS 148.51

CHAGB

306901

NAT1

 

927

 

308

 

1 (1-348)

2 (417-995)

1

 

CHGG_09458

 

BN001380

[41]

NAT2

984

327

1 (1-984)

0

CHGG_06975

BN001381

[41]

Coccidioides immitis H538.4

 

COCIT

 

396776

 

NAT1

 

984

 

327

 

1 (1-411)

2 (473-1045)

1

 

CIHG_06736

 

BN001383

[41]

Coccidioides immitis RS

 

COCIM

 

246410

 

NAT1

 

984

 

327

 

1 (1-411)

2 (473-1045)

1

 

CIMG_01522

 

BN001384

[41]

Coccidioides immitis RMSCC 3703

COCIT

 

454286

 

NAT1

 

984

 

327

 

1 (1-411)

2 (473-1045)

1

 

CISG_01266

 

BN001385

[41]

Coccidioides immitis RMSCC 2394

COCIT

 

404692

 

NAT1

 

984

 

327

 

1 (1-411)

2 (473-1045)

1

 

CIRG_01630

 

BN001382

[41]

Coccidioides posadasii C735

 

COCP7

 

222929

 

NAT1

 

984

 

327

 

1 (1-411)

2 (473-1045)

1

 

 

 

BN001440

[41]

Coccidioides posadasii RMSCC 3488

COCPO

 

454284

 

NAT1

 

984

 

327

 

1 (1-411)

2 (473-1045)

1

 

CPAG_01321

 

BN001441

[41]

Coccidioides posadasii RMSCC 2133

COCPO

 

469470

 

NAT1

 

984

 

327

 

1 (1-411)

2 (473-1045)

1

 

 

 

BN001386

[41]

Coccidioides posadasii CPA 0020

COCPO

 

490068

 

NAT1

 

984

 

327

 

1 (1-411)

2 (473-1045)

1

 

 

 

BN001387

[41]

Coccidioides posadasii RMSCC 1038

COCPO

 

490066

 

NAT1

 

984

 

327

 

1 (1-411)

2 (473-1045)

1

 

 

 

BN001388

[41]

Coccidioides posadasii CPA 0001

COCPO

 

469472

 

NAT1

 

984

 

327

 

1 (1-411)

2 (473-1045)

1

 

 

 

BN001389

[41]

Coccidioides posadasii RMSCC 1037

COCPO

 

490065

 

NAT1

 

984

 

327

 

1 (1-411)

2 (473-1045)

1

 

 

 

BN001390

[41]

Coccidioides posadasii str. Silveira

COCPS

 

443226

 

NAT1

 

984

 

327

 

1 (1-411)

2 (473-1045)

1

 

CPSG_01649

 

BN001391

[41]

Emericella nidulans (Aspergillus nidulans) FGSC A4

 

EMENI

 

 

227321

 

 

ΝΑΤ1

 

 

960

 

 

319

 

 

1 (1-407)

2 (463-510)

3 (568-1072)

2

 

 

ANID_10723

 

 

BN001425

[41]

 

Fusarium oxysporum f. sp. Lycopersici 4287

FUSO4

426428

ΝΑΤ1

1053

350

1 (1-1053)

0

FOXG_15318

BN001426

[41]

ΝΑΤ2

 

957

 

318

 

1 (1-372)

2 (424-1008)

1

FOXG_06095

 

BN001427

[41]

ΝΑΤ3

999

332

1 (1-999)

0

FOXG_03795

BN001428

[41]

ΝΑΤ4

 

963

 

320

 

1 (1-366)

2 (549-1145)

1

 

 

 

BN001429

[41,43]

Gibberella moniliformis (Fusarium verticillioides) 7600

GIBM7

334819

ΝΑΤ1

1038

345

1 (1-1038)

0

FVEG_12636

BN001430

[41]

ΝΑΤ2

957

318

1 (1-957)

0

FVEG_03961

BN001431

[41]

ΝΑΤ3

978

325

1 (1-978)

0

FVEG_12062

BN001432

[41]

ΝΑΤ4

 

 

1011

 

 

336

 

 

1 (1-295)

2 (628-1024)

3 (1079-1397)

2

 

 

 

 

 

BN001433

[41]

 

Gibberella zeae (Fusarium graminearum) RRL31084

GIBZE

229533

ΝΑΤ1

1032

343

1 (1-1032)

0

FGSG_00080

BN001434

[41]

ΝΑΤ2

 

957

 

318

 

1 (1-372)

2 (431-1015)

1

 

FGSG_09400

 

BN001435

[41]

ΝΑΤ3

960

319

1 (1-960)

0

FGSG_07888

BN001436

[41]

Glomerella graminicola M1.001

COLGM

 

645133

 

NAT1

 

927

 

308

 

1 (1-360)

2 (425-991)

1

 

 

 

BN001392

[41]

Magnaporthe grisea 70-15

MAGO7

242507

ΝΑΤ1

 

963

 

320

 

1 (1-354)

2 (454-1062)

1

 

MGG_10560

 

BN001393

[41]

ΝΑΤ2

1113

370

1 (1-1113)

0

 

BN001394

[41]

Microsporum canis CBS 113480

 

ARTOC

 

 

554155

 

 

ΝΑΤ1

 

 

1002

 

 

333

 

 

1 (1-422)

2 (486- 533)

3 (589-1120)

2

 

 

MCYG_06389

 

 

BN001368

 

 

Microsporum gypseum CBS 118893

ARTGP

535722

ΝΑΤ1

 

942

 

313

 

1 (1-416)

2 (585-1110)

1

 

MGYG_06842

 

BN001366

[41]

ΝΑΤ2

 

1008

 

335

 

1 (1-414)

2 (534-1127)

1

 

MGYG_02694

 

BN001367

[41]

Nectria haematococca mpVI 77-13-4

NECH7

660122

NAT1

 

891

 

296

 

1 (1-331)

2 (403-962)

1

 

NECHADRAFT_33380

 

BN001395

[41]

NAT2

 

966

 

321

 

1 (1-381)

2 (435-1019)

1

 

NECHADRAFT_75515

 

BN001396

[41]

NAT3

 

897

 

298

 

1 (1-344)

2 (401-953)

1

 

NECHADRAFT_87499

BN001397

[41]

NAT4

1008

335

1 (1-1008)

0

NECHADRAFT_34602

BN001398

[41]

NAT5

 

 

930

 

 

309

 

 

1 (1-366)

2 (418-563)

3 (588-1005)

2

 

 

NECHADRAFT_44961

 

BN001399

[41]

 

NAT6

 

960

 

319

 

1 (1-366)

2 (413-1006)

1

 

NECHADRAFT_51017

BN001400

[41]

Neosartorya fischeri (Aspergillus fischerianus)

NRRL 181

NEOFI

331117

ΝΑΤ1

 

 

963

 

 

320

 

 

1 (1-410)

2 (467-514)

3 (574-1078)

2

 

 

NFIA_104890

 

 

BN001401

[41]

 

ΝΑΤ2

 

948

 

315

 

1 (1-378)

2 (436-1005)

1

 

NFIA_004340

 

BN001402

[41]

Neurospora crassa OR74A

 

NEUCR

 

367110

 

ΝΑΤ1

 

933

 

310

 

1 (1-333)

2 (402-1001)

1

 

NCU09784

 

BN001448

[41]

Paracoccidioides brasiliensis Pb01

 

PARBA

 

 

502779

 

 

ΝΑΤ1

 

 

951

 

 

possible pseudogene

 

1 (1-402)

2 (527-573)

3 (641-1142)

2

 

 

 

 

 

BN001404

[41]

 

Paracoccidioides brasiliensis Pb03

 

PARBP

 

 

482561

 

 

ΝΑΤ1

 

 

951

 

 

316

 

 

1 (1-402)

2 (528-574)

3 (642-1143)

2

 

 

PABG_03148

 

 

BN001403

[41]

 

Paracoccidioides brasiliensis Pb18

 

PARBD

 

 

502780

 

 

ΝΑΤ1

 

 

951

 

 

316

 

 

1 (1-402)

2 (527-573)

3 (641-1142)

2

 

 

PADG_01689

 

 

BN001405

[41]

 

Penicillium chrysogenum Wisconsin 54-1255

PENCW

500485

NAT1

 

942

 

313

 

1 (1-396)

2 (522-1067)

1

 

Pc20g04880

 

BN001474

[41]

NAT2

 

972

 

323

 

1 (1-401)

2 (497-1067)

1

 

Pc22g06840

 

BN001475

[41]

NAT3

 

948

 

315

 

1 (1-396)

2 (447-998)

1

 

Pc12g02960

 

BN001476

[41]

Phaeosphaeria nodorum SN15

PHANO

321614

ΝΑΤ1

933

310

1 (1-933)

0

SNOG_06352

BN001406

ΝΑΤ2

1053

350

1 (1-1053)

0

SNOG_06959

BN001407

Podospora anserina DSM980

PODAN

515849

NAT1

 

912

 

303

 

1 (1-342)

2 (401-970)

1

 

PODANSg3401

 

[44]

 

NAT2

1002

333

1(1-1002)

0

PODANSg9179

[44]

Pyrenophora tritici-repentis Pt-1C-BFP

PYRTR

426418

ΝΑΤ1

1029

342

1 (1-1029)

0

PTRG_06864

BN001408

[41]

ΝΑΤ2

924

307

1 (1-924)

0

PTRG_02465

BN001409

[41]

Sordaria macrospora k-hell

SORMA

 

5147

 

NAT1

 

1101

 

366

 

1 (1-477)

2 (536-1159)

1

 

 

 

BN001439

[41]

Talaromyces stipitatus ATCC 10500

TALSN

 

441959

 

ΝΑΤ1

 

993

 

330

 

1 (1-438)

2 (555-1109)

1

 

TSTA_053130

 

BN001410

[41]

Trichoderma atroviride IMI 206040

HYPAI

 

452589

 

ΝΑΤ1

 

915

 

304

 

1 (1-339)

2 (417-992)

1

 

 

 

BN001411

[41]

Trichoderma reesei (Hypocrea jecorina) QM6a

HYPJQ

 

431241

 

ΝΑΤ1

 

918

 

305

 

1 (1-336)

2 (430-1011)

1

 

 

 

BN001412

[41]

Trichoderma virens (Hypocrea virens) Gv29-8

HYPVG

413071

ΝΑΤ1

 

918

 

305

 

1 (1-333)

2 (426-1010)

1

 

 

 

BN001413

[41]

ΝΑΤ2

 

1026

 

341

 

1 (1-420)

2 (490-1095)

1

 

 

 

BN001414

[41]

Trichophyton equinum CBS 127.97

 

TRIEC

 

 

559882

 

 

ΝΑΤ1

 

 

981

 

 

326

 

 

1 (1-416)

2 (481-528)

3 (593-1109)

2

 

 

TEQG_03758

 

 

BN001415

[41]

 

Trichophyton rubrum CBS 118892

 

TRIRC

 

 

559305

 

 

ΝΑΤ1

 

 

981

 

 

326

 

 

1 (1-416)

2 (481-528)

3 (593-1109)

2

 

 

TERG_07215

 

 

BN001416

[41]

 

Trichophyton tonsurans CBS 112818

 

TRIT1

 

 

647933

 

 

ΝΑΤ1

 

 

981

 

 

326

 

 

1 (1-416)

2 (481-528)

3 (593-1109)

2

 

 

TESG_05215

 

 

BN001417

[41]

 

Uncinocarpus reesii UAMH 1704

UNCRE

 

336963

 

ΝΑΤ1

 

990

 

329

 

1 (1-417)

2 (467-1039)

1

 

UREG_05260

 

BN001418

[41]

Verticillium albo-atrum VaMs.102

 

VERA1

 

 

526221

 

 

ΝΑΤ1

 

 

762

 

 

253

 

 

1 (1-360)

2 (484-708)

3 (832-1008)

2

 

 

 

 

 

BN001419

[41]

 

Verticillium dahliae VdLs.17

VERDV

498257

ΝΑΤ1

951

316

1 (1-951)

0

VDAG_09857

BN001420

[41]

 

Footnotes, Table 2:

(a)          Fungal NAT sequences have been described in [41,42,43,44]. The reference sequence for annotation of fungal NATs is the NAT1 protein of the ascomycete Fusarium verticillioides (Gibberella moniliformis), GenBank ID: EU552489 [5,43].

(b)          Column key – Colums 1, 2: The numbers in brackets are sequenced fungal genomes; Columns 2-4: The scientific name, official identification code and unique taxonomy identifier for each organism are available from the UniProt Taxonomy Browser (http://www.uniprot.org/taxonomy/); Columns 5-9: The retrieval and annotation of NAT-homologous sequences (ORF span and exon-intron structure) were carried out as described [41], and the gene symbols were assigned according to the guidelines of the NAT Gene Nomenclature Committee. The sequence length of open reading frames (ORF) is shown in nucleotides (nt) and of deduced proteins in amino acids (aa); Column 10: Where available, the locus tags are provided to anchor each annotated NAT gene to the corresponding genomic map; Column 11: Third-party annotation (TPA) accession numbers were assigned by the EMBL database and can be used to retrieve the complete details of NAT genes in the table.

 

 

Table 3: Annotation of NAT genes in sequenced genomes of animals(a),(b)

 

Organism scientific name(c)

 

Organism

identification code (c)

Taxonomy

identifier (c)

Gene

(alias) (c)

ORF

(nt) (c)

Protein

(aa) (c)

Identities

(%)

Positives

(%)

Literature

METAZOA

EUMETAZOA

Bilateria

Deuterostomia

Echinozoa

Strongylocentrotus purpuratus

(purple sea urchin)

STRPU

7668

NAT1

933

310

27

44

1, 41

NAT2

654

217

24

42

NAT3

933

310

23

43

Hemichordata

Saccoglossus kowalevskii (Acorn worm)

SACKO

10224

NAT1

903

300

20

38

41

Chordata

 

 

 

 

 

 

 

 

Tunicata

 

 

 

 

 

 

 

 

Ciona intestinalis (Transparent sea squirt)

CIOIN

7719

NAT1

789

ORF consists of 3 exons

Exon 1: 1-335

Exon 2: 651-838

Exon 3: 1472-1737

(Gene size: 1737nt)

262

28

45

41, BN001483

NAT2

942

ORF consists of 5 exons

Exon 1: 1-335

Exon 2: 685-866

Exon 3: 1863-1968

Exon 4: 3067-3157

Exon 5: 3624-3851

(Gene size: 3851nt)

313

25

47

41, BN001484

Cephalochordata

Branchiostoma floridae (Florida lancelet)

BRAFL

7739

NAT1

780

ORF consists of 4 exons

Exon 1: 1-119

Exon 2: 639-957

Exon 3: 1238-1277

Exon 4: 1347-1648

(Gene size: 1648nt)

259

30

48

41,

BN001480

NAT2

864

ORF consists of 3 exons

Exon 1: 1-119

Exon 2: 1713-2037

Exon 3: 2282-2701

(Gene size: 2701nt)

287

27

48

41,

BN001481

NAT3

909

ORF consists of 3 exons

Exon 1: 1-176

Exon 2: 779-906

Exon 3: 4565-5169

(Gene size: 5169nt)

302

23

42

41, BN001482

Vertebrata

Euteleostomi

Actinopterygii

Danio rerio (zebrafish)

DANRE

7955

NAT1

867

288

43

62

1-4

NAT2

867

288

42

62

NAT3

867

288

42

61

NAT4

873

290

40

59

Gasterosteus aculeatus (three-spined stickleback)

GASAC

69293

NAT1

912

303

42

62

1

NAT2

870

289

41

61

Oryzias latipes (medaka fish)

ORYLA

8090

NAT1

873

290

44

61

1

NAT2

873

290

40

61

NAT3

846

281

35

52

Takifugu (Fugu) rubripes (Japanese pufferfish)

TAKRU

31033

NAT1

846

281

33

50

1,2

Tetraodon nigroviridis (green spotted pufferfish)

TETNG

99883

NAT1

 

843

 

280

 

34

 

49

 

1,2

Sarcopterygii

Amniota

Mammalia (d)

Primates (e)

Allenopithecus nigroviridis (Allen’s swamp monkey)

ALLNI

54135

NAT1

873

290

94

98

46

NAT2

873

290

82

90

Cebus apella (Brown-capped capuchin monkey)

CEBAP

9515

NAT1

873

290

94

98

46

NAT2

873

290

81

90

Cercopithecus diana (Diana monkey)

CERDI

36224

NAT1

873

290

95

98

46

NAT2

873

290

82

90

Colobus guereza (black and white colobus monkey)

COLGU

33548

NAT1

873

290

95

99

46

NAT2

873

290

82

89

Chlorocebus tantalus (tantalus monkey)

CERTA

60712

NAT1

873

290

96

99

46

NAT2

873

290

82

89

Erythrocebus patas (Red guenon)

ERYPA

9538

NAT1

873

290

96

99

46

NAT2

873

290

82

90

Gorilla gorilla (Lowland gorilla)

GORGO

9595

NAT1

873

290

98

99

45,46

NAT2

873

290

82

89

Homo sapiens (human)

HUMAN

9606

NAT1

873

290

100

100

9,10

NAT2

873

290

81

89

Hylobates lar (White handed gibbon)

HYLLA

9580

NAT1

873

290

96

99

46

NAT2

873

290

81

88

Presbytis cristatus (silvered-leaf monkey)

TRACR

122765

NAT2

873

290

96

99

46

NAT1

873

290

82

90

Macaca fascicularis (Crab-eating macaque)

MACFA

9541

NAT1

873

290

94

98

45

NAT2

873

290

82

89

Macaca mulatta (Rhesus macaque)

MACMU

9544

NAT1

873

290

94

99

11

NAT2

873

290

82

90

Macaca sylvanus (Barbary ape)

MACSY

9546

NAT1

873

290

95

99

46

NAT2

873

290

82

90

Mandrillus sphinx (Mandrill)

MANSP

9561

NAT1

873

290

95

99

46

NAT2

873

290

82

89

Microcebus murinus (gray mouse lemur)

MICMU

30608

NAT1

873

290

80

90

1

NAT2

873

290

74

85

Nomascus gabriellae (Red cheeked gibbon)

NOMGA

61852

NAT1

873

290

96

99

46

NAT2

873

290

82

89

Otolemur garnettii (small-eared galago)

OTOGA

30611

NAT1

873

290

82

93

1

Pan paniscus (Pygmy chimpanzee)

PANPA

9597

NAT1

873

290

98

100

46

NAT2

873

290

81

89

Pan troglodytes (chimpanzee)

PANTR

9598

NAT1

873

290

97

100

1

NAT2

873

290

81

 89

1

Papio anubis (Olive baboon)

PAPAN

9555

NAT1

873

290

95

99

45

NAT2

873

290

82

89

Pithecia pithecia (White faced saki)

PITPI

43777

NAT1

873

290

95

99

46

NAT2

873

290

82

91

Pongo abelii (Sumatran orangutan)

PONAB

9601

NAT1

873

290

96

99

45

NAT2

873

290

81

89

Saimiri boliviensis boliviensis (Bolivian squirrel monkey)

SAIBB

39432

NAT1

873

290

93

97

 

 

Other mammals

Bos taurus (domestic cattle)

BOVIN

9913

NAT1

873

290

82

93

1,4,8

Cavia porcellus (domestic Guinea pig)

CAVPO

10141

Nat1

870

289

79

91

1

Nat2

873

290

81

93

Nat3

873

290

71

86

Cricetulus griseus (Chinese hamster)

CRIGR

10029

Nat2

873

290

80

92

AJ289858

Echinops telfairi (lesser hedgehog tenrec)

ECHTE

9371

NAT1

873

290

80

88

1

Equus caballus (domestic horse)

HORSE

9796

NAT1

873

290

82

92

1

Loxodonta Africana (African bush elephant)

LOXAF

9785

NAT1

873

290

84

93

1

NAT2

873

290

82

91

NAT3

873

290

77

87

Mesocricetus auratus (Syrian hamster)

MESAU

10036

Nat1

873

290

71

83

12-16

Nat2

873

290

81

92

Monodelphis domestica (gray short-tailed opossum)

MONDO

13616

NAT1

 

870

 

289

 

68

 

85

 

1

Mus musculus (house mouse)

MOUSE

10090

Nat1

873

290

74

85

17,18

Nat2

873

290

82

94

Nat3

873

290

68

 81

Myotis lucifugus (little brown bat)

MYOLU

59463

NAT1

873

290

82

92

1

NAT2

873

290

73

86

Ornithorhynchus anatinus (platypus)

ORNAN

9258

NAT1

870

289

56

73

1

Oryctolagus cuniculus (European rabbit)

RABIT

9986

NAT1

873

290

75

87

19-21

NAT2

873

290

74

87

 

 

 

NAT2

870

289

81

88

 

Rattus norvegicus (brown rat)

RAT

10116

Nat1

873

290

76

87

22,23

Nat2

873

290

81

93

Nat3

873

290

68

81

Sus scrofa (pig)

PIG

9823

NAT1

873

290

84

92

1,4

Sauria

Lepidosauria

Anolis carolinensis (Carolina green anole)

ANOCA

28377

NAT1

867

288

54

72

1

NAT2

867

288

53

72

NAT3

816

271

35

50