; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002758 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002758
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCTP synthase family protein
Genome locationchr4:45386253..45395066
RNA-Seq ExpressionLag0002758
SyntenyLag0002758
Gene Ontology termsGO:0006221 - pyrimidine nucleotide biosynthetic process (biological process)
GO:0003883 - CTP synthase activity (molecular function)
InterPro domainsIPR004468 - CTP synthase
IPR017926 - Glutamine amidotransferase
IPR021109 - Aspartic peptidase domain superfamily
IPR029062 - Class I glutamine amidotransferase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015944519.1 uncharacterized protein LOC107469655 [Arachis duranensis]5.9e-1847.33Show/hide
Query:  EPEPPILSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MSQYNKFMKECF-------GTYSFRALCDLGASINIIPLSLCKKLDIG
        +P+ P   P ++ P+  +K  K K    QF +F+  F  L INI FAEALE M  Y KFMKE          + + R LCDLGAS N IPLSL +KL I 
Subjt:  EPEPPILSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MSQYNKFMKECF-------GTYSFRALCDLGASINIIPLSLCKKLDIG

Query:  EIKSTPVKLQLADQSVVRPVGIVENVLIRVG
        E+KST + LQLAD+S+  P+G+VEN++++VG
Subjt:  EIKSTPVKLQLADQSVVRPVGIVENVLIRVG

XP_028804017.1 uncharacterized protein LOC114759088 [Prosopis alba]2.5e-1643.57Show/hide
Query:  DEAKKPEPEPPILSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MSQYNKFMKEC------FGT-----YSFRALCDLGASINIIP
        D+ +K  P P    P L  P    ++ +KK+  VQF KF++ F  L+INIPFAEALE M +Y KF+K+       FG      ++ +ALCDLGAS+N++P
Subjt:  DEAKKPEPEPPILSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MSQYNKFMKEC------FGT-----YSFRALCDLGASINIIP

Query:  LSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRV
        L++  +L++GE + T V L  AD+S+  P GIVE+VL++V
Subjt:  LSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRV

XP_030497486.1 uncharacterized protein LOC115713139 [Cannabis sativa]4.7e-1535.2Show/hide
Query:  EVEPESEDYDTPTGEAEEDTSSDEAKKPEPEPPI-LSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MSQYNKFMKECF-------
        + +PE+ + + P    E+  + D  ++ E  PPI +   + +P    ++ +K N   QF KF+  F  L+INIPFAEALE M  Y KFMKE         
Subjt:  EVEPESEDYDTPTGEAEEDTSSDEAKKPEPEPPI-LSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MSQYNKFMKECF-------

Query:  ------------------------------------GTYSFRALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVG
                                            G+   +ALCDLGASIN++PLS+ K+L +GE K T V LQ+AD+S+  P GI+E+VL++VG
Subjt:  ------------------------------------GTYSFRALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVG

XP_031255354.1 uncharacterized protein LOC116113346 [Pistacia vera]7.9e-1535Show/hide
Query:  ILQEEDEVEPESEDYDTPTGEAEEDTSSDEAKKPEPEPPILSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MSQYNKFMK-----
        ++ EE    P     + P  E       ++   PE  P +   T +VP    ++ +K+N   QF KF+N F  L+INIPFA+AL+ MS Y KF+K     
Subjt:  ILQEEDEVEPESEDYDTPTGEAEEDTSSDEAKKPEPEPPILSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MSQYNKFMK-----

Query:  --------------EC-----------------------FGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRV
                      EC                        G  +F + LCDLGASIN+IP S+ +KL IGE+K T + LQLAD+S+  P GI+E+VL++V
Subjt:  --------------EC-----------------------FGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRV

XP_039118013.1 uncharacterized protein LOC120253863 [Dioscorea cayenensis subsp. rotundata]7.9e-1543.17Show/hide
Query:  KKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MSQYNKFMKECF--------------------------GTYSF----------RALCDLGASINIIPL
        KKKK+NQ QF KF++ F  L+INIPFAE LE M  Y  FMKE                            G+++           RALCDLGASIN++PL
Subjt:  KKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MSQYNKFMKECF--------------------------GTYSF----------RALCDLGASINIIPL

Query:  SLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRV
        S+ KKL++GE + T V LQLAD S+  P G++E+VL+++
Subjt:  SLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRV

TrEMBL top hitse value%identityAlignment
A0A1S3UKE1 uncharacterized protein LOC1067662765.6e-1439.87Show/hide
Query:  PTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MSQYNKFMK-------------------ECF--------------GTYSF------
        PT+  P+  KK+++ K    QF +F++ F  L+INIPFAEALE M  Y KFMK                   EC               G++        
Subjt:  PTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MSQYNKFMK-------------------ECF--------------GTYSF------

Query:  ----RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRV
            +ALCDLGASIN++PLS+ K+L IG++K T + LQLAD+S+  P GIVE+VL++V
Subjt:  ----RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRV

A0A2G9GC30 DNA-directed DNA polymerase3.3e-1444.37Show/hide
Query:  EEDTSSDEAKKPEPEPPILSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MSQYNKFMKEC------FGT-YSFRALCDLGASINI
        EE+    EA     +P  L P    P+  +K+K KK    QF  F   F  L+INIPFA+ALE M  Y KFMK+        GT +S  ALCDLGASIN+
Subjt:  EEDTSSDEAKKPEPEPPILSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MSQYNKFMKEC------FGT-YSFRALCDLGASINI

Query:  IPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRV
         P S+ + L +GE+K T + LQLAD+S+  P G++E++L++V
Subjt:  IPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRV

A0A5E4GII9 PREDICTED: LOW QUALITY PROTEIN (Fragment)1.5e-1439.89Show/hide
Query:  TSSDEAKKPEPEPPILSPTL--MVPKEKKKKKKKKNN-QVQFDKFMNAFMNLNINIPFAEALE-MSQYNKFMKEC------FGTY---------------
        T+++++++ E   PI SP L   VP+    ++ +KN    QF KF+  F  L+INIPFAEALE M  Y KFMK+       FG +               
Subjt:  TSSDEAKKPEPEPPILSPTL--MVPKEKKKKKKKKNN-QVQFDKFMNAFMNLNINIPFAEALE-MSQYNKFMKEC------FGTY---------------

Query:  ----------SF------------RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRV
                  SF            RALCDLG+SIN++PLS+ KK+ IGEIK T V LQ+AD+S+  P GI+E+VL++V
Subjt:  ----------SF------------RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRV

A0A6J1DTZ8 uncharacterized protein LOC1110239796.6e-1537.91Show/hide
Query:  EEDTSSDEAKKPEP----EPPILSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MSQYNKFMKEC---------------------
        ++D S +  K+  P    E P+L   L+      +  +KK    QF KF++ F  LNINI FA ALE M  Y KFMKE                      
Subjt:  EEDTSSDEAKKPEP----EPPILSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MSQYNKFMKEC---------------------

Query:  ---------------------FGTYSFRALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVG
                              G+Y FR LCDLG +IN  PLSLC+KL+IGEIK T + +QL D+S   P G++ENVLI+VG
Subjt:  ---------------------FGTYSFRALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVG

A0A6P4C5S3 uncharacterized protein LOC1074696552.8e-1847.33Show/hide
Query:  EPEPPILSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MSQYNKFMKECF-------GTYSFRALCDLGASINIIPLSLCKKLDIG
        +P+ P   P ++ P+  +K  K K    QF +F+  F  L INI FAEALE M  Y KFMKE          + + R LCDLGAS N IPLSL +KL I 
Subjt:  EPEPPILSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MSQYNKFMKECF-------GTYSFRALCDLGASINIIPLSLCKKLDIG

Query:  EIKSTPVKLQLADQSVVRPVGIVENVLIRVG
        E+KST + LQLAD+S+  P+G+VEN++++VG
Subjt:  EIKSTPVKLQLADQSVVRPVGIVENVLIRVG

SwissProt top hitse value%identityAlignment
A1VDL2 CTP synthase1.0e-1258.46Show/hide
Query:  ETFKGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEFFSLVILQEHATKEK
        E  KG+DG+LVPGGFG RGV+GKIL  +YA ENR+PF   CL MQ  VIEF   V+  E A  E+
Subjt:  ETFKGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEFFSLVILQEHATKEK

B4EUF6 CTP synthase6.5e-1256.45Show/hide
Query:  METFKGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEFFSLVILQEHA
        +E  KG+D +LVPGGFG RGV+GKI+A +YA EN+IP+L  CL MQ+ +IEF   V   E A
Subjt:  METFKGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEFFSLVILQEHA

Q3A371 CTP synthase6.5e-1266.67Show/hide
Query:  ETFKGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEF
        +TFK VDG+LVPGGFG RG +GKI A +YA ENRIPF   CL MQ+ V+EF
Subjt:  ETFKGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEF

Q72BL2 CTP synthase1.0e-1258.46Show/hide
Query:  ETFKGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEFFSLVILQEHATKEK
        E  KG+DG+LVPGGFG RGV+GKIL  +YA ENR+PF   CL MQ  VIEF   V+  E A  E+
Subjt:  ETFKGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEFFSLVILQEHATKEK

Q7N836 CTP synthase2.9e-1252Show/hide
Query:  METFKGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEFFSLVILQEHATKEKSFSYEVEAE
        +E  KG+D +LVPGGFGGRGV+GKI+  +YA EN IP+L  CL MQ+ +IEF   V     A  EK+ S E E +
Subjt:  METFKGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEFFSLVILQEHATKEKSFSYEVEAE

Arabidopsis top hitse value%identityAlignment
AT1G30820.1 CTP synthase family protein8.5e-1568.97Show/hide
Query:  KGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEFFSLVILQEHA
        KGVDG+LVPGGFG RGV+GKILA KYA EN+IPFL  CL MQI VIEF   V+  + A
Subjt:  KGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEFFSLVILQEHA

AT2G34890.1 CTP synthase family protein1.1e-1170.83Show/hide
Query:  KGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEF
        K  +GVL+PGGFG RGV+G ILA KYA EN IPFL  CL MQI VIEF
Subjt:  KGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEF

AT3G12670.1 CTP synthase family protein6.5e-1556.16Show/hide
Query:  ETFKGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEFFSLVILQEHATKEKSFSYEVEA
        +  KG DG+LVPGGFG RGVQGKILATKYA EN++PFL  CL MQ+ V+E F+  IL  H      F  E  +
Subjt:  ETFKGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEFFSLVILQEHATKEKSFSYEVEA

AT4G20320.1 CTP synthase family protein1.6e-1366.04Show/hide
Query:  KGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEFFSLVI
        KG DGVLVPGGFG RGV+GK+LA KYA ENRIP+L  CL MQ+ VIE+   ++
Subjt:  KGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEFFSLVI

AT4G20320.2 CTP synthase family protein1.6e-1366.04Show/hide
Query:  KGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEFFSLVI
        KG DGVLVPGGFG RGV+GK+LA KYA ENRIP+L  CL MQ+ VIE+   ++
Subjt:  KGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEFFSLVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACCTTTAAAGGTGTTGATGGAGTTCTAGTTCCAGGAGGTTTTGGTGGTAGGGGAGTACAAGGGAAAATTCTAGCAACCAAATATGCTGGAGAGAATAGGATTCC
CTTTCTGGACACATGCTTGAGGATGCAAATTGATGTCATTGAATTTTTTAGTCTAGTTATTCTTCAAGAACATGCAACGAAGGAAAAGTCATTCTCATATGAGGTTGAAG
CAGAGATGAATTTCCACTACATGAGTGTTTTAATGTCCATGGTCGTCGTCGAGCGTCACAAGCCACTCGTCGATGTGAAACTCTTCAGGTTGTGTCCGCCGATTTGGCAA
TTAAGGAATACTTCGTCTTCTTCGTCGTCGACTCATAACCTTCACCATAGTGGTCATGACCCTAGCGAAAGAAACTCAACAACGTCGAATAATATCAAATTTCCATCATC
TTTCCATGACAGTGTTCTCTATGGCGAAGTCGTTGCCTCCTCGACGTCGGCTTTGAGTGGTTCGGAAATTCCTACTCCACTCCCCAAGACTCCTTCTTTGGTATTTTACT
ACAACGACTCTCTTACGGGTGAGAACCAAACCGACTCTTTTTCCAAGTTCAAGAGTAAACCTCTACCGATCATTTTACAGGAGGAAGATGAAGTGGAACCTGAGTCTGAG
GATTATGATACGCCTACAGGGGAAGCTGAGGAGGACACATCATCAGATGAGGCTAAAAAACCTGAACCTGAGCCTCCTATTCTTTCTCCAACACTGATGGTTCCCAAAGA
AAAGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAGTTTATGAATGCTTTTATGAATCTGAACATTAATATTCCTTTTGCAGAGGCATTAGAGATGT
CACAGTATAACAAGTTCATGAAGGAGTGTTTTGGTACTTATTCTTTCAGAGCATTATGTGATTTAGGTGCTAGCATCAATATCATTCCTTTATCTCTGTGCAAAAAGTTA
GACATAGGTGAGATTAAATCTACTCCTGTAAAGCTCCAATTGGCTGATCAGTCTGTGGTCAGACCAGTTGGCATTGTAGAAAATGTTTTAATCAGAGTAGGTGCACCCCG
CGGGGTGAGGCTCGCAAGCCTACCACGGAATGGTCTCATCGTGGCACCGTCCCTTCTGGTGCGGACATCGGAAAGCATGACACAGCCCATTCTTTCTGTTGGCACTACCC
TTCGGCCAACTCTCAACTGGCCACCTTCATGTGGCAATCGCTGCACCGATGCTTCTGTCGGAACTTGCAGCAGCTTCTTTTTCTTTTTCTGCCCTGTCTCCAGCCTGGTG
CTCCATCTCGAGCAAATTGCTGGTCTTGACATCTGTGAAGCCACCCACTTAGGCATGTGGAATGACTTAAGGGACACACTCGCTCGACACACGAGCGTCTTCACGGGACA
AGCAACCCATTCTCTCGGGTTCTCTCCCAAACTTGGCTCTACGCAGCCCAAATCGCCCACCCATATAGGCTTTCTTGATGCTCAACTCTGGCACTTGACCGGCGCATGTC
ACACTGACCCCCGGCTTTCTCTGCTTGCAACCATTAACACTATCCCTCTGGCGGTAATGGTTCTTTCGAAGGCACCCTACCTGTGGGGCTCTGATCGTAGCCCAAAGCCT
AAACTAGGCTCTGATACCAACTGTCACGGCGAGGTTCTTCATCCTCAACCGAACTCAAATCATCTTGCAGCGGAATACATTCGGTCAATACACGTCGCCCGTACGTGTAT
TTTCACTCATTGCGACTCTAGCCGACTTGCCTCTACTCTAGGCCAAGTCCTAAGGCTCAACCTTGCCTCTACATCAGGCCAAGGTCATCTGAACGCAACGTCGCATCTCA
TTTCATCATCTTCCATCTCGATAACACGGCCCACACGCTTCTACATCGATCCGGAACTTCGTCGGATGTGTTTGGACCCTTCAACGGGTCAAAACGGCCTTAAACGCTCA
ACCGGAGCTTACCGGGTGATTTCAGCACTTACCGGGACCTACCGGGGCATTATAACGCTTACCGGGGCTTTTCGTCGCCTACGGGGCTTATCGGGCCATTATAATGCTTA
CCGGGACATTTCGGGCAAATACCGTGCCAACCGGGAACTATCGGGTTCCACCGATGCTTCACGATGCCACCCGAGCCTCGGGATGTCATCTGACTTAACCGCTCACTCGG
ACCTGCTTCTCGAGAACGGACCCATGGGCTTTACTTATAATCGGGGCTTCCCGGACATCCATCCATCCGGGCATATCGAGGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAACCTTTAAAGGTGTTGATGGAGTTCTAGTTCCAGGAGGTTTTGGTGGTAGGGGAGTACAAGGGAAAATTCTAGCAACCAAATATGCTGGAGAGAATAGGATTCC
CTTTCTGGACACATGCTTGAGGATGCAAATTGATGTCATTGAATTTTTTAGTCTAGTTATTCTTCAAGAACATGCAACGAAGGAAAAGTCATTCTCATATGAGGTTGAAG
CAGAGATGAATTTCCACTACATGAGTGTTTTAATGTCCATGGTCGTCGTCGAGCGTCACAAGCCACTCGTCGATGTGAAACTCTTCAGGTTGTGTCCGCCGATTTGGCAA
TTAAGGAATACTTCGTCTTCTTCGTCGTCGACTCATAACCTTCACCATAGTGGTCATGACCCTAGCGAAAGAAACTCAACAACGTCGAATAATATCAAATTTCCATCATC
TTTCCATGACAGTGTTCTCTATGGCGAAGTCGTTGCCTCCTCGACGTCGGCTTTGAGTGGTTCGGAAATTCCTACTCCACTCCCCAAGACTCCTTCTTTGGTATTTTACT
ACAACGACTCTCTTACGGGTGAGAACCAAACCGACTCTTTTTCCAAGTTCAAGAGTAAACCTCTACCGATCATTTTACAGGAGGAAGATGAAGTGGAACCTGAGTCTGAG
GATTATGATACGCCTACAGGGGAAGCTGAGGAGGACACATCATCAGATGAGGCTAAAAAACCTGAACCTGAGCCTCCTATTCTTTCTCCAACACTGATGGTTCCCAAAGA
AAAGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAGTTTATGAATGCTTTTATGAATCTGAACATTAATATTCCTTTTGCAGAGGCATTAGAGATGT
CACAGTATAACAAGTTCATGAAGGAGTGTTTTGGTACTTATTCTTTCAGAGCATTATGTGATTTAGGTGCTAGCATCAATATCATTCCTTTATCTCTGTGCAAAAAGTTA
GACATAGGTGAGATTAAATCTACTCCTGTAAAGCTCCAATTGGCTGATCAGTCTGTGGTCAGACCAGTTGGCATTGTAGAAAATGTTTTAATCAGAGTAGGTGCACCCCG
CGGGGTGAGGCTCGCAAGCCTACCACGGAATGGTCTCATCGTGGCACCGTCCCTTCTGGTGCGGACATCGGAAAGCATGACACAGCCCATTCTTTCTGTTGGCACTACCC
TTCGGCCAACTCTCAACTGGCCACCTTCATGTGGCAATCGCTGCACCGATGCTTCTGTCGGAACTTGCAGCAGCTTCTTTTTCTTTTTCTGCCCTGTCTCCAGCCTGGTG
CTCCATCTCGAGCAAATTGCTGGTCTTGACATCTGTGAAGCCACCCACTTAGGCATGTGGAATGACTTAAGGGACACACTCGCTCGACACACGAGCGTCTTCACGGGACA
AGCAACCCATTCTCTCGGGTTCTCTCCCAAACTTGGCTCTACGCAGCCCAAATCGCCCACCCATATAGGCTTTCTTGATGCTCAACTCTGGCACTTGACCGGCGCATGTC
ACACTGACCCCCGGCTTTCTCTGCTTGCAACCATTAACACTATCCCTCTGGCGGTAATGGTTCTTTCGAAGGCACCCTACCTGTGGGGCTCTGATCGTAGCCCAAAGCCT
AAACTAGGCTCTGATACCAACTGTCACGGCGAGGTTCTTCATCCTCAACCGAACTCAAATCATCTTGCAGCGGAATACATTCGGTCAATACACGTCGCCCGTACGTGTAT
TTTCACTCATTGCGACTCTAGCCGACTTGCCTCTACTCTAGGCCAAGTCCTAAGGCTCAACCTTGCCTCTACATCAGGCCAAGGTCATCTGAACGCAACGTCGCATCTCA
TTTCATCATCTTCCATCTCGATAACACGGCCCACACGCTTCTACATCGATCCGGAACTTCGTCGGATGTGTTTGGACCCTTCAACGGGTCAAAACGGCCTTAAACGCTCA
ACCGGAGCTTACCGGGTGATTTCAGCACTTACCGGGACCTACCGGGGCATTATAACGCTTACCGGGGCTTTTCGTCGCCTACGGGGCTTATCGGGCCATTATAATGCTTA
CCGGGACATTTCGGGCAAATACCGTGCCAACCGGGAACTATCGGGTTCCACCGATGCTTCACGATGCCACCCGAGCCTCGGGATGTCATCTGACTTAACCGCTCACTCGG
ACCTGCTTCTCGAGAACGGACCCATGGGCTTTACTTATAATCGGGGCTTCCCGGACATCCATCCATCCGGGCATATCGAGGCATGA
Protein sequenceShow/hide protein sequence
METFKGVDGVLVPGGFGGRGVQGKILATKYAGENRIPFLDTCLRMQIDVIEFFSLVILQEHATKEKSFSYEVEAEMNFHYMSVLMSMVVVERHKPLVDVKLFRLCPPIWQ
LRNTSSSSSSTHNLHHSGHDPSERNSTTSNNIKFPSSFHDSVLYGEVVASSTSALSGSEIPTPLPKTPSLVFYYNDSLTGENQTDSFSKFKSKPLPIILQEEDEVEPESE
DYDTPTGEAEEDTSSDEAKKPEPEPPILSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALEMSQYNKFMKECFGTYSFRALCDLGASINIIPLSLCKKL
DIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGAPRGVRLASLPRNGLIVAPSLLVRTSESMTQPILSVGTTLRPTLNWPPSCGNRCTDASVGTCSSFFFFFCPVSSLV
LHLEQIAGLDICEATHLGMWNDLRDTLARHTSVFTGQATHSLGFSPKLGSTQPKSPTHIGFLDAQLWHLTGACHTDPRLSLLATINTIPLAVMVLSKAPYLWGSDRSPKP
KLGSDTNCHGEVLHPQPNSNHLAAEYIRSIHVARTCIFTHCDSSRLASTLGQVLRLNLASTSGQGHLNATSHLISSSSISITRPTRFYIDPELRRMCLDPSTGQNGLKRS
TGAYRVISALTGTYRGIITLTGAFRRLRGLSGHYNAYRDISGKYRANRELSGSTDASRCHPSLGMSSDLTAHSDLLLENGPMGFTYNRGFPDIHPSGHIEA