; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017680 (gene) of Snake gourd v1 genome

Gene IDTan0017680
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG08:72837537..72839575
RNA-Seq ExpressionTan0017680
SyntenyTan0017680
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3472112.1 reverse transcriptase [Gossypium australe]4.0e-2829.08Show/hide
Query:  RYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKH-ERVKVLIDDHG-TWKEQVVKGIFSPMDIE
        RY+   +FL A  G+ PS  WRSI   REL   G  W++GNG  I I+ DPW+   G S  L V ++  H + V  LID+   TWK+ ++  +      +
Subjt:  RYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKH-ERVKVLIDDHG-TWKEQVVKGIFSPMDIE

Query:  DILKIPIGNHLSKDEIIWNHDSKGLFSVKSAFREDWKPNVYW--------------------DWLEDN--------LSEANLKLALIILWSLWFFRNQVL
         IL IP+     +D ++W HD+ G++SVKS    +   ++ W                    D  EDN        +++   KL  I  W++W+ RN+++
Subjt:  DILKIPIGNHLSKDEIIWNHDSKGLFSVKSAFREDWKPNVYW--------------------DWLEDN--------LSEANLKLALIILWSLWFFRNQVL

Query:  HN--EITVDFQLIYRQIDTYKLKFQSRENAINVTSWSENLSSHAAWEPPQPNHWKLNCDASWFEKKNRGGCGWIARDSEDLL
        H   + ++D  + + +   Y+L      N ++++       + A W+PP P   KLN DAS+    N    G + R+   L+
Subjt:  HN--EITVDFQLIYRQIDTYKLKFQSRENAINVTSWSENLSSHAAWEPPQPNHWKLNCDASWFEKKNRGGCGWIARDSEDLL

KAA3479362.1 reverse transcriptase [Gossypium australe]2.8e-2930.5Show/hide
Query:  RYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKH-ERVKVLIDDHG-TWKEQVVKGIFSPMDIE
        RY+   +FL A  G+ PS  WRSI   REL   G  W++GNG  + I+ DPW+   G S  L V ++  H   V  LID+H  TWK+ ++  +      +
Subjt:  RYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKH-ERVKVLIDDHG-TWKEQVVKGIFSPMDIE

Query:  DILKIPIGNHLSKDEIIWNHDSKGLFSVKSAFREDWKPNVYW--------------------DWLEDN--------LSEANLKLALIILWSLWFFRNQVL
         IL IP+ +   +D ++W+HD+ G+++VKS    +   ++ W                    D  EDN        +++   KL  I  W+LWF RN+++
Subjt:  DILKIPIGNHLSKDEIIWNHDSKGLFSVKSAFREDWKPNVYW--------------------DWLEDN--------LSEANLKLALIILWSLWFFRNQVL

Query:  HN--EITVDFQLIYRQIDTYKLKFQSRENAINVTSWSENLSSHAAWEPPQPNHWKLNCDASWFEKKNRGGCGWIARDSEDLL
        H   + ++D  L + +   Y+LK        + +S+S+N    A W+PP P   KLN DAS+ +  N    G +  + + L+
Subjt:  HN--EITVDFQLIYRQIDTYKLKFQSRENAINVTSWSENLSSHAAWEPPQPNHWKLNCDASWFEKKNRGGCGWIARDSEDLL

XP_023870384.1 uncharacterized protein LOC111982984, partial [Quercus suber]3.6e-2933.07Show/hide
Query:  RYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKHE--RVKVLIDD-HGTWKEQVVKGIFSPMDI
        +YF +GDFL A  G+NPS  WR I   + + ++G RW+VGNG  IRI+ D WL  +GSS  +    +F H+  RV  LI     TW E+V++ IF P+D 
Subjt:  RYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKHE--RVKVLIDD-HGTWKEQVVKGIFSPMDI

Query:  EDILKIPIGNHLSKDEIIWNHDSKGLFSVKSAF---REDWKPNVYWDWLEDNLSEANLKLALIILWSLWFFRNQVLHNEITVDFQLIYRQIDTYKLKFQS
        + IL IP+   L  D IIW   + G F+V+SA+    +  +         D   E  + L + I WSLW   N V H  +    + I +    Y   + +
Subjt:  EDILKIPIGNHLSKDEIIWNHDSKGLFSVKSAF---REDWKPNVYWDWLEDNLSEANLKLALIILWSLWFFRNQVLHNEITVDFQLIYRQIDTYKLKFQS

Query:  RENAINVTSWSENLSSHAAWEPPQPNHWKLNCDASWFEKKNRGGCGWIARD
          +++ V   S +++    W PP P+  K+N D +  + +     G + RD
Subjt:  RENAINVTSWSENLSSHAAWEPPQPNHWKLNCDASWFEKKNRGGCGWIARD

XP_023897447.1 uncharacterized protein LOC112009345 [Quercus suber]1.2e-3231.37Show/hide
Query:  RYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKHERVKV---LIDDHGTWKEQVVKGIFSPMDI
        +YF   DFL A  GNNPS  WRSIL  + + EKG RW+VG+G++I+++ D WL R  S   +    LF H   KV   +  +   WKE++++ IF P+D+
Subjt:  RYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKHERVKV---LIDDHGTWKEQVVKGIFSPMDI

Query:  EDILKIPIGNHLSKDEIIWNHDSKGLFSVKSAFR---------------------EDWKPNVYWDWLEDNLSEANLKLALIILWSLWFFRNQVLHNEITV
        E IL IP+     +D +IW   S G FSV+SA+R                       WK   +      +  E  + L + + W+ W  RN++ H     
Subjt:  EDILKIPIGNHLSKDEIIWNHDSKGLFSVKSAFR---------------------EDWKPNVYWDWLEDNLSEANLKLALIILWSLWFFRNQVLHNEITV

Query:  DFQLIYRQIDTYKLKFQSRENAINVTSWSENLSSHAAWEPPQPNHWKLNCDASWFEKKNRGGCGWIARDSE
          + I + ++ Y L++ +     +V +  E +S    W PP P+  K+N D +  +  N  G G + RD +
Subjt:  DFQLIYRQIDTYKLKFQSRENAINVTSWSENLSSHAAWEPPQPNHWKLNCDASWFEKKNRGGCGWIARDSE

XP_030487365.1 uncharacterized protein LOC115704292 [Cannabis sativa]6.8e-2829.56Show/hide
Query:  RYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPL---GVNDLFKHERVKVLIDDHGTWKEQVVKGIFSPMDI
        RYF +  FL A  G++PSL W+SI WG+EL  KG R+K+G+GNH+   +DPW+       P+   G N +     V   I     W  Q++   F  +DI
Subjt:  RYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPL---GVNDLFKHERVKVLIDDHGTWKEQVVKGIFSPMDI

Query:  EDILKIPIGNHLSKDEIIWNHDSKGLFSVKSAF--------REDWK---------------PNVYWDWLEDNLSEANLKLALIILWSLWFFRNQVLHNEI
        + IL I +      D ++W+H   G++SVKS          +  WK                  Y   L     +   +  L +LW +W  RN+V+H  I
Subjt:  EDILKIPIGNHLSKDEIIWNHDSKGLFSVKSAF--------REDWK---------------PNVYWDWLEDNLSEANLKLALIILWSLWFFRNQVLHNEI

Query:  ----TVDFQLIYRQIDTY-KLKFQSRENAINVTSW----SENLSSHA-AWEPPQPNHWKLNCDASWFEKKNRGGCGWIARDSEDLL---------NNFST
            T   Q   R  + + +LK   R  A+   +     S +++ H   W PPQ N +KLN DA+   ++ + G G I RD  D +          +F +
Subjt:  ----TVDFQLIYRQIDTY-KLKFQSRENAINVTSW----SENLSSHA-AWEPPQPNHWKLNCDASWFEKKNRGGCGWIARDSEDLL---------NNFST

Query:  DLTEISNLVAEVELVARS
        D+ E   L   V  V++S
Subjt:  DLTEISNLVAEVELVARS

TrEMBL top hitse value%identityAlignment
A0A5B6VSY9 Reverse transcriptase1.9e-2829.08Show/hide
Query:  RYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKH-ERVKVLIDDHG-TWKEQVVKGIFSPMDIE
        RY+   +FL A  G+ PS  WRSI   REL   G  W++GNG  I I+ DPW+   G S  L V ++  H + V  LID+   TWK+ ++  +      +
Subjt:  RYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKH-ERVKVLIDDHG-TWKEQVVKGIFSPMDIE

Query:  DILKIPIGNHLSKDEIIWNHDSKGLFSVKSAFREDWKPNVYW--------------------DWLEDN--------LSEANLKLALIILWSLWFFRNQVL
         IL IP+     +D ++W HD+ G++SVKS    +   ++ W                    D  EDN        +++   KL  I  W++W+ RN+++
Subjt:  DILKIPIGNHLSKDEIIWNHDSKGLFSVKSAFREDWKPNVYW--------------------DWLEDN--------LSEANLKLALIILWSLWFFRNQVL

Query:  HN--EITVDFQLIYRQIDTYKLKFQSRENAINVTSWSENLSSHAAWEPPQPNHWKLNCDASWFEKKNRGGCGWIARDSEDLL
        H   + ++D  + + +   Y+L      N ++++       + A W+PP P   KLN DAS+    N    G + R+   L+
Subjt:  HN--EITVDFQLIYRQIDTYKLKFQSRENAINVTSWSENLSSHAAWEPPQPNHWKLNCDASWFEKKNRGGCGWIARDSEDLL

A0A5B6WEY4 Reverse transcriptase1.3e-2930.5Show/hide
Query:  RYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKH-ERVKVLIDDHG-TWKEQVVKGIFSPMDIE
        RY+   +FL A  G+ PS  WRSI   REL   G  W++GNG  + I+ DPW+   G S  L V ++  H   V  LID+H  TWK+ ++  +      +
Subjt:  RYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKH-ERVKVLIDDHG-TWKEQVVKGIFSPMDIE

Query:  DILKIPIGNHLSKDEIIWNHDSKGLFSVKSAFREDWKPNVYW--------------------DWLEDN--------LSEANLKLALIILWSLWFFRNQVL
         IL IP+ +   +D ++W+HD+ G+++VKS    +   ++ W                    D  EDN        +++   KL  I  W+LWF RN+++
Subjt:  DILKIPIGNHLSKDEIIWNHDSKGLFSVKSAFREDWKPNVYW--------------------DWLEDN--------LSEANLKLALIILWSLWFFRNQVL

Query:  HN--EITVDFQLIYRQIDTYKLKFQSRENAINVTSWSENLSSHAAWEPPQPNHWKLNCDASWFEKKNRGGCGWIARDSEDLL
        H   + ++D  L + +   Y+LK        + +S+S+N    A W+PP P   KLN DAS+ +  N    G +  + + L+
Subjt:  HN--EITVDFQLIYRQIDTYKLKFQSRENAINVTSWSENLSSHAAWEPPQPNHWKLNCDASWFEKKNRGGCGWIARDSEDLL

A0A6J1DRA0 uncharacterized protein LOC1110224232.1e-2746.56Show/hide
Query:  GRYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKHERVKVLIDDHGTWKEQVVKGIFSPMDIED
        G+YFK G FL+A  G  PS  WRSILWGR+LF+KGYRWKVGNG  I +  DPWL R G+  P+  +   ++  V  L++  G W E  V+  F   + + 
Subjt:  GRYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKHERVKVLIDDHGTWKEQVVKGIFSPMDIED

Query:  ILKIPIGNHLSKDEIIWNHDSKGLFSVKSAF
        IL+ P+ +    DEIIW  D  G+FSV+SA+
Subjt:  ILKIPIGNHLSKDEIIWNHDSKGLFSVKSAF

Q33AJ4 Retrotransposon protein, putative, unclassified1.3e-2727.16Show/hide
Query:  YFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKHERVKVLIDDHGTWKEQVVKGIFSPMDIEDIL
        Y+ +G  +   FG N S  WR+I +G EL +KG  W++GNG  +R++ DPW+ R  S  P+      +   V  LID  G+W    +   F PMD E IL
Subjt:  YFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKHERVKVLIDDHGTWKEQVVKGIFSPMDIEDIL

Query:  KIPIGNHLSKDEIIWNHDSKGLFSVKSAF------------------------REDWK-----------------PNVYWDWLEDNLSEANLKLALIILW
         I + + L  D I W+ D  G FSV+SA+                           WK                 P V  D LE +  E  + L L++LW
Subjt:  KIPIGNHLSKDEIIWNHDSKGLFSVKSAF------------------------REDWK-----------------PNVYWDWLEDNLSEANLKLALIILW

Query:  SLWFFRNQVLHNEITVDFQLIYRQIDTYKLKF-----------QSRENAINVTSWSENLSSHA------AWEPPQPNHWKLNCDASWFEKKNRGGCGWIA
         +W  RN+++H +      +  R I++Y L             +  ++ ++V     +    +       W  P P   KLN D S+ E   +GG G + 
Subjt:  SLWFFRNQVLHNEITVDFQLIYRQIDTYKLKF-----------QSRENAINVTSWSENLSSHA------AWEPPQPNHWKLNCDASWFEKKNRGGCGWIA

Query:  RDSEDLLNNFSTDLTEISNLVAEVELVARSLGLVL
        R+    +   +    +  +   E EL+A   GL L
Subjt:  RDSEDLLNNFSTDLTEISNLVAEVELVARSLGLVL

Q8S5K1 Putative retroelement1.3e-2727.16Show/hide
Query:  YFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKHERVKVLIDDHGTWKEQVVKGIFSPMDIEDIL
        Y+ +G  +   FG N S  WR+I +G EL +KG  W++GNG  +R++ DPW+ R  S  P+      +   V  LID  G+W    +   F PMD E IL
Subjt:  YFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKHERVKVLIDDHGTWKEQVVKGIFSPMDIEDIL

Query:  KIPIGNHLSKDEIIWNHDSKGLFSVKSAF------------------------REDWK-----------------PNVYWDWLEDNLSEANLKLALIILW
         I + + L  D I W+ D  G FSV+SA+                           WK                 P V  D LE +  E  + L L++LW
Subjt:  KIPIGNHLSKDEIIWNHDSKGLFSVKSAF------------------------REDWK-----------------PNVYWDWLEDNLSEANLKLALIILW

Query:  SLWFFRNQVLHNEITVDFQLIYRQIDTYKLKF-----------QSRENAINVTSWSENLSSHA------AWEPPQPNHWKLNCDASWFEKKNRGGCGWIA
         +W  RN+++H +      +  R I++Y L             +  ++ ++V     +    +       W  P P   KLN D S+ E   +GG G + 
Subjt:  SLWFFRNQVLHNEITVDFQLIYRQIDTYKLKF-----------QSRENAINVTSWSENLSSHA------AWEPPQPNHWKLNCDASWFEKKNRGGCGWIA

Query:  RDSEDLLNNFSTDLTEISNLVAEVELVARSLGLVL
        R+    +   +    +  +   E EL+A   GL L
Subjt:  RDSEDLLNNFSTDLTEISNLVAEVELVARSLGLVL

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003101.3e-0537.74Show/hide
Query:  RYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWL
        RYF     ++   G  PS  WRSI+ GREL  +G    +G+G H +++ D W+
Subjt:  RYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWL

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein2.5e-1230.08Show/hide
Query:  RYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKHERVKVLIDDHGT---WKEQVVKGIFSPMDI
        RYFKD   L A      S  W S+L G  L +KG R  +G+G +IRI  D  +D      PL   + +K   +  L +  G+   W +  +       D 
Subjt:  RYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKHERVKVLIDDHGT---WKEQVVKGIFSPMDI

Query:  EDILKIPIGNHLSKDEIIWNHDSKGLFSVKSAF
          I +I +      D+IIWN+++ G ++V+S +
Subjt:  EDILKIPIGNHLSKDEIIWNHDSKGLFSVKSAF

AT4G29090.1 Ribonuclease H-like superfamily protein4.9e-1632.39Show/hide
Query:  YLGRYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKHE--------RVKVLIDDHG-TWKEQVV
        +  RYF   D L AP G+ PS  W+SI   +E+  +G R  VGNG  I I+   WLD K +S  L +  +   E        +V  LID+ G  W++ V+
Subjt:  YLGRYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKHE--------RVKVLIDDHG-TWKEQVV

Query:  KGIFSPMDIEDILKIPIGNHLSKDEIIWNHDSKGLFSVKSAF
        + +F  ++ + I ++  G     D   W++ S G ++VKS +
Subjt:  KGIFSPMDIEDILKIPIGNHLSKDEIIWNHDSKGLFSVKSAF

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.3e-0737.74Show/hide
Query:  RYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWL
        RYF     ++   G  PS  WRSI+ GREL  +G    +G+G H +++ D W+
Subjt:  RYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGTGAGCAAAAACATCTCGGAGGCCCATGCCTCGGATTTCGGTGCCATCTTGGCGGTGAACCAATCTTCCTCCCTTGGAGAGTACCTTGGGCGTTACTTTAAGGA
TGGTGATTTCCTAAAAGCCCCCTTCGGTAATAACCCCTCTTTGAACTGGAGGAGCATCCTTTGGGGCAGAGAGCTCTTTGAGAAAGGCTATCGTTGGAAGGTCGGTAATG
GGAATCACATTAGAATCTTTGAAGATCCTTGGTTGGATAGGAAAGGCAGTAGTGTCCCTTTGGGGGTCAATGATCTGTTTAAACATGAACGTGTTAAAGTGTTGATTGAT
GATCATGGTACCTGGAAAGAGCAAGTCGTTAAGGGCATTTTCTCTCCTATGGACATTGAAGATATCCTCAAAATCCCCATTGGGAACCACCTGTCTAAAGACGAAATAAT
TTGGAACCATGACTCCAAGGGCTTATTCTCTGTTAAAAGTGCCTTTCGGGAAGACTGGAAGCCTAACGTCTACTGGGATTGGCTAGAAGATAACCTTTCAGAGGCAAACC
TGAAGCTTGCCTTAATTATTCTCTGGAGTCTTTGGTTTTTCAGAAATCAAGTCTTACACAATGAAATTACTGTTGACTTTCAACTAATCTACAGGCAAATTGACACTTAT
AAGCTGAAATTTCAGTCTCGTGAAAATGCCATAAATGTTACCTCCTGGTCTGAGAACCTCTCGAGTCACGCCGCATGGGAGCCCCCCCAACCCAATCACTGGAAATTGAA
CTGCGACGCTTCCTGGTTTGAGAAGAAGAATAGAGGCGGTTGTGGTTGGATTGCCCGTGACTCTGAAGATCTTCTGAACAATTTTTCGACTGATTTAACGGAAATTTCTA
ACCTGGTGGCTGAGGTTGAGCTCGTGGCGCGCTCGTTGGGTTTGGTCTTGTTCTCCAATTGCCCTCGCGCTTGCAATTCTGCTGCTCACAACCTTGCTCGGAGAGCTTCT
GTGGAGGTCTCCTCTCCCCCGCGACCTGGAGATTTCTTCCCTTCTCCCTCTTGCATGTTTGCTTCGTCCTCTTCCCCTGTGTTCTTCTCTCCCCTCCTGTTAGACTCTTA
CGAGGATATTATTGCTTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTAGTGAGCAAAAACATCTCGGAGGCCCATGCCTCGGATTTCGGTGCCATCTTGGCGGTGAACCAATCTTCCTCCCTTGGAGAGTACCTTGGGCGTTACTTTAAGGA
TGGTGATTTCCTAAAAGCCCCCTTCGGTAATAACCCCTCTTTGAACTGGAGGAGCATCCTTTGGGGCAGAGAGCTCTTTGAGAAAGGCTATCGTTGGAAGGTCGGTAATG
GGAATCACATTAGAATCTTTGAAGATCCTTGGTTGGATAGGAAAGGCAGTAGTGTCCCTTTGGGGGTCAATGATCTGTTTAAACATGAACGTGTTAAAGTGTTGATTGAT
GATCATGGTACCTGGAAAGAGCAAGTCGTTAAGGGCATTTTCTCTCCTATGGACATTGAAGATATCCTCAAAATCCCCATTGGGAACCACCTGTCTAAAGACGAAATAAT
TTGGAACCATGACTCCAAGGGCTTATTCTCTGTTAAAAGTGCCTTTCGGGAAGACTGGAAGCCTAACGTCTACTGGGATTGGCTAGAAGATAACCTTTCAGAGGCAAACC
TGAAGCTTGCCTTAATTATTCTCTGGAGTCTTTGGTTTTTCAGAAATCAAGTCTTACACAATGAAATTACTGTTGACTTTCAACTAATCTACAGGCAAATTGACACTTAT
AAGCTGAAATTTCAGTCTCGTGAAAATGCCATAAATGTTACCTCCTGGTCTGAGAACCTCTCGAGTCACGCCGCATGGGAGCCCCCCCAACCCAATCACTGGAAATTGAA
CTGCGACGCTTCCTGGTTTGAGAAGAAGAATAGAGGCGGTTGTGGTTGGATTGCCCGTGACTCTGAAGATCTTCTGAACAATTTTTCGACTGATTTAACGGAAATTTCTA
ACCTGGTGGCTGAGGTTGAGCTCGTGGCGCGCTCGTTGGGTTTGGTCTTGTTCTCCAATTGCCCTCGCGCTTGCAATTCTGCTGCTCACAACCTTGCTCGGAGAGCTTCT
GTGGAGGTCTCCTCTCCCCCGCGACCTGGAGATTTCTTCCCTTCTCCCTCTTGCATGTTTGCTTCGTCCTCTTCCCCTGTGTTCTTCTCTCCCCTCCTGTTAGACTCTTA
CGAGGATATTATTGCTTGGTAGTTGGGTTTTTACTTGTATTTATTTCCTTCTCAAAAAAAAAAAGTTGCATTTTTAGTTTGGC
Protein sequenceShow/hide protein sequence
MVVSKNISEAHASDFGAILAVNQSSSLGEYLGRYFKDGDFLKAPFGNNPSLNWRSILWGRELFEKGYRWKVGNGNHIRIFEDPWLDRKGSSVPLGVNDLFKHERVKVLID
DHGTWKEQVVKGIFSPMDIEDILKIPIGNHLSKDEIIWNHDSKGLFSVKSAFREDWKPNVYWDWLEDNLSEANLKLALIILWSLWFFRNQVLHNEITVDFQLIYRQIDTY
KLKFQSRENAINVTSWSENLSSHAAWEPPQPNHWKLNCDASWFEKKNRGGCGWIARDSEDLLNNFSTDLTEISNLVAEVELVARSLGLVLFSNCPRACNSAAHNLARRAS
VEVSSPPRPGDFFPSPSCMFASSSSPVFFSPLLLDSYEDIIAW