; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015413 (gene) of Snake gourd v1 genome

Gene IDTan0015413
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG02:86882952..86886580
RNA-Seq ExpressionTan0015413
SyntenyTan0015413
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAS55787.1 hypothetical protein [Oryza sativa Japonica Group]3.3e-2625.57Show/hide
Query:  REGYRWKIGNGCQISIFNDPWLTVEGRQHPMMVSPKLAQSTVRSLLKEDGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTVKSAYR
        ++G  W+IGNG  + I+ DPW+  +  + P+          V  L+ EDG WD  KI + F + D   I +I        + I W  +  G F+V+SAY+
Subjt:  REGYRWKIGNGCQISIFNDPWLTVEGRQHPMMVSPKLAQSTVRSLLKEDGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTVKSAYR

Query:  LGLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEIDPWCPMC---------------------------------
        L LQ       SSS+S      W+ +W   +PQ+++I + ++  N L T +N  +R LE    C +C                                 
Subjt:  LGLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEIDPWCPMC---------------------------------

Query:  ----------------------KKGWWSLCTDASRNDTLEAGGIGWIVRNCEGKVICAGQQFIRDQWSILNFELLGIIKGMN-SIKNRVIPLMVESDFLE
                              + GW  L  D S +   E GGIG I+RNC G VI +  + +      L  EL   ++G++ ++   ++P+ VE+D   
Subjt:  ----------------------KKGWWSLCTDASRNDTLEAGGIGWIVRNCEGKVICAGQQFIRDQWSILNFELLGIIKGMN-SIKNRVIPLMVESDFLE

Query:  AISLVEGKLVDYTEACDLRDIIWHLREEWSHITFRHIHRSTNKEAHNLAQRA
         I L+     D +   ++      L      I    + RS N  +H LA +A
Subjt:  AISLVEGKLVDYTEACDLRDIIWHLREEWSHITFRHIHRSTNKEAHNLAQRA

EEC68026.1 hypothetical protein OsI_35837 [Oryza sativa Indica Group]5.1e-2733.09Show/hide
Query:  VRSLLKEDGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTVKSAYRLGLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGK
        V  L+K DG WD D+I + F   D   IL I        + + W  +  G+F+V+SAY L L       +SSS+ E  +  W  LW  ++PQ++KI + K
Subjt:  VRSLLKEDGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTVKSAYRLGLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGK

Query:  IIQNCLPTKDNLIRRGLE------IDPWCPMCKK--GWWSLCTDASRNDTLEAGGIGWIVRNCEGKVICAGQQFIRDQWSILNFELLGIIKG-MNSIKNR
           N LPT +N  +R LE      I P  P  K   GW  L  D S + +L  GGIG I+RN  G VI +   FI    S L  ELL   +G + +++  
Subjt:  IIQNCLPTKDNLIRRGLE------IDPWCPMCKK--GWWSLCTDASRNDTLEAGGIGWIVRNCEGKVICAGQQFIRDQWSILNFELLGIIKG-MNSIKNR

Query:  VIPLMVESDFLEAISLVEGKLVDYTEACDLRDIIWHLREEWSHITFRHIHRSTNKEAHNLAQRAIRLRSDDF
        ++P+ +E+D LEA++L +      +E   L   I  L      IT + + R  +  +H LA R   +   +F
Subjt:  VIPLMVESDFLEAISLVEGKLVDYTEACDLRDIIWHLREEWSHITFRHIHRSTNKEAHNLAQRAIRLRSDDF

EEC73134.1 hypothetical protein OsI_07152 [Oryza sativa Indica Group]2.2e-3029.71Show/hide
Query:  REGYRWKIGNGCQISIFNDPWLTVEGRQHPMMVSPKLAQSTVRSLLKEDGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTVKSAYR
        ++G  W++GNG  I ++ DPW+     + P+ +        V  +LK DG WD + +   F   DV  IL+I        + + W  +  G F+V+SAY+
Subjt:  REGYRWKIGNGCQISIFNDPWLTVEGRQHPMMVSPKLAQSTVRSLLKEDGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTVKSAYR

Query:  LGLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEI------DPWCPMC----------KKGWWSLCTDASRNDTL
        L +Q     E S S+    K  W  +WS  +PQ++KI   +   N LPT +N  +R   I        W              K GW  L  D S     
Subjt:  LGLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEI------DPWCPMC----------KKGWWSLCTDASRNDTL

Query:  EAGGIGWIVRNCEGKVICAGQQFIRDQWSILNFELLGIIKGMN-SIKNRVIPLMVESDFLEAISLVEGK-LVDYTEACDLRDIIWHLREEWSHITFRHIH
          GGIG ++R   G+VI +   FI      L  EL+    G+N +++  ++P+ VESD LE I L+  K  +   E   +R+I + L      ++F  +H
Subjt:  EAGGIGWIVRNCEGKVICAGQQFIRDQWSILNFELLGIIKGMN-SIKNRVIPLMVESDFLEAISLVEGK-LVDYTEACDLRDIIWHLREEWSHITFRHIH

Query:  RSTNKEAHNLAQR
        R+ N+ +H LA +
Subjt:  RSTNKEAHNLAQR

XP_023905045.1 uncharacterized protein LOC112016795 [Quercus suber]1.1e-2638.6Show/hide
Query:  REGYRWKIGNGCQISIFNDPWLTVEGRQHPMMVSPKLAQ-STVRSLLKE-DGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTVKSA
        + G+RW++G+G  I I+ D WL        +     L + +TV  L+ E  GEW+VD ++  F  DD   IL IPR    N ++++W Y P+G FTV SA
Subjt:  REGYRWKIGNGCQISIFNDPWLTVEGRQHPMMVSPKLAQ-STVRSLLKE-DGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTVKSA

Query:  YR--LGLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEIDPWCPMC
        Y+  L L Q  A+E +S  S  ++ FW+ +WS++IP ++K  + +  +N LPTK NL  RG+  DP C  C
Subjt:  YR--LGLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEIDPWCPMC

XP_024950112.1 uncharacterized protein LOC112496847 [Citrus sinensis]1.6e-2534.52Show/hide
Query:  REGYRWKIGNGCQISIFNDPWLTVEGRQHPMMVSPKLAQSTVRSLLKEDGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTVKSAYR
        ++G RW+IGNG +I+IF+D WL       P+        S V  L+K D +WD  K+R+ F + D   IL+IP       ++++W Y+ RG ++VKS Y+
Subjt:  REGYRWKIGNGCQISIFNDPWLTVEGRQHPMMVSPKLAQSTVRSLLKEDGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTVKSAYR

Query:  LGLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEIDPWCPMCK
        L L+ +     S+S +E +  +W +LW++++P+++KI   +   N LP+ +NL +R +  +P C  CK
Subjt:  LGLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEIDPWCPMCK

TrEMBL top hitse value%identityAlignment
A0A2N9E949 CCHC-type domain-containing protein5.5e-2739.23Show/hide
Query:  DLYWRQREGYRWKIGNGCQISIFNDPWLTVEGRQHPM------MVSPKLAQSTVRSLLKEDGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYN
        D  W   EG RW+IGNG ++S++ D WL       P+      MV PK+++     +  E G+WDV+K++  F  +DV  IL IP       +  +W   
Subjt:  DLYWRQREGYRWKIGNGCQISIFNDPWLTVEGRQHPM------MVSPKLAQSTVRSLLKEDGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYN

Query:  PRGIFTVKSAYRL-GLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEIDPWCPMCK
        P G FTV SAYRL   +  LAQ  SSS    AK  W+++W VQ+P  IK+ + + + N LPT  NL RR +    WCP CK
Subjt:  PRGIFTVKSAYRL-GLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEIDPWCPMCK

A0A2N9HJV7 Reverse transcriptase domain-containing protein7.1e-2739.66Show/hide
Query:  WRQREGYRWKIGNGCQISIFNDPWLTVEGRQHPMMVSPK-LAQSTVRSLLK-EDGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTV
        W   EG RW+IGNG ++S++ D WL       P+   P+ +    V  L++ E G+WDV+K++  F  +DV  IL IP       +  +W   P G FTV
Subjt:  WRQREGYRWKIGNGCQISIFNDPWLTVEGRQHPMMVSPK-LAQSTVRSLLK-EDGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTV

Query:  KSAYRL-GLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEIDPWCPMCK
         SAYRL   +  LAQ  SSS    AK  W+++W VQ+P  IK+ + + + N LPT  NL RR +    WCP CK
Subjt:  KSAYRL-GLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEIDPWCPMCK

B8AHI8 Uncharacterized protein1.1e-3029.71Show/hide
Query:  REGYRWKIGNGCQISIFNDPWLTVEGRQHPMMVSPKLAQSTVRSLLKEDGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTVKSAYR
        ++G  W++GNG  I ++ DPW+     + P+ +        V  +LK DG WD + +   F   DV  IL+I        + + W  +  G F+V+SAY+
Subjt:  REGYRWKIGNGCQISIFNDPWLTVEGRQHPMMVSPKLAQSTVRSLLKEDGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTVKSAYR

Query:  LGLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEI------DPWCPMC----------KKGWWSLCTDASRNDTL
        L +Q     E S S+    K  W  +WS  +PQ++KI   +   N LPT +N  +R   I        W              K GW  L  D S     
Subjt:  LGLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEI------DPWCPMC----------KKGWWSLCTDASRNDTL

Query:  EAGGIGWIVRNCEGKVICAGQQFIRDQWSILNFELLGIIKGMN-SIKNRVIPLMVESDFLEAISLVEGK-LVDYTEACDLRDIIWHLREEWSHITFRHIH
          GGIG ++R   G+VI +   FI      L  EL+    G+N +++  ++P+ VESD LE I L+  K  +   E   +R+I + L      ++F  +H
Subjt:  EAGGIGWIVRNCEGKVICAGQQFIRDQWSILNFELLGIIKGMN-SIKNRVIPLMVESDFLEAISLVEGK-LVDYTEACDLRDIIWHLREEWSHITFRHIH

Query:  RSTNKEAHNLAQR
        R+ N+ +H LA +
Subjt:  RSTNKEAHNLAQR

B8BK40 Uncharacterized protein2.5e-2733.09Show/hide
Query:  VRSLLKEDGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTVKSAYRLGLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGK
        V  L+K DG WD D+I + F   D   IL I        + + W  +  G+F+V+SAY L L       +SSS+ E  +  W  LW  ++PQ++KI + K
Subjt:  VRSLLKEDGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTVKSAYRLGLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGK

Query:  IIQNCLPTKDNLIRRGLE------IDPWCPMCKK--GWWSLCTDASRNDTLEAGGIGWIVRNCEGKVICAGQQFIRDQWSILNFELLGIIKG-MNSIKNR
           N LPT +N  +R LE      I P  P  K   GW  L  D S + +L  GGIG I+RN  G VI +   FI    S L  ELL   +G + +++  
Subjt:  IIQNCLPTKDNLIRRGLE------IDPWCPMCKK--GWWSLCTDASRNDTLEAGGIGWIVRNCEGKVICAGQQFIRDQWSILNFELLGIIKG-MNSIKNR

Query:  VIPLMVESDFLEAISLVEGKLVDYTEACDLRDIIWHLREEWSHITFRHIHRSTNKEAHNLAQRAIRLRSDDF
        ++P+ +E+D LEA++L +      +E   L   I  L      IT + + R  +  +H LA R   +   +F
Subjt:  VIPLMVESDFLEAISLVEGKLVDYTEACDLRDIIWHLREEWSHITFRHIHRSTNKEAHNLAQRAIRLRSDDF

Q75M12 Reverse transcriptase domain-containing protein1.6e-2625.57Show/hide
Query:  REGYRWKIGNGCQISIFNDPWLTVEGRQHPMMVSPKLAQSTVRSLLKEDGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTVKSAYR
        ++G  W+IGNG  + I+ DPW+  +  + P+          V  L+ EDG WD  KI + F + D   I +I        + I W  +  G F+V+SAY+
Subjt:  REGYRWKIGNGCQISIFNDPWLTVEGRQHPMMVSPKLAQSTVRSLLKEDGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTVKSAYR

Query:  LGLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEIDPWCPMC---------------------------------
        L LQ       SSS+S      W+ +W   +PQ+++I + ++  N L T +N  +R LE    C +C                                 
Subjt:  LGLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEIDPWCPMC---------------------------------

Query:  ----------------------KKGWWSLCTDASRNDTLEAGGIGWIVRNCEGKVICAGQQFIRDQWSILNFELLGIIKGMN-SIKNRVIPLMVESDFLE
                              + GW  L  D S +   E GGIG I+RNC G VI +  + +      L  EL   ++G++ ++   ++P+ VE+D   
Subjt:  ----------------------KKGWWSLCTDASRNDTLEAGGIGWIVRNCEGKVICAGQQFIRDQWSILNFELLGIIKGMN-SIKNRVIPLMVESDFLE

Query:  AISLVEGKLVDYTEACDLRDIIWHLREEWSHITFRHIHRSTNKEAHNLAQRA
         I L+     D +   ++      L      I    + RS N  +H LA +A
Subjt:  AISLVEGKLVDYTEACDLRDIIWHLREEWSHITFRHIHRSTNKEAHNLAQRA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein9.0e-1426.74Show/hide
Query:  REGYRWKIGNGCQISIFNDPWLTVEGRQHPMMVSPKLAQSTVRSLLKEDGE---WDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTVKS
        ++G R  IG+G  I I  D  +       P+       + T+ +L +  G    WD  KI +   + D G I +I        +KI+W YN  G +TV+S
Subjt:  REGYRWKIGNGCQISIFNDPWLTVEGRQHPMMVSPKLAQSTVRSLLKEDGE---WDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTVKS

Query:  AYRLGLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEIDPWCPMCKK
         Y L          + +   G+ +    +W++ I  ++K    + +   L T + L  RG+ IDP CP C +
Subjt:  AYRLGLQQRLAQEASSSNSEGAKNFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEIDPWCPMCKK

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.4e-0433.33Show/hide
Query:  NFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEIDPWCPMCK
        N+   +WS++I  +IK+   K + N LP    L+ R + I+P+C  C+
Subjt:  NFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEIDPWCPMCK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGGTTCAGGGAAGTCCTAGATGAGTGTAATCTAGTAGATCCAGGCTTTCAAGGTCATGAGTTCACTTGGTTTAGAAGAGTTATTAATGGCCCGACTATCTGGGA
GCGTTTAGACCGTTTTCTCATGAATGTAGATGCTTATGCTAGGTTACGTCGATGGAACCAAAACAGGTTGGGACGTTCGCTAAGAGGCTCGATTCAGAATAAGGAGACTG
AGATAAGAAAGATTGAAGTAAATCTGACACAGGATAAAGAGGATCTCTGGAAAAATAAAATGGCGGACCTGGAGAAACTTCTTGAAGAAGAGGACCTTTACTGGCGTCAA
AGAGAAGGGTACAGATGGAAGATTGGAAATGGATGTCAGATCAGTATATTTAATGATCCATGGCTCACGGTGGAAGGTCGGCAGCACCCGATGATGGTGAGTCCCAAACT
GGCTCAAAGCACGGTTCGTAGTCTGTTAAAAGAAGACGGGGAGTGGGACGTTGATAAGATCCGAAGGTGTTTCCAGGAAGATGATGTTGGTAATATCCTCCAAATCCCAC
GTTACGGGATACATAATAATGAAAAAATTGTTTGGAAGTATAATCCACGTGGTATCTTTACTGTCAAGAGTGCTTATCGCCTGGGCCTCCAACAACGACTAGCTCAGGAA
GCCTCAAGTTCTAACAGTGAGGGTGCTAAGAACTTTTGGAAGTCATTATGGAGTGTCCAGATCCCACAAAGAATTAAAATTTGTTCTGGAAAAATTATTCAAAATTGCTT
ACCAACGAAAGACAACTTAATTCGTCGGGGTTTGGAAATCGACCCATGGTGCCCGATGTGCAAGAAGGGGTGGTGGAGCCTCTGTACAGATGCCTCTAGGAATGACACAC
TGGAAGCGGGCGGTATTGGCTGGATAGTTCGAAATTGTGAGGGTAAGGTGATCTGTGCGGGTCAACAATTCATCAGAGATCAGTGGTCTATTCTAAATTTTGAGTTGTTG
GGAATCATAAAGGGAATGAACTCTATCAAGAATAGAGTAATTCCTCTAATGGTGGAATCAGACTTTCTGGAGGCAATATCACTAGTTGAAGGTAAGCTTGTTGATTACAC
AGAGGCCTGTGACTTAAGAGACATCATCTGGCATTTACGAGAGGAATGGTCACATATTACATTCAGACATATTCACCGTTCGACGAATAAGGAAGCTCACAATTTAGCTC
AAAGGGCAATACGTTTGCGTTCTGATGACTTTTTGTTTGAGGAGTCAGCCAGGCTCCGAAGCTTTATTAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGGTTCAGGGAAGTCCTAGATGAGTGTAATCTAGTAGATCCAGGCTTTCAAGGTCATGAGTTCACTTGGTTTAGAAGAGTTATTAATGGCCCGACTATCTGGGA
GCGTTTAGACCGTTTTCTCATGAATGTAGATGCTTATGCTAGGTTACGTCGATGGAACCAAAACAGGTTGGGACGTTCGCTAAGAGGCTCGATTCAGAATAAGGAGACTG
AGATAAGAAAGATTGAAGTAAATCTGACACAGGATAAAGAGGATCTCTGGAAAAATAAAATGGCGGACCTGGAGAAACTTCTTGAAGAAGAGGACCTTTACTGGCGTCAA
AGAGAAGGGTACAGATGGAAGATTGGAAATGGATGTCAGATCAGTATATTTAATGATCCATGGCTCACGGTGGAAGGTCGGCAGCACCCGATGATGGTGAGTCCCAAACT
GGCTCAAAGCACGGTTCGTAGTCTGTTAAAAGAAGACGGGGAGTGGGACGTTGATAAGATCCGAAGGTGTTTCCAGGAAGATGATGTTGGTAATATCCTCCAAATCCCAC
GTTACGGGATACATAATAATGAAAAAATTGTTTGGAAGTATAATCCACGTGGTATCTTTACTGTCAAGAGTGCTTATCGCCTGGGCCTCCAACAACGACTAGCTCAGGAA
GCCTCAAGTTCTAACAGTGAGGGTGCTAAGAACTTTTGGAAGTCATTATGGAGTGTCCAGATCCCACAAAGAATTAAAATTTGTTCTGGAAAAATTATTCAAAATTGCTT
ACCAACGAAAGACAACTTAATTCGTCGGGGTTTGGAAATCGACCCATGGTGCCCGATGTGCAAGAAGGGGTGGTGGAGCCTCTGTACAGATGCCTCTAGGAATGACACAC
TGGAAGCGGGCGGTATTGGCTGGATAGTTCGAAATTGTGAGGGTAAGGTGATCTGTGCGGGTCAACAATTCATCAGAGATCAGTGGTCTATTCTAAATTTTGAGTTGTTG
GGAATCATAAAGGGAATGAACTCTATCAAGAATAGAGTAATTCCTCTAATGGTGGAATCAGACTTTCTGGAGGCAATATCACTAGTTGAAGGTAAGCTTGTTGATTACAC
AGAGGCCTGTGACTTAAGAGACATCATCTGGCATTTACGAGAGGAATGGTCACATATTACATTCAGACATATTCACCGTTCGACGAATAAGGAAGCTCACAATTTAGCTC
AAAGGGCAATACGTTTGCGTTCTGATGACTTTTTGTTTGAGGAGTCAGCCAGGCTCCGAAGCTTTATTAGCTAA
Protein sequenceShow/hide protein sequence
MEGFREVLDECNLVDPGFQGHEFTWFRRVINGPTIWERLDRFLMNVDAYARLRRWNQNRLGRSLRGSIQNKETEIRKIEVNLTQDKEDLWKNKMADLEKLLEEEDLYWRQ
REGYRWKIGNGCQISIFNDPWLTVEGRQHPMMVSPKLAQSTVRSLLKEDGEWDVDKIRRCFQEDDVGNILQIPRYGIHNNEKIVWKYNPRGIFTVKSAYRLGLQQRLAQE
ASSSNSEGAKNFWKSLWSVQIPQRIKICSGKIIQNCLPTKDNLIRRGLEIDPWCPMCKKGWWSLCTDASRNDTLEAGGIGWIVRNCEGKVICAGQQFIRDQWSILNFELL
GIIKGMNSIKNRVIPLMVESDFLEAISLVEGKLVDYTEACDLRDIIWHLREEWSHITFRHIHRSTNKEAHNLAQRAIRLRSDDFLFEESARLRSFIS