; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020603 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020603
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF4283 domain-containing protein
Genome locationscaffold9:17629733..17635311
RNA-Seq ExpressionSpg020603
SyntenySpg020603
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039967.1 hypothetical protein E6C27_scaffold122G002490 [Cucumis melo var. makuwa]1.1e-3533.7Show/hide
Query:  SIERKSYV--VDRRN---DIRIFESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVNGGLNKIIVP
        +IE+K +V  VD R+    + I E    ++F+I++  ESL WL S F  LL+ P T +FF E R EE  LW +K   RKG+ AEI R+   G    I+VP
Subjt:  SIERKSYV--VDRRN---DIRIFESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVNGGLNKIIVP

Query:  VGADKYGWKSFLSLLKDPTKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENVPISAMHKEVSDFRSISPIQPDRALLAVENKELGQIICNIKGW
         GA+K       S   + +      IK+      +     +   +   +  H   E + +  +++++       P + D+AL+  +N+E  +++C  KGW
Subjt:  VGADKYGWKSFLSLLKDPTKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENVPISAMHKEVSDFRSISPIQPDRALLAVENKELGQIICNIKGW

Query:  YKVGAFKVRFEPWNVESFNKEPKIPSYSGWIKIRNLPVDKWSIETFQRLGDACGGYIETTKKTLSRMDMMESS
          VG F V+FE W+ ++      IPSY GWIK+R +P+  W++E+F ++GDACGG++E  K+T    D+ E+S
Subjt:  YKVGAFKVRFEPWNVESFNKEPKIPSYSGWIKIRNLPVDKWSIETFQRLGDACGGYIETTKKTLSRMDMMESS

KAA0040039.1 hypothetical protein E6C27_scaffold366G00060 [Cucumis melo var. makuwa]1.7e-3632.92Show/hide
Query:  SIERKSYVVDRRNDIRIF-----ESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVNGGLNKIIVP
        +IE+K +V+   N  R F     E    ++F+I++  ESL WL S F  LL+ P T +FF E R EE  LW +K   RKG+ AEI R+   G    I+VP
Subjt:  SIERKSYVVDRRNDIRIF-----ESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVNGGLNKIIVP

Query:  VGADKYGWKSFLSLL---KDP-TKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENVPISAMHKEVSDFRS------------------------
         GA+K GW  F+SLL   KDP +K N   I   +     ++D     +  ++  R SY E V   +   E S+ R+                        
Subjt:  VGADKYGWKSFLSLL---KDP-TKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENVPISAMHKEVSDFRS------------------------

Query:  ------------------------ISPIQPDRALLAVENKELGQIICNIKGWYKVGAFKVRFEPWNVESFNKEPKIPSYSGWIKIRNLPVDKWSIETFQR
                                  P   D+AL+  +N+E  +++C  KGW  VG F V+FE W+ ++      IPSY GWIK+R +P+  W++E+F +
Subjt:  ------------------------ISPIQPDRALLAVENKELGQIICNIKGWYKVGAFKVRFEPWNVESFNKEPKIPSYSGWIKIRNLPVDKWSIETFQR

Query:  LGDACGGYIETTKKTLSRMDMMESS
        +GDACGG++E  K+T    D+ E+S
Subjt:  LGDACGGYIETTKKTLSRMDMMESS

KAA0050054.1 hypothetical protein E6C27_scaffold675G00340 [Cucumis melo var. makuwa]5.9e-3732.73Show/hide
Query:  RPSPTSSTFSIERKSYV--VDRRN---DIRIFESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVN
        R  P S T  IE+K +V  VD R+    + I E    ++F+I++  ESL WL S F  LL+   T +FF E R E+Y LW +K   RKG+ AEI R+   
Subjt:  RPSPTSSTFSIERKSYV--VDRRN---DIRIFESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVN

Query:  GGLNKIIVPVGADKYGWKSFLSLL---KDPTKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENV------------------------------
        G    I+VP G++K GW  F+SLL   KD +K        +      +KD     E   +  R SY E V                              
Subjt:  GGLNKIIVPVGADKYGWKSFLSLL---KDPTKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENV------------------------------

Query:  ------------------PISAMHKEVSDFRSISPIQPDRALLAVENKELGQIICNIKGWYKVGAFKVRFEPWNVESFNKEPKIPSYSGWIKIRNLPVDK
                           +  +++++       P   D+AL+  +N+E   +IC  KGW  VG F V+FE WN ++      IPSY GWIK+R +P+  
Subjt:  ------------------PISAMHKEVSDFRSISPIQPDRALLAVENKELGQIICNIKGWYKVGAFKVRFEPWNVESFNKEPKIPSYSGWIKIRNLPVDK

Query:  WSIETFQRLGDACGGYIETTKKTLSRMDMMESS
        W++E+F ++GDACGG+IE  K+T    D++E+S
Subjt:  WSIETFQRLGDACGGYIETTKKTLSRMDMMESS

KAA0067710.1 hypothetical protein E6C27_scaffold352G00160 [Cucumis melo var. makuwa]1.5e-4039.45Show/hide
Query:  IFESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVNGGLNKIIVPVGADKYGWKSFLSLL---KD-
        I E    ++F+I++  ESL WL S F  LL+ P T +FF + R EEY LW +K   RKG+ AEI R+   G    I+VP G++K GW  F+SLL   KD 
Subjt:  IFESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVNGGLNKIIVPVGADKYGWKSFLSLL---KD-

Query:  PTKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENVPISAMHKEVSDFRSISPIQPDRALLAVENKELGQIICNIKGWYKVGAFKVRFEPWNVES
         TK N      K   G  +KD L   E   +K R SY + V   +   E ++F    P   D+AL+  ++ E   +IC  KGW  VG F V+FE WN + 
Subjt:  PTKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENVPISAMHKEVSDFRSISPIQPDRALLAVENKELGQIICNIKGWYKVGAFKVRFEPWNVES

Query:  FNKEPKIPSYSGWIKIRNLPVDKWSIETFQRLGDACGGYIETTKKTLSRMDMMESS
              +PSY GWIK+R +P+  W++E+F ++GD  GG++E  K+T    D++E+S
Subjt:  FNKEPKIPSYSGWIKIRNLPVDKWSIETFQRLGDACGGYIETTKKTLSRMDMMESS

TYK10355.1 hypothetical protein E5676_scaffold367G00330 [Cucumis melo var. makuwa]1.8e-3832.42Show/hide
Query:  RPSPTSSTFSIERKSYVVDRRNDIR-----IFESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVN
        R  P S T  IE+K +V+   N  R     I E    ++F+I++  ESL WL S F  LL+ P T +FF E R E+Y LW +K   RKG+ AEI R+   
Subjt:  RPSPTSSTFSIERKSYVVDRRNDIR-----IFESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVN

Query:  GGLNKIIVPVGADKYGWKSFLSLLKDPTKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENV---------------------------------
        G    I+VP G++K GW  F+SLL    K +      +      +KD     E   +  R SY E V                                 
Subjt:  GGLNKIIVPVGADKYGWKSFLSLLKDPTKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENV---------------------------------

Query:  ---------------PISAMHKEVSDFRSISPIQPDRALLAVENKELGQIICNIKGWYKVGAFKVRFEPWNVESFNKEPKIPSYSGWIKIRNLPVDKWSI
                        +  +++++       P   D+AL+  +N+E   +IC  KGW  VG F V+FE WN ++      IPSY GWIK+R +P+  W++
Subjt:  ---------------PISAMHKEVSDFRSISPIQPDRALLAVENKELGQIICNIKGWYKVGAFKVRFEPWNVESFNKEPKIPSYSGWIKIRNLPVDKWSI

Query:  ETFQRLGDACGGYIETTKKTLSRMDMMESS
        E+F ++GDACGG+IE  K+T    D++E+S
Subjt:  ETFQRLGDACGGYIETTKKTLSRMDMMESS

TrEMBL top hitse value%identityAlignment
A0A5A7TEK8 DUF4283 domain-containing protein5.4e-3633.7Show/hide
Query:  SIERKSYV--VDRRN---DIRIFESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVNGGLNKIIVP
        +IE+K +V  VD R+    + I E    ++F+I++  ESL WL S F  LL+ P T +FF E R EE  LW +K   RKG+ AEI R+   G    I+VP
Subjt:  SIERKSYV--VDRRN---DIRIFESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVNGGLNKIIVP

Query:  VGADKYGWKSFLSLLKDPTKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENVPISAMHKEVSDFRSISPIQPDRALLAVENKELGQIICNIKGW
         GA+K       S   + +      IK+      +     +   +   +  H   E + +  +++++       P + D+AL+  +N+E  +++C  KGW
Subjt:  VGADKYGWKSFLSLLKDPTKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENVPISAMHKEVSDFRSISPIQPDRALLAVENKELGQIICNIKGW

Query:  YKVGAFKVRFEPWNVESFNKEPKIPSYSGWIKIRNLPVDKWSIETFQRLGDACGGYIETTKKTLSRMDMMESS
          VG F V+FE W+ ++      IPSY GWIK+R +P+  W++E+F ++GDACGG++E  K+T    D+ E+S
Subjt:  YKVGAFKVRFEPWNVESFNKEPKIPSYSGWIKIRNLPVDKWSIETFQRLGDACGGYIETTKKTLSRMDMMESS

A0A5A7TFK7 DUF4283 domain-containing protein8.3e-3732.92Show/hide
Query:  SIERKSYVVDRRNDIRIF-----ESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVNGGLNKIIVP
        +IE+K +V+   N  R F     E    ++F+I++  ESL WL S F  LL+ P T +FF E R EE  LW +K   RKG+ AEI R+   G    I+VP
Subjt:  SIERKSYVVDRRNDIRIF-----ESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVNGGLNKIIVP

Query:  VGADKYGWKSFLSLL---KDP-TKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENVPISAMHKEVSDFRS------------------------
         GA+K GW  F+SLL   KDP +K N   I   +     ++D     +  ++  R SY E V   +   E S+ R+                        
Subjt:  VGADKYGWKSFLSLL---KDP-TKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENVPISAMHKEVSDFRS------------------------

Query:  ------------------------ISPIQPDRALLAVENKELGQIICNIKGWYKVGAFKVRFEPWNVESFNKEPKIPSYSGWIKIRNLPVDKWSIETFQR
                                  P   D+AL+  +N+E  +++C  KGW  VG F V+FE W+ ++      IPSY GWIK+R +P+  W++E+F +
Subjt:  ------------------------ISPIQPDRALLAVENKELGQIICNIKGWYKVGAFKVRFEPWNVESFNKEPKIPSYSGWIKIRNLPVDKWSIETFQR

Query:  LGDACGGYIETTKKTLSRMDMMESS
        +GDACGG++E  K+T    D+ E+S
Subjt:  LGDACGGYIETTKKTLSRMDMMESS

A0A5A7U495 DUF4283 domain-containing protein2.8e-3732.73Show/hide
Query:  RPSPTSSTFSIERKSYV--VDRRN---DIRIFESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVN
        R  P S T  IE+K +V  VD R+    + I E    ++F+I++  ESL WL S F  LL+   T +FF E R E+Y LW +K   RKG+ AEI R+   
Subjt:  RPSPTSSTFSIERKSYV--VDRRN---DIRIFESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVN

Query:  GGLNKIIVPVGADKYGWKSFLSLL---KDPTKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENV------------------------------
        G    I+VP G++K GW  F+SLL   KD +K        +      +KD     E   +  R SY E V                              
Subjt:  GGLNKIIVPVGADKYGWKSFLSLL---KDPTKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENV------------------------------

Query:  ------------------PISAMHKEVSDFRSISPIQPDRALLAVENKELGQIICNIKGWYKVGAFKVRFEPWNVESFNKEPKIPSYSGWIKIRNLPVDK
                           +  +++++       P   D+AL+  +N+E   +IC  KGW  VG F V+FE WN ++      IPSY GWIK+R +P+  
Subjt:  ------------------PISAMHKEVSDFRSISPIQPDRALLAVENKELGQIICNIKGWYKVGAFKVRFEPWNVESFNKEPKIPSYSGWIKIRNLPVDK

Query:  WSIETFQRLGDACGGYIETTKKTLSRMDMMESS
        W++E+F ++GDACGG+IE  K+T    D++E+S
Subjt:  WSIETFQRLGDACGGYIETTKKTLSRMDMMESS

A0A5A7VKI3 DUF4283 domain-containing protein7.2e-4139.45Show/hide
Query:  IFESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVNGGLNKIIVPVGADKYGWKSFLSLL---KD-
        I E    ++F+I++  ESL WL S F  LL+ P T +FF + R EEY LW +K   RKG+ AEI R+   G    I+VP G++K GW  F+SLL   KD 
Subjt:  IFESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVNGGLNKIIVPVGADKYGWKSFLSLL---KD-

Query:  PTKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENVPISAMHKEVSDFRSISPIQPDRALLAVENKELGQIICNIKGWYKVGAFKVRFEPWNVES
         TK N      K   G  +KD L   E   +K R SY + V   +   E ++F    P   D+AL+  ++ E   +IC  KGW  VG F V+FE WN + 
Subjt:  PTKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENVPISAMHKEVSDFRSISPIQPDRALLAVENKELGQIICNIKGWYKVGAFKVRFEPWNVES

Query:  FNKEPKIPSYSGWIKIRNLPVDKWSIETFQRLGDACGGYIETTKKTLSRMDMMESS
              +PSY GWIK+R +P+  W++E+F ++GD  GG++E  K+T    D++E+S
Subjt:  FNKEPKIPSYSGWIKIRNLPVDKWSIETFQRLGDACGGYIETTKKTLSRMDMMESS

A0A5D3CFS8 DUF4283 domain-containing protein8.8e-3932.42Show/hide
Query:  RPSPTSSTFSIERKSYVVDRRNDIR-----IFESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVN
        R  P S T  IE+K +V+   N  R     I E    ++F+I++  ESL WL S F  LL+ P T +FF E R E+Y LW +K   RKG+ AEI R+   
Subjt:  RPSPTSSTFSIERKSYVVDRRNDIR-----IFESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVN

Query:  GGLNKIIVPVGADKYGWKSFLSLLKDPTKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENV---------------------------------
        G    I+VP G++K GW  F+SLL    K +      +      +KD     E   +  R SY E V                                 
Subjt:  GGLNKIIVPVGADKYGWKSFLSLLKDPTKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENV---------------------------------

Query:  ---------------PISAMHKEVSDFRSISPIQPDRALLAVENKELGQIICNIKGWYKVGAFKVRFEPWNVESFNKEPKIPSYSGWIKIRNLPVDKWSI
                        +  +++++       P   D+AL+  +N+E   +IC  KGW  VG F V+FE WN ++      IPSY GWIK+R +P+  W++
Subjt:  ---------------PISAMHKEVSDFRSISPIQPDRALLAVENKELGQIICNIKGWYKVGAFKVRFEPWNVESFNKEPKIPSYSGWIKIRNLPVDKWSI

Query:  ETFQRLGDACGGYIETTKKTLSRMDMMESS
        E+F ++GDACGG+IE  K+T    D++E+S
Subjt:  ETFQRLGDACGGYIETTKKTLSRMDMMESS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.7e-1124.87Show/hide
Query:  VGDGKCTSFWRDNWMGISNLQQIYPSLYHLSSRKDAPIVEFWCQQTRSWSFYPRRHLLENEIEDWTSLLSLLQPLINLSRGDSWSWALEDNWAFSTSSLL
        VG G    FW DNW+G+  L ++   L   +       V     +  SW     R      I    +LL   Q L++    DS+ W  + +   +  S  
Subjt:  VGDGKCTSFWRDNWMGISNLQQIYPSLYHLSSRKDAPIVEFWCQQTRSWSFYPRRHLLENEIEDWTSLLSLLQPLINLSRGDSWSWALEDNWAFSTSSLL

Query:  KHLSTTSPNELSV-LYKHIWAGHYPKKVKFFLWEVSHSCINTQDKLHRRSSWLVISPSFCPMCYGDAESLMHIFSNCPYASKYWSFLQA
        +  S   P   +V  +K +W  ++  K  F  W V+ + ++T+D+L    +W +  P+ C +C    +S  H+F  C ++   W F  A
Subjt:  KHLSTTSPNELSV-LYKHIWAGHYPKKVKFFLWEVSHSCINTQDKLHRRSSWLVISPSFCPMCYGDAESLMHIFSNCPYASKYWSFLQA

AT5G16486.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.4e-0625.84Show/hide
Query:  RTRNTIMELLSQSVGDGKCTSFWRDNWMGISNLQQIYPSLYHLSS--RKDAPIVE------FWCQQTRSWSFYPRRHLLENEIEDWTSLLSLLQPLINL-
        + R    E +   VG G   +FW +NW  +  L  +   L    S   ++A + +      +W   +RS +  P   LL+N +         L  ++NL 
Subjt:  RTRNTIMELLSQSVGDGKCTSFWRDNWMGISNLQQIYPSLYHLSS--RKDAPIVE------FWCQQTRSWSFYPRRHLLENEIEDWTSLLSLLQPLINL-

Query:  --SRGDSWSWAL---EDNWAFSTSSLLKHLSTTSPNELSVLYKHIW-AGHYPKKVKFFLWEVSHSCINTQDKLHRRSSWLVISPSFCPMCYGDAESLMHI
          +  D + W +   E +  FS+++   HL+     E    +K IW  G  PK   F  W      + T+DKL    SW +  PS C +C    E+  H+
Subjt:  --SRGDSWSWAL---EDNWAFSTSSLLKHLSTTSPNELSVLYKHIW-AGHYPKKVKFFLWEVSHSCINTQDKLHRRSSWLVISPSFCPMCYGDAESLMHI

Query:  FSNCPYASK
        F +C +A +
Subjt:  FSNCPYASK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCACGACCCTCCCCAACCTCGTCAACCTTCTCTATCGAACGGAAATCTTATGTTGTTGATAGACGAAATGACATACGTATTTTCGAGTCTTCAAAGGACCGCAC
CTTCACCATCTCCCTCAAGAATGAATCTCTCGCCTGGTTAATCTCCTGCTTTGCTGACCTCTTGAATGCCCCTGTCACACAAAAGTTCTTCAAGGAATGTCGCACCGAAG
AATATGTATTATGGACAGAAAAGATTACAATTAGAAAGGGTCATTGTGCCGAAATTGCAAGACTGGGGGTAAATGGTGGGCTGAACAAGATTATTGTCCCTGTTGGAGCT
GATAAATATGGATGGAAAAGTTTCCTCTCTCTTCTTAAAGACCCCACCAAACCAAATCCCCCTGCCATTAAAGAAAAACAGGCACACGGGAACCTTATTAAAGACCACCT
TAAAGAGAAAGAAATAGAGAAGGAAAAGATCAGACACTCATACCAGGAGAACGTCCCTATCTCCGCTATGCATAAAGAAGTTTCAGACTTCAGATCTATCAGCCCTATTC
AACCGGACAGAGCATTATTGGCTGTTGAAAACAAAGAACTGGGACAAATCATCTGTAACATAAAGGGATGGTATAAGGTAGGAGCTTTTAAGGTAAGATTTGAACCATGG
AATGTAGAATCCTTTAACAAGGAACCAAAAATCCCTTCTTACAGTGGATGGATCAAGATTAGGAACTTACCAGTAGATAAATGGTCCATTGAGACTTTTCAAAGGCTTGG
TGATGCTTGCGGAGGCTACATAGAAACAACCAAAAAGACTCTCTCTCGTATGGACATGATGGAGTCATCCAATCACAGTCACGGTGGAACCTTTCTTTGTTGCAGAAAAC
TTCCTTTGATACTTGACCGATATCCATGGCAAAATCCAGCCAACGACAAAGAGGATGAAGATGACCTCTGTGTCAGACCTACAAAGAATTATGATATAAAGGTGCGAAGC
CCATCAGGATCAGTACAGGCCCAATCTCCTGATAATCAAACCATATCCACCTTCGCTCGTACAACCCAGAGTAACCAATCCAATCCGACCCACCCCACCCAACCTGACCT
AAAAGACCAACCCGACTCAAATCGACCCGGCCTGAAATATCACCTTGAACCAACTCAACCCAGCCTAACTCTCAAATCCGACCCAGCCAAATTCGTTCTTCAAAAAACCA
AATCAGCCTCTGTAACAGAGTCTACTTCTTCTCCACCTACCAATAAAGGACCTTCGGATATCCACGATTTGCCTCTTCCCCTGCCGCTGATTAAATCGGTTGTTATCAAC
AACAAGCCATCTTTGCTGATAACGGGAACCAAATTCTCTACTAATCCCAACCGAACATCCATCAATTCGGATGATTTTTTCTCATCTCCGGCCCCGTCAGGCCTTGATTT
TGAAGATTTGAACCCTATGTCCGACACTGACATCATAGTCAGACAAGAATACCCTTGTGAAGAGGTCGTAGAAGAAGAAAATACGGATGCCAAGGAAGAGTCGGGCTCAC
TGGATGATCACACGTCCAAGCGCAGGCTATCAATTAAAATTGACCTTTTAACCTTGGCAACCCGAGAGGATGCCCTGTGGAGACGAAGATGCAAGTTCAAATGGCTTACA
GAGGGAGATGAGAACACTGCCTTTTTCCACAATTATATGGCAGCCACTCGAACAAGAAACACCATTATGGAACTTTTATCCCAATCGGTGGGGGATGGCAAATGCACATC
TTTCTGGAGGGACAATTGGATGGGGATCTCTAACCTTCAACAAATATATCCTAGTCTGTATCATCTTTCTTCTAGAAAAGATGCTCCCATTGTAGAATTTTGGTGTCAAC
AGACTCGATCTTGGTCTTTTTATCCAAGAAGACACTTATTGGAGAATGAAATCGAGGATTGGACCTCCCTTCTATCTTTATTACAGCCGTTGATCAACCTATCTAGAGGA
GATTCATGGTCGTGGGCCCTCGAAGATAATTGGGCATTTTCCACCAGCTCACTTTTGAAGCATCTCTCCACCACAAGCCCTAACGAATTATCAGTCTTATATAAGCATAT
ATGGGCTGGTCATTATCCAAAGAAGGTGAAGTTCTTCCTCTGGGAGGTTAGCCACTCATGCATCAACACTCAAGACAAGCTCCACCGTAGATCTTCGTGGCTGGTTATTT
CTCCATCTTTCTGCCCAATGTGCTATGGAGATGCAGAATCCTTGATGCACATTTTCAGCAACTGTCCATATGCTTCTAAGTATTGGAGCTTCCTCCAAGCGGTTTTTGAA
TGGTCCTTCCCTAGACCGGGTGATATTCTCTCTCTTATCTCTTCGTCTTATGGGCCACCCCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCACGACCCTCCCCAACCTCGTCAACCTTCTCTATCGAACGGAAATCTTATGTTGTTGATAGACGAAATGACATACGTATTTTCGAGTCTTCAAAGGACCGCAC
CTTCACCATCTCCCTCAAGAATGAATCTCTCGCCTGGTTAATCTCCTGCTTTGCTGACCTCTTGAATGCCCCTGTCACACAAAAGTTCTTCAAGGAATGTCGCACCGAAG
AATATGTATTATGGACAGAAAAGATTACAATTAGAAAGGGTCATTGTGCCGAAATTGCAAGACTGGGGGTAAATGGTGGGCTGAACAAGATTATTGTCCCTGTTGGAGCT
GATAAATATGGATGGAAAAGTTTCCTCTCTCTTCTTAAAGACCCCACCAAACCAAATCCCCCTGCCATTAAAGAAAAACAGGCACACGGGAACCTTATTAAAGACCACCT
TAAAGAGAAAGAAATAGAGAAGGAAAAGATCAGACACTCATACCAGGAGAACGTCCCTATCTCCGCTATGCATAAAGAAGTTTCAGACTTCAGATCTATCAGCCCTATTC
AACCGGACAGAGCATTATTGGCTGTTGAAAACAAAGAACTGGGACAAATCATCTGTAACATAAAGGGATGGTATAAGGTAGGAGCTTTTAAGGTAAGATTTGAACCATGG
AATGTAGAATCCTTTAACAAGGAACCAAAAATCCCTTCTTACAGTGGATGGATCAAGATTAGGAACTTACCAGTAGATAAATGGTCCATTGAGACTTTTCAAAGGCTTGG
TGATGCTTGCGGAGGCTACATAGAAACAACCAAAAAGACTCTCTCTCGTATGGACATGATGGAGTCATCCAATCACAGTCACGGTGGAACCTTTCTTTGTTGCAGAAAAC
TTCCTTTGATACTTGACCGATATCCATGGCAAAATCCAGCCAACGACAAAGAGGATGAAGATGACCTCTGTGTCAGACCTACAAAGAATTATGATATAAAGGTGCGAAGC
CCATCAGGATCAGTACAGGCCCAATCTCCTGATAATCAAACCATATCCACCTTCGCTCGTACAACCCAGAGTAACCAATCCAATCCGACCCACCCCACCCAACCTGACCT
AAAAGACCAACCCGACTCAAATCGACCCGGCCTGAAATATCACCTTGAACCAACTCAACCCAGCCTAACTCTCAAATCCGACCCAGCCAAATTCGTTCTTCAAAAAACCA
AATCAGCCTCTGTAACAGAGTCTACTTCTTCTCCACCTACCAATAAAGGACCTTCGGATATCCACGATTTGCCTCTTCCCCTGCCGCTGATTAAATCGGTTGTTATCAAC
AACAAGCCATCTTTGCTGATAACGGGAACCAAATTCTCTACTAATCCCAACCGAACATCCATCAATTCGGATGATTTTTTCTCATCTCCGGCCCCGTCAGGCCTTGATTT
TGAAGATTTGAACCCTATGTCCGACACTGACATCATAGTCAGACAAGAATACCCTTGTGAAGAGGTCGTAGAAGAAGAAAATACGGATGCCAAGGAAGAGTCGGGCTCAC
TGGATGATCACACGTCCAAGCGCAGGCTATCAATTAAAATTGACCTTTTAACCTTGGCAACCCGAGAGGATGCCCTGTGGAGACGAAGATGCAAGTTCAAATGGCTTACA
GAGGGAGATGAGAACACTGCCTTTTTCCACAATTATATGGCAGCCACTCGAACAAGAAACACCATTATGGAACTTTTATCCCAATCGGTGGGGGATGGCAAATGCACATC
TTTCTGGAGGGACAATTGGATGGGGATCTCTAACCTTCAACAAATATATCCTAGTCTGTATCATCTTTCTTCTAGAAAAGATGCTCCCATTGTAGAATTTTGGTGTCAAC
AGACTCGATCTTGGTCTTTTTATCCAAGAAGACACTTATTGGAGAATGAAATCGAGGATTGGACCTCCCTTCTATCTTTATTACAGCCGTTGATCAACCTATCTAGAGGA
GATTCATGGTCGTGGGCCCTCGAAGATAATTGGGCATTTTCCACCAGCTCACTTTTGAAGCATCTCTCCACCACAAGCCCTAACGAATTATCAGTCTTATATAAGCATAT
ATGGGCTGGTCATTATCCAAAGAAGGTGAAGTTCTTCCTCTGGGAGGTTAGCCACTCATGCATCAACACTCAAGACAAGCTCCACCGTAGATCTTCGTGGCTGGTTATTT
CTCCATCTTTCTGCCCAATGTGCTATGGAGATGCAGAATCCTTGATGCACATTTTCAGCAACTGTCCATATGCTTCTAAGTATTGGAGCTTCCTCCAAGCGGTTTTTGAA
TGGTCCTTCCCTAGACCGGGTGATATTCTCTCTCTTATCTCTTCGTCTTATGGGCCACCCCTTTAA
Protein sequenceShow/hide protein sequence
MASRPSPTSSTFSIERKSYVVDRRNDIRIFESSKDRTFTISLKNESLAWLISCFADLLNAPVTQKFFKECRTEEYVLWTEKITIRKGHCAEIARLGVNGGLNKIIVPVGA
DKYGWKSFLSLLKDPTKPNPPAIKEKQAHGNLIKDHLKEKEIEKEKIRHSYQENVPISAMHKEVSDFRSISPIQPDRALLAVENKELGQIICNIKGWYKVGAFKVRFEPW
NVESFNKEPKIPSYSGWIKIRNLPVDKWSIETFQRLGDACGGYIETTKKTLSRMDMMESSNHSHGGTFLCCRKLPLILDRYPWQNPANDKEDEDDLCVRPTKNYDIKVRS
PSGSVQAQSPDNQTISTFARTTQSNQSNPTHPTQPDLKDQPDSNRPGLKYHLEPTQPSLTLKSDPAKFVLQKTKSASVTESTSSPPTNKGPSDIHDLPLPLPLIKSVVIN
NKPSLLITGTKFSTNPNRTSINSDDFFSSPAPSGLDFEDLNPMSDTDIIVRQEYPCEEVVEEENTDAKEESGSLDDHTSKRRLSIKIDLLTLATREDALWRRRCKFKWLT
EGDENTAFFHNYMAATRTRNTIMELLSQSVGDGKCTSFWRDNWMGISNLQQIYPSLYHLSSRKDAPIVEFWCQQTRSWSFYPRRHLLENEIEDWTSLLSLLQPLINLSRG
DSWSWALEDNWAFSTSSLLKHLSTTSPNELSVLYKHIWAGHYPKKVKFFLWEVSHSCINTQDKLHRRSSWLVISPSFCPMCYGDAESLMHIFSNCPYASKYWSFLQAVFE
WSFPRPGDILSLISSSYGPPL