; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010857 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010857
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr1:8058849..8060703
RNA-Seq ExpressionLag0010857
SyntenyLag0010857
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW87607.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]9.4e-2929.97Show/hide
Query:  RGLGDKSKRVSLKKFLQNSCPDLVLIQETKQAAIDLRFIESLWSSKEISWSFVEAYGKSG-------------------GLIMWDESCKAFPNAPLECWL
        +G G ++KR  +K FL++  PD+V+IQETK+   D RF+ S+W+ +   W  + A G SG                      MW  +   F N     WL
Subjt:  RGLGDKSKRVSLKKFLQNSCPDLVLIQETKQAAIDLRFIESLWSSKEISWSFVEAYGKSG-------------------GLIMWDESCKAFPNAPLECWL

Query:  SQMGCDKVILDSLFLDRSQGWVGFVISSKFKNLKAEIKKWFADFEASRKRKDKSLLSKLEFFDAKAESSVLSDIEIDIRLGIKGEIMGLYMSDERNLIQK
              +   D     +  GW G     + + +KA++K+W        K K KS+L+ L  FDA  +   L+   I  R+  KGE+  L + +E +  QK
Subjt:  SQMGCDKVILDSLFLDRSQGWVGFVISSKFKNLKAEIKKWFADFEASRKRKDKSLLSKLEFFDAKAESSVLSDIEIDIRLGIKGEIMGLYMSDERNLIQK

Query:  SKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSLYQKIPGHRFLPSNLSWGTISAGQNTALSAPFSVEEIRAAL
        +K+ W+K GD N+ F+H+    ++ +  I  L +  G+ L +   I EEI+ +F  LY    G  +    L W  IS      L +PF+ EEI  A+
Subjt:  SKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSLYQKIPGHRFLPSNLSWGTISAGQNTALSAPFSVEEIRAAL

RVW90071.1 hypothetical protein CK203_035926 [Vitis vinifera]3.0e-2729.56Show/hide
Query:  RGLGDKSKRVSLKKFLQNSCPDLVLIQETKQAAIDLRFIESLWSSKEISWSFVEAYGKSGG-LIMWD----------------------ESCKAF-----
        RGLG  +KR  +K FL+   PD+V+IQETK+   + R + S+W+ +   W  + A G SGG L +WD                      E C        
Subjt:  RGLGDKSKRVSLKKFLQNSCPDLVLIQETKQAAIDLRFIESLWSSKEISWSFVEAYGKSGG-LIMWD----------------------ESCKAF-----

Query:  --PNAPL----ECWLSQ------MGCDKVILDSLFLDRSQGWVGFVISSKFKNLKAEIKKWFADFEASRKRKDKSLLSKLEFFDAKAESSVLSDIEIDIR
          PN+P      C+  +          ++I   L++    GW G     + + +KA++K+W          K KS+L  L  FDA  +   L+   +D R
Subjt:  --PNAPL----ECWLSQ------MGCDKVILDSLFLDRSQGWVGFVISSKFKNLKAEIKKWFADFEASRKRKDKSLLSKLEFFDAKAESSVLSDIEIDIR

Query:  LGIKGEIMGLYMSDERNLIQKSKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSLYQKIPGHRFLPSNLSWGTISAG
           KGE+  L + +E +  QK+++ W+K GD N+ FFH+    ++ +  I  L +  G+ L +   I EEI+ +F  LY    G  +    L W  IS  
Subjt:  LGIKGEIMGLYMSDERNLIQKSKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSLYQKIPGHRFLPSNLSWGTISAG

Query:  QNTALSAPFSVEEIRAAL
           +L APF+ EEI  A+
Subjt:  QNTALSAPFSVEEIRAAL

XP_038884535.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X1 [Benincasa hispida]3.6e-4444.04Show/hide
Query:  GLIMWDESCKAFPNAPLECWLSQMGCDKVILDSLFLDRSQGWVGFVISSKFKNLKAEIKKWFADFEASRKRKDKSLLSKLEFFDAKAESSVLSDIEIDIR
        G   W  S   F N+    WL    C ++I +S  +   Q W GF + S+ + +K  +K+W A+ E  +K +++SLL +++  D +A++      E D+R
Subjt:  GLIMWDESCKAFPNAPLECWLSQMGCDKVILDSLFLDRSQGWVGFVISSKFKNLKAEIKKWFADFEASRKRKDKSLLSKLEFFDAKAESSVLSDIEIDIR

Query:  LGIKGEIMGLYMSDERNLIQKSKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSLYQKIPGHRFLPSNLSWGTISAG
        + +K +++ LY ++ER+LIQKSKLNWL LGDENTSFFHRFLAAK+RKNLI  L +  G+   SFREIE  I+DFFS+LY K  G R +P N++W  +SA 
Subjt:  LGIKGEIMGLYMSDERNLIQKSKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSLYQKIPGHRFLPSNLSWGTISAG

Query:  QNTALSAPFSVEEIRAAL
         N+ L A FS  EI  A+
Subjt:  QNTALSAPFSVEEIRAAL

XP_038884536.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X2 [Benincasa hispida]3.6e-4444.04Show/hide
Query:  GLIMWDESCKAFPNAPLECWLSQMGCDKVILDSLFLDRSQGWVGFVISSKFKNLKAEIKKWFADFEASRKRKDKSLLSKLEFFDAKAESSVLSDIEIDIR
        G   W  S   F N+    WL    C ++I +S  +   Q W GF + S+ + +K  +K+W A+ E  +K +++SLL +++  D +A++      E D+R
Subjt:  GLIMWDESCKAFPNAPLECWLSQMGCDKVILDSLFLDRSQGWVGFVISSKFKNLKAEIKKWFADFEASRKRKDKSLLSKLEFFDAKAESSVLSDIEIDIR

Query:  LGIKGEIMGLYMSDERNLIQKSKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSLYQKIPGHRFLPSNLSWGTISAG
        + +K +++ LY ++ER+LIQKSKLNWL LGDENTSFFHRFLAAK+RKNLI  L +  G+   SFREIE  I+DFFS+LY K  G R +P N++W  +SA 
Subjt:  LGIKGEIMGLYMSDERNLIQKSKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSLYQKIPGHRFLPSNLSWGTISAG

Query:  QNTALSAPFSVEEIRAAL
         N+ L A FS  EI  A+
Subjt:  QNTALSAPFSVEEIRAAL

XP_038884537.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X3 [Benincasa hispida]3.6e-4444.04Show/hide
Query:  GLIMWDESCKAFPNAPLECWLSQMGCDKVILDSLFLDRSQGWVGFVISSKFKNLKAEIKKWFADFEASRKRKDKSLLSKLEFFDAKAESSVLSDIEIDIR
        G   W  S   F N+    WL    C ++I +S  +   Q W GF + S+ + +K  +K+W A+ E  +K +++SLL +++  D +A++      E D+R
Subjt:  GLIMWDESCKAFPNAPLECWLSQMGCDKVILDSLFLDRSQGWVGFVISSKFKNLKAEIKKWFADFEASRKRKDKSLLSKLEFFDAKAESSVLSDIEIDIR

Query:  LGIKGEIMGLYMSDERNLIQKSKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSLYQKIPGHRFLPSNLSWGTISAG
        + +K +++ LY ++ER+LIQKSKLNWL LGDENTSFFHRFLAAK+RKNLI  L +  G+   SFREIE  I+DFFS+LY K  G R +P N++W  +SA 
Subjt:  LGIKGEIMGLYMSDERNLIQKSKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSLYQKIPGHRFLPSNLSWGTISAG

Query:  QNTALSAPFSVEEIRAAL
         N+ L A FS  EI  A+
Subjt:  QNTALSAPFSVEEIRAAL

TrEMBL top hitse value%identityAlignment
A0A438BQB2 Transposon TX1 uncharacterized 149 kDa protein8.1e-2626.84Show/hide
Query:  RGLGDKSKRVSLKKFLQNSCPDLVLIQETKQAAIDLRFIESLWSSKEISWSFVEAYGKSGG-LIMWDE-----------------------------SCK
        RGLG ++KR  +K FL++  PD+V+IQETK+   D RF+ S+W+ +   W  + A G SGG LI+WD                              S  
Subjt:  RGLGDKSKRVSLKKFLQNSCPDLVLIQETKQAAIDLRFIESLWSSKEISWSFVEAYGKSGG-LIMWDE-----------------------------SCK

Query:  AFPNAPL----------------------------------------------ECWLSQMGCDKVILDSLFLDRSQGWVGFVISSKFKNLKAEIKKWFAD
          PN+P                                                 WL      +   D     +  GW G     + + +KA++K+W   
Subjt:  AFPNAPL----------------------------------------------ECWLSQMGCDKVILDSLFLDRSQGWVGFVISSKFKNLKAEIKKWFAD

Query:  FEASRKRKDKSLLSKLEFFDAKAESSVLSDIEIDIRLGIKGEIMGLYMSDERNLIQKSKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSF
             K K KS+L+ L  FDA  +   L+   +  R   KGE+  L + +E +  QK+K+ W+K GD N+ F+H+    ++ +  I  L +  G+ L + 
Subjt:  FEASRKRKDKSLLSKLEFFDAKAESSVLSDIEIDIRLGIKGEIMGLYMSDERNLIQKSKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSF

Query:  REIEEEIMDFFSSLYQKIPGHRFLPSNLSWGTISAGQNTALSAPFSVEEIRAAL
          I EEI+ +F  LY    G  +    L W  IS      L +PF+ EEI  A+
Subjt:  REIEEEIMDFFSSLYQKIPGHRFLPSNLSWGTISAGQNTALSAPFSVEEIRAAL

A0A438HT14 Transposon TX1 uncharacterized 149 kDa protein4.6e-2929.97Show/hide
Query:  RGLGDKSKRVSLKKFLQNSCPDLVLIQETKQAAIDLRFIESLWSSKEISWSFVEAYGKSG-------------------GLIMWDESCKAFPNAPLECWL
        +G G ++KR  +K FL++  PD+V+IQETK+   D RF+ S+W+ +   W  + A G SG                      MW  +   F N     WL
Subjt:  RGLGDKSKRVSLKKFLQNSCPDLVLIQETKQAAIDLRFIESLWSSKEISWSFVEAYGKSG-------------------GLIMWDESCKAFPNAPLECWL

Query:  SQMGCDKVILDSLFLDRSQGWVGFVISSKFKNLKAEIKKWFADFEASRKRKDKSLLSKLEFFDAKAESSVLSDIEIDIRLGIKGEIMGLYMSDERNLIQK
              +   D     +  GW G     + + +KA++K+W        K K KS+L+ L  FDA  +   L+   I  R+  KGE+  L + +E +  QK
Subjt:  SQMGCDKVILDSLFLDRSQGWVGFVISSKFKNLKAEIKKWFADFEASRKRKDKSLLSKLEFFDAKAESSVLSDIEIDIRLGIKGEIMGLYMSDERNLIQK

Query:  SKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSLYQKIPGHRFLPSNLSWGTISAGQNTALSAPFSVEEIRAAL
        +K+ W+K GD N+ F+H+    ++ +  I  L +  G+ L +   I EEI+ +F  LY    G  +    L W  IS      L +PF+ EEI  A+
Subjt:  SKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSLYQKIPGHRFLPSNLSWGTISAGQNTALSAPFSVEEIRAAL

A0A438I033 Uncharacterized protein1.5e-2729.56Show/hide
Query:  RGLGDKSKRVSLKKFLQNSCPDLVLIQETKQAAIDLRFIESLWSSKEISWSFVEAYGKSGG-LIMWD----------------------ESCKAF-----
        RGLG  +KR  +K FL+   PD+V+IQETK+   + R + S+W+ +   W  + A G SGG L +WD                      E C        
Subjt:  RGLGDKSKRVSLKKFLQNSCPDLVLIQETKQAAIDLRFIESLWSSKEISWSFVEAYGKSGG-LIMWD----------------------ESCKAF-----

Query:  --PNAPL----ECWLSQ------MGCDKVILDSLFLDRSQGWVGFVISSKFKNLKAEIKKWFADFEASRKRKDKSLLSKLEFFDAKAESSVLSDIEIDIR
          PN+P      C+  +          ++I   L++    GW G     + + +KA++K+W          K KS+L  L  FDA  +   L+   +D R
Subjt:  --PNAPL----ECWLSQ------MGCDKVILDSLFLDRSQGWVGFVISSKFKNLKAEIKKWFADFEASRKRKDKSLLSKLEFFDAKAESSVLSDIEIDIR

Query:  LGIKGEIMGLYMSDERNLIQKSKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSLYQKIPGHRFLPSNLSWGTISAG
           KGE+  L + +E +  QK+++ W+K GD N+ FFH+    ++ +  I  L +  G+ L +   I EEI+ +F  LY    G  +    L W  IS  
Subjt:  LGIKGEIMGLYMSDERNLIQKSKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSLYQKIPGHRFLPSNLSWGTISAG

Query:  QNTALSAPFSVEEIRAAL
           +L APF+ EEI  A+
Subjt:  QNTALSAPFSVEEIRAAL

A0A438I438 Uncharacterized protein8.9e-2530.11Show/hide
Query:  RGLGDKSKRVSLKKFLQNSCPDLVLIQETKQAAIDLRFIESLWSSKEISWSFVEAYGKSGG-LIMWDESCKAFPNAPLECWLSQMGCDKVILDSLFLDRS
        RGLG ++KR  +K FL++  PD+V+IQETK+   D RF+ S+W+ +   W  + A G SGG LI+WD                     K+ ++  F    
Subjt:  RGLGDKSKRVSLKKFLQNSCPDLVLIQETKQAAIDLRFIESLWSSKEISWSFVEAYGKSGG-LIMWDESCKAFPNAPLECWLSQMGCDKVILDSLFLDRS

Query:  QGWVGFVISSKFKNLKAEIKKWFADFEASRKRKDKSLLSKLEFFDAKAESSVLSDIEIDIRLGIKGEIMGLYMSDERNLIQKSKLNWLKLGDENTSFFHR
          W+      K  N K   + W++ F+ +     K  + +L+      E  + SD+ +  R   KGE+  L + +E +  QK+K+ W+K GD N+ F+H+
Subjt:  QGWVGFVISSKFKNLKAEIKKWFADFEASRKRKDKSLLSKLEFFDAKAESSVLSDIEIDIRLGIKGEIMGLYMSDERNLIQKSKLNWLKLGDENTSFFHR

Query:  FLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSLYQKIPGHRFLPSNLSWGTISAGQNTALSAPFSVEEIRAAL
            ++ +  I  L +  G+ L +   I EEI+ +F  LY    G  +    L W  IS      L +PF+ EEI  A+
Subjt:  FLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSLYQKIPGHRFLPSNLSWGTISAGQNTALSAPFSVEEIRAAL

A5BCE8 Reverse transcriptase domain-containing protein1.9e-2728.44Show/hide
Query:  GDKSKRVSLKKFLQNSCPDLVLIQETKQAAIDLRFIESLWSSKEISWSFVEAYGKSGG-LIMWD----------------------ESCKAF-------P
        G ++KR  +K FL++  PD+V+IQETK    D RF+ S+W+ +   W  + A G SGG LI+WD                      + C          P
Subjt:  GDKSKRVSLKKFLQNSCPDLVLIQETKQAAIDLRFIESLWSSKEISWSFVEAYGKSGG-LIMWD----------------------ESCKAF-------P

Query:  NAP-----------------LECWLSQMGCDKVILDSLFLD-----RSQGWVGFVISSKFKNLKAEIKKWFADFEASRKRKDKSLLSKLEFFDAKAESSV
        N+P                    W   M          F D     +  GW G     + + +KA++K+W        K K KS+L+ L  FDA  +   
Subjt:  NAP-----------------LECWLSQMGCDKVILDSLFLD-----RSQGWVGFVISSKFKNLKAEIKKWFADFEASRKRKDKSLLSKLEFFDAKAESSV

Query:  LSDIEIDIRLGIKGEIMGLYMSDERNLIQKSKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSLYQKIPGHRFLPSN
        L+   +  R   KGE+  L + +E +  QK+K+ W+K GD N+ F+H+    ++ +  I  L +  G+ L +   I EEI+ +F  LY    G  +    
Subjt:  LSDIEIDIRLGIKGEIMGLYMSDERNLIQKSKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSLYQKIPGHRFLPSN

Query:  LSWGTISAGQNTALSAPFSVEEIRAAL
        L W  IS      L +PF+ EEI  A+
Subjt:  LSWGTISAGQNTALSAPFSVEEIRAAL

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.1e-0643.86Show/hide
Query:  MDFIEGLPQSRGYNSIFVVVDRLSKYAHFIPLSHPYNAKGVAIVFVKEIVRLHGFPR
        MDFI  LP+S GYN++FVVVDR SK A  +P +    A+  A +F + ++   G P+
Subjt:  MDFIEGLPQSRGYNSIFVVVDRLSKYAHFIPLSHPYNAKGVAIVFVKEIVRLHGFPR

P0CT35 Transposon Tf2-2 polyprotein1.1e-0643.86Show/hide
Query:  MDFIEGLPQSRGYNSIFVVVDRLSKYAHFIPLSHPYNAKGVAIVFVKEIVRLHGFPR
        MDFI  LP+S GYN++FVVVDR SK A  +P +    A+  A +F + ++   G P+
Subjt:  MDFIEGLPQSRGYNSIFVVVDRLSKYAHFIPLSHPYNAKGVAIVFVKEIVRLHGFPR

P0CT36 Transposon Tf2-3 polyprotein1.1e-0643.86Show/hide
Query:  MDFIEGLPQSRGYNSIFVVVDRLSKYAHFIPLSHPYNAKGVAIVFVKEIVRLHGFPR
        MDFI  LP+S GYN++FVVVDR SK A  +P +    A+  A +F + ++   G P+
Subjt:  MDFIEGLPQSRGYNSIFVVVDRLSKYAHFIPLSHPYNAKGVAIVFVKEIVRLHGFPR

P0CT41 Transposon Tf2-12 polyprotein1.1e-0643.86Show/hide
Query:  MDFIEGLPQSRGYNSIFVVVDRLSKYAHFIPLSHPYNAKGVAIVFVKEIVRLHGFPR
        MDFI  LP+S GYN++FVVVDR SK A  +P +    A+  A +F + ++   G P+
Subjt:  MDFIEGLPQSRGYNSIFVVVDRLSKYAHFIPLSHPYNAKGVAIVFVKEIVRLHGFPR

Q9UR07 Transposon Tf2-11 polyprotein1.1e-0643.86Show/hide
Query:  MDFIEGLPQSRGYNSIFVVVDRLSKYAHFIPLSHPYNAKGVAIVFVKEIVRLHGFPR
        MDFI  LP+S GYN++FVVVDR SK A  +P +    A+  A +F + ++   G P+
Subjt:  MDFIEGLPQSRGYNSIFVVVDRLSKYAHFIPLSHPYNAKGVAIVFVKEIVRLHGFPR

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein9.2e-0637.29Show/hide
Query:  QKSKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSL
        QKS++ WL+ GD NT FFH+ + A + KNLI  L   + V + +  +++E I+ +++ L
Subjt:  QKSKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTTATCGAAGGATTACCTCAGTCACGGGGTTACAACTCCATCTTTGTGGTTGTGGACCGCCTTAGCAAGTATGCCCATTTTATTCCTTTAAGTCATCCGTACAA
TGCTAAGGGGGTAGCCATTGTGTTTGTTAAAGAGATAGTTCGCTTACATGGATTTCCCCGGGGTCTCGGTGATAAGTCTAAAAGGGTGTCTTTGAAGAAATTCCTTCAGA
ATAGTTGCCCTGATTTAGTCCTAATTCAGGAGACTAAACAAGCAGCAATTGATTTGAGATTCATTGAATCTTTATGGAGTTCCAAGGAAATCAGCTGGTCGTTTGTGGAA
GCCTATGGAAAATCGGGAGGTCTTATTATGTGGGATGAAAGTTGTAAGGCTTTCCCCAATGCTCCTTTAGAATGTTGGCTTTCTCAAATGGGGTGTGATAAGGTTATTTT
GGATTCTCTTTTTCTTGATCGTTCTCAAGGATGGGTTGGTTTTGTTATTAGCTCTAAATTCAAAAATTTAAAAGCAGAAATCAAGAAGTGGTTTGCAGATTTTGAAGCTA
GCAGAAAAAGAAAAGATAAAAGTTTGCTTTCTAAACTTGAATTCTTTGATGCAAAGGCTGAAAGTTCTGTTTTATCCGACATTGAGATTGATATTCGGCTGGGTATAAAA
GGGGAAATTATGGGATTATATATGTCTGATGAAAGAAATTTAATTCAGAAAAGTAAGCTTAATTGGTTAAAGCTTGGGGATGAGAACACTAGTTTTTTCCATCGATTTTT
GGCAGCCAAGAAGAGGAAGAATTTGATTACAAACTTAATTTCTAGCAATGGGGTTTCTTTAGTTTCCTTCAGGGAAATTGAAGAAGAAATTATGGATTTCTTTTCTTCTT
TATATCAGAAAATTCCAGGTCATCGGTTTCTACCCTCTAATTTATCTTGGGGTACTATTTCAGCTGGTCAAAACACAGCCCTTTCGGCTCCTTTTTCTGTTGAAGAAATT
AGGGCCGCTTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATTTTATCGAAGGATTACCTCAGTCACGGGGTTACAACTCCATCTTTGTGGTTGTGGACCGCCTTAGCAAGTATGCCCATTTTATTCCTTTAAGTCATCCGTACAA
TGCTAAGGGGGTAGCCATTGTGTTTGTTAAAGAGATAGTTCGCTTACATGGATTTCCCCGGGGTCTCGGTGATAAGTCTAAAAGGGTGTCTTTGAAGAAATTCCTTCAGA
ATAGTTGCCCTGATTTAGTCCTAATTCAGGAGACTAAACAAGCAGCAATTGATTTGAGATTCATTGAATCTTTATGGAGTTCCAAGGAAATCAGCTGGTCGTTTGTGGAA
GCCTATGGAAAATCGGGAGGTCTTATTATGTGGGATGAAAGTTGTAAGGCTTTCCCCAATGCTCCTTTAGAATGTTGGCTTTCTCAAATGGGGTGTGATAAGGTTATTTT
GGATTCTCTTTTTCTTGATCGTTCTCAAGGATGGGTTGGTTTTGTTATTAGCTCTAAATTCAAAAATTTAAAAGCAGAAATCAAGAAGTGGTTTGCAGATTTTGAAGCTA
GCAGAAAAAGAAAAGATAAAAGTTTGCTTTCTAAACTTGAATTCTTTGATGCAAAGGCTGAAAGTTCTGTTTTATCCGACATTGAGATTGATATTCGGCTGGGTATAAAA
GGGGAAATTATGGGATTATATATGTCTGATGAAAGAAATTTAATTCAGAAAAGTAAGCTTAATTGGTTAAAGCTTGGGGATGAGAACACTAGTTTTTTCCATCGATTTTT
GGCAGCCAAGAAGAGGAAGAATTTGATTACAAACTTAATTTCTAGCAATGGGGTTTCTTTAGTTTCCTTCAGGGAAATTGAAGAAGAAATTATGGATTTCTTTTCTTCTT
TATATCAGAAAATTCCAGGTCATCGGTTTCTACCCTCTAATTTATCTTGGGGTACTATTTCAGCTGGTCAAAACACAGCCCTTTCGGCTCCTTTTTCTGTTGAAGAAATT
AGGGCCGCTTTGTAG
Protein sequenceShow/hide protein sequence
MDFIEGLPQSRGYNSIFVVVDRLSKYAHFIPLSHPYNAKGVAIVFVKEIVRLHGFPRGLGDKSKRVSLKKFLQNSCPDLVLIQETKQAAIDLRFIESLWSSKEISWSFVE
AYGKSGGLIMWDESCKAFPNAPLECWLSQMGCDKVILDSLFLDRSQGWVGFVISSKFKNLKAEIKKWFADFEASRKRKDKSLLSKLEFFDAKAESSVLSDIEIDIRLGIK
GEIMGLYMSDERNLIQKSKLNWLKLGDENTSFFHRFLAAKKRKNLITNLISSNGVSLVSFREIEEEIMDFFSSLYQKIPGHRFLPSNLSWGTISAGQNTALSAPFSVEEI
RAAL