; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg010617 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg010617
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionhAT transposon superfamily protein
Genome locationscaffold5:12374116..12377562
RNA-Seq ExpressionSpg010617
SyntenySpg010617
Gene Ontology termsNA
InterPro domainsIPR007021 - Domain of unknown function DUF659
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8546963.1 hypothetical protein F0562_003392 [Nyssa sinensis]4.1e-6059.8Show/hide
Query:  MEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSK-----------------------------AGVTRFATFYLTLQSLAENKIGLRAMFGSKDWQRS
        M KRK+LF S CAAH LDLIL+ IG++PLH D +SK                             A VTRFAT YLTL+ L + KI LRAMF SK+WQ S
Subjt:  MEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSK-----------------------------AGVTRFATFYLTLQSLAENKIGLRAMFGSKDWQRS

Query:  NYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKRWTMQLHRPLHATTYY
         Y+KK +    M IILA  +FW+SIKYCLKCV+PLVKVLRLVDGD +PAMGYIY+AMD+A EQIEKNF+ ++KHYKP+WDIID RW MQLH+PL+A  YY
Subjt:  NYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKRWTMQLHRPLHATTYY

Query:  LNPR
        LNPR
Subjt:  LNPR

RWR88638.1 hypothetical protein CKAN_01766500 [Cinnamomum micranthum f. kanehirae]2.0e-5951.1Show/hide
Query:  VQKSSSITLNTGEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSK-----------------------------AGVTRFATFYLTLQSLAENK
        V  S++  +  GE LMEKRK++FWS C+AH LDL+L DIG +P H   V+K                             A VTRFAT +LTL+S+ E +
Subjt:  VQKSSSITLNTGEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSK-----------------------------AGVTRFATFYLTLQSLAENK

Query:  IGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKR
        + LR+MF SK+W+ S+Y+K  DG  V  IIL +  FW  IKYC+KCV PLVKVLRLVD D RP MGYIY+AMDRAKE I KNF+ ++K Y PIW I+D R
Subjt:  IGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKR

Query:  WTMQLHRPLHATTYYLNPRLSFIPNFH
        W +QLHRPLHA  Y+LNP   +   FH
Subjt:  WTMQLHRPLHATTYYLNPRLSFIPNFH

RWR88638.1 hypothetical protein CKAN_01766500 [Cinnamomum micranthum f. kanehirae]1.0e-0246Show/hide
Query:  GIYRIKHHLAHTRKNVAPCCKVSDEVRDIFKNLLNGNK--RNKDGDDFND
        G++R+KHHLA T++N+ PC +V D+V ++ K LL  NK  + K  DD  D
Subjt:  GIYRIKHHLAHTRKNVAPCCKVSDEVRDIFKNLLNGNK--RNKDGDDFND

XP_030945716.1 uncharacterized protein LOC115970195 [Quercus lobata]1.4e-6052.21Show/hide
Query:  VQKSSSITLNTGEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSKA-----------------------------GVTRFATFYLTLQSLAENK
        V  S+   ++ GE LMEK++++FW+ C AH +DL+L DIGD+P+H + + KA                             GVTRFAT Y TL+S+ + K
Subjt:  VQKSSSITLNTGEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSKA-----------------------------GVTRFATFYLTLQSLAENK

Query:  IGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKR
        IGLR+MF S++W +S YAKKSD   V  I L++  FW +IK+CLKCVIPLVKVLRLVDGDA+PAMGYIY+AMDRAKEQI K  N +Q+ Y+PI  II+ R
Subjt:  IGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKR

Query:  WTMQLHRPLHATTYYLNPRLSFIPNF
        W +QLHRPLHA  Y+LNP+  + PNF
Subjt:  WTMQLHRPLHATTYYLNPRLSFIPNF

XP_030959124.1 uncharacterized protein LOC115981079 [Quercus lobata]4.4e-6259.9Show/hide
Query:  VQKSSSITLNTGEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSKAGVTRFATFYLTLQSLAENKIGLRAMFGSKDWQRSNYAKKSDGATVMKI
        V  S+S  ++TGE LMEKR ++FW+ CAAH +DL+L DIG        +++ GV RFAT YLTL+++ + KIGLR+MF S++W +S YAKKSDG  V  I
Subjt:  VQKSSSITLNTGEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSKAGVTRFATFYLTLQSLAENKIGLRAMFGSKDWQRSNYAKKSDGATVMKI

Query:  ILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKRWTMQLHRPLHATTYYLNPRLSFIPNF
        +L++  FW +IK+CLKCVIPLVKVLRLVDGDA+PAMGYIY+AMDRAKEQI KNFN +Q+ Y+PI  II+ RW +QLHRPLH   Y+LNP+  + PNF
Subjt:  ILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKRWTMQLHRPLHATTYYLNPRLSFIPNF

XP_030969699.1 uncharacterized protein LOC115989976 [Quercus lobata]1.2e-6454.42Show/hide
Query:  VQKSSSITLNTGEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSKA-----------------------------GVTRFATFYLTLQSLAENK
        V  S+S  ++ GE LMEKR+++FW+ C AH +DL+L DIGD+P+H + + KA                             GVTRFAT YLTL+S+ + K
Subjt:  VQKSSSITLNTGEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSKA-----------------------------GVTRFATFYLTLQSLAENK

Query:  IGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKR
        IGLR+MF S++W +S YAKKSDG  V  I+L++  FW +IK+CLKCVIPLVKVLRLVDGDA+PAMGYIY+AMDRAKEQ+ KNFN +Q+ Y+PI  II+ R
Subjt:  IGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKR

Query:  WTMQLHRPLHATTYYLNPRLSFIPNF
        W +QLHRPLHA  Y+LNP+  + PNF
Subjt:  WTMQLHRPLHATTYYLNPRLSFIPNF

TrEMBL top hitse value%identityAlignment
A0A151QZQ8 DUF659 domain-containing protein2.8e-5451.4Show/hide
Query:  MLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSKA-----------------------------GVTRFATFYLTLQSLAENKIGLRAMFGSKDWQ
        MLMEKR +LFWS CAAH LDLIL DIG++P+  + ++ A                              VTRFAT YLTL  + + K  LR+MF S++W 
Subjt:  MLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSKA-----------------------------GVTRFATFYLTLQSLAENKIGLRAMFGSKDWQ

Query:  RSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKRWTMQLHRPLHATT
         S +A KS+   VM ++L++  FW SI YCLKCVIPLVKVLRLVDGD++PA  YIY+AMDRAKE+I +NF   +  YK +W IID RW +QLHRPLHA  
Subjt:  RSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKRWTMQLHRPLHATT

Query:  YYLNPRLSFIPNFH
        YYLNPR  +  NF+
Subjt:  YYLNPRLSFIPNFH

A0A2R6PGA0 Myosin-3 like2.8e-5449.12Show/hide
Query:  VQKSSSITLNTGEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSKA-----------------------------GVTRFATFYLTLQSLAENK
        V  S+S  +  G+ L EKR  LFWS CAAH LDL+LSDIG++P+  D + KA                              +TRFAT YLTL+S  E +
Subjt:  VQKSSSITLNTGEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSKA-----------------------------GVTRFATFYLTLQSLAENK

Query:  IGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKR
        + LRAMF S  W  S+Y+K   G  V  II+ +  FW +IKYC+K V PLVKVLRLVDGDA+PAMGYIY+A+DRAKE+I KN + +++ Y+ IW I+D R
Subjt:  IGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKR

Query:  WTMQLHRPLHATTYYLNPRLSFIPNF
        W +QLHRPLHA  +YLNP+  +  NF
Subjt:  WTMQLHRPLHATTYYLNPRLSFIPNF

A0A3S3NB18 Uncharacterized protein9.9e-6051.1Show/hide
Query:  VQKSSSITLNTGEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSK-----------------------------AGVTRFATFYLTLQSLAENK
        V  S++  +  GE LMEKRK++FWS C+AH LDL+L DIG +P H   V+K                             A VTRFAT +LTL+S+ E +
Subjt:  VQKSSSITLNTGEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSK-----------------------------AGVTRFATFYLTLQSLAENK

Query:  IGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKR
        + LR+MF SK+W+ S+Y+K  DG  V  IIL +  FW  IKYC+KCV PLVKVLRLVD D RP MGYIY+AMDRAKE I KNF+ ++K Y PIW I+D R
Subjt:  IGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKR

Query:  WTMQLHRPLHATTYYLNPRLSFIPNFH
        W +QLHRPLHA  Y+LNP   +   FH
Subjt:  WTMQLHRPLHATTYYLNPRLSFIPNFH

A0A3S3NB18 Uncharacterized protein5.0e-0346Show/hide
Query:  GIYRIKHHLAHTRKNVAPCCKVSDEVRDIFKNLLNGNK--RNKDGDDFND
        G++R+KHHLA T++N+ PC +V D+V ++ K LL  NK  + K  DD  D
Subjt:  GIYRIKHHLAHTRKNVAPCCKVSDEVRDIFKNLLNGNK--RNKDGDDFND

A0A3S3NB18 Uncharacterized protein1.9e-5549.78Show/hide
Query:  VQKSSSITLNTGEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSKA-----------------------------GVTRFATFYLTLQSLAENK
        V   +S  +  G+MLMEKR +LFWS CAAH LDLIL DIG++P+  + ++ A                              VTRFAT YLTL  + + K
Subjt:  VQKSSSITLNTGEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSKA-----------------------------GVTRFATFYLTLQSLAENK

Query:  IGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKR
          LR+MF S++W  S +A KS+   VM ++L++  FW SI YCLKCVIPLVKVLRLVDGD++PA  YIY+AMDRAKE+I +NF   +  YK +W IID R
Subjt:  IGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKR

Query:  WTMQLHRPLHATTYYLNPRLSFIPNFH
        W +QLHRPLHA  YYLNPR  +  NF+
Subjt:  WTMQLHRPLHATTYYLNPRLSFIPNFH

A0A5J5BV14 Uncharacterized protein2.0e-6059.8Show/hide
Query:  MEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSK-----------------------------AGVTRFATFYLTLQSLAENKIGLRAMFGSKDWQRS
        M KRK+LF S CAAH LDLIL+ IG++PLH D +SK                             A VTRFAT YLTL+ L + KI LRAMF SK+WQ S
Subjt:  MEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSK-----------------------------AGVTRFATFYLTLQSLAENKIGLRAMFGSKDWQRS

Query:  NYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKRWTMQLHRPLHATTYY
         Y+KK +    M IILA  +FW+SIKYCLKCV+PLVKVLRLVDGD +PAMGYIY+AMD+A EQIEKNF+ ++KHYKP+WDIID RW MQLH+PL+A  YY
Subjt:  NYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKRWTMQLHRPLHATTYY

Query:  LNPR
        LNPR
Subjt:  LNPR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G13020.1 hAT transposon superfamily protein1.2e-2833.94Show/hide
Query:  GEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSKAGVT------------------------------RFATFYLTLQSLAENKIGLRAMFGSK
        G++     +++FWS+  +H  +L+L  IG M    DI+ K                                   F   YL L+S+ + K  L AMF S 
Subjt:  GEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSKAGVT------------------------------RFATFYLTLQSLAENKIGLRAMFGSK

Query:  DWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLV-DGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKRWTMQLHRPL
         W      KK +G +V  ++  +SSFW +++  LKC  PL   LRL  + D    +GYIYD +D  K  I+K FN  +KHY  +WD+ID  W   LH PL
Subjt:  DWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLV-DGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKRWTMQLHRPL

Query:  HATTYYLNPRLSFIPNFH
        HA  YYLNP   +  +FH
Subjt:  HATTYYLNPRLSFIPNFH

AT3G13030.1 hAT transposon superfamily protein3.4e-2830.15Show/hide
Query:  DGDDFNDEDDVESKKKLMMGAFGK-GVQKSSSITL--------NTGEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSK---------------
        D  DF  +DDV +   L+ G   + GV+  + I            GE+     +++FWS+  +H  +L+L  I  +    DI  K               
Subjt:  DGDDFNDEDDVESKKKLMMGAFGK-GVQKSSSITL--------NTGEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSK---------------

Query:  ---------------AGVTRFATFYLTLQSLAENKIGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAM
                       +    F T YL L+S+ + K  L AMF S +W        ++    +  ++++SSFW +++  LKC  PL+  L L        +
Subjt:  ---------------AGVTRFATFYLTLQSLAENKIGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAM

Query:  GYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKRWTMQLHRPLHATTYYLNPRLSFIPNFH
        GY+YD MD  KE I + FN   + YKP+WD+ID  W   LH PLHA  Y+LNP   +  NFH
Subjt:  GYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKRWTMQLHRPLHATTYYLNPRLSFIPNFH

AT3G13030.2 hAT transposon superfamily protein3.4e-2830.15Show/hide
Query:  DGDDFNDEDDVESKKKLMMGAFGK-GVQKSSSITL--------NTGEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSK---------------
        D  DF  +DDV +   L+ G   + GV+  + I            GE+     +++FWS+  +H  +L+L  I  +    DI  K               
Subjt:  DGDDFNDEDDVESKKKLMMGAFGK-GVQKSSSITL--------NTGEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSK---------------

Query:  ---------------AGVTRFATFYLTLQSLAENKIGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAM
                       +    F T YL L+S+ + K  L AMF S +W        ++    +  ++++SSFW +++  LKC  PL+  L L        +
Subjt:  ---------------AGVTRFATFYLTLQSLAENKIGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAM

Query:  GYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKRWTMQLHRPLHATTYYLNPRLSFIPNFH
        GY+YD MD  KE I + FN   + YKP+WD+ID  W   LH PLHA  Y+LNP   +  NFH
Subjt:  GYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKRWTMQLHRPLHATTYYLNPRLSFIPNFH

AT3G13030.3 hAT transposon superfamily protein3.4e-2830.15Show/hide
Query:  DGDDFNDEDDVESKKKLMMGAFGK-GVQKSSSITL--------NTGEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSK---------------
        D  DF  +DDV +   L+ G   + GV+  + I            GE+     +++FWS+  +H  +L+L  I  +    DI  K               
Subjt:  DGDDFNDEDDVESKKKLMMGAFGK-GVQKSSSITL--------NTGEMLMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSK---------------

Query:  ---------------AGVTRFATFYLTLQSLAENKIGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAM
                       +    F T YL L+S+ + K  L AMF S +W        ++    +  ++++SSFW +++  LKC  PL+  L L        +
Subjt:  ---------------AGVTRFATFYLTLQSLAENKIGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAM

Query:  GYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKRWTMQLHRPLHATTYYLNPRLSFIPNFH
        GY+YD MD  KE I + FN   + YKP+WD+ID  W   LH PLHA  Y+LNP   +  NFH
Subjt:  GYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKRWTMQLHRPLHATTYYLNPRLSFIPNFH

AT5G33406.1 hAT dimerisation domain-containing protein / transposase-related1.5e-3144.76Show/hide
Query:  KAGVTRFATFYLTLQSLAENKIGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQI
        +  +TR AT ++TL      K  LR M  S +W  S + K++ G  + K      SFW ++ + LK   PL++VLR+VDG+ +P MGYIY AMD+AKE I
Subjt:  KAGVTRFATFYLTLQSLAENKIGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKVLRLVDGDARPAMGYIYDAMDRAKEQI

Query:  EKNFNRIQKHYKPIWDIIDKRWTMQLHRPLHATTYYLNPRLSF
         K+F   +++YK  ++IID+RW +QLHRPLHA  YYLNP   +
Subjt:  EKNFNRIQKHYKPIWDIIDKRWTMQLHRPLHATTYYLNPRLSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACGACCAGATCCTCCTCTCAAGCTTGGGTGAAGAGTTGGCAATTGGAAAACTACTCGCAAATTGCAAATCTTGGAGTTTCAATCCAGGTATATATAGGATTAAACA
TCATCTAGCACATACTAGAAAAAATGTCGCTCCATGTTGTAAAGTTTCGGATGAGGTTCGAGATATTTTCAAGAACTTATTGAATGGAAATAAAAGAAATAAAGATGGAG
ATGATTTTAATGATGAAGATGATGTTGAGAGCAAGAAGAAACTTATGATGGGTGCATTTGGTAAAGGCGTACAGAAATCTAGTTCTATTACCCTAAATACTGGAGAAATG
TTAATGGAGAAGCGTAAACAACTATTCTGGTCTCTTTGTGCGGCACATCGCCTAGACTTGATTTTATCTGATATTGGAGATATGCCCTTACATAATGATATAGTGTCTAA
AGCTGGTGTGACTCGATTTGCAACTTTCTACTTGACTTTGCAAAGCTTGGCAGAAAATAAAATAGGTTTGAGAGCGATGTTTGGGTCTAAAGATTGGCAAAGAAGTAACT
ATGCAAAGAAGTCTGATGGGGCTACGGTAATGAAAATTATTCTTGCTAATTCAAGCTTTTGGTCATCAATAAAGTATTGTTTAAAATGTGTTATCCCTTTAGTAAAAGTT
TTAAGACTTGTTGATGGTGATGCGAGACCTGCAATGGGATACATATACGATGCAATGGACAGGGCAAAGGAACAAATTGAGAAGAATTTCAACAGAATTCAGAAACATTA
TAAGCCTATTTGGGATATTATTGATAAAAGATGGACAATGCAACTTCACAGGCCTCTTCATGCAACAACATATTATCTTAATCCAAGGTTAAGTTTCATTCCTAATTTTC
ATCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTACGACCAGATCCTCCTCTCAAGCTTGGGTGAAGAGTTGGCAATTGGAAAACTACTCGCAAATTGCAAATCTTGGAGTTTCAATCCAGGTATATATAGGATTAAACA
TCATCTAGCACATACTAGAAAAAATGTCGCTCCATGTTGTAAAGTTTCGGATGAGGTTCGAGATATTTTCAAGAACTTATTGAATGGAAATAAAAGAAATAAAGATGGAG
ATGATTTTAATGATGAAGATGATGTTGAGAGCAAGAAGAAACTTATGATGGGTGCATTTGGTAAAGGCGTACAGAAATCTAGTTCTATTACCCTAAATACTGGAGAAATG
TTAATGGAGAAGCGTAAACAACTATTCTGGTCTCTTTGTGCGGCACATCGCCTAGACTTGATTTTATCTGATATTGGAGATATGCCCTTACATAATGATATAGTGTCTAA
AGCTGGTGTGACTCGATTTGCAACTTTCTACTTGACTTTGCAAAGCTTGGCAGAAAATAAAATAGGTTTGAGAGCGATGTTTGGGTCTAAAGATTGGCAAAGAAGTAACT
ATGCAAAGAAGTCTGATGGGGCTACGGTAATGAAAATTATTCTTGCTAATTCAAGCTTTTGGTCATCAATAAAGTATTGTTTAAAATGTGTTATCCCTTTAGTAAAAGTT
TTAAGACTTGTTGATGGTGATGCGAGACCTGCAATGGGATACATATACGATGCAATGGACAGGGCAAAGGAACAAATTGAGAAGAATTTCAACAGAATTCAGAAACATTA
TAAGCCTATTTGGGATATTATTGATAAAAGATGGACAATGCAACTTCACAGGCCTCTTCATGCAACAACATATTATCTTAATCCAAGGTTAAGTTTCATTCCTAATTTTC
ATCATTAG
Protein sequenceShow/hide protein sequence
MYDQILLSSLGEELAIGKLLANCKSWSFNPGIYRIKHHLAHTRKNVAPCCKVSDEVRDIFKNLLNGNKRNKDGDDFNDEDDVESKKKLMMGAFGKGVQKSSSITLNTGEM
LMEKRKQLFWSLCAAHRLDLILSDIGDMPLHNDIVSKAGVTRFATFYLTLQSLAENKIGLRAMFGSKDWQRSNYAKKSDGATVMKIILANSSFWSSIKYCLKCVIPLVKV
LRLVDGDARPAMGYIYDAMDRAKEQIEKNFNRIQKHYKPIWDIIDKRWTMQLHRPLHATTYYLNPRLSFIPNFHH