; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000754 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000754
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr4:15271020..15275414
RNA-Seq ExpressionLag0000754
SyntenyLag0000754
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.9e-16751.31Show/hide
Query:  KTSSMVAVKIKSYMGSTAHCCFNELRLQEDKASIIASEETTLQGAYTDDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHIN
        K +S  +    SY+G            ++++  ++  +             +++ NPLF            E   +VV+VMM +  T E  M EM+  IN
Subjt:  KTSSMVAVKIKSYMGSTAHCCFNELRLQEDKASIIASEETTLQGAYTDDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHIN

Query:  NLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-QCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLRTP
         LMK +EE+D +I  LK Q++    +ESSQT VVK  DKGK++V+++ P Q SVSVASLS+QQLQDMI N IRAQY GP Q S +YSKPYTKRIDNLR P
Subjt:  NLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-QCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLRTP

Query:  IGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFL-----NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMRLDCKDRLTELS
        +GYQP KFQQFDGKGNPKQHI HFVETCENA        ++L R+F+     N F  TRR VSM ELTNT QRKGE V++YINRWRA+ LDCKD+LTELS
Subjt:  IGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFL-----NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMRLDCKDRLTELS

Query:  SVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH----
        +VEMC QGMHWELLYIL+GIKPRTFEELATRAHDM+LSIA+R  +D L+   R +     +T       + ESM+V  T  KS SK K   +  +H    
Subjt:  SVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH----

Query:  ---LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKG
            TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFVLK+LI KLA+E KIELD+DEVAQ+N   +  
Subjt:  ---LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKG

Query:  NNK--------HQRKK-------DPKKLQPKRKRSKKFSQHQQ----------LVMLNKSFSKTFHKKKKENLA--TSYCIDV-------EEIDNSKKSE
         +          QRK        +P  ++ ++K     SQ+++          +  L +SF +   ++  E  A  T+  ++V       EE+DNS + +
Subjt:  NNK--------HQRKK-------DPKKLQPKRKRSKKFSQHQQ----------LVMLNKSFSKTFHKKKKENLA--TSYCIDV-------EEIDNSKKSE

Query:  QRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSR
        QRTSVFD IKP TTR SVFQR+SMA  +EENQC   T  + SAF+RLS+S SKK +PST  FDRLK+T+DQ +R+M  L+ K F E N D K+HS VPSR
Subjt:  QRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSR

Query:  MKRKLSVLITTEGSL--KPQVRCFTHP
        MKRKLSV I TEGSL  KP+   FT+P
Subjt:  MKRKLSVLITTEGSL--KPQVRCFTHP

KAA0033746.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.7e-15746.32Show/hide
Query:  KTSSMVAVKIKSYMGSTAHCCFNELRLQEDKASIIASEETTLQGAYTDDK--FLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEH
        K +S  +    SY+G            ++++  ++  ++ +L+      K   +++ NPLF            E   +VV+VMM +  T E  M EM+  
Subjt:  KTSSMVAVKIKSYMGSTAHCCFNELRLQEDKASIIASEETTLQGAYTDDK--FLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEH

Query:  INNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-QCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLR
        IN LMK  EE+D +I  LK Q++     ESSQT VVK  DKGK++VQ++ P Q SVSVASLS+QQLQDMI N IRAQY GP Q S +YSK YTKRIDNLR
Subjt:  INNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-QCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLR

Query:  TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMRLDCKDRLTELSSVE
         P+GYQPPKFQQFDGKGNPKQHIAHFVETCENA        ++L R+F+          + FE     + +GE V++YINRWRA+ LDCKD+LTELS+VE
Subjt:  TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMRLDCKDRLTELSSVE

Query:  MCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRND--------EETIEESMVVNTTLPKSSSKEKRQTNGAHH------
        MC QGMHWELLYIL+GIKPRTFEEL+TRAHDMELSIA+   +D L+   ++  +N+           + ESM+V  T  KS SK K   +  ++      
Subjt:  MCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRND--------EETIEESMVVNTTLPKSSSKEKRQTNGAHH------

Query:  -LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI----
          TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N   +    
Subjt:  -LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI----

Query:  ----------------------------------------------------------------------------KGNNKHQRK----KDPKKLQPKRK
                                                                                    KGN  H++K    K   K +P + 
Subjt:  ----------------------------------------------------------------------------KGNNKHQRK----KDPKKLQPKRK

Query:  RSKKFSQHQQLVMLNKSFSKTFHKKKKENLA------TSYCIDV-------EEIDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTST
        + K F Q ++ + L +   ++F +   E +       T+  ++V       EE+DNS + +QRT VF RIKP T R SVFQR+SMA  EEENQC  ST  
Subjt:  RSKKFSQHQQLVMLNKSFSKTFHKKKKENLA------TSYCIDV-------EEIDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTST

Query:  RPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKLSVLITTEGSL--KPQVRCFTHP
        R SAF+RLS+ST KK +PSTS FDRLK+ +DQ +R+M +L+ K F E N D K+HS VPSRMKRKLSV I TEGSL  KP+   FT+P
Subjt:  RPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKLSVLITTEGSL--KPQVRCFTHP

KAA0045440.1 retrotransposon gag protein [Cucumis melo var. makuwa]8.8e-15947.82Show/hide
Query:  KASIIASEETTLQGAYTDDKFLVKYNPLFE--PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKG
        K  I+  E   +   Y+  K      P  E  P  ++++VM+T   T E RM E+++ +N LMK +EE+D +I  LK+ IE+   AESS    VKN DKG
Subjt:  KASIIASEETTLQGAYTDDKFLVKYNPLFE--PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKG

Query:  KSIVQDDHPQCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEE
        K+++Q+  P+   S+ASLS+QQLQ+MIT+ I+ QY G  Q   LYSKPYTKRIDNLR P GYQPPKFQQ+  K      +    +  +   PES+D+WE+
Subjt:  KSIVQDDHPQCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEE

Query:  LEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMRLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQD
        LER+FLNRFYSTRR +SM ELTNT+Q+KGELV++YINRWRA+ LDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTF+ELATRAHDMEL++A++  +D
Subjt:  LEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMRLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQD

Query:  LLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEME
         L+P +R +    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFP++D+ DMLEQLLE QLI+L +CKR E+  
Subjt:  LLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEME

Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN---------------------------------------LATIKGNNK----
        KVDDP YCKYHRVI HP+E+CFVLK+LILKLA+E KIELD+DEVAQ+N                                       + TI   NK    
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN---------------------------------------LATIKGNNK----

Query:  -----------HQRKKDPKKLQ-----------------PKRKRSKK-------------FSQHQQLVMLNKSFSKTF-HKKKKENLATSYC-----IDV
                   HQ+ + P  +Q                  K +R+KK             F Q ++ + L +   ++F     +E L  + C     ++V
Subjt:  -----------HQRKKDPKKLQ-----------------PKRKRSKK-------------FSQHQQLVMLNKSFSKTF-HKKKKENLATSYC-----IDV

Query:  -------EEIDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKL
               +E++NS +  QRTSVFDRIKP TTR SVFQR+SMA  EE+NQC     TR S F+RLS+STSKK++PSTS FDRLK+T+DQ +R+M +L+ K 
Subjt:  -------EEIDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKL

Query:  FDEVNRDKKLHSSVPSRMKRKLSVLITTEGSLKP
        F E N D K+HS +PSRMKRKL V I TEG + P
Subjt:  FDEVNRDKKLHSSVPSRMKRKLSVLITTEGSLKP

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]9.0e-18054.23Show/hide
Query:  KASIIASEETTLQGAYTDDKFLVKYNPLFE--PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKG
        K  I+  E   +   Y+  K      P  E  P  ++++VM+T   T E RM E+++ +N LMK +EE+D +I  LK+ IE++  AESS    VKN DKG
Subjt:  KASIIASEETTLQGAYTDDKFLVKYNPLFE--PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKG

Query:  KSIVQDDHPQCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA----------
        K+++Q+  PQ S S+ASLS+QQLQ+MI + I+ QY GP Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE A          
Subjt:  KSIVQDDHPQCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA----------

Query:  ----------------EPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMRLDCKDRLTELSSVEMCIQGMHWELLYILKGI
                        EPES+D+WE+LER+FLNRFYSTR  VSM ELTNT+Q+KGELV++YINRWRA+ LDCKDRLTELS+VEMC QGMHW LLYIL+GI
Subjt:  ----------------EPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMRLDCKDRLTELSSVEMCIQGMHWELLYILKGI

Query:  KPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDAD
        KPRTFEELATRAHDMELSIA+R  +D L+P  R +    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D
Subjt:  KPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDAD

Query:  IPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRKRS
        + DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N A    +N            P + + 
Subjt:  IPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRKRS

Query:  KKFSQHQQLVMLNKSFSKTF-HKKKKENLATSYC-----IDVE-------EIDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRP
        + F Q ++ + L +   ++F     +E L  + C     ++V+       E++NS +  QRTSVFDRIKP TTR SVFQR+S+A  EEENQC     TR 
Subjt:  KKFSQHQQLVMLNKSFSKTF-HKKKKENLATSYC-----IDVE-------EIDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRP

Query:  SAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKLSVLITTEGSL--KPQVRCFTHP
        S  +RLS+ST KK +PSTS FDRLK+T+DQ +R+M + + K F E N D K+HS VPSRMKRKL V I TEGSL  KP+   FT+P
Subjt:  SAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKLSVLITTEGSL--KPQVRCFTHP

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]4.8e-17346.69Show/hide
Query:  TSSMVAVKIKSYMGS--TAHCCFNELRLQEDKASIIASEETTLQ-GAYTDDKFLVKYNPLFE-------------PDSDVVTVMMTETRTMEERMTEMQE
        TS     K  SY+G     H C  E++  ED      ++    Q         ++K NP  +             P  ++++VM+T   T E RM E+++
Subjt:  TSSMVAVKIKSYMGS--TAHCCFNELRLQEDKASIIASEETTLQ-GAYTDDKFLVKYNPLFE-------------PDSDVVTVMMTETRTMEERMTEMQE

Query:  HINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLR
         +N LMK +EE+D +I  LK+ IE++  AESS    VKN DKGK+++Q+  PQ S S+ASLS+QQLQ+MI + I+ QY GP Q   LYSKPYTKRIDNLR
Subjt:  HINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLR

Query:  TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL
         P GYQPPKFQQFDGKGNPKQH+AHF+ETCE A                          EPES+D+WE+LER+FLNRFYSTRR VSM ELTNT+Q+KGEL
Subjt:  TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL

Query:  VVNYINRWRAMRLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVN
        V++YINRWRA+ LDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R  +D L+P  R +     +T       I+ESMVV+
Subjt:  VVNYINRWRAMRLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVN

Query:  TTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKL
         T  KS SK K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP YCKYHRVI HPVE+CFVLK+LILKL
Subjt:  TTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKL

Query:  AKEGKIELDLDEVAQSN-----------------------------------LATIKGNNKHQRKKD---------------PKKLQ-------------
        A+E KIEL++DEVAQ+N                                   + TI   NK    KD               P  +Q             
Subjt:  AKEGKIELDLDEVAQSN-----------------------------------LATIKGNNKHQRKKD---------------PKKLQ-------------

Query:  ----PKRKRSKK-------------FSQHQQLVMLNKSFSKTF---HKKKKENLATSYCIDVEEIDNSKKS----------EQRTSVFDRIKPPTTRPSV
             K +R+KK             F Q ++ + L +   ++F   H ++   + T +   + E++N+  S           QRTSVFDRIKP TTR SV
Subjt:  ----PKRKRSKK-------------FSQHQQLVMLNKSFSKTF---HKKKKENLATSYCIDVEEIDNSKKS----------EQRTSVFDRIKPPTTRPSV

Query:  FQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKLSVLITTEGSLKPQ
        FQR+SMA  EEENQC     TR S F+RLS+S SKK++PSTS FDRLK+T+DQ +R+M +L+ K F E N D K+HS VPSRMKRKL V I T+   +  
Subjt:  FQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKLSVLITTEGSLKPQ

Query:  VRCFTHPKFVHSFPPSSKVLTALRHSSFLQV
        +  F+H   +     SS  +   R    L +
Subjt:  VRCFTHPKFVHSFPPSSKVLTALRHSSFLQV

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein1.9e-16751.31Show/hide
Query:  KTSSMVAVKIKSYMGSTAHCCFNELRLQEDKASIIASEETTLQGAYTDDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHIN
        K +S  +    SY+G            ++++  ++  +             +++ NPLF            E   +VV+VMM +  T E  M EM+  IN
Subjt:  KTSSMVAVKIKSYMGSTAHCCFNELRLQEDKASIIASEETTLQGAYTDDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHIN

Query:  NLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-QCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLRTP
         LMK +EE+D +I  LK Q++    +ESSQT VVK  DKGK++V+++ P Q SVSVASLS+QQLQDMI N IRAQY GP Q S +YSKPYTKRIDNLR P
Subjt:  NLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-QCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLRTP

Query:  IGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFL-----NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMRLDCKDRLTELS
        +GYQP KFQQFDGKGNPKQHI HFVETCENA        ++L R+F+     N F  TRR VSM ELTNT QRKGE V++YINRWRA+ LDCKD+LTELS
Subjt:  IGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFL-----NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMRLDCKDRLTELS

Query:  SVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH----
        +VEMC QGMHWELLYIL+GIKPRTFEELATRAHDM+LSIA+R  +D L+   R +     +T       + ESM+V  T  KS SK K   +  +H    
Subjt:  SVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH----

Query:  ---LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKG
            TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFVLK+LI KLA+E KIELD+DEVAQ+N   +  
Subjt:  ---LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKG

Query:  NNK--------HQRKK-------DPKKLQPKRKRSKKFSQHQQ----------LVMLNKSFSKTFHKKKKENLA--TSYCIDV-------EEIDNSKKSE
         +          QRK        +P  ++ ++K     SQ+++          +  L +SF +   ++  E  A  T+  ++V       EE+DNS + +
Subjt:  NNK--------HQRKK-------DPKKLQPKRKRSKKFSQHQQ----------LVMLNKSFSKTFHKKKKENLA--TSYCIDV-------EEIDNSKKSE

Query:  QRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSR
        QRTSVFD IKP TTR SVFQR+SMA  +EENQC   T  + SAF+RLS+S SKK +PST  FDRLK+T+DQ +R+M  L+ K F E N D K+HS VPSR
Subjt:  QRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSR

Query:  MKRKLSVLITTEGSL--KPQVRCFTHP
        MKRKLSV I TEGSL  KP+   FT+P
Subjt:  MKRKLSVLITTEGSL--KPQVRCFTHP

A0A5A7SUW1 Retrotransposon gag protein8.0e-15846.32Show/hide
Query:  KTSSMVAVKIKSYMGSTAHCCFNELRLQEDKASIIASEETTLQGAYTDDK--FLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEH
        K +S  +    SY+G            ++++  ++  ++ +L+      K   +++ NPLF            E   +VV+VMM +  T E  M EM+  
Subjt:  KTSSMVAVKIKSYMGSTAHCCFNELRLQEDKASIIASEETTLQGAYTDDK--FLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEH

Query:  INNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-QCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLR
        IN LMK  EE+D +I  LK Q++     ESSQT VVK  DKGK++VQ++ P Q SVSVASLS+QQLQDMI N IRAQY GP Q S +YSK YTKRIDNLR
Subjt:  INNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-QCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLR

Query:  TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMRLDCKDRLTELSSVE
         P+GYQPPKFQQFDGKGNPKQHIAHFVETCENA        ++L R+F+          + FE     + +GE V++YINRWRA+ LDCKD+LTELS+VE
Subjt:  TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMRLDCKDRLTELSSVE

Query:  MCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRND--------EETIEESMVVNTTLPKSSSKEKRQTNGAHH------
        MC QGMHWELLYIL+GIKPRTFEEL+TRAHDMELSIA+   +D L+   ++  +N+           + ESM+V  T  KS SK K   +  ++      
Subjt:  MCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRND--------EETIEESMVVNTTLPKSSSKEKRQTNGAHH------

Query:  -LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI----
          TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N   +    
Subjt:  -LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI----

Query:  ----------------------------------------------------------------------------KGNNKHQRK----KDPKKLQPKRK
                                                                                    KGN  H++K    K   K +P + 
Subjt:  ----------------------------------------------------------------------------KGNNKHQRK----KDPKKLQPKRK

Query:  RSKKFSQHQQLVMLNKSFSKTFHKKKKENLA------TSYCIDV-------EEIDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTST
        + K F Q ++ + L +   ++F +   E +       T+  ++V       EE+DNS + +QRT VF RIKP T R SVFQR+SMA  EEENQC  ST  
Subjt:  RSKKFSQHQQLVMLNKSFSKTFHKKKKENLA------TSYCIDV-------EEIDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTST

Query:  RPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKLSVLITTEGSL--KPQVRCFTHP
        R SAF+RLS+ST KK +PSTS FDRLK+ +DQ +R+M +L+ K F E N D K+HS VPSRMKRKLSV I TEGSL  KP+   FT+P
Subjt:  RPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKLSVLITTEGSL--KPQVRCFTHP

A0A5A7TQK6 Retrotransposon gag protein4.3e-15947.82Show/hide
Query:  KASIIASEETTLQGAYTDDKFLVKYNPLFE--PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKG
        K  I+  E   +   Y+  K      P  E  P  ++++VM+T   T E RM E+++ +N LMK +EE+D +I  LK+ IE+   AESS    VKN DKG
Subjt:  KASIIASEETTLQGAYTDDKFLVKYNPLFE--PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKG

Query:  KSIVQDDHPQCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEE
        K+++Q+  P+   S+ASLS+QQLQ+MIT+ I+ QY G  Q   LYSKPYTKRIDNLR P GYQPPKFQQ+  K      +    +  +   PES+D+WE+
Subjt:  KSIVQDDHPQCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEE

Query:  LEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMRLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQD
        LER+FLNRFYSTRR +SM ELTNT+Q+KGELV++YINRWRA+ LDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTF+ELATRAHDMEL++A++  +D
Subjt:  LEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMRLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQD

Query:  LLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEME
         L+P +R +    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFP++D+ DMLEQLLE QLI+L +CKR E+  
Subjt:  LLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEME

Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN---------------------------------------LATIKGNNK----
        KVDDP YCKYHRVI HP+E+CFVLK+LILKLA+E KIELD+DEVAQ+N                                       + TI   NK    
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN---------------------------------------LATIKGNNK----

Query:  -----------HQRKKDPKKLQ-----------------PKRKRSKK-------------FSQHQQLVMLNKSFSKTF-HKKKKENLATSYC-----IDV
                   HQ+ + P  +Q                  K +R+KK             F Q ++ + L +   ++F     +E L  + C     ++V
Subjt:  -----------HQRKKDPKKLQ-----------------PKRKRSKK-------------FSQHQQLVMLNKSFSKTF-HKKKKENLATSYC-----IDV

Query:  -------EEIDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKL
               +E++NS +  QRTSVFDRIKP TTR SVFQR+SMA  EE+NQC     TR S F+RLS+STSKK++PSTS FDRLK+T+DQ +R+M +L+ K 
Subjt:  -------EEIDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKL

Query:  FDEVNRDKKLHSSVPSRMKRKLSVLITTEGSLKP
        F E N D K+HS +PSRMKRKL V I TEG + P
Subjt:  FDEVNRDKKLHSSVPSRMKRKLSVLITTEGSLKP

A0A5A7URH1 Ty3-gypsy retrotransposon protein4.4e-18054.23Show/hide
Query:  KASIIASEETTLQGAYTDDKFLVKYNPLFE--PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKG
        K  I+  E   +   Y+  K      P  E  P  ++++VM+T   T E RM E+++ +N LMK +EE+D +I  LK+ IE++  AESS    VKN DKG
Subjt:  KASIIASEETTLQGAYTDDKFLVKYNPLFE--PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKG

Query:  KSIVQDDHPQCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA----------
        K+++Q+  PQ S S+ASLS+QQLQ+MI + I+ QY GP Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE A          
Subjt:  KSIVQDDHPQCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA----------

Query:  ----------------EPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMRLDCKDRLTELSSVEMCIQGMHWELLYILKGI
                        EPES+D+WE+LER+FLNRFYSTR  VSM ELTNT+Q+KGELV++YINRWRA+ LDCKDRLTELS+VEMC QGMHW LLYIL+GI
Subjt:  ----------------EPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMRLDCKDRLTELSSVEMCIQGMHWELLYILKGI

Query:  KPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDAD
        KPRTFEELATRAHDMELSIA+R  +D L+P  R +    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D
Subjt:  KPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDAD

Query:  IPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRKRS
        + DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N A    +N            P + + 
Subjt:  IPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRKRS

Query:  KKFSQHQQLVMLNKSFSKTF-HKKKKENLATSYC-----IDVE-------EIDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRP
        + F Q ++ + L +   ++F     +E L  + C     ++V+       E++NS +  QRTSVFDRIKP TTR SVFQR+S+A  EEENQC     TR 
Subjt:  KKFSQHQQLVMLNKSFSKTF-HKKKKENLATSYC-----IDVE-------EIDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRP

Query:  SAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKLSVLITTEGSL--KPQVRCFTHP
        S  +RLS+ST KK +PSTS FDRLK+T+DQ +R+M + + K F E N D K+HS VPSRMKRKL V I TEGSL  KP+   FT+P
Subjt:  SAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKLSVLITTEGSL--KPQVRCFTHP

A0A5D3BX77 Retrotransposon gag protein2.3e-17346.69Show/hide
Query:  TSSMVAVKIKSYMGS--TAHCCFNELRLQEDKASIIASEETTLQ-GAYTDDKFLVKYNPLFE-------------PDSDVVTVMMTETRTMEERMTEMQE
        TS     K  SY+G     H C  E++  ED      ++    Q         ++K NP  +             P  ++++VM+T   T E RM E+++
Subjt:  TSSMVAVKIKSYMGS--TAHCCFNELRLQEDKASIIASEETTLQ-GAYTDDKFLVKYNPLFE-------------PDSDVVTVMMTETRTMEERMTEMQE

Query:  HINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLR
         +N LMK +EE+D +I  LK+ IE++  AESS    VKN DKGK+++Q+  PQ S S+ASLS+QQLQ+MI + I+ QY GP Q   LYSKPYTKRIDNLR
Subjt:  HINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLR

Query:  TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL
         P GYQPPKFQQFDGKGNPKQH+AHF+ETCE A                          EPES+D+WE+LER+FLNRFYSTRR VSM ELTNT+Q+KGEL
Subjt:  TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL

Query:  VVNYINRWRAMRLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVN
        V++YINRWRA+ LDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R  +D L+P  R +     +T       I+ESMVV+
Subjt:  VVNYINRWRAMRLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVN

Query:  TTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKL
         T  KS SK K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP YCKYHRVI HPVE+CFVLK+LILKL
Subjt:  TTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKL

Query:  AKEGKIELDLDEVAQSN-----------------------------------LATIKGNNKHQRKKD---------------PKKLQ-------------
        A+E KIEL++DEVAQ+N                                   + TI   NK    KD               P  +Q             
Subjt:  AKEGKIELDLDEVAQSN-----------------------------------LATIKGNNKHQRKKD---------------PKKLQ-------------

Query:  ----PKRKRSKK-------------FSQHQQLVMLNKSFSKTF---HKKKKENLATSYCIDVEEIDNSKKS----------EQRTSVFDRIKPPTTRPSV
             K +R+KK             F Q ++ + L +   ++F   H ++   + T +   + E++N+  S           QRTSVFDRIKP TTR SV
Subjt:  ----PKRKRSKK-------------FSQHQQLVMLNKSFSKTF---HKKKKENLATSYCIDVEEIDNSKKS----------EQRTSVFDRIKPPTTRPSV

Query:  FQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKLSVLITTEGSLKPQ
        FQR+SMA  EEENQC     TR S F+RLS+S SKK++PSTS FDRLK+T+DQ +R+M +L+ K F E N D K+HS VPSRMKRKL V I T+   +  
Subjt:  FQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKLSVLITTEGSLKPQ

Query:  VRCFTHPKFVHSFPPSSKVLTALRHSSFLQV
        +  F+H   +     SS  +   R    L +
Subjt:  VRCFTHPKFVHSFPPSSKVLTALRHSSFLQV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTCAAGACTTCTTCGATGGTCGCTGTCAAGATCAAGTCTTACATGGGTTCAACTGCCCATTGTTGCTTCAATGAACTGAGGTTGCAAGAAGATAAAGCTTCTAT
CATTGCAAGCGAAGAAACAACCTTGCAGGGGGCATATACTGATGACAAGTTTCTTGTGAAGTATAACCCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCATGATGA
CTGAGACAAGAACTATGGAAGAAAGAATGACTGAGATGCAAGAACACATCAACAACTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATTGAGCAACTAAAGAGCCAA
ATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCGTAAAAAATCACGACAAAGGAAAGTCTATAGTGCAAGATGATCATCCACAATGTTCTGTTTCGGTCGC
TTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGATGGACCTACTCAAGATTCCCTCTTGTACTCCAAACCTTATACTAAGAGGA
TTGATAATTTGAGAACGCCAATCGGGTATCAGCCACCGAAATTTCAACAATTTGATGGAAAGGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAAC
GCTGAACCTGAGTCAGTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACAAACACTAAACA
ACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGCTGGAGAGCTATGAGGCTAGATTGCAAAGATCGCCTCACTGAACTCTCTTCCGTTGAGATGTGCATTCAAGGCA
TGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAA
GACCTTCTCCTCCCTAACATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCG
ACAAACTAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAAC
TGATAAAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTC
CTAAAGGACTTAATTTTAAAGCTGGCTAAGGAAGGCAAAATCGAGCTCGACCTTGATGAAGTAGCCCAATCAAATCTTGCTACAATTAAAGGAAATAACAAACATCAAAG
AAAGAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACATCAACAACTGGTGATGTTGAATAAATCCTTCTCCAAAACTTTCCACAAAA
AGAAAAAAGAGAACCTTGCAACTTCTTACTGCATCGACGTAGAAGAAATTGACAATTCCAAGAAAAGTGAACAAAGGACTTCTGTCTTCGATCGCATCAAGCCTCCAACT
ACTCGTCCTTCAGTATTCCAAAGAATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTC
CACATCGAAGAAAAGTCAACCTTCGACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAAATGGACAACTTGGAGGTGAAACTTTTCGATGAAG
TAAACCGCGACAAGAAGCTTCATAGTAGCGTCCCGTCACGTATGAAGAGGAAATTATCTGTTCTCATAACTACAGAAGGTTCCTTGAAGCCTCAAGTTCGGTGTTTCACT
CACCCTAAGTTCGTGCATTCCTTCCCCCCAAGTTCGAAGGTTCTCACGGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCT
TCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCA
TCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCCCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCCATTCCTCCAA
GTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCCTCCAAGTTCGAAGGGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCA
TTGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGT
TCGAAGGTTCTCACGCGCATTGCCACAGTTCCTTCCTCACGCGCATTGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCCACAGTTCCTTCCTCCAAGT
TCGAAGGTTCTCGCGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCGCGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCCCATCGCC
ACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCCATTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCCTCCAAGTTC
GAAGGGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGACCCAAGACCAATAAAGCTCAAGCCCAAGTTGTTAGGCCCAAAAGTCACCAGGGCCTATCC
AGCGAGAACTCTATAAATAGAGGGGTTCTCCATCATTTCAAGGGTTCAGAAATTCTACACTCTCACAAAGACAAGAGTTCAGAGTTTTCAAAGCTCTCAAGCAGAACCAG
AGAATTCAGAGAGACTCCACAAAGTCTGAAGACCGAAGACTCTCTGCAATCCACAAGTCCAAGTGTTGAACACTTCTTGAAGACCAAACACTCTTCAAGACTTCAACACT
CCTTGAAGATCAAAGACTCTTCAGGACATCGACACTTCTTGAAGACTGAAGACTCCTTCAAGACTAGAAGATCAACAAGCCGATCATCCAAGAAGATCAACAAGCCAACA
GGCCGATCATCCAAGAAGATCAACAAGTCACAACAGGCCGATCATCCAAGAAGATCATCAAGCCAACAGGCCGATCCAAGATATCATCAACCTAGCAAGCCGATCATCCA
AGAAGCTCAACAAACCAGCCCAAGAAGATCAAGAAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCATTCAAGACTTCTTCGATGGTCGCTGTCAAGATCAAGTCTTACATGGGTTCAACTGCCCATTGTTGCTTCAATGAACTGAGGTTGCAAGAAGATAAAGCTTCTAT
CATTGCAAGCGAAGAAACAACCTTGCAGGGGGCATATACTGATGACAAGTTTCTTGTGAAGTATAACCCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCATGATGA
CTGAGACAAGAACTATGGAAGAAAGAATGACTGAGATGCAAGAACACATCAACAACTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATTGAGCAACTAAAGAGCCAA
ATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCGTAAAAAATCACGACAAAGGAAAGTCTATAGTGCAAGATGATCATCCACAATGTTCTGTTTCGGTCGC
TTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGATGGACCTACTCAAGATTCCCTCTTGTACTCCAAACCTTATACTAAGAGGA
TTGATAATTTGAGAACGCCAATCGGGTATCAGCCACCGAAATTTCAACAATTTGATGGAAAGGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAAC
GCTGAACCTGAGTCAGTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACAAACACTAAACA
ACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGCTGGAGAGCTATGAGGCTAGATTGCAAAGATCGCCTCACTGAACTCTCTTCCGTTGAGATGTGCATTCAAGGCA
TGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAA
GACCTTCTCCTCCCTAACATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCG
ACAAACTAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAAC
TGATAAAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTC
CTAAAGGACTTAATTTTAAAGCTGGCTAAGGAAGGCAAAATCGAGCTCGACCTTGATGAAGTAGCCCAATCAAATCTTGCTACAATTAAAGGAAATAACAAACATCAAAG
AAAGAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACATCAACAACTGGTGATGTTGAATAAATCCTTCTCCAAAACTTTCCACAAAA
AGAAAAAAGAGAACCTTGCAACTTCTTACTGCATCGACGTAGAAGAAATTGACAATTCCAAGAAAAGTGAACAAAGGACTTCTGTCTTCGATCGCATCAAGCCTCCAACT
ACTCGTCCTTCAGTATTCCAAAGAATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTC
CACATCGAAGAAAAGTCAACCTTCGACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAAATGGACAACTTGGAGGTGAAACTTTTCGATGAAG
TAAACCGCGACAAGAAGCTTCATAGTAGCGTCCCGTCACGTATGAAGAGGAAATTATCTGTTCTCATAACTACAGAAGGTTCCTTGAAGCCTCAAGTTCGGTGTTTCACT
CACCCTAAGTTCGTGCATTCCTTCCCCCCAAGTTCGAAGGTTCTCACGGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCT
TCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCA
TCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCCCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCCATTCCTCCAA
GTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCCTCCAAGTTCGAAGGGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCA
TTGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGT
TCGAAGGTTCTCACGCGCATTGCCACAGTTCCTTCCTCACGCGCATTGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCCACAGTTCCTTCCTCCAAGT
TCGAAGGTTCTCGCGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCGCGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCCCATCGCC
ACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCCATTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCCTCCAAGTTC
GAAGGGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGACCCAAGACCAATAAAGCTCAAGCCCAAGTTGTTAGGCCCAAAAGTCACCAGGGCCTATCC
AGCGAGAACTCTATAAATAGAGGGGTTCTCCATCATTTCAAGGGTTCAGAAATTCTACACTCTCACAAAGACAAGAGTTCAGAGTTTTCAAAGCTCTCAAGCAGAACCAG
AGAATTCAGAGAGACTCCACAAAGTCTGAAGACCGAAGACTCTCTGCAATCCACAAGTCCAAGTGTTGAACACTTCTTGAAGACCAAACACTCTTCAAGACTTCAACACT
CCTTGAAGATCAAAGACTCTTCAGGACATCGACACTTCTTGAAGACTGAAGACTCCTTCAAGACTAGAAGATCAACAAGCCGATCATCCAAGAAGATCAACAAGCCAACA
GGCCGATCATCCAAGAAGATCAACAAGTCACAACAGGCCGATCATCCAAGAAGATCATCAAGCCAACAGGCCGATCCAAGATATCATCAACCTAGCAAGCCGATCATCCA
AGAAGCTCAACAAACCAGCCCAAGAAGATCAAGAAGCTAG
Protein sequenceShow/hide protein sequence
MSFKTSSMVAVKIKSYMGSTAHCCFNELRLQEDKASIIASEETTLQGAYTDDKFLVKYNPLFEPDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQ
IENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSVSVASLSIQQLQDMITNCIRAQYDGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCEN
AEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMRLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQ
DLLLPNMRKEGRNDEETIEESMVVNTTLPKSSSKEKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
LKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRKRSKKFSQHQQLVMLNKSFSKTFHKKKKENLATSYCIDVEEIDNSKKSEQRTSVFDRIKPPT
TRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVPSRMKRKLSVLITTEGSLKPQVRCFT
HPKFVHSFPPSSKVLTALRHSSFLQVRRFTRASPQFLPPSSKVHTRIATVPSSKFEGSHAHRLLPPSSKVLTCIATVPSSKFEGSHAHRHSSFLQVRRFSRASPQFPFLQ
VRRFSRASPQFLPSKFEGVLTRIATVPSSKFEGSHAHCHSSFLQVRRFTRASPQFLPPSSKVHTRIATVPSSKFEGSHAHCHSSFLTRIATVPSSKFEGSHAHPQFLPPS
SKVLARIATVPSSKFEGSRAHRHSSFLQVRRFSRPSPQFLPPSSKVLTRIATVPIPPSSKVLTRIATVPSLQVRRGSHAHRHSSFLQVRRPKTNKAQAQVVRPKSHQGLS
SENSINRGVLHHFKGSEILHSHKDKSSEFSKLSSRTREFRETPQSLKTEDSLQSTSPSVEHFLKTKHSSRLQHSLKIKDSSGHRHFLKTEDSFKTRRSTSRSSKKINKPT
GRSSKKINKSQQADHPRRSSSQQADPRYHQPSKPIIQEAQQTSPRRSRS