; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G26990 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G26990
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationChr3:24404782..24409491
RNA-Seq ExpressionCSPI03G26990
SyntenyCSPI03G26990
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN78803.1 hypothetical protein VITISV_032700 [Vitis vinifera]8.9e-7930.48Show/hide
Query:  PESLAKTCI--FSNHFPILLEAGAIIWGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVG-------------------FILHEEQRKMK--------
        P+ L +  I   S+H+PI+++    +WGP+PF+F N WL   +      +  S F  + W G                   F   E + K K        
Subjt:  PESLAKTCI--FSNHFPILLEAGAIIWGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVG-------------------FILHEEQRKMK--------

Query:  ------EAGLNEDECVFRSALQAELLNIYHLEERNLMQKSKLNWLSLGDENSGFFHRFLHAKR-KNLISELKDTNGMLHKKS------------------
              E GLN D    R++ + EL  +   EE +  QK+K+ W+  G  NS F+H+  + +R +  I EL++  G++ K +                  
Subjt:  ------EAGLNEDECVFRSALQAELLNIYHLEERNLMQKSKLNWLSLGDENSGFFHRFLHAKR-KNLISELKDTNGMLHKKS------------------

Query:  --SIEEIRLIVKALGRNKAPGLD---------------------------------------------EELALEMKDFRPISLTTLSYKVVAKVLAERLK
          + EEI   +  L R+KAPG D                                             + L+  + DFRPISL T  YK++AKVL+ RL+
Subjt:  --SIEEIRLIVKALGRNKAPGLD---------------------------------------------EELALEMKDFRPISLTTLSYKVVAKVLAERLK

Query:  SVMDSIISPYQSAFIEGRQILDLILIASEVVEDYRAKKKKGWILKLELEKASD---------------------QWIGTSL------------RRGRIIA
         V+   I   Q AF++GRQILD +LIA+E+V++ R   ++G + K++ EKA D                     +W+   L             +G + A
Subjt:  SVMDSIISPYQSAFIEGRQILDLILIASEVVEDYRAKKKKGWILKLELEKASD---------------------QWIGTSL------------RRGRIIA

Query:  SRGIRQGDPLSPFLFLL-------------------------DSIHLPLCQFADDTLLFCKYDDQMLLKLKDAIRLFEWCSRQKINWEKSALSGVNINDD
        SRG+RQGDPLSPFLF L                         +   +   QFADDT+ F    ++ L  LK  + +F   S  K+N  KS++ G+N++  
Subjt:  SRGIRQGDPLSPFLFLL-------------------------DSIHLPLCQFADDTLLFCKYDDQMLLKLKDAIRLFEWCSRQKINWEKSALSGVNINDD

Query:  DLFQTAARLGCKAEKLPILYLGLPLGGYPRRREFWQPFMWQLHW----KKIMRNFFWEGFAGGKINHLVKWRLISLPLKNVGLGLEGIKTQNSVLLAKWG
         L + A  L  KA   PILYLGLPLGG P+   FW P + ++      +++ R+F W GF  GK +HLV+W ++  P    GLGL  I  +N  LL KW 
Subjt:  DLFQTAARLGCKAEKLPILYLGLPLGGYPRRREFWQPFMWQLHW----KKIMRNFFWEGFAGGKINHLVKWRLISLPLKNVGLGLEGIKTQNSVLLAKWG

Query:  WRYSKEDSALWRKNICSIHGKDTFDWFTLEKSGNSLRSPWVNISRMWMLVEHLAHLKLGCGSRIGFWTDFWVGPSTLKKHFPSLFRIAL
        WRY +E SALW + I SI+G  +  W        S R PW  I++++     +    +G G RI FW D W G   L   +P LFR+ +
Subjt:  WRYSKEDSALWRKNICSIHGKDTFDWFTLEKSGNSLRSPWVNISRMWMLVEHLAHLKLGCGSRIGFWTDFWVGPSTLKKHFPSLFRIAL

KAA0056660.1 GTP-binding protein [Cucumis melo var. makuwa]2.6e-8639.5Show/hide
Query:  KTCIFSNHFPILLEAGAIIWGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVGFILHEEQRKMKEAGLNEDECVFRSALQAELLNIYHLEERNLMQKS
        K  IFS+HFP+LLEAGAI+WGPS F+FCNSWLL  ECN  IEE + +     W GFILHE+  K+K A  N     + +A + +L     +EERN +QKS
Subjt:  KTCIFSNHFPILLEAGAIIWGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVGFILHEEQRKMKEAGLNEDECVFRSALQAELLNIYHLEERNLMQKS

Query:  KLNWLSLGDENSGFFHRFLHAKRKNLISELKDTNGMLHKKSSIEEIRLIVKALGRNKAPGLDEELALEMKDFRPISLTTLSYKVVAKVLAERLKSVMDSI
        KLNWL+                                                        +E A+ ++DFRPISLTT++YKVVA VLA+RLK VM S 
Subjt:  KLNWLSLGDENSGFFHRFLHAKRKNLISELKDTNGMLHKKSSIEEIRLIVKALGRNKAPGLDEELALEMKDFRPISLTTLSYKVVAKVLAERLKSVMDSI

Query:  ISPYQSAFIEG-RQILDLI---LIASEVVEDYRAKKKKGWILKLELEKASDQWIGTSLRRGRIIASRGIRQGDPLSPFLFLLDSIHLPLCQFADDTLLFC
        I+P+QSAFIE   +  D +   ++   +     + K   WI++   +     +I  S  RGRI+ASR I+Q           D IH+ + Q+ADDT+L C
Subjt:  ISPYQSAFIEG-RQILDLI---LIASEVVEDYRAKKKKGWILKLELEKASDQWIGTSLRRGRIIASRGIRQGDPLSPFLFLLDSIHLPLCQFADDTLLFC

Query:  KYDDQMLLKLKDAIRLFEWCSRQKINWEKSALSGVNINDDDLFQTAARLGCKAEKLPILYLGLPLGGYPRRREFWQPFMWQLHWKKIMRNFFWEGFAGGK
        K D+ ML KLK+AI+ FEWCS QK                                                                     EG +G K
Subjt:  KYDDQMLLKLKDAIRLFEWCSRQKINWEKSALSGVNINDDDLFQTAARLGCKAEKLPILYLGLPLGGYPRRREFWQPFMWQLHWKKIMRNFFWEGFAGGK

Query:  INHLVKWRLISLPLKNVGLGLEGIKTQNSVLLAKWGWRYSKEDSALWRKNICSIHGKDTFDWFTLEKSGNSLRSPWVNISRMWMLVEHLAHLKLGCGSRI
        INHL  W+ +     + GLGL G+K +N  LLAKWGW + KEDSALWR+ I SIHG   F+W TL KSGNSLRS W+NIS     VE LA  K+G G+R+
Subjt:  INHLVKWRLISLPLKNVGLGLEGIKTQNSVLLAKWGWRYSKEDSALWRKNICSIHGKDTFDWFTLEKSGNSLRSPWVNISRMWMLVEHLAHLKLGCGSRI

Query:  GFWTDFWVGPSTLKKHFPSLFRIA
         FWT+ WVG   LK  FPSLFRIA
Subjt:  GFWTDFWVGPSTLKKHFPSLFRIA

RVW13148.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]4.0e-7929.21Show/hide
Query:  GASGGILTMWDSSITSVIEVIKGRFSLSIKCLSLSNQIFWFTNVYG----------WITNVYGPCGLQIGMTFLKTPESLAKTCIFSNHFPILLEAGAII
        GASGGIL +WDS      EV+ G FS+S+K         W + VYG          W+  ++   GLQ         E+L +    S+H+PI+++    +
Subjt:  GASGGILTMWDSSITSVIEVIKGRFSLSIKCLSLSNQIFWFTNVYG----------WITNVYGPCGLQIGMTFLKTPESLAKTCIFSNHFPILLEAGAII

Query:  WGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVG-------------------FILHEEQRKMK--------------EAGLNEDECVFRSALQAELL
        WGP+PF+F N WL          +  S F  + W G                   F   E + K K              E GLN D    R++ + EL 
Subjt:  WGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVG-------------------FILHEEQRKMK--------------EAGLNEDECVFRSALQAELL

Query:  NIYHLEERNLMQKSKLNWLSLGDENSGFFHRFLHAKR-KNLISELKDTNGMLHKKS--------------------------------------------
         +   EE +  QK+K+ W+  GD NS F+H+  + +R +  I EL++  G++ K +                                            
Subjt:  NIYHLEERNLMQKSKLNWLSLGDENSGFFHRFLHAKR-KNLISELKDTNGMLHKKS--------------------------------------------

Query:  -SIEEIRLIVKALGRNKAPGLD---------------------------------------------EELALEMKDFRPISLTTLSYKVVAKVLAERLKS
         + EEI   +  L R+KAPG D                                             + L+  + DFRPISL T  YK++AKVL+ RL+ 
Subjt:  -SIEEIRLIVKALGRNKAPGLD---------------------------------------------EELALEMKDFRPISLTTLSYKVVAKVLAERLKS

Query:  VMDSIISPYQSAFIEGRQILDLILIASEVVEDYRAKKKKGWILKLE------LEKASDQWIGTSLRRGRIIASRGIRQGDPLSPFLFLL-----------
        V+   I   Q AF++GRQILD +LIA+E+V+  R ++KKG+  +        L   S   +     +G + ASRG+RQGDPLSPFLF L           
Subjt:  VMDSIISPYQSAFIEGRQILDLILIASEVVEDYRAKKKKGWILKLE------LEKASDQWIGTSLRRGRIIASRGIRQGDPLSPFLFLL-----------

Query:  --------------DSIHLPLCQFADDTLLFCKYDDQMLLKLKDAIRLFEWCSRQKINWEKSALSGVNINDDDLFQTAARLGCKAEKLPILYLGLPLGGY
                      +   +   QFADDT+ F    ++ L  LK  + +F   S  K+N  KS++ G+N++   L + A  L CKA   PILYLGLPLGG 
Subjt:  --------------DSIHLPLCQFADDTLLFCKYDDQMLLKLKDAIRLFEWCSRQKINWEKSALSGVNINDDDLFQTAARLGCKAEKLPILYLGLPLGGY

Query:  PRRREFWQPFM---------WQLHW-----------------------------------KKIMRNFFWEGFAGGKINHLVKWRLISLPLKNVGLGLEGI
        P+   FW P +         WQ  +                                   +++ R+F W G   GK +HLV+W ++  P    GLGL  I
Subjt:  PRRREFWQPFM---------WQLHW-----------------------------------KKIMRNFFWEGFAGGKINHLVKWRLISLPLKNVGLGLEGI

Query:  KTQNSVLLAKWGWRYSKEDSALWRKNICSIHGKDTFDWFTLEKSGNSLRSPWVNISRMWMLVEHLAHLKLGCGSRIGFWTDFWVGPSTLKKHFPSLFRIA
          +N  LL KW WRY +E SALW + I SI+G  +  W        S R PW  I++++     +    +G G RI FW D W G   L   +P LFR+ 
Subjt:  KTQNSVLLAKWGWRYSKEDSALWRKNICSIHGKDTFDWFTLEKSGNSLRSPWVNISRMWMLVEHLAHLKLGCGSRIGFWTDFWVGPSTLKKHFPSLFRIA

Query:  L
        +
Subjt:  L

RVW89552.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.1e-7930.32Show/hide
Query:  GASGGILTMWDSSITSVIEVIKGRFSLSIKCLSLSNQIFWFTNVYG----------WI--TNVYGPCGLQIG-MTFLKTPESLAKTCIFSNHFPILLEAG
        GASGG L +WDS      EV+ G FS+SIK      +  W + VYG          W+  +++ G    + G   FLK  +   +  +      + LE  
Subjt:  GASGGILTMWDSSITSVIEVIKGRFSLSIKCLSLSNQIFWFTNVYG----------WI--TNVYGPCGLQIG-MTFLKTPESLAKTCIFSNHFPILLEAG

Query:  AIIWGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVGFILHEEQRKM------------------------------------KEAGLNEDECVFRSA
           WGP+PF+F N WL  +          S F  + W G   H+  RK+                                    +E GL+ +  V R+ 
Subjt:  AIIWGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVGFILHEEQRKM------------------------------------KEAGLNEDECVFRSA

Query:  LQAELLNIYHLEERNLMQKSKLNWLSLGDENSGFFHRFLHAKRKNLISELKDTNGMLHKKSSIEEIRLIVKALGRNKAPGLDEELALEMKDFRPISLTTL
         + EL  +   EE +  QK+++ W+  GD NS FF R+       + +E    +G++++ ++   I L+ K           + ++  + DFRPISL T 
Subjt:  LQAELLNIYHLEERNLMQKSKLNWLSLGDENSGFFHRFLHAKRKNLISELKDTNGMLHKKSSIEEIRLIVKALGRNKAPGLDEELALEMKDFRPISLTTL

Query:  SYKVVAKVLAERLKSVMDSIISPYQSAFIEGRQILDLILIASEVVEDYRAKKKKGWILKLELEKASD-----------QWIGTSLR--------------
         YK++AKVLA RL+ V+   I   Q AF++GRQILD +LIA+E+V++ R   ++G + K++ EKA D           +  G SLR              
Subjt:  SYKVVAKVLAERLKSVMDSIISPYQSAFIEGRQILDLILIASEVVEDYRAKKKKGWILKLELEKASD-----------QWIGTSLR--------------

Query:  --------RGRIIASRGIRQGDPLSPFLFLL-------------------------DSIHLPLCQFADDTLLFCKYDDQMLLKLKDAIRLFEWCSRQKIN
                +G + ASRG+RQGDPLSPFLF +                         +   +   QFADDT+ F    ++ L+ LK  + +F   S  K+N
Subjt:  --------RGRIIASRGIRQGDPLSPFLFLL-------------------------DSIHLPLCQFADDTLLFCKYDDQMLLKLKDAIRLFEWCSRQKIN

Query:  WEKSALSGVNINDDDLFQTAARLGCKAEKLPILYLGLPLGGYPRRREFWQPFMWQLHWK----------------------KIMRNFFWEGFAGGKINHL
         +KS + G+N+  + L + A  L CKA   PILYLGLPLGG P+   FW P + ++  +                      ++ R F W G   GK +HL
Subjt:  WEKSALSGVNINDDDLFQTAARLGCKAEKLPILYLGLPLGGYPRRREFWQPFMWQLHWK----------------------KIMRNFFWEGFAGGKINHL

Query:  VKWRLISLPLKNVGLGLEGIKTQNSVLLAKWGWRYSKEDSALWRKNICSIHGKDTFDWFTLEKSGNSLRSPWVNISRMWMLVEHLAHLKLGCGSRIGFWT
        V W ++  P    GLG   I  +N  LL KW WRY +E SALW + I SI+G  +  W        S R PW  I+ ++          +G G RI FW 
Subjt:  VKWRLISLPLKNVGLGLEGIKTQNSVLLAKWGWRYSKEDSALWRKNICSIHGKDTFDWFTLEKSGNSLRSPWVNISRMWMLVEHLAHLKLGCGSRIGFWT

Query:  DFWVGPSTLKKHFPSLFRI
        D W G   L   +P L  +
Subjt:  DFWVGPSTLKKHFPSLFRI

TYK31266.1 hypothetical protein E5676_scaffold455G005560 [Cucumis melo var. makuwa]6.2e-8038.3Show/hide
Query:  PSISANNPDLLEQTHALLSSISNTQNSPNIKSKVNLLRGSPCHAAIDKPTSKTMDSP--------FSISSEESLGFPNGFNFKSNEEIEGADLSSLFDEA
        PSIS N  + L +  A +  ++       +   +NL R SP    I +      DS           +SS+ES+   N  + K + EIEG DL++LF++ 
Subjt:  PSISANNPDLLEQTHALLSSISNTQNSPNIKSKVNLLRGSPCHAAIDKPTSKTMDSP--------FSISSEESLGFPNGFNFKSNEEIEGADLSSLFDEA

Query:  SVVPIKPNEDNLLENWGLNDITK--RAALK-KFIKIIIWILDINREFVESLGASGGILTMWDSSITSVIEVIKGRFSLSIKCLSLSNQ----IFWFTNVY
             K   D   + +G +  ++     L+  FIK +    DI  +FV S+G+ GGILTMWDSS  SV EVIK RFSLSIKCLSL  +    I+W    +
Subjt:  SVVPIKPNEDNLLENWGLNDITK--RAALK-KFIKIIIWILDINREFVESLGASGGILTMWDSSITSVIEVIKGRFSLSIKCLSLSNQ----IFWFTNVY

Query:  GWITNVYGPCGLQI-GMT-FLKTPESLAKTCIFSNHFPILLEAGAIIWGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVGFILHEEQRKMKEAGLNE
                P G    GM+ F K  +S+           I L+ G   W     +   + +   E  LL+E               L +     +  GLNE
Subjt:  GWITNVYGPCGLQI-GMT-FLKTPESLAKTCIFSNHFPILLEAGAIIWGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVGFILHEEQRKMKEAGLNE

Query:  DECVFRSALQAELLNIYHLEERNLMQKSKLNWLSLGDENSGFFHRFLHAKRKNLISELKDTNGMLHKKSSIEEIRLIVKALGRNKAPGLDEELALEMKDF
        +E   R+ALQAELL IY  EE N MQK                                           +EEI   +KALG N APG D   A      
Subjt:  DECVFRSALQAELLNIYHLEERNLMQKSKLNWLSLGDENSGFFHRFLHAKRKNLISELKDTNGMLHKKSSIEEIRLIVKALGRNKAPGLDEELALEMKDF

Query:  RPISLTTLSYKVVAKVLAERLKSVMDSIISPYQSAF--IEGRQILDLILIASEVVEDYRAKKKKGWILKLELEKASDQ---------------------W
                  + + K  AE+ +        P QS +  I+GRQILD ILIA+E VEDYR K KK WILKL+LEKA D+                     W
Subjt:  RPISLTTLSYKVVAKVLAERLKSVMDSIISPYQSAF--IEGRQILDLILIASEVVEDYRAKKKKGWILKLELEKASDQ---------------------W

Query:  IGTSLR------------RGRIIASRGIRQGDPLSPFLFLL-------------------------DSIHLPLCQFADDTLLFCKYDDQMLLKLKDAIRL
        I   L+            RGRI ASRGIRQGDP SPFLFLL                         D IHLPL QF++DTLLFCKYD QM LKLKDAIRL
Subjt:  IGTSLR------------RGRIIASRGIRQGDPLSPFLFLL-------------------------DSIHLPLCQFADDTLLFCKYDDQMLLKLKDAIRL

Query:  FEWCSRQKINWEKSALSGVNINDDDLFQTAARLGCKAEKLPILYLGLPLGGYPRRREF
        FEWCS QK+NWEKSA+SGVN+  DDL QTA  LGCK EKLPI+YLGLPLGGYP ++ F
Subjt:  FEWCSRQKINWEKSALSGVNINDDDLFQTAARLGCKAEKLPILYLGLPLGGYPRRREF

TrEMBL top hitse value%identityAlignment
A0A438BQB2 Transposon TX1 uncharacterized 149 kDa protein1.9e-7929.21Show/hide
Query:  GASGGILTMWDSSITSVIEVIKGRFSLSIKCLSLSNQIFWFTNVYG----------WITNVYGPCGLQIGMTFLKTPESLAKTCIFSNHFPILLEAGAII
        GASGGIL +WDS      EV+ G FS+S+K         W + VYG          W+  ++   GLQ         E+L +    S+H+PI+++    +
Subjt:  GASGGILTMWDSSITSVIEVIKGRFSLSIKCLSLSNQIFWFTNVYG----------WITNVYGPCGLQIGMTFLKTPESLAKTCIFSNHFPILLEAGAII

Query:  WGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVG-------------------FILHEEQRKMK--------------EAGLNEDECVFRSALQAELL
        WGP+PF+F N WL          +  S F  + W G                   F   E + K K              E GLN D    R++ + EL 
Subjt:  WGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVG-------------------FILHEEQRKMK--------------EAGLNEDECVFRSALQAELL

Query:  NIYHLEERNLMQKSKLNWLSLGDENSGFFHRFLHAKR-KNLISELKDTNGMLHKKS--------------------------------------------
         +   EE +  QK+K+ W+  GD NS F+H+  + +R +  I EL++  G++ K +                                            
Subjt:  NIYHLEERNLMQKSKLNWLSLGDENSGFFHRFLHAKR-KNLISELKDTNGMLHKKS--------------------------------------------

Query:  -SIEEIRLIVKALGRNKAPGLD---------------------------------------------EELALEMKDFRPISLTTLSYKVVAKVLAERLKS
         + EEI   +  L R+KAPG D                                             + L+  + DFRPISL T  YK++AKVL+ RL+ 
Subjt:  -SIEEIRLIVKALGRNKAPGLD---------------------------------------------EELALEMKDFRPISLTTLSYKVVAKVLAERLKS

Query:  VMDSIISPYQSAFIEGRQILDLILIASEVVEDYRAKKKKGWILKLE------LEKASDQWIGTSLRRGRIIASRGIRQGDPLSPFLFLL-----------
        V+   I   Q AF++GRQILD +LIA+E+V+  R ++KKG+  +        L   S   +     +G + ASRG+RQGDPLSPFLF L           
Subjt:  VMDSIISPYQSAFIEGRQILDLILIASEVVEDYRAKKKKGWILKLE------LEKASDQWIGTSLRRGRIIASRGIRQGDPLSPFLFLL-----------

Query:  --------------DSIHLPLCQFADDTLLFCKYDDQMLLKLKDAIRLFEWCSRQKINWEKSALSGVNINDDDLFQTAARLGCKAEKLPILYLGLPLGGY
                      +   +   QFADDT+ F    ++ L  LK  + +F   S  K+N  KS++ G+N++   L + A  L CKA   PILYLGLPLGG 
Subjt:  --------------DSIHLPLCQFADDTLLFCKYDDQMLLKLKDAIRLFEWCSRQKINWEKSALSGVNINDDDLFQTAARLGCKAEKLPILYLGLPLGGY

Query:  PRRREFWQPFM---------WQLHW-----------------------------------KKIMRNFFWEGFAGGKINHLVKWRLISLPLKNVGLGLEGI
        P+   FW P +         WQ  +                                   +++ R+F W G   GK +HLV+W ++  P    GLGL  I
Subjt:  PRRREFWQPFM---------WQLHW-----------------------------------KKIMRNFFWEGFAGGKINHLVKWRLISLPLKNVGLGLEGI

Query:  KTQNSVLLAKWGWRYSKEDSALWRKNICSIHGKDTFDWFTLEKSGNSLRSPWVNISRMWMLVEHLAHLKLGCGSRIGFWTDFWVGPSTLKKHFPSLFRIA
          +N  LL KW WRY +E SALW + I SI+G  +  W        S R PW  I++++     +    +G G RI FW D W G   L   +P LFR+ 
Subjt:  KTQNSVLLAKWGWRYSKEDSALWRKNICSIHGKDTFDWFTLEKSGNSLRSPWVNISRMWMLVEHLAHLKLGCGSRIGFWTDFWVGPSTLKKHFPSLFRIA

Query:  L
        +
Subjt:  L

A0A438HYP2 LINE-1 retrotransposable element ORF2 protein5.1e-8030.32Show/hide
Query:  GASGGILTMWDSSITSVIEVIKGRFSLSIKCLSLSNQIFWFTNVYG----------WI--TNVYGPCGLQIG-MTFLKTPESLAKTCIFSNHFPILLEAG
        GASGG L +WDS      EV+ G FS+SIK      +  W + VYG          W+  +++ G    + G   FLK  +   +  +      + LE  
Subjt:  GASGGILTMWDSSITSVIEVIKGRFSLSIKCLSLSNQIFWFTNVYG----------WI--TNVYGPCGLQIG-MTFLKTPESLAKTCIFSNHFPILLEAG

Query:  AIIWGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVGFILHEEQRKM------------------------------------KEAGLNEDECVFRSA
           WGP+PF+F N WL  +          S F  + W G   H+  RK+                                    +E GL+ +  V R+ 
Subjt:  AIIWGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVGFILHEEQRKM------------------------------------KEAGLNEDECVFRSA

Query:  LQAELLNIYHLEERNLMQKSKLNWLSLGDENSGFFHRFLHAKRKNLISELKDTNGMLHKKSSIEEIRLIVKALGRNKAPGLDEELALEMKDFRPISLTTL
         + EL  +   EE +  QK+++ W+  GD NS FF R+       + +E    +G++++ ++   I L+ K           + ++  + DFRPISL T 
Subjt:  LQAELLNIYHLEERNLMQKSKLNWLSLGDENSGFFHRFLHAKRKNLISELKDTNGMLHKKSSIEEIRLIVKALGRNKAPGLDEELALEMKDFRPISLTTL

Query:  SYKVVAKVLAERLKSVMDSIISPYQSAFIEGRQILDLILIASEVVEDYRAKKKKGWILKLELEKASD-----------QWIGTSLR--------------
         YK++AKVLA RL+ V+   I   Q AF++GRQILD +LIA+E+V++ R   ++G + K++ EKA D           +  G SLR              
Subjt:  SYKVVAKVLAERLKSVMDSIISPYQSAFIEGRQILDLILIASEVVEDYRAKKKKGWILKLELEKASD-----------QWIGTSLR--------------

Query:  --------RGRIIASRGIRQGDPLSPFLFLL-------------------------DSIHLPLCQFADDTLLFCKYDDQMLLKLKDAIRLFEWCSRQKIN
                +G + ASRG+RQGDPLSPFLF +                         +   +   QFADDT+ F    ++ L+ LK  + +F   S  K+N
Subjt:  --------RGRIIASRGIRQGDPLSPFLFLL-------------------------DSIHLPLCQFADDTLLFCKYDDQMLLKLKDAIRLFEWCSRQKIN

Query:  WEKSALSGVNINDDDLFQTAARLGCKAEKLPILYLGLPLGGYPRRREFWQPFMWQLHWK----------------------KIMRNFFWEGFAGGKINHL
         +KS + G+N+  + L + A  L CKA   PILYLGLPLGG P+   FW P + ++  +                      ++ R F W G   GK +HL
Subjt:  WEKSALSGVNINDDDLFQTAARLGCKAEKLPILYLGLPLGGYPRRREFWQPFMWQLHWK----------------------KIMRNFFWEGFAGGKINHL

Query:  VKWRLISLPLKNVGLGLEGIKTQNSVLLAKWGWRYSKEDSALWRKNICSIHGKDTFDWFTLEKSGNSLRSPWVNISRMWMLVEHLAHLKLGCGSRIGFWT
        V W ++  P    GLG   I  +N  LL KW WRY +E SALW + I SI+G  +  W        S R PW  I+ ++          +G G RI FW 
Subjt:  VKWRLISLPLKNVGLGLEGIKTQNSVLLAKWGWRYSKEDSALWRKNICSIHGKDTFDWFTLEKSGNSLRSPWVNISRMWMLVEHLAHLKLGCGSRIGFWT

Query:  DFWVGPSTLKKHFPSLFRI
        D W G   L   +P L  +
Subjt:  DFWVGPSTLKKHFPSLFRI

A0A5A7UNL3 GTP-binding protein1.3e-8639.5Show/hide
Query:  KTCIFSNHFPILLEAGAIIWGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVGFILHEEQRKMKEAGLNEDECVFRSALQAELLNIYHLEERNLMQKS
        K  IFS+HFP+LLEAGAI+WGPS F+FCNSWLL  ECN  IEE + +     W GFILHE+  K+K A  N     + +A + +L     +EERN +QKS
Subjt:  KTCIFSNHFPILLEAGAIIWGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVGFILHEEQRKMKEAGLNEDECVFRSALQAELLNIYHLEERNLMQKS

Query:  KLNWLSLGDENSGFFHRFLHAKRKNLISELKDTNGMLHKKSSIEEIRLIVKALGRNKAPGLDEELALEMKDFRPISLTTLSYKVVAKVLAERLKSVMDSI
        KLNWL+                                                        +E A+ ++DFRPISLTT++YKVVA VLA+RLK VM S 
Subjt:  KLNWLSLGDENSGFFHRFLHAKRKNLISELKDTNGMLHKKSSIEEIRLIVKALGRNKAPGLDEELALEMKDFRPISLTTLSYKVVAKVLAERLKSVMDSI

Query:  ISPYQSAFIEG-RQILDLI---LIASEVVEDYRAKKKKGWILKLELEKASDQWIGTSLRRGRIIASRGIRQGDPLSPFLFLLDSIHLPLCQFADDTLLFC
        I+P+QSAFIE   +  D +   ++   +     + K   WI++   +     +I  S  RGRI+ASR I+Q           D IH+ + Q+ADDT+L C
Subjt:  ISPYQSAFIEG-RQILDLI---LIASEVVEDYRAKKKKGWILKLELEKASDQWIGTSLRRGRIIASRGIRQGDPLSPFLFLLDSIHLPLCQFADDTLLFC

Query:  KYDDQMLLKLKDAIRLFEWCSRQKINWEKSALSGVNINDDDLFQTAARLGCKAEKLPILYLGLPLGGYPRRREFWQPFMWQLHWKKIMRNFFWEGFAGGK
        K D+ ML KLK+AI+ FEWCS QK                                                                     EG +G K
Subjt:  KYDDQMLLKLKDAIRLFEWCSRQKINWEKSALSGVNINDDDLFQTAARLGCKAEKLPILYLGLPLGGYPRRREFWQPFMWQLHWKKIMRNFFWEGFAGGK

Query:  INHLVKWRLISLPLKNVGLGLEGIKTQNSVLLAKWGWRYSKEDSALWRKNICSIHGKDTFDWFTLEKSGNSLRSPWVNISRMWMLVEHLAHLKLGCGSRI
        INHL  W+ +     + GLGL G+K +N  LLAKWGW + KEDSALWR+ I SIHG   F+W TL KSGNSLRS W+NIS     VE LA  K+G G+R+
Subjt:  INHLVKWRLISLPLKNVGLGLEGIKTQNSVLLAKWGWRYSKEDSALWRKNICSIHGKDTFDWFTLEKSGNSLRSPWVNISRMWMLVEHLAHLKLGCGSRI

Query:  GFWTDFWVGPSTLKKHFPSLFRIA
         FWT+ WVG   LK  FPSLFRIA
Subjt:  GFWTDFWVGPSTLKKHFPSLFRIA

A0A5D3E6J9 Reverse transcriptase domain-containing protein3.0e-8038.3Show/hide
Query:  PSISANNPDLLEQTHALLSSISNTQNSPNIKSKVNLLRGSPCHAAIDKPTSKTMDSP--------FSISSEESLGFPNGFNFKSNEEIEGADLSSLFDEA
        PSIS N  + L +  A +  ++       +   +NL R SP    I +      DS           +SS+ES+   N  + K + EIEG DL++LF++ 
Subjt:  PSISANNPDLLEQTHALLSSISNTQNSPNIKSKVNLLRGSPCHAAIDKPTSKTMDSP--------FSISSEESLGFPNGFNFKSNEEIEGADLSSLFDEA

Query:  SVVPIKPNEDNLLENWGLNDITK--RAALK-KFIKIIIWILDINREFVESLGASGGILTMWDSSITSVIEVIKGRFSLSIKCLSLSNQ----IFWFTNVY
             K   D   + +G +  ++     L+  FIK +    DI  +FV S+G+ GGILTMWDSS  SV EVIK RFSLSIKCLSL  +    I+W    +
Subjt:  SVVPIKPNEDNLLENWGLNDITK--RAALK-KFIKIIIWILDINREFVESLGASGGILTMWDSSITSVIEVIKGRFSLSIKCLSLSNQ----IFWFTNVY

Query:  GWITNVYGPCGLQI-GMT-FLKTPESLAKTCIFSNHFPILLEAGAIIWGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVGFILHEEQRKMKEAGLNE
                P G    GM+ F K  +S+           I L+ G   W     +   + +   E  LL+E               L +     +  GLNE
Subjt:  GWITNVYGPCGLQI-GMT-FLKTPESLAKTCIFSNHFPILLEAGAIIWGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVGFILHEEQRKMKEAGLNE

Query:  DECVFRSALQAELLNIYHLEERNLMQKSKLNWLSLGDENSGFFHRFLHAKRKNLISELKDTNGMLHKKSSIEEIRLIVKALGRNKAPGLDEELALEMKDF
        +E   R+ALQAELL IY  EE N MQK                                           +EEI   +KALG N APG D   A      
Subjt:  DECVFRSALQAELLNIYHLEERNLMQKSKLNWLSLGDENSGFFHRFLHAKRKNLISELKDTNGMLHKKSSIEEIRLIVKALGRNKAPGLDEELALEMKDF

Query:  RPISLTTLSYKVVAKVLAERLKSVMDSIISPYQSAF--IEGRQILDLILIASEVVEDYRAKKKKGWILKLELEKASDQ---------------------W
                  + + K  AE+ +        P QS +  I+GRQILD ILIA+E VEDYR K KK WILKL+LEKA D+                     W
Subjt:  RPISLTTLSYKVVAKVLAERLKSVMDSIISPYQSAF--IEGRQILDLILIASEVVEDYRAKKKKGWILKLELEKASDQ---------------------W

Query:  IGTSLR------------RGRIIASRGIRQGDPLSPFLFLL-------------------------DSIHLPLCQFADDTLLFCKYDDQMLLKLKDAIRL
        I   L+            RGRI ASRGIRQGDP SPFLFLL                         D IHLPL QF++DTLLFCKYD QM LKLKDAIRL
Subjt:  IGTSLR------------RGRIIASRGIRQGDPLSPFLFLL-------------------------DSIHLPLCQFADDTLLFCKYDDQMLLKLKDAIRL

Query:  FEWCSRQKINWEKSALSGVNINDDDLFQTAARLGCKAEKLPILYLGLPLGGYPRRREF
        FEWCS QK+NWEKSA+SGVN+  DDL QTA  LGCK EKLPI+YLGLPLGGYP ++ F
Subjt:  FEWCSRQKINWEKSALSGVNINDDDLFQTAARLGCKAEKLPILYLGLPLGGYPRRREF

A5BA26 Reverse transcriptase domain-containing protein4.3e-7930.48Show/hide
Query:  PESLAKTCI--FSNHFPILLEAGAIIWGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVG-------------------FILHEEQRKMK--------
        P+ L +  I   S+H+PI+++    +WGP+PF+F N WL   +      +  S F  + W G                   F   E + K K        
Subjt:  PESLAKTCI--FSNHFPILLEAGAIIWGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVG-------------------FILHEEQRKMK--------

Query:  ------EAGLNEDECVFRSALQAELLNIYHLEERNLMQKSKLNWLSLGDENSGFFHRFLHAKR-KNLISELKDTNGMLHKKS------------------
              E GLN D    R++ + EL  +   EE +  QK+K+ W+  G  NS F+H+  + +R +  I EL++  G++ K +                  
Subjt:  ------EAGLNEDECVFRSALQAELLNIYHLEERNLMQKSKLNWLSLGDENSGFFHRFLHAKR-KNLISELKDTNGMLHKKS------------------

Query:  --SIEEIRLIVKALGRNKAPGLD---------------------------------------------EELALEMKDFRPISLTTLSYKVVAKVLAERLK
          + EEI   +  L R+KAPG D                                             + L+  + DFRPISL T  YK++AKVL+ RL+
Subjt:  --SIEEIRLIVKALGRNKAPGLD---------------------------------------------EELALEMKDFRPISLTTLSYKVVAKVLAERLK

Query:  SVMDSIISPYQSAFIEGRQILDLILIASEVVEDYRAKKKKGWILKLELEKASD---------------------QWIGTSL------------RRGRIIA
         V+   I   Q AF++GRQILD +LIA+E+V++ R   ++G + K++ EKA D                     +W+   L             +G + A
Subjt:  SVMDSIISPYQSAFIEGRQILDLILIASEVVEDYRAKKKKGWILKLELEKASD---------------------QWIGTSL------------RRGRIIA

Query:  SRGIRQGDPLSPFLFLL-------------------------DSIHLPLCQFADDTLLFCKYDDQMLLKLKDAIRLFEWCSRQKINWEKSALSGVNINDD
        SRG+RQGDPLSPFLF L                         +   +   QFADDT+ F    ++ L  LK  + +F   S  K+N  KS++ G+N++  
Subjt:  SRGIRQGDPLSPFLFLL-------------------------DSIHLPLCQFADDTLLFCKYDDQMLLKLKDAIRLFEWCSRQKINWEKSALSGVNINDD

Query:  DLFQTAARLGCKAEKLPILYLGLPLGGYPRRREFWQPFMWQLHW----KKIMRNFFWEGFAGGKINHLVKWRLISLPLKNVGLGLEGIKTQNSVLLAKWG
         L + A  L  KA   PILYLGLPLGG P+   FW P + ++      +++ R+F W GF  GK +HLV+W ++  P    GLGL  I  +N  LL KW 
Subjt:  DLFQTAARLGCKAEKLPILYLGLPLGGYPRRREFWQPFMWQLHW----KKIMRNFFWEGFAGGKINHLVKWRLISLPLKNVGLGLEGIKTQNSVLLAKWG

Query:  WRYSKEDSALWRKNICSIHGKDTFDWFTLEKSGNSLRSPWVNISRMWMLVEHLAHLKLGCGSRIGFWTDFWVGPSTLKKHFPSLFRIAL
        WRY +E SALW + I SI+G  +  W        S R PW  I++++     +    +G G RI FW D W G   L   +P LFR+ +
Subjt:  WRYSKEDSALWRKNICSIHGKDTFDWFTLEKSGNSLRSPWVNISRMWMLVEHLAHLKLGCGSRIGFWTDFWVGPSTLKKHFPSLFRIAL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein8.0e-0628.08Show/hide
Query:  PGLDEELALEMKDFRPISLTTLSYKVVAKVLAERLKSVMDSIISPYQSAFIEGRQILDLILIASEVVEDY-RAKKKKGWILKLELEKASDQ---------
        PG D     + ++FRPISL  +  K++ K+LA R++  +  +I   Q  FI G Q    I  +  V++   RAK K   I+ ++ EKA D+         
Subjt:  PGLDEELALEMKDFRPISLTTLSYKVVAKVLAERLKSVMDSIISPYQSAFIEGRQILDLILIASEVVEDY-RAKKKKGWILKLELEKASDQ---------

Query:  -----WIGTSLRRGRIIASR-------------------GIRQGDPLSPFLF--------------------LLDSIHLPLCQFADDTLLFCK---YDDQ
               G  L+  R I  +                   G RQG PLSP LF                     L    + L  FADD +++ +      Q
Subjt:  -----WIGTSLRRGRIIASR-------------------GIRQGDPLSPFLF--------------------LLDSIHLPLCQFADDTLLFCK---YDDQ

Query:  MLLKLKDAIRLFEWCSRQKINWEKSALSGVNINDDDLFQTAARLGCKAEKLPILYLGLPL
         LLKL   I  F   S  KIN +KS     N N     Q    L        I YLG+ L
Subjt:  MLLKLKDAIRLFEWCSRQKINWEKSALSGVNINDDDLFQTAARLGCKAEKLPILYLGLPL

P08548 LINE-1 reverse transcriptase homolog6.8e-0527.01Show/hide
Query:  KDFRPISLTTLSYKVVAKVLAERLKSVMDSIISPYQSAFIEGRQILDLILIASEVVEDY-RAKKKKGWILKLELEKASDQ--------------WIGTSL
        +++RPISL  +  K++ K+L  R++  +  II   Q  FI G Q    I  +  V++   + K K   IL ++ EKA D                 GT L
Subjt:  KDFRPISLTTLSYKVVAKVLAERLKSVMDSIISPYQSAFIEGRQILDLILIASEVVEDY-RAKKKKGWILKLELEKASDQ--------------WIGTSL

Query:  RRGRIIASR-------------------GIRQGDPLSPFLF---------------LLDSIH-----LPLCQFADDTLLFCKYDDQMLLKLKDAIRLFEW
        +    I S+                   G RQG PLSP LF                +  IH     + L  FADD +++ +       KL + I+ +  
Subjt:  RRGRIIASR-------------------GIRQGDPLSPFLF---------------LLDSIH-----LPLCQFADDTLLFCKYDDQMLLKLKDAIRLFEW

Query:  CSRQKINWEKS
         S  KIN  KS
Subjt:  CSRQKINWEKS

P0C2F6 Putative ribonuclease H protein At1g657506.3e-1134.62Show/hide
Query:  KIMRNFFWEGFAGGKINHLVKWRLISLPLKNVGLGLEGIKTQNSVLLAKWGWRYSKEDSALWRKNI-CSIHGKDTFDWFTLEKSGNSLRSPWVNIS-RMW
        ++ R F W   A  K  HLVKW  +  P K  GLG+   K+ N  L++K GWR  +E ++LW   +    H  +  D   L   G S  S W +I+  + 
Subjt:  KIMRNFFWEGFAGGKINHLVKWRLISLPLKNVGLGLEGIKTQNSVLLAKWGWRYSKEDSALWRKNI-CSIHGKDTFDWFTLEKSGNSLRSPWVNIS-RMW

Query:  MLVEHLAHLKLGCGSRIGFWTDFWVGPSTL
         +V H      G G +I FWTD WV    L
Subjt:  MLVEHLAHLKLGCGSRIGFWTDFWVGPSTL

P11369 LINE-1 retrotransposable element ORF2 protein7.5e-0430.07Show/hide
Query:  EMKDFRPISLTTLSYKVVAKVLAERLKSVMDSIISPYQSAFIEGRQILDLILIASEVVEDY-RAKKKKGWILKLELEKASDQ--------------WIGT
        ++++FRPISL  +  K++ K+LA R++  + +II P Q  FI G Q    I  +  V+    + K K   I+ L+ EKA D+                G 
Subjt:  EMKDFRPISLTTLSYKVVAKVLAERLKSVMDSIISPYQSAFIEGRQILDLILIASEVVEDY-RAKKKKGWILKLELEKASDQ--------------WIGT

Query:  SLRRGRIIASR-------------------GIRQGDPLSPFLF
         L   + I S+                   G RQG PLSP+LF
Subjt:  SLRRGRIIASR-------------------GIRQGDPLSPFLF

P14381 Transposon TX1 uncharacterized 149 kDa protein1.6e-0927.76Show/hide
Query:  MKDFRPISLTTLSYKVVAKVLAERLKSVMDSIISPYQSAFIEGRQILDLILIASEVVEDYRAKKKKGWILKLELEKASD------------------QWI
        +K++RP+SL +  YK+VAK ++ RLKSV+  +I P QS  + GR I D + +  +++   R        L L+ EKA D                  Q++
Subjt:  MKDFRPISLTTLSYKVVAKVLAERLKSVMDSIISPYQSAFIEGRQILDLILIASEVVEDYRAKKKKGWILKLELEKASD------------------QWI

Query:  G---TSLRRGRIIA------------SRGIRQGDPLS---------PFLFLL-----------DSIHLPLCQFADDTLLFCKYDDQMLLKLKDAIRLFEW
        G   T       +              RG+RQG PLS         PFL LL             + + L  +ADD +L  + D   L + ++   ++  
Subjt:  G---TSLRRGRIIA------------SRGIRQGDPLS---------PFLFLL-----------DSIHLPLCQFADDTLLFCKYDDQMLLKLKDAIRLFEW

Query:  CSRQKINWEKSA-LSGVNINDDDLFQTAARLGCKAEKLPILYLGLPLGG--YPRRREFWQPFMWQLHWKKIMRNFFWEGFA
         S  +INW KS+ L   ++  D  F   A      E   I YLG+ L    YP  + F      +L    + R   W+GFA
Subjt:  CSRQKINWEKSA-LSGVNINDDDLFQTAARLGCKAEKLPILYLGLPLGG--YPRRREFWQPFMWQLHWKKIMRNFFWEGFA

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCTATAAATTTTGCTTTTATAAAATCTCTATGGAGCATCAAAGATGTTGGTCGGGATTTTGTGCAATCAGTGGGTTCTTCAAGGGGAATTTTGTCTATGTGGGA
CAGCAGCTTATTTGTGATATTAGAGCTGCTTGATTTGCGAGAAAGTAAGGAGGCTGTTTATGGTGCTCTTGATGCCTGGGTTGCAGGACATGGAACACCGAGCGGAAGCT
CACAAATTTTGGGTCATGATGGCTGGTTCGGATCTTCATTTAGTTCCTTGGCAATTTTTATGGATTCTGCAACAAGACTAATTCTTCACAAGTATGAATTGAATGGAGGT
TGTGACATTAAAATTCCATCTTCAGCCCCCTCTCGAAAGCACAACTTAAAGCCTTCTATTTCAGCAAACAACCCTGACCTTTTGGAGCAAACTCATGCTCTCCTATCAAG
CATTTCAAATACTCAAAACTCCCCAAATATTAAGTCTAAAGTTAATTTGTTGCGAGGCTCCCCTTGTCATGCCGCCATTGACAAACCGACCTCCAAAACCATGGATTCTC
CATTTAGTATCAGTAGTGAGGAATCCTTGGGGTTCCCAAATGGTTTCAATTTCAAGAGTAATGAAGAAATTGAAGGGGCTGACCTATCTTCCCTATTTGATGAAGCAAGT
GTTGTTCCAATTAAACCTAATGAAGATAATCTCTTGGAAAACTGGGGTTTAAATGACATCACAAAAAGAGCAGCATTGAAGAAATTCATAAAAATCATAATCTGGATATT
AGACATCAACCGGGAATTTGTGGAATCATTAGGTGCCTCCGGAGGAATTTTAACAATGTGGGACAGCAGTATAACTTCTGTTATTGAGGTGATTAAAGGGAGATTTTCCT
TATCAATAAAATGTCTCTCCTTAAGTAATCAAATCTTCTGGTTTACTAATGTTTATGGCTGGATTACTAATGTTTATGGACCCTGCGGGTTGCAGATTGGGATGACCTTC
TTGAAAACTCCCGAGTCTCTCGCAAAAACCTGCATCTTCTCAAACCATTTCCCCATTTTATTAGAGGCTGGTGCCATCATTTGGGGTCCCTCTCCCTTTCAGTTTTGCAA
CAGCTGGCTGTTGTCCAATGAGTGCAACCTTTTAATTGAAGAAACAGTATCGAACTTCACTCACCATAGATGGGTTGGTTTTATTCTACACGAGGAACAGAGAAAGATGA
AGGAAGCTGGCCTAAATGAGGATGAGTGTGTATTCAGATCAGCATTACAAGCAGAATTACTCAATATTTACCATCTTGAAGAGCGCAACCTTATGCAGAAAAGCAAGCTT
AATTGGCTGTCACTTGGGGATGAAAATTCTGGGTTTTTCCATCGTTTTCTGCATGCAAAGAGGAAAAATTTAATTTCAGAATTGAAAGATACTAATGGAATGCTGCACAA
AAAGAGCAGCATTGAAGAAATACGTTTGATAGTAAAAGCTCTTGGCAGGAACAAAGCCCCGGGCCTTGATGAAGAATTGGCCTTGGAAATGAAAGATTTTCGCCCGATCA
GTCTCACTACCCTATCTTATAAGGTAGTAGCTAAAGTCCTAGCAGAACGGTTAAAATCTGTAATGGACTCAATCATCAGCCCTTACCAAAGTGCTTTTATAGAAGGAAGA
CAAATACTTGACCTGATATTAATTGCTAGTGAGGTCGTTGAAGATTATAGGGCAAAAAAGAAAAAGGGTTGGATCTTGAAGCTTGAGCTTGAAAAGGCCTCTGATCAGTG
GATTGGAACTTCCTTGAGAAGGGGCCGCATTATAGCTTCAAGAGGCATCCGCCAAGGGGACCCCCTTTCTCCCTTTCTTTTCCTTCTAGATTCAATCCATCTTCCCCTAT
GCCAATTTGCTGATGACACCCTACTTTTTTGTAAGTATGATGATCAAATGCTTCTCAAGTTGAAGGATGCCATTAGACTTTTTGAATGGTGTTCAAGGCAGAAGATTAAT
TGGGAGAAATCAGCCCTTAGTGGTGTAAATATAAATGATGATGACTTGTTTCAAACAGCAGCCCGTCTTGGTTGCAAGGCAGAAAAATTACCTATTCTCTACTTAGGTCT
CCCCCTAGGAGGCTACCCTCGGCGAAGGGAGTTTTGGCAGCCATTCATGTGGCAACTTCATTGGAAAAAAATCATGAGGAACTTCTTTTGGGAAGGATTTGCAGGCGGCA
AAATCAATCATCTAGTAAAATGGAGATTGATTTCCCTTCCTTTAAAAAATGTAGGTCTCGGTCTAGAGGGCATTAAAACCCAAAATTCAGTTTTGCTTGCTAAGTGGGGC
TGGAGATATTCTAAGGAAGATTCAGCATTATGGAGGAAAAATATTTGTAGTATCCATGGCAAAGATACGTTTGATTGGTTCACTTTGGAAAAATCTGGAAATAGCCTAAG
GAGCCCTTGGGTTAACATTTCTAGAATGTGGATGTTAGTGGAGCATTTAGCGCACCTTAAACTCGGCTGTGGCAGCAGAATCGGTTTTTGGACAGATTTCTGGGTTGGTC
CCTCTACTCTAAAGAAACATTTTCCTTCGCTTTTCAGAATTGCTTTATTACCACAAGGCTCAGTTGTTGATCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCTATAAATTTTGCTTTTATAAAATCTCTATGGAGCATCAAAGATGTTGGTCGGGATTTTGTGCAATCAGTGGGTTCTTCAAGGGGAATTTTGTCTATGTGGGA
CAGCAGCTTATTTGTGATATTAGAGCTGCTTGATTTGCGAGAAAGTAAGGAGGCTGTTTATGGTGCTCTTGATGCCTGGGTTGCAGGACATGGAACACCGAGCGGAAGCT
CACAAATTTTGGGTCATGATGGCTGGTTCGGATCTTCATTTAGTTCCTTGGCAATTTTTATGGATTCTGCAACAAGACTAATTCTTCACAAGTATGAATTGAATGGAGGT
TGTGACATTAAAATTCCATCTTCAGCCCCCTCTCGAAAGCACAACTTAAAGCCTTCTATTTCAGCAAACAACCCTGACCTTTTGGAGCAAACTCATGCTCTCCTATCAAG
CATTTCAAATACTCAAAACTCCCCAAATATTAAGTCTAAAGTTAATTTGTTGCGAGGCTCCCCTTGTCATGCCGCCATTGACAAACCGACCTCCAAAACCATGGATTCTC
CATTTAGTATCAGTAGTGAGGAATCCTTGGGGTTCCCAAATGGTTTCAATTTCAAGAGTAATGAAGAAATTGAAGGGGCTGACCTATCTTCCCTATTTGATGAAGCAAGT
GTTGTTCCAATTAAACCTAATGAAGATAATCTCTTGGAAAACTGGGGTTTAAATGACATCACAAAAAGAGCAGCATTGAAGAAATTCATAAAAATCATAATCTGGATATT
AGACATCAACCGGGAATTTGTGGAATCATTAGGTGCCTCCGGAGGAATTTTAACAATGTGGGACAGCAGTATAACTTCTGTTATTGAGGTGATTAAAGGGAGATTTTCCT
TATCAATAAAATGTCTCTCCTTAAGTAATCAAATCTTCTGGTTTACTAATGTTTATGGCTGGATTACTAATGTTTATGGACCCTGCGGGTTGCAGATTGGGATGACCTTC
TTGAAAACTCCCGAGTCTCTCGCAAAAACCTGCATCTTCTCAAACCATTTCCCCATTTTATTAGAGGCTGGTGCCATCATTTGGGGTCCCTCTCCCTTTCAGTTTTGCAA
CAGCTGGCTGTTGTCCAATGAGTGCAACCTTTTAATTGAAGAAACAGTATCGAACTTCACTCACCATAGATGGGTTGGTTTTATTCTACACGAGGAACAGAGAAAGATGA
AGGAAGCTGGCCTAAATGAGGATGAGTGTGTATTCAGATCAGCATTACAAGCAGAATTACTCAATATTTACCATCTTGAAGAGCGCAACCTTATGCAGAAAAGCAAGCTT
AATTGGCTGTCACTTGGGGATGAAAATTCTGGGTTTTTCCATCGTTTTCTGCATGCAAAGAGGAAAAATTTAATTTCAGAATTGAAAGATACTAATGGAATGCTGCACAA
AAAGAGCAGCATTGAAGAAATACGTTTGATAGTAAAAGCTCTTGGCAGGAACAAAGCCCCGGGCCTTGATGAAGAATTGGCCTTGGAAATGAAAGATTTTCGCCCGATCA
GTCTCACTACCCTATCTTATAAGGTAGTAGCTAAAGTCCTAGCAGAACGGTTAAAATCTGTAATGGACTCAATCATCAGCCCTTACCAAAGTGCTTTTATAGAAGGAAGA
CAAATACTTGACCTGATATTAATTGCTAGTGAGGTCGTTGAAGATTATAGGGCAAAAAAGAAAAAGGGTTGGATCTTGAAGCTTGAGCTTGAAAAGGCCTCTGATCAGTG
GATTGGAACTTCCTTGAGAAGGGGCCGCATTATAGCTTCAAGAGGCATCCGCCAAGGGGACCCCCTTTCTCCCTTTCTTTTCCTTCTAGATTCAATCCATCTTCCCCTAT
GCCAATTTGCTGATGACACCCTACTTTTTTGTAAGTATGATGATCAAATGCTTCTCAAGTTGAAGGATGCCATTAGACTTTTTGAATGGTGTTCAAGGCAGAAGATTAAT
TGGGAGAAATCAGCCCTTAGTGGTGTAAATATAAATGATGATGACTTGTTTCAAACAGCAGCCCGTCTTGGTTGCAAGGCAGAAAAATTACCTATTCTCTACTTAGGTCT
CCCCCTAGGAGGCTACCCTCGGCGAAGGGAGTTTTGGCAGCCATTCATGTGGCAACTTCATTGGAAAAAAATCATGAGGAACTTCTTTTGGGAAGGATTTGCAGGCGGCA
AAATCAATCATCTAGTAAAATGGAGATTGATTTCCCTTCCTTTAAAAAATGTAGGTCTCGGTCTAGAGGGCATTAAAACCCAAAATTCAGTTTTGCTTGCTAAGTGGGGC
TGGAGATATTCTAAGGAAGATTCAGCATTATGGAGGAAAAATATTTGTAGTATCCATGGCAAAGATACGTTTGATTGGTTCACTTTGGAAAAATCTGGAAATAGCCTAAG
GAGCCCTTGGGTTAACATTTCTAGAATGTGGATGTTAGTGGAGCATTTAGCGCACCTTAAACTCGGCTGTGGCAGCAGAATCGGTTTTTGGACAGATTTCTGGGTTGGTC
CCTCTACTCTAAAGAAACATTTTCCTTCGCTTTTCAGAATTGCTTTATTACCACAAGGCTCAGTTGTTGATCATTGA
Protein sequenceShow/hide protein sequence
MEAINFAFIKSLWSIKDVGRDFVQSVGSSRGILSMWDSSLFVILELLDLRESKEAVYGALDAWVAGHGTPSGSSQILGHDGWFGSSFSSLAIFMDSATRLILHKYELNGG
CDIKIPSSAPSRKHNLKPSISANNPDLLEQTHALLSSISNTQNSPNIKSKVNLLRGSPCHAAIDKPTSKTMDSPFSISSEESLGFPNGFNFKSNEEIEGADLSSLFDEAS
VVPIKPNEDNLLENWGLNDITKRAALKKFIKIIIWILDINREFVESLGASGGILTMWDSSITSVIEVIKGRFSLSIKCLSLSNQIFWFTNVYGWITNVYGPCGLQIGMTF
LKTPESLAKTCIFSNHFPILLEAGAIIWGPSPFQFCNSWLLSNECNLLIEETVSNFTHHRWVGFILHEEQRKMKEAGLNEDECVFRSALQAELLNIYHLEERNLMQKSKL
NWLSLGDENSGFFHRFLHAKRKNLISELKDTNGMLHKKSSIEEIRLIVKALGRNKAPGLDEELALEMKDFRPISLTTLSYKVVAKVLAERLKSVMDSIISPYQSAFIEGR
QILDLILIASEVVEDYRAKKKKGWILKLELEKASDQWIGTSLRRGRIIASRGIRQGDPLSPFLFLLDSIHLPLCQFADDTLLFCKYDDQMLLKLKDAIRLFEWCSRQKIN
WEKSALSGVNINDDDLFQTAARLGCKAEKLPILYLGLPLGGYPRRREFWQPFMWQLHWKKIMRNFFWEGFAGGKINHLVKWRLISLPLKNVGLGLEGIKTQNSVLLAKWG
WRYSKEDSALWRKNICSIHGKDTFDWFTLEKSGNSLRSPWVNISRMWMLVEHLAHLKLGCGSRIGFWTDFWVGPSTLKKHFPSLFRIALLPQGSVVDH