; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011643 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011643
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr1:29825861..29834784
RNA-Seq ExpressionLag0011643
SyntenyLag0011643
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043220.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.6e-7335.6Show/hide
Query:  ENIDDENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNALRSKELESRPQKKETEALTVRGRSETRSNQSKN-------------------------
        E + DENQA+ILLNSLPE+Y+E++ AIKYGRDSL+MS+VL+AL+++ LE + + K  E L  RGRSE +S + K+                         
Subjt:  ENIDDENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNALRSKELESRPQKKETEALTVRGRSETRSNQSKN-------------------------

Query:  TKKDHTEVLAVTEIDPTEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRLSLTDGSYKILSSVRYVQALKRNLISLGTLDKAGY
        T  +  EVL V+  D  + WI+DSGC++HMTP++ +  + ++ + GKVL+G+N  C +KG GS++++  DG  +IL++  YV  LKRNLISL  LD++G 
Subjt:  TKKDHTEVLAVTEIDPTEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRLSLTDGSYKILSSVRYVQALKRNLISLGTLDKAGY

Query:  KYKSEGGIMKIYKGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISE------RGLVELSKQGEIRCIISRDVKFNEEE-----
          KSE G+MK+ KGSL+K+  T ++G+Y L GTT  G V IA + +  + +++WHKRLAH  +      R +  L     +  + ++   F + E     
Subjt:  KYKSEGGIMKIYKGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISE------RGLVELSKQGEIRCIISRDVKFNEEE-----

Query:  -----MPWESEGSFNYQKTKLEIE------------TSF----DIELENEDQNISPEGSSQATN--EQFDHNTGNTEVLQEIDSVNQN--------IQAQ
              P ++  +  + +T +E +            T++    D +L          G  Q     + +    G  + +   DS  +         I+  
Subjt:  -----MPWESEGSFNYQKTKLEIE------------TSF----DIELENEDQNISPEGSSQATN--EQFDHNTGNTEVLQEIDSVNQN--------IQAQ

Query:  NTEENLANYQLTRDRQKRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDC
        ++  +L NYQLT D+ +R      R+ YA       D  AYAL        +EP ++ +A+ S+ K +W +AM  E  SL KN+ W LV  P   K++  
Subjt:  NTEENLANYQLTRDRQKRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDC

Query:  KWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSPWLDI
        K +YKIK S  G++KPR+KARLVAK +TQ+ GVD+ EIFSP +D+
Subjt:  KWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSPWLDI

KAA0067607.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]6.6e-7532.06Show/hide
Query:  EVLSDAEEFYLLHFLIGDERRGREFVIVRLKLLRINSLLASFPLVRFNLNKRYGVKTRENIDDENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNA
        E+    E  YL  FL   E+  R + + + K L  N  L  F  +  +L+        E + DENQA+ILLNSLPE+Y+E++AAIKYGRDSL+MS+VL+A
Subjt:  EVLSDAEEFYLLHFLIGDERRGREFVIVRLKLLRINSLLASFPLVRFNLNKRYGVKTRENIDDENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNA

Query:  LRSKELESRPQKKETEALTVRGRSE-------TRSNQSKNTKK--------------------------------------------DHTEVLAVTEIDP
        L+++ LE + ++K+ E L  RGRSE        RS++SK+  K                                            +  EVL V+  D 
Subjt:  LRSKELESRPQKKETEALTVRGRSE-------TRSNQSKNTKK--------------------------------------------DHTEVLAVTEIDP

Query:  TEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRLSLTDGSYKILSSVRYVQALKRNLISLGTLDKAGYKYKSEGGIMKIYKGSL
         + WI+DSGC++HMTPH+ +  + +  +GGKVL+G+N  C++K  GS++++  D   +IL++VRYV  LKRNLISLG LD++G   KSE G+MK+ KGSL
Subjt:  TEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRLSLTDGSYKILSSVRYVQALKRNLISLGTLDKAGYKYKSEGGIMKIYKGSL

Query:  LKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISERGLVELSKQGEIRCIISRDVKFNE-----------------------------
        +K+ GTL++G+Y L GTT  G   IA+  +     ++WH RLAH+SERGL  LS+QG +  + + ++ F E                             
Subjt:  LKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISERGLVELSKQGEIRCIISRDVKFNE-----------------------------

Query:  -----------------------EEMPW--------ESEGSFNYQKTKLEIETSFDIELEN------------EDQNIS-----------PEGSSQATNE
                                   W        E+ G F   K ++E +T     L++            +D  ++           P+G  +   +
Subjt:  -----------------------EEMPW--------ESEGSFNYQKTKLEIETSFDIELEN------------EDQNIS-----------PEGSSQATNE

Query:  Q-FDHNTGNTEVLQE-------------------IDSVNQN-------------------IQAQNTEENLANYQLTRDRQKRVIKPLARFDYADCNMDAF
        Q  DH      +  E                   I+   Q+                   I+  ++  +L NYQLTRDR +R      R+ YA       
Subjt:  Q-FDHNTGNTEVLQE-------------------IDSVNQN-------------------IQAQNTEENLANYQLTRDRQKRVIKPLARFDYADCNMDAF

Query:  DAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYT
        D  AYAL         +P ++ +A+ S+ K  W + M  E+ SL KN+TW LV  P   K++  KW+YKIK    G++KPR+KARLVAK +TQ+ GVD+ 
Subjt:  DAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYT

Query:  EIFSP
        E+FSP
Subjt:  EIFSP

RVW35472.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.9e-7637.31Show/hide
Query:  ENID----DENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNALRSKELESRPQKKET--EALTVRGRS--------ETRSNQSKNTKK--------
        ENID    DE++AI+LL SL  SY  ++ AI YGRD+L+ + V + L ++EL+ +   KE   E L +RG+         + R N  K T          
Subjt:  ENID----DENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNALRSKELESRPQKKET--EALTVRGRS--------ETRSNQSKNTKK--------

Query:  ---DHTEVLAVTEIDPTEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRLSLTDGSYKILSSVRYVQALKRNLISLGTLDKAGY
           D  EVL V E+D ++EWILDS CS+HM P K +F D K+++GG VL+GNN+ C+I GIG++R+   DG  ++L  VRY+  LKRNLISLG LDK GY
Subjt:  ---DHTEVLAVTEIDPTEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRLSLTDGSYKILSSVRYVQALKRNLISLGTLDKAGY

Query:  KYKSEGGIMKIYKGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISERGLVELSKQGEI------------RCIISRDVKFNEE
         +KSE   +++ + SL  M  T+KNG+YTL G T  G V+I    +      +WH+RL HIS RGL EL KQ  +             C+  +  +    
Subjt:  KYKSEGGIMKIYKGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISERGLVELSKQGEI------------RCIISRDVKFNEE

Query:  EMPWESEGSFNYQKTKL-------EIETSFDIELENEDQNISP-----EGSSQATNEQFDHNTGNTEVLQEIDS--VNQNIQAQNTEE---NLANYQLTR
        +   E++   +Y  + L        I  + D+    +D +        EG  Q   E  +H T   E  +E  S  V + I  +  +E    L +Y L R
Subjt:  EMPWESEGSFNYQKTKL-------EIETSFDIELENEDQNISP-----EGSSQATNEQFDHNTGNTEVLQEIDS--VNQNIQAQNTEE---NLANYQLTR

Query:  DRQKRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGD
        DRQKR +KP  R+          +   +AL   +   + E  +Y +A+NS   ++W +A+ +EM SL KN+TW+LV  P+   VV  KW+++ K    G+
Subjt:  DRQKRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGD

Query:  NKPRFKARLVAKWFTQEYGVDYTEIFSP
          PR+KARLVAK F+Q+  V Y EIFSP
Subjt:  NKPRFKARLVAKWFTQEYGVDYTEIFSP

TYK21344.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]3.6e-7335.6Show/hide
Query:  ENIDDENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNALRSKELESRPQKKETEALTVRGRSETRSNQSKN-------------------------
        E + DENQA+ILLNSLPE+Y+E++ AIKYGRDSL+MS+VL+AL+++ LE + + K  E L  RGRSE +S + K+                         
Subjt:  ENIDDENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNALRSKELESRPQKKETEALTVRGRSETRSNQSKN-------------------------

Query:  TKKDHTEVLAVTEIDPTEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRLSLTDGSYKILSSVRYVQALKRNLISLGTLDKAGY
        T  +  EVL V+  D  + WI+DSGC++HMTP++ +  + ++ + GKVL+G+N  C +KG GS++++  DG  +IL++  YV  LKRNLISL  LD++G 
Subjt:  TKKDHTEVLAVTEIDPTEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRLSLTDGSYKILSSVRYVQALKRNLISLGTLDKAGY

Query:  KYKSEGGIMKIYKGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISE------RGLVELSKQGEIRCIISRDVKFNEEE-----
          KSE G+MK+ KGSL+K+  T ++G+Y L GTT  G V IA + +  + +++WHKRLAH  +      R +  L     +  + ++   F + E     
Subjt:  KYKSEGGIMKIYKGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISE------RGLVELSKQGEIRCIISRDVKFNEEE-----

Query:  -----MPWESEGSFNYQKTKLEIE------------TSF----DIELENEDQNISPEGSSQATN--EQFDHNTGNTEVLQEIDSVNQN--------IQAQ
              P ++  +  + +T +E +            T++    D +L          G  Q     + +    G  + +   DS  +         I+  
Subjt:  -----MPWESEGSFNYQKTKLEIE------------TSF----DIELENEDQNISPEGSSQATN--EQFDHNTGNTEVLQEIDSVNQN--------IQAQ

Query:  NTEENLANYQLTRDRQKRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDC
        ++  +L NYQLT D+ +R      R+ YA       D  AYAL        +EP ++ +A+ S+ K +W +AM  E  SL KN+ W LV  P   K++  
Subjt:  NTEENLANYQLTRDRQKRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDC

Query:  KWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSPWLDI
        K +YKIK S  G++KPR+KARLVAK +TQ+ GVD+ EIFSP +D+
Subjt:  KWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSPWLDI

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]4.9e-7028.34Show/hide
Query:  ENIDDENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNALRSKELESRPQKKETEALTVRGRSETRS------------------------------
        E + DENQA+ILLNSLPE+Y+E++AAIKYGRDSL+MS+VL+AL+++ LE + ++K+ E L  RGRSE +S                              
Subjt:  ENIDDENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNALRSKELESRPQKKETEALTVRGRSETRS------------------------------

Query:  ----NQSK-------------------------NTKKDHTEVLAVTEIDPTEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRL
            N+S+                          T  +  EVL V+  D  + WI+DSGC++HMTPH+ +  + + ++GGKVL+G+N  C++KG GS+++
Subjt:  ----NQSK-------------------------NTKKDHTEVLAVTEIDPTEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRL

Query:  SLTDGSYKILSSVRYVQALKRNLISLGTLDKAGYKYKSEGGIMKIYKGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISERGL
        +  DG  +IL++VRYV  LKRNLISLG LD++G   KSE G+MK+ KGSL+K+ GTL++G+Y L GTT  G   IA+  +  + +++WHKRLAH+SERGL
Subjt:  SLTDGSYKILSSVRYVQALKRNLISLGTLDKAGYKYKSEGGIMKIYKGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISERGL

Query:  VELSKQGEI-------------------------------------------------------------------------------------------
          LS+QG +                                                                                           
Subjt:  VELSKQGEI-------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------RCIISRDVKFNEEEMPW-------ESEGSFNYQKTKLEIETSFD
                                                                +CIISRDV FNE EMP+       +  G     + ++  E    
Subjt:  --------------------------------------------------------RCIISRDVKFNEEEMPW-------ESEGSFNYQKTKLEIETSFD

Query:  IELENEDQNISPEGSSQATNEQFDHNTGNTEVLQEIDSVNQNIQAQNTEENLANYQLTRDRQKRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEP
        I+L+N+   +S    +Q    +FD      E +  +      I+  ++  +L NYQLTRDR +R      R+ YAD    A   AA      DS E +EP
Subjt:  IELENEDQNISPEGSSQATNEQFDHNTGNTEVLQEIDSVNQNIQAQNTEENLANYQLTRDRQKRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEP

Query:  TSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSP
         ++ +A+ S+ K +W +AM +E+ SL KN+TW LV  P   K++  KW+YKIK    G++KPR+KARLVAK +TQ+ GVD+ EIFSP
Subjt:  TSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSP

TrEMBL top hitse value%identityAlignment
A0A438DJ20 Retrovirus-related Pol polyprotein from transposon TNT 1-942.9e-7637.31Show/hide
Query:  ENID----DENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNALRSKELESRPQKKET--EALTVRGRS--------ETRSNQSKNTKK--------
        ENID    DE++AI+LL SL  SY  ++ AI YGRD+L+ + V + L ++EL+ +   KE   E L +RG+         + R N  K T          
Subjt:  ENID----DENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNALRSKELESRPQKKET--EALTVRGRS--------ETRSNQSKNTKK--------

Query:  ---DHTEVLAVTEIDPTEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRLSLTDGSYKILSSVRYVQALKRNLISLGTLDKAGY
           D  EVL V E+D ++EWILDS CS+HM P K +F D K+++GG VL+GNN+ C+I GIG++R+   DG  ++L  VRY+  LKRNLISLG LDK GY
Subjt:  ---DHTEVLAVTEIDPTEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRLSLTDGSYKILSSVRYVQALKRNLISLGTLDKAGY

Query:  KYKSEGGIMKIYKGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISERGLVELSKQGEI------------RCIISRDVKFNEE
         +KSE   +++ + SL  M  T+KNG+YTL G T  G V+I    +      +WH+RL HIS RGL EL KQ  +             C+  +  +    
Subjt:  KYKSEGGIMKIYKGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISERGLVELSKQGEI------------RCIISRDVKFNEE

Query:  EMPWESEGSFNYQKTKL-------EIETSFDIELENEDQNISP-----EGSSQATNEQFDHNTGNTEVLQEIDS--VNQNIQAQNTEE---NLANYQLTR
        +   E++   +Y  + L        I  + D+    +D +        EG  Q   E  +H T   E  +E  S  V + I  +  +E    L +Y L R
Subjt:  EMPWESEGSFNYQKTKL-------EIETSFDIELENEDQNISP-----EGSSQATNEQFDHNTGNTEVLQEIDS--VNQNIQAQNTEE---NLANYQLTR

Query:  DRQKRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGD
        DRQKR +KP  R+          +   +AL   +   + E  +Y +A+NS   ++W +A+ +EM SL KN+TW+LV  P+   VV  KW+++ K    G+
Subjt:  DRQKRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGD

Query:  NKPRFKARLVAKWFTQEYGVDYTEIFSP
          PR+KARLVAK F+Q+  V Y EIFSP
Subjt:  NKPRFKARLVAKWFTQEYGVDYTEIFSP

A0A5A7TIS5 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-7335.6Show/hide
Query:  ENIDDENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNALRSKELESRPQKKETEALTVRGRSETRSNQSKN-------------------------
        E + DENQA+ILLNSLPE+Y+E++ AIKYGRDSL+MS+VL+AL+++ LE + + K  E L  RGRSE +S + K+                         
Subjt:  ENIDDENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNALRSKELESRPQKKETEALTVRGRSETRSNQSKN-------------------------

Query:  TKKDHTEVLAVTEIDPTEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRLSLTDGSYKILSSVRYVQALKRNLISLGTLDKAGY
        T  +  EVL V+  D  + WI+DSGC++HMTP++ +  + ++ + GKVL+G+N  C +KG GS++++  DG  +IL++  YV  LKRNLISL  LD++G 
Subjt:  TKKDHTEVLAVTEIDPTEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRLSLTDGSYKILSSVRYVQALKRNLISLGTLDKAGY

Query:  KYKSEGGIMKIYKGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISE------RGLVELSKQGEIRCIISRDVKFNEEE-----
          KSE G+MK+ KGSL+K+  T ++G+Y L GTT  G V IA + +  + +++WHKRLAH  +      R +  L     +  + ++   F + E     
Subjt:  KYKSEGGIMKIYKGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISE------RGLVELSKQGEIRCIISRDVKFNEEE-----

Query:  -----MPWESEGSFNYQKTKLEIE------------TSF----DIELENEDQNISPEGSSQATN--EQFDHNTGNTEVLQEIDSVNQN--------IQAQ
              P ++  +  + +T +E +            T++    D +L          G  Q     + +    G  + +   DS  +         I+  
Subjt:  -----MPWESEGSFNYQKTKLEIE------------TSF----DIELENEDQNISPEGSSQATN--EQFDHNTGNTEVLQEIDSVNQN--------IQAQ

Query:  NTEENLANYQLTRDRQKRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDC
        ++  +L NYQLT D+ +R      R+ YA       D  AYAL        +EP ++ +A+ S+ K +W +AM  E  SL KN+ W LV  P   K++  
Subjt:  NTEENLANYQLTRDRQKRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDC

Query:  KWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSPWLDI
        K +YKIK S  G++KPR+KARLVAK +TQ+ GVD+ EIFSP +D+
Subjt:  KWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSPWLDI

A0A5A7VKC2 Retrotransposon protein, putative, Ty1-copia subclass3.2e-7532.06Show/hide
Query:  EVLSDAEEFYLLHFLIGDERRGREFVIVRLKLLRINSLLASFPLVRFNLNKRYGVKTRENIDDENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNA
        E+    E  YL  FL   E+  R + + + K L  N  L  F  +  +L+        E + DENQA+ILLNSLPE+Y+E++AAIKYGRDSL+MS+VL+A
Subjt:  EVLSDAEEFYLLHFLIGDERRGREFVIVRLKLLRINSLLASFPLVRFNLNKRYGVKTRENIDDENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNA

Query:  LRSKELESRPQKKETEALTVRGRSE-------TRSNQSKNTKK--------------------------------------------DHTEVLAVTEIDP
        L+++ LE + ++K+ E L  RGRSE        RS++SK+  K                                            +  EVL V+  D 
Subjt:  LRSKELESRPQKKETEALTVRGRSE-------TRSNQSKNTKK--------------------------------------------DHTEVLAVTEIDP

Query:  TEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRLSLTDGSYKILSSVRYVQALKRNLISLGTLDKAGYKYKSEGGIMKIYKGSL
         + WI+DSGC++HMTPH+ +  + +  +GGKVL+G+N  C++K  GS++++  D   +IL++VRYV  LKRNLISLG LD++G   KSE G+MK+ KGSL
Subjt:  TEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRLSLTDGSYKILSSVRYVQALKRNLISLGTLDKAGYKYKSEGGIMKIYKGSL

Query:  LKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISERGLVELSKQGEIRCIISRDVKFNE-----------------------------
        +K+ GTL++G+Y L GTT  G   IA+  +     ++WH RLAH+SERGL  LS+QG +  + + ++ F E                             
Subjt:  LKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISERGLVELSKQGEIRCIISRDVKFNE-----------------------------

Query:  -----------------------EEMPW--------ESEGSFNYQKTKLEIETSFDIELEN------------EDQNIS-----------PEGSSQATNE
                                   W        E+ G F   K ++E +T     L++            +D  ++           P+G  +   +
Subjt:  -----------------------EEMPW--------ESEGSFNYQKTKLEIETSFDIELEN------------EDQNIS-----------PEGSSQATNE

Query:  Q-FDHNTGNTEVLQE-------------------IDSVNQN-------------------IQAQNTEENLANYQLTRDRQKRVIKPLARFDYADCNMDAF
        Q  DH      +  E                   I+   Q+                   I+  ++  +L NYQLTRDR +R      R+ YA       
Subjt:  Q-FDHNTGNTEVLQE-------------------IDSVNQN-------------------IQAQNTEENLANYQLTRDRQKRVIKPLARFDYADCNMDAF

Query:  DAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYT
        D  AYAL         +P ++ +A+ S+ K  W + M  E+ SL KN+TW LV  P   K++  KW+YKIK    G++KPR+KARLVAK +TQ+ GVD+ 
Subjt:  DAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYT

Query:  EIFSP
        E+FSP
Subjt:  EIFSP

A0A5D3DCM9 Retrotransposon protein, putative, Ty1-copia subclass1.7e-7335.6Show/hide
Query:  ENIDDENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNALRSKELESRPQKKETEALTVRGRSETRSNQSKN-------------------------
        E + DENQA+ILLNSLPE+Y+E++ AIKYGRDSL+MS+VL+AL+++ LE + + K  E L  RGRSE +S + K+                         
Subjt:  ENIDDENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNALRSKELESRPQKKETEALTVRGRSETRSNQSKN-------------------------

Query:  TKKDHTEVLAVTEIDPTEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRLSLTDGSYKILSSVRYVQALKRNLISLGTLDKAGY
        T  +  EVL V+  D  + WI+DSGC++HMTP++ +  + ++ + GKVL+G+N  C +KG GS++++  DG  +IL++  YV  LKRNLISL  LD++G 
Subjt:  TKKDHTEVLAVTEIDPTEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRLSLTDGSYKILSSVRYVQALKRNLISLGTLDKAGY

Query:  KYKSEGGIMKIYKGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISE------RGLVELSKQGEIRCIISRDVKFNEEE-----
          KSE G+MK+ KGSL+K+  T ++G+Y L GTT  G V IA + +  + +++WHKRLAH  +      R +  L     +  + ++   F + E     
Subjt:  KYKSEGGIMKIYKGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISE------RGLVELSKQGEIRCIISRDVKFNEEE-----

Query:  -----MPWESEGSFNYQKTKLEIE------------TSF----DIELENEDQNISPEGSSQATN--EQFDHNTGNTEVLQEIDSVNQN--------IQAQ
              P ++  +  + +T +E +            T++    D +L          G  Q     + +    G  + +   DS  +         I+  
Subjt:  -----MPWESEGSFNYQKTKLEIE------------TSF----DIELENEDQNISPEGSSQATN--EQFDHNTGNTEVLQEIDSVNQN--------IQAQ

Query:  NTEENLANYQLTRDRQKRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDC
        ++  +L NYQLT D+ +R      R+ YA       D  AYAL        +EP ++ +A+ S+ K +W +AM  E  SL KN+ W LV  P   K++  
Subjt:  NTEENLANYQLTRDRQKRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDC

Query:  KWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSPWLDI
        K +YKIK S  G++KPR+KARLVAK +TQ+ GVD+ EIFSP +D+
Subjt:  KWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSPWLDI

A0A5D3DNU1 Putative gag-pol polyprotein2.4e-7028.34Show/hide
Query:  ENIDDENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNALRSKELESRPQKKETEALTVRGRSETRS------------------------------
        E + DENQA+ILLNSLPE+Y+E++AAIKYGRDSL+MS+VL+AL+++ LE + ++K+ E L  RGRSE +S                              
Subjt:  ENIDDENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNALRSKELESRPQKKETEALTVRGRSETRS------------------------------

Query:  ----NQSK-------------------------NTKKDHTEVLAVTEIDPTEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRL
            N+S+                          T  +  EVL V+  D  + WI+DSGC++HMTPH+ +  + + ++GGKVL+G+N  C++KG GS+++
Subjt:  ----NQSK-------------------------NTKKDHTEVLAVTEIDPTEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRL

Query:  SLTDGSYKILSSVRYVQALKRNLISLGTLDKAGYKYKSEGGIMKIYKGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISERGL
        +  DG  +IL++VRYV  LKRNLISLG LD++G   KSE G+MK+ KGSL+K+ GTL++G+Y L GTT  G   IA+  +  + +++WHKRLAH+SERGL
Subjt:  SLTDGSYKILSSVRYVQALKRNLISLGTLDKAGYKYKSEGGIMKIYKGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISERGL

Query:  VELSKQGEI-------------------------------------------------------------------------------------------
          LS+QG +                                                                                           
Subjt:  VELSKQGEI-------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------RCIISRDVKFNEEEMPW-------ESEGSFNYQKTKLEIETSFD
                                                                +CIISRDV FNE EMP+       +  G     + ++  E    
Subjt:  --------------------------------------------------------RCIISRDVKFNEEEMPW-------ESEGSFNYQKTKLEIETSFD

Query:  IELENEDQNISPEGSSQATNEQFDHNTGNTEVLQEIDSVNQNIQAQNTEENLANYQLTRDRQKRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEP
        I+L+N+   +S    +Q    +FD      E +  +      I+  ++  +L NYQLTRDR +R      R+ YAD    A   AA      DS E +EP
Subjt:  IELENEDQNISPEGSSQATNEQFDHNTGNTEVLQEIDSVNQNIQAQNTEENLANYQLTRDRQKRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEP

Query:  TSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSP
         ++ +A+ S+ K +W +AM +E+ SL KN+TW LV  P   K++  KW+YKIK    G++KPR+KARLVAK +TQ+ GVD+ EIFSP
Subjt:  TSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSP

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.7e-1327.39Show/hide
Query:  SKQGEIRCIISRDVKFNEEEMPWESEGSFNYQKTKLEIETS----FDIELENEDQNISPEGSSQATNEQFDHNTGNTEVLQEIDSVNQNIQAQNTEENLA
        SK+ E +   +   K  + E P ES+   N Q  K   E++     + +    D +++    S   NE  +  T   E L+EI          N  +N  
Subjt:  SKQGEIRCIISRDVKFNEEEMPWESEGSFNYQKTKLEIETS----FDIELENEDQNISPEGSSQATNEQFDHNTGNTEVLQEIDSVNQNIQAQNTEENLA

Query:  NYQLTRDRQKRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIK
           + R  ++   KP   ++  D +++     A+ +F      N  P S+++    + K+ W EA+  E+++ + N TW + + PE   +VD +W++ +K
Subjt:  NYQLTRDRQKRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIK

Query:  ASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSPWLDILQF
         +  G N  R+KARLVA+ FTQ+Y +DY E F+P   I  F
Subjt:  ASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSPWLDILQF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-2028.62Show/hide
Query:  IDDENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNALRSKELESRPQKKETEALTV--RGRSETRSN----------QSKNTKK------------
        I++E++AI+LLNSLP SY  +   I +G+ ++ +  V +AL   E   +  + + +AL    RGRS  RS+          +SKN  K            
Subjt:  IDDENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNALRSKELESRPQKKETEALTV--RGRSETRSN----------QSKNTKK------------

Query:  -----------------------DHTE---------VLAVTEID-------PTEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSI
                               D+T          VL + E +       P  EW++D+  S+H TP +  F      + G V MGN    +I GIG I
Subjt:  -----------------------DHTE---------VLAVTEID-------PTEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSI

Query:  RLSLTDGSYKILSSVRYVQALKRNLISLGTLDKAGYKYKSEGGIMKIYKGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISER
         +    G   +L  VR+V  L+ NLIS   LD+ GY+        ++ KGSL+   G  +  +Y  N     G++  A   +      +WHKR+ H+SE+
Subjt:  RLSLTDGSYKILSSVRYVQALKRNLISLGTLDKAGYKYKSEGGIMKIYKGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISER

Query:  GLVELSKQGEI
        GL  L+K+  I
Subjt:  GLVELSKQGEI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.5e-1731.86Show/hide
Query:  RCIISRDVKFNEEEMPWESEGSFNYQKTKLEIETSFDIELENEDQNISPEGSSQATNEQFDHNTGNTEVLQEIDSVNQNI-QAQNTEENLANYQLTRDRQ
        + I SRDV F E E+   ++ S   +K K  I  +F + + +   N  P  +   T+E  +      EV+++ + +++ + + ++  +    +Q      
Subjt:  RCIISRDVKFNEEEMPWESEGSFNYQKTKLEIETSFDIELENEDQNISPEGSSQATNEQFDHNTGNTEVLQEIDSVNQNI-QAQNTEENLANYQLTRDRQ

Query:  KRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGDNK-
             PL R +        + +  Y L  DD     EP S  + ++   KN+  +AM +EM SL+KN T+KLVELP+G + + CKW++K+K    GD K 
Subjt:  KRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGDNK-

Query:  PRFKARLVAKWFTQEYGVDYTEIFSP
         R+KARLV K F Q+ G+D+ EIFSP
Subjt:  PRFKARLVAKWFTQEYGVDYTEIFSP

P92520 Uncharacterized mitochondrial protein AtMg008202.0e-1045.21Show/hide
Query:  WFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSP
        W +AM +E+ +L +NKTW LV  P    ++ CKW++K K    G    R KARLVAK F QE G+ + E +SP
Subjt:  WFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.5e-0941.76Show/hide
Query:  SEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGH-KVVDCKWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSP
        SEP +  QA+  E    W  AM  E+++   N TW LV  P  H  +V C+W++  K +  G +  R+KARLVAK + Q  G+DY E FSP
Subjt:  SEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGH-KVVDCKWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.4e-0841.3Show/hide
Query:  NSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLV-ELPEGHKVVDCKWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSP
        NSEP +  QAM  +    W +AM  E+++   N TW LV   P    +V C+W++  K +  G +  R+KARLVAK + Q  G+DY E FSP
Subjt:  NSEPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLV-ELPEGHKVVDCKWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.8e-1444.94Show/hide
Query:  EPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSP
        EP++YN+A   +    W  AM DE+ ++E   TW++  LP   K + CKW+YKIK +  G  + R+KARLVAK +TQ+ G+D+ E FSP
Subjt:  EPTSYNQAMNSEHKNEWFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSP

ATMG00300.1 Gag-Pol-related retrotransposon family protein5.3e-0632.93Show/hide
Query:  GIMKIYKGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISERGLVELSKQGEIRCIISRDVKFNEE
        G++K+ KG    + G   + +Y L G+   G+  +A T +  +E  +WH RLAH+S+RG+  L K+G +       +KF E+
Subjt:  GIMKIYKGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISERGLVELSKQGEIRCIISRDVKFNEE

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.4e-1145.21Show/hide
Query:  WFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSP
        W +AM +E+ +L +NKTW LV  P    ++ CKW++K K    G    R KARLVAK F QE G+ + E +SP
Subjt:  WFEAMVDEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAAGCCATCTCCACACACATCTTTGTATTTTTTTCCCCTTCACTACACTGCCCCCCCTCACCTCCGCTTGCTCCGACCCCTACATCTCGTTGCTGTCGTGGCCGCC
GTTGTCTTCTGGTAAGGAGAAGCAGCCGTATCTTACAGATAATGTTACTCCAAGAGGTCGAGTTGATGGAGGAGAGACTATTCTTCCATTGTTGCACCATTCCAATCTCT
TTTCTCTTTCACCTCCTCAGTGTTTAAGCAATGGCGACCGCTGTAAGGACAATGTTGCTGCACCTCGTTGGACCTTCCAAGCACTGATGAAAATTATTGCAAGAGGGGAA
ACAAGCATGGCTACCGACATTTTAGGGAACTGTTTACTGGATTATTTTGGACGTTTTTACCCCTTGTATTTACGGTTATTTACAGACGCCGTTTACGTCATTTCCGAGGA
CAGAGATGTGTTTTGGATGGTCAACGAATTTCAAAACGGTACATACCGGGGGTGCTGTGTAGTCGTTGACATCCCGAGTGTCAATACTGTCGTCGCGTGGGGGGAGAAAC
GACCGAGCATGCAAGAGATGGTTGTCGAAGCGGCGGATGAAGGTCGTTGGCACGGTGGAGATGGTTCGTCGGCGTGGTATGAGAGGGTTTCTACTTTTGGAGGAAGAAGA
TGGACGGGTGGGCTGCTGCATGAAGGCTATTCTGGTGGTGAGGACGCCGTTTACGTCATTTCCGAGGACAAAGATGTGTTTTGGATGGTGAACGAATTTCAAAACGGTAC
ATACAAGGGATGTTGTGTAGTCGTTGACATCCCGAGTGTCAATACTAGTATACACCCCATGGGGACAGTTTCAAAGGGAGGGGAGAAACGATCATGCAAGAGATGGTTGT
CGACGCGGCTTATGAAGGTCGTTGACACGGCGGAGATGGTTCGTCGGCGTGGTGTGAGAGGGTTTCTACTTTTGGAGGAAGAAGATGAACGGGTGGGCTGCTGCATGAAG
GCTATTCTGGTGGTGAGGGTGGGTGGTGACATGACGTTTTTACCCCTTGTATTTACGGTTATTTACAGACGTCGTTTACGCCATTTCCGAGGACAGAGATGTGTTTTGCA
GTATTTAATTGCAATTATTCATTCTGGTGGTGAGGGTGGGCGGTATTTTTCTCTACTGAGGGAGGTTGAACTTTGTTATCCTTCCCCAATAAAGAGTGTGCTGGGGAGAG
AAAGAGTGTCTGTGAGCTTGGGAAATCAGGGAAGTGAGGTTTTGAGTGATGCTGAGGAGTTCTATCTCCTCCATTTCTTAATCGGTGATGAAAGGAGAGGCAGAGAGTTC
GTAATCGTTCGATTGAAGCTGCTAAGGATCAATTCGCTGCTTGCTTCTTTTCCTTTGGTGAGATTTAATCTCAATAAGAGATACGGTGTTAAGACAAGGGAAAATATTGA
TGATGAAAATCAAGCCATTATATTACTAAACTCACTACCAGAGAGTTATAAAGAAATAAGGGCAGCCATAAAATATGGGAGGGATTCTCTTTCGATGAGCCTAGTTTTAA
ATGCTCTTCGATCAAAAGAGCTAGAATCCAGACCTCAGAAAAAGGAAACAGAAGCTTTGACAGTAAGAGGAAGATCAGAAACTAGGTCAAATCAGTCTAAAAATACAAAG
AAAGATCACACCGAAGTCCTTGCTGTAACTGAAATTGATCCAACTGAAGAATGGATATTAGATTCAGGATGTTCCTATCATATGACCCCACATAAACACTATTTTATGGA
CTTAAAAGACTTAAATGGTGGTAAAGTTCTAATGGGAAACAATCAACAATGTGAGATTAAGGGCATTGGATCGATAAGATTGAGTTTAACTGATGGTTCATATAAAATAT
TATCTTCTGTGAGATATGTACAAGCTTTAAAACGCAATCTTATTTCCCTAGGAACTCTTGATAAAGCTGGATACAAGTATAAATCAGAGGGAGGTATTATGAAAATATAT
AAGGGGTCTTTGCTGAAAATGACGGGAACCTTAAAAAATGGCGTGTATACTTTAAATGGTACAACAAGTATAGGAGATGTGACCATAGCTACTACTGTTGAAACTAATAA
TGAAGCAATCATATGGCATAAGCGATTAGCCCACATCAGTGAAAGAGGCCTAGTTGAATTAAGTAAGCAAGGAGAAATAAGATGTATAATCAGTAGGGATGTTAAATTTA
ATGAAGAAGAAATGCCATGGGAATCTGAGGGAAGCTTCAATTATCAGAAAACTAAATTAGAAATAGAGACCTCTTTTGATATTGAACTAGAAAATGAAGATCAGAACATT
AGTCCAGAAGGTTCAAGTCAAGCTACTAATGAACAATTTGATCATAATACTGGAAATACTGAAGTTTTGCAAGAAATAGACAGTGTAAATCAGAACATTCAGGCCCAAAA
CACTGAAGAAAACTTAGCAAATTACCAATTGACCCGAGATAGACAGAAAAGAGTCATTAAACCTCTTGCTAGGTTTGATTATGCTGACTGCAATATGGATGCTTTTGATG
CTGCTGCATATGCCTTGTTCTATGATGATTCTTTTGAAAATTCAGAACCAACATCTTACAACCAAGCCATGAATTCTGAACATAAAAATGAGTGGTTTGAAGCCATGGTT
GATGAGATGAGTTCGCTTGAGAAAAACAAAACTTGGAAGTTAGTAGAATTACCTGAGGGTCACAAGGTAGTTGATTGCAAGTGGCTGTATAAAATAAAAGCTAGTCTAAA
AGGTGATAATAAACCTAGATTTAAGGCTAGACTAGTAGCCAAATGGTTTACTCAAGAGTATGGTGTTGATTACACCGAGATATTCTCCCCGTGGTTAGATATTCTTCAAT
TCGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAAGCCATCTCCACACACATCTTTGTATTTTTTTCCCCTTCACTACACTGCCCCCCCTCACCTCCGCTTGCTCCGACCCCTACATCTCGTTGCTGTCGTGGCCGCC
GTTGTCTTCTGGTAAGGAGAAGCAGCCGTATCTTACAGATAATGTTACTCCAAGAGGTCGAGTTGATGGAGGAGAGACTATTCTTCCATTGTTGCACCATTCCAATCTCT
TTTCTCTTTCACCTCCTCAGTGTTTAAGCAATGGCGACCGCTGTAAGGACAATGTTGCTGCACCTCGTTGGACCTTCCAAGCACTGATGAAAATTATTGCAAGAGGGGAA
ACAAGCATGGCTACCGACATTTTAGGGAACTGTTTACTGGATTATTTTGGACGTTTTTACCCCTTGTATTTACGGTTATTTACAGACGCCGTTTACGTCATTTCCGAGGA
CAGAGATGTGTTTTGGATGGTCAACGAATTTCAAAACGGTACATACCGGGGGTGCTGTGTAGTCGTTGACATCCCGAGTGTCAATACTGTCGTCGCGTGGGGGGAGAAAC
GACCGAGCATGCAAGAGATGGTTGTCGAAGCGGCGGATGAAGGTCGTTGGCACGGTGGAGATGGTTCGTCGGCGTGGTATGAGAGGGTTTCTACTTTTGGAGGAAGAAGA
TGGACGGGTGGGCTGCTGCATGAAGGCTATTCTGGTGGTGAGGACGCCGTTTACGTCATTTCCGAGGACAAAGATGTGTTTTGGATGGTGAACGAATTTCAAAACGGTAC
ATACAAGGGATGTTGTGTAGTCGTTGACATCCCGAGTGTCAATACTAGTATACACCCCATGGGGACAGTTTCAAAGGGAGGGGAGAAACGATCATGCAAGAGATGGTTGT
CGACGCGGCTTATGAAGGTCGTTGACACGGCGGAGATGGTTCGTCGGCGTGGTGTGAGAGGGTTTCTACTTTTGGAGGAAGAAGATGAACGGGTGGGCTGCTGCATGAAG
GCTATTCTGGTGGTGAGGGTGGGTGGTGACATGACGTTTTTACCCCTTGTATTTACGGTTATTTACAGACGTCGTTTACGCCATTTCCGAGGACAGAGATGTGTTTTGCA
GTATTTAATTGCAATTATTCATTCTGGTGGTGAGGGTGGGCGGTATTTTTCTCTACTGAGGGAGGTTGAACTTTGTTATCCTTCCCCAATAAAGAGTGTGCTGGGGAGAG
AAAGAGTGTCTGTGAGCTTGGGAAATCAGGGAAGTGAGGTTTTGAGTGATGCTGAGGAGTTCTATCTCCTCCATTTCTTAATCGGTGATGAAAGGAGAGGCAGAGAGTTC
GTAATCGTTCGATTGAAGCTGCTAAGGATCAATTCGCTGCTTGCTTCTTTTCCTTTGGTGAGATTTAATCTCAATAAGAGATACGGTGTTAAGACAAGGGAAAATATTGA
TGATGAAAATCAAGCCATTATATTACTAAACTCACTACCAGAGAGTTATAAAGAAATAAGGGCAGCCATAAAATATGGGAGGGATTCTCTTTCGATGAGCCTAGTTTTAA
ATGCTCTTCGATCAAAAGAGCTAGAATCCAGACCTCAGAAAAAGGAAACAGAAGCTTTGACAGTAAGAGGAAGATCAGAAACTAGGTCAAATCAGTCTAAAAATACAAAG
AAAGATCACACCGAAGTCCTTGCTGTAACTGAAATTGATCCAACTGAAGAATGGATATTAGATTCAGGATGTTCCTATCATATGACCCCACATAAACACTATTTTATGGA
CTTAAAAGACTTAAATGGTGGTAAAGTTCTAATGGGAAACAATCAACAATGTGAGATTAAGGGCATTGGATCGATAAGATTGAGTTTAACTGATGGTTCATATAAAATAT
TATCTTCTGTGAGATATGTACAAGCTTTAAAACGCAATCTTATTTCCCTAGGAACTCTTGATAAAGCTGGATACAAGTATAAATCAGAGGGAGGTATTATGAAAATATAT
AAGGGGTCTTTGCTGAAAATGACGGGAACCTTAAAAAATGGCGTGTATACTTTAAATGGTACAACAAGTATAGGAGATGTGACCATAGCTACTACTGTTGAAACTAATAA
TGAAGCAATCATATGGCATAAGCGATTAGCCCACATCAGTGAAAGAGGCCTAGTTGAATTAAGTAAGCAAGGAGAAATAAGATGTATAATCAGTAGGGATGTTAAATTTA
ATGAAGAAGAAATGCCATGGGAATCTGAGGGAAGCTTCAATTATCAGAAAACTAAATTAGAAATAGAGACCTCTTTTGATATTGAACTAGAAAATGAAGATCAGAACATT
AGTCCAGAAGGTTCAAGTCAAGCTACTAATGAACAATTTGATCATAATACTGGAAATACTGAAGTTTTGCAAGAAATAGACAGTGTAAATCAGAACATTCAGGCCCAAAA
CACTGAAGAAAACTTAGCAAATTACCAATTGACCCGAGATAGACAGAAAAGAGTCATTAAACCTCTTGCTAGGTTTGATTATGCTGACTGCAATATGGATGCTTTTGATG
CTGCTGCATATGCCTTGTTCTATGATGATTCTTTTGAAAATTCAGAACCAACATCTTACAACCAAGCCATGAATTCTGAACATAAAAATGAGTGGTTTGAAGCCATGGTT
GATGAGATGAGTTCGCTTGAGAAAAACAAAACTTGGAAGTTAGTAGAATTACCTGAGGGTCACAAGGTAGTTGATTGCAAGTGGCTGTATAAAATAAAAGCTAGTCTAAA
AGGTGATAATAAACCTAGATTTAAGGCTAGACTAGTAGCCAAATGGTTTACTCAAGAGTATGGTGTTGATTACACCGAGATATTCTCCCCGTGGTTAGATATTCTTCAAT
TCGTGTGA
Protein sequenceShow/hide protein sequence
MLSHLHTHLCIFFPFTTLPPLTSACSDPYISLLSWPPLSSGKEKQPYLTDNVTPRGRVDGGETILPLLHHSNLFSLSPPQCLSNGDRCKDNVAAPRWTFQALMKIIARGE
TSMATDILGNCLLDYFGRFYPLYLRLFTDAVYVISEDRDVFWMVNEFQNGTYRGCCVVVDIPSVNTVVAWGEKRPSMQEMVVEAADEGRWHGGDGSSAWYERVSTFGGRR
WTGGLLHEGYSGGEDAVYVISEDKDVFWMVNEFQNGTYKGCCVVVDIPSVNTSIHPMGTVSKGGEKRSCKRWLSTRLMKVVDTAEMVRRRGVRGFLLLEEEDERVGCCMK
AILVVRVGGDMTFLPLVFTVIYRRRLRHFRGQRCVLQYLIAIIHSGGEGGRYFSLLREVELCYPSPIKSVLGRERVSVSLGNQGSEVLSDAEEFYLLHFLIGDERRGREF
VIVRLKLLRINSLLASFPLVRFNLNKRYGVKTRENIDDENQAIILLNSLPESYKEIRAAIKYGRDSLSMSLVLNALRSKELESRPQKKETEALTVRGRSETRSNQSKNTK
KDHTEVLAVTEIDPTEEWILDSGCSYHMTPHKHYFMDLKDLNGGKVLMGNNQQCEIKGIGSIRLSLTDGSYKILSSVRYVQALKRNLISLGTLDKAGYKYKSEGGIMKIY
KGSLLKMTGTLKNGVYTLNGTTSIGDVTIATTVETNNEAIIWHKRLAHISERGLVELSKQGEIRCIISRDVKFNEEEMPWESEGSFNYQKTKLEIETSFDIELENEDQNI
SPEGSSQATNEQFDHNTGNTEVLQEIDSVNQNIQAQNTEENLANYQLTRDRQKRVIKPLARFDYADCNMDAFDAAAYALFYDDSFENSEPTSYNQAMNSEHKNEWFEAMV
DEMSSLEKNKTWKLVELPEGHKVVDCKWLYKIKASLKGDNKPRFKARLVAKWFTQEYGVDYTEIFSPWLDILQFV