; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006827 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006827
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:46201111..46204537
RNA-Seq ExpressionLag0006827
SyntenyLag0006827
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_010673168.1 PREDICTED: uncharacterized protein LOC104889608 [Beta vulgaris subsp. vulgaris]6.0e-9528.34Show/hide
Query:  FKKLRDKVRITRGGSWSYDDAILIFDEPKGSCSVESLEFKFVSFWVHFHKLPRVCFCRKYAVALGNSIGTFEDAEWDANEKMEGQSLRVKIRINVKEPLK
        F   RDK ++  G  WS+D  +++F+E +G+    ++      FW+  + LP           +G+ +GT  + ++D    +  +S RVK+ ++V +PL+
Subjt:  FKKLRDKVRITRGGSWSYDDAILIFDEPKGSCSVESLEFKFVSFWVHFHKLPRVCFCRKYAVALGNSIGTFEDAEWDANEKMEGQSLRVKIRINVKEPLK

Query:  RGTNV--KIGSMAEKTWISITYEKMPDFCYFCGKLGHVVQECVMVGASN-SKERPFGIELRETKGSKGIYKSWKYDNRDFSFWGRGRGRGRNARGAHQFS
        R   +  K G++A    + + YE++P+FCY CG LGH+ ++C+ V   + ++ER +G  LR +   +G  K  + + ++F          R+      FS
Subjt:  RGTNV--KIGSMAEKTWISITYEKMPDFCYFCGKLGHVVQECVMVGASN-SKERPFGIELRETKGSKGIYKSWKYDNRDFSFWGRGRGRGRNARGAHQFS

Query:  NNHVGEEEPIPGNSGKNYTSRINRKEEKASDRTWRKTADPVGNSVDPAMEREVGMSRNIDRCRQLDTTAVNPDQMMRMPQIKVNQNGAKGPLDLENDKNN
         +    EE +  +S       +N +     DR   KT        +P    E  +S++     + ++T +N    + +P +                   
Subjt:  NNHVGEEEPIPGNSGKNYTSRINRKEEKASDRTWRKTADPVGNSVDPAMEREVGMSRNIDRCRQLDTTAVNPDQMMRMPQIKVNQNGAKGPLDLENDKNN

Query:  NKDMDWERNKQAKGLGQTEKEAQELIQRKSSNLDYMSIGKEKSDYAHVGPPQKNSGNQSSTPFNLLPPSVDNPQI---VHSKEGRASKETKDNNAKTSSG
                                         D  S+  +K     V P +KN   +       LP  V+N  +   +++  G   K++ D     S  
Subjt:  NKDMDWERNKQAKGLGQTEKEAQELIQRKSSNLDYMSIGKEKSDYAHVGPPQKNSGNQSSTPFNLLPPSVDNPQI---VHSKEGRASKETKDNNAKTSSG

Query:  PHGKDKNACKGGGKGVKKKDTDEQRHNVKSWKRIARSHKEESGVLDQNSQNQRKRAKEQEEDLTPVKDNKKQCTYPLDLSERGSTEAVNQPPCVPSIGLS
                      G KK   D+   N  S   +     E  G++ +N +++                    C + +D                 S+G S
Subjt:  PHGKDKNACKGGGKGVKKKDTDEQRHNVKSWKRIARSHKEESGVLDQNSQNQRKRAKEQEEDLTPVKDNKKQCTYPLDLSERGSTEAVNQPPCVPSIGLS

Query:  GGLILLWKD-KLTVSIKSFSKG--CIDSIIQDGIEQWRFTAL--------------------DFG-----GGDFNETLSISEKKGGRPKSQKQMDDFSST
        GGL + WK   L  S+ SFS    C D ++ +G+ +WRF  +                    D+      GGDFNE LS+SE +GGR   ++ M DF   
Subjt:  GGLILLWKD-KLTVSIKSFSKG--CIDSIIQDGIEQWRFTAL--------------------DFG-----GGDFNETLSISEKKGGRPKSQKQMDDFSST

Query:  LSLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEESWLQFEECKNIVK
        +   HL D+ F G  +TW+R       I+ERLDRF+A+ +       + VEH+  + S H PI+V     +     K K+++ RF  +WL  + C+++V+
Subjt:  LSLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEESWLQFEECKNIVK

Query:  DAWGKESHSEAGALISKIEESMRKLAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDNNIDKAEKELDQLLEEEEQYWRIRVREDWLNWGDRNTK--
         AW    HS      ++I    + L  W SK     L   I    +EI  L  +S      ++ +   +LD LLE++E YW +R R   +  GD+NTK  
Subjt:  DAWGKESHSEAGALISKIEESMRKLAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDNNIDKAEKELDQLLEEEEQYWRIRVREDWLNWGDRNTK--

Query:  ---------------LKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHA
                       L D    W DD+E I  V   Y+K LF+S+ P+   ++  L  +   I+E+ N  L R   + ++  AL+ ++PSKAPGPDG HA
Subjt:  ---------------LKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHA

Query:  MFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIG
        +F+QRFW IVG D++ V   I++       LN T I+LIPKV  P  +  FRPISLC V++K+++K LANRLK +L  V+S +Q+AF+PGRLITDN LI 
Subjt:  MFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIG

Query:  FESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIME
         E  H++  +  G  G++A+KLDMSKA+DRVEW ++  +++
Subjt:  FESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIME

XP_023896927.1 uncharacterized protein LOC112008817 [Quercus suber]1.0e-9436.55Show/hide
Query:  VPSIGLSGGLILLWKDKLTVSIKSFSKGCIDSIIQDGIEQ-WRFTALDFGG---------------------------GDFNETLSISEKKGGRPKSQKQ
        VP   L GGL LLW + L + I++FS   ID++I  GI+  WRFT   +G                            GDFNE   + EK GG  + +KQ
Subjt:  VPSIGLSGGLILLWKDKLTVSIKSFSKGCIDSIIQDGIEQ-WRFTALDFGG---------------------------GDFNETLSISEKKGGRPKSQKQ

Query:  MDDFSSTLSLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEESWLQFE
        M DF   L +C L D+ + G  FTW        L+  RLDR VA ++ I +     + HL   +S H+P+ +    +++  +    ++  RFE  WL+ E
Subjt:  MDDFSSTLSLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEESWLQFE

Query:  ECKNIVKDAWGKESHSE-AGALISKIEESMRKLAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDN-NIDKAEKELDQLLEEEEQYWRIRVREDWLN
         C+ +V   W K S  +    ++ K+EE   +L  W+ K + G ++ A+ R  + +      S    ++  +    +E+ +L++ EE+ W  R + +WL 
Subjt:  ECKNIVKDAWGKESHSE-AGALISKIEESMRKLAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDN-NIDKAEKELDQLLEEEEQYWRIRVREDWLN

Query:  WGDRNTK-----------------LKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSK
        +GD+NTK                 L+DA G W++ EE IG +   Y+  LF++   NP  ++T L+G++  +++  N +L +PF   ++  ALK + P  
Subjt:  WGDRNTK-----------------LKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSK

Query:  APGPDGAHAMFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGR
        APGPDG   +F++ FWD VGG++S+  L +LN+  +   LN T+ISLIPK+  P K+  FRPISLC V+YK+I+K LANRLK +L  +IS +Q+AF+  R
Subjt:  APGPDGAHAMFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGR

Query:  LITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIME
        LITDN+LI  E++H +  KR GK GY+++KLDMSK +DRVEW+Y+ KIME
Subjt:  LITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIME

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]1.0e-9929.36Show/hide
Query:  QLWFKKLRDKVRITRGGSWSYDDAILIFDEPKGSCSVESLEFKFVSFWVHFHKLPRVCFCRKYAVALGNSIGTFEDAEWDANEKMEGQSLRVKIRINVKE
        Q  F+   D  RI RGG WS+D+ +L+    K   +V ++ F+  S W+     P        A  +G+ +G  E+ E    +      +RV++ + + +
Subjt:  QLWFKKLRDKVRITRGGSWSYDDAILIFDEPKGSCSVESLEFKFVSFWVHFHKLPRVCFCRKYAVALGNSIGTFEDAEWDANEKMEGQSLRVKIRINVKE

Query:  PLKRGTNVKIGSMAEKTWISITYEKMPDFCYFCGKLGHVVQECVMVGASNSKERPFGIELRETKGSKGIYKSWKYDNRDFSFWGRGRGRGRNARGAHQFS
        P++RG  +  GS   KTW+S  YE++P FC++CG LGH ++ CV            G    E KG +      +Y   DF     GR +  +A+ +    
Subjt:  PLKRGTNVKIGSMAEKTWISITYEKMPDFCYFCGKLGHVVQECVMVGASNSKERPFGIELRETKGSKGIYKSWKYDNRDFSFWGRGRGRGRNARGAHQFS

Query:  NNHVGEEEPIPGNSGKNYTSRINRKEEKASDRTWRKTADPVGNSVDPAMEREVGMSRNIDRCRQLDTTAVNPDQMMRMPQIKVNQ-------NGAKGPLD
                   G S           E ++ D   +        +V+   +R      N  R    +  AVNP  +  +    + Q       +GAK   +
Subjt:  NNHVGEEEPIPGNSGKNYTSRINRKEEKASDRTWRKTADPVGNSVDPAMEREVGMSRNIDRCRQLDTTAVNPDQMMRMPQIKVNQ-------NGAKGPLD

Query:  LENDKNNNKDMDWERNKQAKGLGQTEKEAQELIQRKSSNLDYMSIGKEKSDYAHVGPPQKNSGNQSSTPFNLLPPSVDNPQIVHSKEGRASKETKDNNAK
            KN +K             GQ  +E   + +                   HVG   K++ N      NL+        +V   +G+           
Subjt:  LENDKNNNKDMDWERNKQAKGLGQTEKEAQELIQRKSSNLDYMSIGKEKSDYAHVGPPQKNSGNQSSTPFNLLPPSVDNPQIVHSKEGRASKETKDNNAK

Query:  TSSGPHGKDKNACKGGGKGVKKKDTDEQRHNVK---SWKRIARSHKEESGVLDQNSQNQRKRAKEQEEDLTPVKDNKKQCTYPL-DLSERGSTEAVNQPP
              G D +    G  G           NVK   SW R+ R    + G+ D ++                        T  L  L +RG  E   +P 
Subjt:  TSSGPHGKDKNACKGGGKGVKKKDTDEQRHNVK---SWKRIARSHKEESGVLDQNSQNQRKRAKEQEEDLTPVKDNKKQCTYPL-DLSERGSTEAVNQPP

Query:  CVPSI---GLSGGLILLWKDKLTVSIKSFSKGCIDSII--QDGI-----------------EQWR----FTALDFGG----GDFNETLSISEKKGGRPKS
           +       GGL  LWK+ + + + +F+   + + +  +DG                    WR          G     GDFN  L  SEK   R   
Subjt:  CVPSI---GLSGGLILLWKDKLTVSIKSFSKGCIDSII--QDGI-----------------EQWR----FTALDFGG----GDFNETLSISEKKGGRPKS

Query:  QKQMDDFSSTLSLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEESWL
          Q++ F   LS C L D+ FKG  +TW          K RLDR VAN E   R     V HL+ H S H P+++  QS      + G  R  +FEESWL
Subjt:  QKQMDDFSSTLSLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEESWL

Query:  QFEECKNIVKDAWGKESHSEAG--ALISKIEESMRKLAAWNSKRL---KGTLKGAIERKLQEINVLASTSQRNLDNNIDKAEKELDQLLEEEEQYWRIRV
          +EC  ++++AWG    +  G  A+  KI+    +L AW S       G +K  I+++L  +N    T     +       K++D LL+++E YW  R 
Subjt:  QFEECKNIVKDAWGKESHSEAG--ALISKIEESMRKLAAWNSKRL---KGTLKGAIERKLQEINVLASTSQRNLDNNIDKAEKELDQLLEEEEQYWRIRV

Query:  REDWLNWGDRNTK-----------------LKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERALK
        R +WL  GDRNTK                 ++++ G WV++ E +G VA +YF  LF +   +   +   L  +   +TED    L   F+  +++ AL 
Subjt:  REDWLNWGDRNTK-----------------LKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERALK

Query:  DINPSKAPGPDGAHAMFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQA
         + P+KAPGPDG +A+F+Q+FW IVG  +    LD LNN  ++ ++N T I LIPKV +P++M  FRPISLC V+YKIISK LANRLK+VL ++IS +Q+
Subjt:  DINPSKAPGPDGAHAMFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQA

Query:  AFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIME
        AF+PGRLITDNVL+ +E++H ++ ++ GK G +A+KLD+SKA+DRVEW ++  IME
Subjt:  AFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIME

XP_030969743.1 uncharacterized protein LOC115990020 [Quercus lobata]3.9e-10240.22Show/hide
Query:  GGLILLWKDKLTVSIKSFSKGCIDSIIQDGIEQ-WRFTALDFGG---------------------------GDFNETLSISEKKGGRPKSQKQMDDFSST
        GGL LLWK+ +T+ + SFSK  ID+I+  G E  WR T   +G                            GDFNE L +S+K GG P+S  QM  F   
Subjt:  GGLILLWKDKLTVSIKSFSKGCIDSIIQDGIEQ-WRFTALDFGG---------------------------GDFNETLSISEKKGGRPKSQKQMDDFSST

Query:  LSLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEESWLQFEECKNIVK
        L  C  VD+ F G  FTW    ++G+ I ERLDR VAN E ++R     V+HLN + S HRP+++S  S  +    + +R+  RFE  W+    CK  V 
Subjt:  LSLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEESWLQFEECKNIVK

Query:  DAW-GKESHSEAGALISKIEESMRKLAAWNSKRLKGTLKGAIERKLQEINVLASTS-QRNLDNNIDKAEKELDQLLEEEEQYWRIRVREDWLNWGDRNTK
        +AW G    +      +KI+E  ++L  W SK   G +K  I+   +++ V    S QR     +D  + EL  LLE+EE+ W  R R  WL  GD+NT+
Subjt:  DAW-GKESHSEAGALISKIEESMRKLAAWNSKRLKGTLKGAIERKLQEINVLASTS-QRNLDNNIDKAEKELDQLLEEEEQYWRIRVREDWLNWGDRNTK

Query:  -----------------LKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGA
                         L+D NG W  +E+    +  ++++KLF S+  NP  I+  + G++  +T   N DL +P+S  ++ERA+KD+ P KAPGPDG 
Subjt:  -----------------LKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGA

Query:  HAMFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVL
          +F+Q +W  V  DI++  L  LN+  ++  +N T+I+LIPKV +P+K+  FRPISLC V+YKI+SKA+ANRLK +L+ +IS +Q+AFI  RLITDNVL
Subjt:  HAMFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVL

Query:  IGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIM
        I FES+H + N   GK G++A+KLDMSKA+DRVEW ++ K++
Subjt:  IGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIM

XP_042990668.1 uncharacterized protein LOC122317666 [Carya illinoinensis]1.2e-9527.86Show/hide
Query:  FKKLRDKVRITRGGSWSYDDAILIFDEPKGSCSVESLEFKFVSFWVHFHKLPRVCFCRKYAVALGNSIGTFEDAEWDANEKMEGQSLRVKIRINVKEPLK
        F+ + DK ++  G  WS+D  ++   E +G  S+  + F    FW+  H LP     ++  + +G+ IG   + E +      G  LR+K  +NV + L 
Subjt:  FKKLRDKVRITRGGSWSYDDAILIFDEPKGSCSVESLEFKFVSFWVHFHKLPRVCFCRKYAVALGNSIGTFEDAEWDANEKMEGQSLRVKIRINVKEPLK

Query:  RGTNVKIGSMAEKTWISITYEKMPDFCYFCGKLGHVVQECVMVGASNSKERPFGIELRETKGSKGI-----YKSWKYDNRDFSFWGRGRGRGRNARGAHQ
        RG  +K GS  +++W+S  YE++P FC+ CG+  H    C   GA N     +G  LR T           Y   K      S W   +  G +  G   
Subjt:  RGTNVKIGSMAEKTWISITYEKMPDFCYFCGKLGHVVQECVMVGASNSKERPFGIELRETKGSKGI-----YKSWKYDNRDFSFWGRGRGRGRNARGAHQ

Query:  FSNNHVGEEEPIPGNSGK------NYTSRINRKEEKASDRTWRKTADPVGNSVDPAMEREVGMSRNIDR-CRQLDTTAVNPDQMMRMPQIKVNQNGAKGP
         S +  G E P    S +       +T     KE++ +  T     D  G+   P  ER       + +    L    + P  +     + +    A GP
Subjt:  FSNNHVGEEEPIPGNSGK------NYTSRINRKEEKASDRTWRKTADPVGNSVDPAMEREVGMSRNIDR-CRQLDTTAVNPDQMMRMPQIKVNQNGAKGP

Query:  LDLENDKNNNKDMDWERNKQAKGLGQTEKEAQELIQRKSSNLDYMSIGKEKSDYAHVGPPQKNSGNQSSTPFNLLPPSVDNPQIVHSKEGRASKETKDNN
          +               K+  G+  TE   + L    +S +               G   K       TP + +   +  P             +  NN
Subjt:  LDLENDKNNNKDMDWERNKQAKGLGQTEKEAQELIQRKSSNLDYMSIGKEKSDYAHVGPPQKNSGNQSSTPFNLLPPSVDNPQIVHSKEGRASKETKDNN

Query:  AKTSSGPHGKDKNACKGGGKGVKKKDTDEQRHNVKSWKRIARSHKEESGVLDQNSQNQRKRAKEQEEDLTPVKDNKKQCTYPLDLSERGSTEAVNQPPCV
        +K S     + + + +  GK  KK+ T     +V+    + ++ +     L +   N  +       D   +    + C                    V
Subjt:  AKTSSGPHGKDKNACKGGGKGVKKKDTDEQRHNVKSWKRIARSHKEESGVLDQNSQNQRKRAKEQEEDLTPVKDNKKQCTYPLDLSERGSTEAVNQPPCV

Query:  PSIGLSGGLILLWKDKLTVSIKSFSKGCIDSIIQDGIE--QWRFTALDFGG---------------------------GDFNETLSISEKKGGRPKSQKQ
         S G SG L LLWKD + V + +++   I ++I   I+  QW+ T   +G                            GDFNE    SEK G   +  +Q
Subjt:  PSIGLSGGLILLWKDKLTVSIKSFSKGCIDSIIQDGIE--QWRFTALDFGG---------------------------GDFNETLSISEKKGGRPKSQKQ

Query:  MDDFSSTLSLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEESWLQFE
        M  F ++LS C L D+ F GD FTW  + +     KERLDR   N   I    N  V HL+   S H+ ++V    + S      K R  RFE +W +  
Subjt:  MDDFSSTLSLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEESWLQFE

Query:  ECKNIVKDAWGKES-HSEAGALISKIEESMRKLAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDNNIDKAEKELDQLLEEEEQYWRIRVREDWLNW
        EC+ I+K  W   S  S     +  + +   KL  W+  + +   K A++ K + + +L   +Q  L   I K  + ++ +++ E    + R ++ WL  
Subjt:  ECKNIVKDAWGKES-HSEAGALISKIEESMRKLAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDNNIDKAEKELDQLLEEEEQYWRIRVREDWLNW

Query:  GDRNTK-----------------LKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKA
        GDRNTK                 ++  +G    D + I    +E+F  LF+S+  +P+ I+  L+ ++  IT+D    L   F+  +++ A+  +NP  +
Subjt:  GDRNTK-----------------LKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKA

Query:  PGPDGAHAMFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRL
        PGPDG  A+FFQ++WD VG +++K  L++LN       LN T I+LIPK  +P  +  FRPISLC V+YKII+K LANRLKK+L  +ISP+QAAF+PGRL
Subjt:  PGPDGAHAMFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRL

Query:  ITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIM
        ITDN+++ FE++H +  +  G +GY+A+KLDMSKA+DR+EW ++  +M
Subjt:  ITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIM

TrEMBL top hitse value%identityAlignment
A0A2N9EL92 Reverse transcriptase domain-containing protein9.2e-11031.28Show/hide
Query:  FKKLRDKVRITRGGSWSYDDAILIFDEPKGSCSVESLEFKFVSFWVHFHKLPRVCFCRKYAVALGNSIGTFEDAEWDANEKMEGQSLRVKIRINVKEPLK
        F+ + D  R+ +   WSYD  ++ F   +   S+  ++ ++VSFWV  H LP      + A ALG ++G  E       E+     +RV++++++ +PL 
Subjt:  FKKLRDKVRITRGGSWSYDDAILIFDEPKGSCSVESLEFKFVSFWVHFHKLPRVCFCRKYAVALGNSIGTFEDAEWDANEKMEGQSLRVKIRINVKEPLK

Query:  RGTNVKIGSMAEKTWISITYEKMPDFCYFCGKLGHVVQECV----MVGASNSKERPFGIELRETKGSKGIYKSWKYDNR-DFSFWGRGRGRGRNARGAHQ
        RG   ++    ++TWIS  YE++P+FCY+CG L H  ++C       G    +++ +G  LR +          K   R D   WG+          +HQ
Subjt:  RGTNVKIGSMAEKTWISITYEKMPDFCYFCGKLGHVVQECV----MVGASNSKERPFGIELRETKGSKGIYKSWKYDNR-DFSFWGRGRGRGRNARGAHQ

Query:  FSNNHVGEEEPIPGNSG----KNYTSRINRKEEKASDRTWRKTADPVGNSVDPAMEREVGMSRNIDRCRQLDT---TAVNPDQMMRMPQIKVNQNGAKGP
         + +H G+     GN      K++      KE            +P G+    A+  E  +  N      L T   TAVN +Q++               
Subjt:  FSNNHVGEEEPIPGNSG----KNYTSRINRKEEKASDRTWRKTADPVGNSVDPAMEREVGMSRNIDRCRQLDT---TAVNPDQMMRMPQIKVNQNGAKGP

Query:  LDLENDKNNNKDMDWERNKQAKGLGQT-EKEAQELIQRKSSNLDYMSIGKEKSDYAHVGP-PQKNSGNQSSTPFNLLPPSVDNPQIVHSKEGRASKETKD
               N+   M +     A  + +T  K++   I     ++    + K+    A   P    NS N +++  +  P +  N + + S    AS +  D
Subjt:  LDLENDKNNNKDMDWERNKQAKGLGQT-EKEAQELIQRKSSNLDYMSIGKEKSDYAHVGP-PQKNSGNQSSTPFNLLPPSVDNPQIVHSKEGRASKETKD

Query:  NNAKTSSGPHGKDKNACKGGGKGVKKKDTDEQRHNVKSWKRIARSHKEESGVLDQNSQNQRKRAKEQEE-DLTPVKDNKKQCTY-PLDLSERGSTEAVN-
         N   +               K + KK T+                       DQN+   R+R +E +  +L      +++  Y P   +E    E    
Subjt:  NNAKTSSGPHGKDKNACKGGGKGVKKKDTDEQRHNVKSWKRIARSHKEESGVLDQNSQNQRKRAKEQEE-DLTPVKDNKKQCTY-PLDLSERGSTEAVN-

Query:  -QPPCVPSIGLSGGLILLWKDKLTVSIKSFSKGCIDSIIQ--DGIEQWRFT----------------------ALD----FGGGDFNETLSISEKKGGRP
         +PP   S G SGGL LLW D   V+I++FS+  +DS +Q  +G  +WRFT                      +LD       GDFNE LSISE+ G   
Subjt:  -QPPCVPSIGLSGGLILLWKDKLTVSIKSFSKGCIDSIIQ--DGIEQWRFT----------------------ALD----FGGGDFNETLSISEKKGGRP

Query:  KSQKQMDDFSSTLSLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEES
         S ++M DFS  ++ C LVD+ F+G  FTW+       LI++RLDR +AN   +   N   V H+    S H P+++   +  S+   + KRR  +FEE 
Subjt:  KSQKQMDDFSSTLSLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEES

Query:  WLQFEECKNIVKDAWGKES--HSEAGALISKIEESMRKLAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDNN-IDKAEKELDQLLEEEEQYWRIRV
        W    EC+ I++D W +E    S    L  KI+     L  W  K + G  +  I+     +  L S++    +N+ I   + E+++LL  EE +WR R 
Subjt:  WLQFEECKNIVKDAWGKES--HSEAGALISKIEESMRKLAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDNN-IDKAEKELDQLLEEEEQYWRIRV

Query:  REDWLNWGDRNTK-----------------LKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERALK
        R  WL  GD NTK                 L +++  W  DE  I  +AV YF  +F ++ P  NL +T L  + + +T + N+ L +PF+  ++  AL 
Subjt:  REDWLNWGDRNTK-----------------LKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERALK

Query:  DINPSKAPGPDGAHAMFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQA
         ++PSKAPGPDG  + FFQ++W+IVG D+    L +LN+  ++ ++N T ISLIPK  +P++M  +RPISLC VVYKIISK LANRLK +L  +IS SQ+
Subjt:  DINPSKAPGPDGAHAMFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQA

Query:  AFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIM
        AF+PGRLITDNV + FE IH +  KR GK G +A+KLDMSKA+DRVEW +I  IM
Subjt:  AFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIM

A0A2N9F6L9 Reverse transcriptase domain-containing protein2.7e-10929.48Show/hide
Query:  FKKLRDKVRITRGGSWSYDDAILIFDEPKGSCSVESLEFKFVSFWVHFHKLPRVCFCRKYAVALGNSIGTFEDAEWDANEKMEGQSLRVKIRINVKEPLK
        F+   D  R+  G  WSYD  ++ F       +VE+L F  V FW+  H LP +   RK AVA+G  IG    +     E   G+ +RV++R+++ +PL 
Subjt:  FKKLRDKVRITRGGSWSYDDAILIFDEPKGSCSVESLEFKFVSFWVHFHKLPRVCFCRKYAVALGNSIGTFEDAEWDANEKMEGQSLRVKIRINVKEPLK

Query:  RGTNVKIGSMAEKTWISITYEKMPDFCYFCGKLGHVVQECV--MVGASNSKERPFGIELRETKGSKGIYKSWKYDNRDFSFWGRGRGRGRNARGAHQFSN
        RG  + +G+  E  W+S  YE++P+FCY+CG   H  ++C   +   +N +ERP                       ++  W R  G             
Subjt:  RGTNVKIGSMAEKTWISITYEKMPDFCYFCGKLGHVVQECV--MVGASNSKERPFGIELRETKGSKGIYKSWKYDNRDFSFWGRGRGRGRNARGAHQFSN

Query:  NHVGEEEPIPGNSGKNYTSRINRKEEKASDRTWRKTADPVGNSVDPAMEREVGMSRNIDRCRQLDTTAVNPDQMMRMPQIKVNQNGAKGPLDLENDKNNN
                      +     +     +AS   W   A P      P                +   T +   +     Q+K +  GA  P   E    + 
Subjt:  NHVGEEEPIPGNSGKNYTSRINRKEEKASDRTWRKTADPVGNSVDPAMEREVGMSRNIDRCRQLDTTAVNPDQMMRMPQIKVNQNGAKGPLDLENDKNNN

Query:  KDMDWERNKQAKGLGQTEKEAQELIQRKSSNLDYMSIGKEKSDYAHVGPPQKNSGNQSSTPFNLLPPSVDNPQIVHS-KEGRASKETKDNNAKTSSGPHG
         ++D   N      G  E    EL Q        +S G   +  A V P   NS + S TP + L      P+     K G   K+ +  +  T++    
Subjt:  KDMDWERNKQAKGLGQTEKEAQELIQRKSSNLDYMSIGKEKSDYAHVGPPQKNSGNQSSTPFNLLPPSVDNPQIVHS-KEGRASKETKDNNAKTSSGPHG

Query:  KDKNACKGGGKGVKKKDTDEQRHNVKSWKRIARSHKEESGVLDQNSQNQRKRAKEQEEDLTPVKDNKKQCTYPLDLSERGSTEAVNQPPCVPSIGLSGGL
          +         V   +  + +  ++   R+ R     +  L +   N        E+ L  ++ N       L  S +    + N+          GGL
Subjt:  KDKNACKGGGKGVKKKDTDEQRHNVKSWKRIARSHKEESGVLDQNSQNQRKRAKEQEEDLTPVKDNKKQCTYPLDLSERGSTEAVNQPPCVPSIGLSGGL

Query:  ILLWKDKLTVSIKSFSKGCIDSIIQDG-IEQWRFTALDFGG---------------------------GDFNETLSISEKKGGRPKSQKQMDDFSSTLSL
         L W   + VSIKS+S   ID++I DG  + WR T + +G                            GDFNE + ++E  G   +  +QM  F + L  
Subjt:  ILLWKDKLTVSIKSFSKGCIDSIIQDG-IEQWRFTALDFGG---------------------------GDFNETLSISEKKGGRPKSQKQMDDFSSTLSL

Query:  CHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEESWLQFEECKNIVKDAW
        C LVD+ F G  FTW  +    +    RLDR V N+E + R     V+HL+   S H+ +   W +     S++ +R+  RFEE W+    C+  +K+AW
Subjt:  CHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEESWLQFEECKNIVKDAW

Query:  GKESHSEAGALIS-KIEESMRKLAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDNNIDKAEKELDQLLEEEEQYWRIRVREDWLNWGDRNTK----
        G +    A   +S K+ E   +L +W+ +      K     KL+         Q +   N+      L+ L E+EE+ WR R R  WL  GDRNTK    
Subjt:  GKESHSEAGALIS-KIEESMRKLAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDNNIDKAEKELDQLLEEEEQYWRIRVREDWLNWGDRNTK----

Query:  -------------LKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAMF
                     LKD  G   D  EG+  + + Y+  LF++ +P+   I   +A +   +TED N+ L R F+  ++E ALK + P+KAPGPDG   +F
Subjt:  -------------LKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAMF

Query:  FQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFE
        +Q+FW +VG D++K  L  LN+  ++  +N T+I+LIPK  +P+++  FRPISLC V+YK+ISK LANRLK +L +++S SQ+AF+PGRLITDNVL+ FE
Subjt:  FQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFE

Query:  SIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIM
        ++H +++ ++G+DG +A+KLDMSKA+DRVEW+++ KIM
Subjt:  SIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIM

A0A2N9G3I8 Reverse transcriptase domain-containing protein4.1e-11029.55Show/hide
Query:  FKKLRDKVRITRGGSWSYDDAILIFDEPKGSCSVESLEFKFVSFWVHFHKLPRVCFCRKYAVALGNSIGTFEDAEWDANEKMEGQSLRVKIRINVKEPLK
        F+   D  R+  G  WSYD  ++ F        VE L F  V FWV  H LP +C  +  A  LG SIG     +    +   G+ +RV++++++ +PL 
Subjt:  FKKLRDKVRITRGGSWSYDDAILIFDEPKGSCSVESLEFKFVSFWVHFHKLPRVCFCRKYAVALGNSIGTFEDAEWDANEKMEGQSLRVKIRINVKEPLK

Query:  RGTNVKIGSMAEKTWISITYEKMPDFCYFCGKLGHVVQEC----VMVGASNSKERPFGIELRETKGSKGIYKSWKYDNRDFSFWGRGRGRGRNARGAHQF
        RG  + + +  E  W+S  YE++P+FCY+CG   H  ++C            K+  +G+ LR ++                                   
Subjt:  RGTNVKIGSMAEKTWISITYEKMPDFCYFCGKLGHVVQEC----VMVGASNSKERPFGIELRETKGSKGIYKSWKYDNRDFSFWGRGRGRGRNARGAHQF

Query:  SNNHVGEEEPIPGNSGKNYTSRINRKEEKASDRTWRKTADPVGNSVDPAMEREVGMSRNIDRCRQLDTTAVNPD-QMMRMPQIKVNQNGAKGPLDLENDK
                             R+ R+ +   +   R T+ P+G                          AV P  Q    P +  +      P D+E  +
Subjt:  SNNHVGEEEPIPGNSGKNYTSRINRKEEKASDRTWRKTADPVGNSVDPAMEREVGMSRNIDRCRQLDTTAVNPD-QMMRMPQIKVNQNGAKGPLDLENDK

Query:  NNNKDMDWERNKQAKGLGQTEKEAQELIQRKSSNLDYMSIGKEKSDYAHVGPPQKNSGNQSSTPFNLLPPSVDNPQIVHSKEGRASKETK----DNNAKT
                  NK      Q     ++ ++     L+Y S+  EKS  A           QS  P   + P   +P ++H K   ++K+T+    +   K 
Subjt:  NNNKDMDWERNKQAKGLGQTEKEAQELIQRKSSNLDYMSIGKEKSDYAHVGPPQKNSGNQSSTPFNLLPPSVDNPQIVHSKEGRASKETK----DNNAKT

Query:  SSGPH----GKDKNACKGGGKG---VKKKDTDEQR---HNVKSWKRIARSHKEESGVLDQNSQNQRKRAKEQEEDLTPVKDNK-----KQCTYPLDLSER
        ++  H    G  K   +  G+G    +K    E+R     V+  +   RS K  + VL+ +  +      ++  ++  VKD       +  ++   L + 
Subjt:  SSGPH----GKDKNACKGGGKG---VKKKDTDEQR---HNVKSWKRIARSHKEESGVLDQNSQNQRKRAKEQEEDLTPVKDNK-----KQCTYPLDLSER

Query:  GSTEAVNQPPCVPSIGLSGGLILLWKDKLTVSIKSFSKGCIDSIIQDGIEQ-WRFTALDFGG---------------------------GDFNETLSISE
              N    V S    GGL L WKD   V+IKS+S   ID+II++G E  WR T + +G                            GDFNE L + E
Subjt:  GSTEAVNQPPCVPSIGLSGGLILLWKDKLTVSIKSFSKGCIDSIIQDGIEQ-WRFTALDFGG---------------------------GDFNETLSISE

Query:  KKGGRPKSQKQMDDFSSTLSLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRK
         KG   +  +QM  F S L  C LVD+ ++G  FTW  +         RLDR VANME + R    +VEH++   S H+ +   W S       + KRR 
Subjt:  KKGGRPKSQKQMDDFSSTLSLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRK

Query:  SRFEESWL-----QFEECKNIVKDAWG-KESHSEAGALISKIEESMRKLAAWNSKRLKGTLKGAIERKLQEINVLASTS-QRNLDNNIDKAEKELDQLLE
         RFEE W+     +   C+  +  AW   +  +    +  K++E  ++L  W+ K   G +K  IE   Q I    S + Q  +  NI+   KEL+ LL 
Subjt:  SRFEESWL-----QFEECKNIVKDAWG-KESHSEAGALISKIEESMRKLAAWNSKRLKGTLKGAIERKLQEINVLASTS-QRNLDNNIDKAEKELDQLLE

Query:  EEEQYWRIRVREDWLNWGDRNT-----------------KLKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPF
        +EE++WR R R  WL  GDRNT                 KL D  G W +  E I  + +EY+  LF+++  NP  +  A + ++  +T + N +L R F
Subjt:  EEEQYWRIRVREDWLNWGDRNT-----------------KLKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPF

Query:  SRCDIERALKDINPSKAPGPDGAHAMFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKV
           ++E+A++ + PSKAPGPDG   +F+Q++W +VG D++   L  LN+  ++  +N T+I+LIPKV +P+K+  FRPISLC V+YK++SK LANRLK +
Subjt:  SRCDIERALKDINPSKAPGPDGAHAMFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKV

Query:  LDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIME
        L +++S SQ+AF+PGRLITDNVL+ FE++H +++ ++G++G +A+KLDMSKA+DRVEW Y+ +IM+
Subjt:  LDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIME

A0A2N9G933 Reverse transcriptase domain-containing protein2.5e-10729.83Show/hide
Query:  FKKLRDKVRITRGGSWSYDDAILIFDEPKGSCSVESLEFKFVSFWVHFHKLPRVCFCRKYAVALGNSIGTFEDAEWDANEKMEGQSLRVKIRINVKEPLK
        F+   D  R+ +G  WSYD  ++ F        V  +E  FVSFWV  H LP     R++AVALG ++G  E       E+     +R+++RI++ +PL 
Subjt:  FKKLRDKVRITRGGSWSYDDAILIFDEPKGSCSVESLEFKFVSFWVHFHKLPRVCFCRKYAVALGNSIGTFEDAEWDANEKMEGQSLRVKIRINVKEPLK

Query:  RGTNVKIGSMAEKTWISITYEKMPDFCYFCGKLGHVVQECVM----VGASNSKERPFGIELRETKGSKGIYKSWKYDNRDFSFWGRGRGRGRNAR-GAHQ
        RG   ++ S   +TWIS  YE++P FCY+CG L H  ++C +     G    +++ +G  LR +           +D        +  GR    R G   
Subjt:  RGTNVKIGSMAEKTWISITYEKMPDFCYFCGKLGHVVQECVM----VGASNSKERPFGIELRETKGSKGIYKSWKYDNRDFSFWGRGRGRGRNAR-GAHQ

Query:  FSNNHVGEEEPIPGNSGKNYTSRINRKEEKASDRTWRKTADPVGNSVDPAMEREVGMSRNIDRCRQLDTTAVNPDQMMRMPQIKVNQNGAKGPLDLENDK
         ++N V  E     +SG                +T +  A  +G S +     E   S  +  C        N D    + +I    + +   ++L N +
Subjt:  FSNNHVGEEEPIPGNSGKNYTSRINRKEEKASDRTWRKTADPVGNSVDPAMEREVGMSRNIDRCRQLDTTAVNPDQMMRMPQIKVNQNGAKGPLDLENDK

Query:  NNNKDMDWERNKQAKGLGQTEKEAQELIQRKSSNLDYMSIGKEKSDYAHVGPPQKNSGNQSSTPFNLLPPSVDNPQIVHSKEGRASKETKDNNAKTSSGP
        + ++++      + K +G    E  + ++   + +    I   K+        Q    N  S    L   S+ N ++    E +   E++    + S   
Subjt:  NNNKDMDWERNKQAKGLGQTEKEAQELIQRKSSNLDYMSIGKEKSDYAHVGPPQKNSGNQSSTPFNLLPPSVDNPQIVHSKEGRASKETKDNNAKTSSGP

Query:  HGKDKNACKGGG--KGVKKKDTDEQRHNVKSWKR--IARSHKEESGVLDQNSQNQRKRAK----------EQEEDLTPVKDNKKQCTYPLDLSERGSTEA
         GK   + K  G  +   + +  +QR + K   R  +     E  GVL  ++  + +  K           +  +   +K   K C              
Subjt:  HGKDKNACKGGG--KGVKKKDTDEQRHNVKSWKR--IARSHKEESGVLDQNSQNQRKRAK----------EQEEDLTPVKDNKKQCTYPLDLSERGSTEA

Query:  VNQPPCVPSIGLSGGLILLWKDKLTVSIKSFSKGCIDSIIQ-DGIEQWRFTALDFGG---------------------------GDFNETLSISEKKGGR
              VPSIG SGGL LLW D++ +SI++FS   ID+ ++  G  +WRFT   +G                            GDFNE LS+ E+ G  
Subjt:  VNQPPCVPSIGLSGGLILLWKDKLTVSIKSFSKGCIDSIIQ-DGIEQWRFTALDFGG---------------------------GDFNETLSISEKKGGR

Query:  PKSQKQMDDFSSTLSLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEE
          SQ  M +F   L+ C LVD+ ++G  FTW+        +++RLDR VA++  +S      ++HL    S H PI++   +   S  ++ KRR  +FEE
Subjt:  PKSQKQMDDFSSTLSLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEE

Query:  SWLQFEECKNIVKDAWGKES--HSEAGALISKIEESMRKLAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDNN-IDKAEKELDQLLEEEEQYWRIR
         W    EC+ ++K+ W + +   S    +  KI++    L  W  K +       I+ +   ++ L + +Q  L+N+ I   ++E++QLL  EE +WR R
Subjt:  SWLQFEECKNIVKDAWGKES--HSEAGALISKIEESMRKLAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDNN-IDKAEKELDQLLEEEEQYWRIR

Query:  VREDWLNWGDRNTK-----------------LKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERAL
         R  WL  GDRNTK                 L+D    W D E+ +  +AV+YF+ +F+++  +P  I   +A +   ++ + N+ L  P++  ++  AL
Subjt:  VREDWLNWGDRNTK-----------------LKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERAL

Query:  KDINPSKAPGPDGAHAMFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQ
          ++PSKAPG DG  + FFQ++W IVG  +S   L +LN+  ++ ++N T+I+LIPK   P+KM  +RPISLC V+YKIISK +ANRLK VL  +IS SQ
Subjt:  KDINPSKAPGPDGAHAMFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQ

Query:  AAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIM
        +AF+PGRLITDNV + FE +H +  KR GK G +AVKLDMSKA+DRVEW ++  +M
Subjt:  AAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIM

A0A2N9GIC4 Reverse transcriptase domain-containing protein1.7e-10829.49Show/hide
Query:  FKKLRDKVRITRGGSWSYDDAILIFDEPKGSCSVESLEFKFVSFWVHFHKLPRVCFCRKYAVALGNSIGTFEDAEWDANEKMEGQSLRVKIRINVKEPLK
        F++  D  R+  G  WSYD  ++ F       +VE L F  V FWV  H LP +   R+ A+ALG  IG          E   G+ +RV++R+++ +PL 
Subjt:  FKKLRDKVRITRGGSWSYDDAILIFDEPKGSCSVESLEFKFVSFWVHFHKLPRVCFCRKYAVALGNSIGTFEDAEWDANEKMEGQSLRVKIRINVKEPLK

Query:  RGTNVKIGSMAEKTWISITYEKMPDFCYFCGKLGHVVQECVMVGASNSKERPFGIELRETKGSKGIYKSWKYDNRDFSFWGRGRGRGRNARGAHQFSNNH
        RG  + +G+  E  W+S  YE++P+FCY+CG   H  ++C        +     + LRE             DNR      R  G  RN   + +     
Subjt:  RGTNVKIGSMAEKTWISITYEKMPDFCYFCGKLGHVVQECVMVGASNSKERPFGIELRETKGSKGIYKSWKYDNRDFSFWGRGRGRGRNARGAHQFSNNH

Query:  VGEEEPIPGNSGKNYTSRINRKEEKASDRTWRKTADPVGNSVD-PAMEREVGMSRNIDRCRQLDTTAVNP---DQMMRMPQIKVNQNGAKGPLDLENDKN
             P P                         T +P   SVD P  + EV  +   D     D+ A NP   +  +R   + +N       + + N+  
Subjt:  VGEEEPIPGNSGKNYTSRINRKEEKASDRTWRKTADPVGNSVD-PAMEREVGMSRNIDRCRQLDTTAVNP---DQMMRMPQIKVNQNGAKGPLDLENDKN

Query:  NNKDMDWERNKQAKGLGQTEKEAQELIQRKSSNLDYMSIGKEKSDYAHVGPPQKNSGNQSSTPFNLLPPSVDNPQIVHSKEGRASKETKDNNAKTSSGPH
        +                      + +I  +S  L YM      +       P     NQ +TP                          +   K   G  
Subjt:  NNKDMDWERNKQAKGLGQTEKEAQELIQRKSSNLDYMSIGKEKSDYAHVGPPQKNSGNQSSTPFNLLPPSVDNPQIVHSKEGRASKETKDNNAKTSSGPH

Query:  GKDKNACKGGGKGVKKKDTDEQRHNVKSWKRIARSHKEESGVLDQNSQNQRKRAKEQEEDLTPVKDNKKQCTYPLDLSE-RGSTEAVNQPPCVPSIGLSG
                                   SWK+ ARS    +                               T P+ L+E R +TEA   P     +   G
Subjt:  GKDKNACKGGGKGVKKKDTDEQRHNVKSWKRIARSHKEESGVLDQNSQNQRKRAKEQEEDLTPVKDNKKQCTYPLDLSE-RGSTEAVNQPPCVPSIGLSG

Query:  GLILLWKDKLTVSIKSFSKGCIDSIIQDGI-EQWRFTALDFGG---------------------------GDFNETLSISEKKGGRPKSQKQMDDFSSTL
        GL L W + L V IKS+S   ID++I +G+ + WR T + +G                            GDFNE + + E  G   +  +QM  F + L
Subjt:  GLILLWKDKLTVSIKSFSKGCIDSIIQDGI-EQWRFTALDFGG---------------------------GDFNETLSISEKKGGRPKSQKQMDDFSSTL

Query:  SLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEESWLQFEECKNIVKD
          C LVD+ F G  FTW  +    +    RLDR VA  + + R     V+H++   S H+ +   W       S   KRR  RFEE WL    C+  + +
Subjt:  SLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEESWLQFEECKNIVKD

Query:  AWGKESHSEAGALIS-KIEESMRKLAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDNN-IDKAEKELDQLLEEEEQYWRIRVREDWLNWGDRNTK-
        AW       A   ++ K++E   +L +W+ ++  G +   IE    E+    +++   +++N +    K+L+ L E+EE+ WR R R  WL  GDRNTK 
Subjt:  AWGKESHSEAGALIS-KIEESMRKLAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDNN-IDKAEKELDQLLEEEEQYWRIRVREDWLNWGDRNTK-

Query:  ----------------LKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAH
                        LKD    W D  +G+  + ++Y+  LFS++  +P+ I   +  +   +TED N+ L R F+  ++E ALK + P+KAPGPDG  
Subjt:  ----------------LKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAH

Query:  AMFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLI
         +F+Q+FW +VGGD++K  L  LN+  ++  +N T+ISLIPK  +P+++  FRPISLC V+YK+ISK LANRLK +L +V+S SQ+AF+PGRLITDNVL+
Subjt:  AMFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLI

Query:  GFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIM
         FE++H +++ ++G+DG +A+KLDMSKA+DRVEW ++ KIM
Subjt:  GFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIM

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein8.5e-2024Show/hide
Query:  GDFNETLSISEKKGGRPKSQKQMDDFSSTLSLCHLVDI------------DFKGDNFTWKRSD------------KKGDLIKERL-DRFVANMEL-----
        GDFN  LSI + +  R K  K   + +S L    L+DI             F   + T+ + D            K+ ++I   L D     +EL     
Subjt:  GDFNETLSISEKKGGRPKSQKQMDDFSSTLSLCHLVDI------------DFKGDNFTWKRSD------------KKGDLIKERL-DRFVANMEL-----

Query:  ------ISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEESWLQFEEC---KNIVKDAW-GKESHSEAGALISKIEESMRKLAAWNSK
                ++NNL +     HN     I + +++         + + + ++  W  F+     K I  +A+  K+  S+   L S+++E + K    +SK
Subjt:  ------ISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEESWLQFEEC---KNIVKDAW-GKESHSEAGALISKIEESMRKLAAWNSK

Query:  RLKGTLKGAIERKLQEINVLASTSQRNLDNN-----IDKAEKELDQLLEEEEQYWRIRVREDWLNWGDRNTKLKDANGAWVDDEEGIGVVAVEYFKKLFS
          +      I  +L+EI    +  + N   +     I+K ++ L +L++++ +  +I               +K+  G    D   I     EY+K L++
Subjt:  RLKGTLKGAIERKLQEINVLASTSQRNLDNN-----IDKAEKELDQLLEEEEQYWRIRVREDWLNWGDRNTKLKDANGAWVDDEEGIGVVAVEYFKKLFS

Query:  SAKPNPNLINTAL-AGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAMFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKV
        +   N   ++T L       + +++   L RP +  +I   +  +   K+PGPDG  A F+QR+ + +   + K+   I     +        I LIPK 
Subjt:  SAKPNPNLINTAL-AGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAMFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKV

Query:  PHPD--KMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIM
        P  D  K E+FRPISL  +  KI++K LANR+++ + K+I   Q  FIPG     N+      I  IN  R     ++ + +D  KAFD+++  ++ K +
Subjt:  PHPD--KMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYICKIM

P11369 LINE-1 retrotransposable element ORF2 protein1.4e-2230.9Show/hide
Query:  KLKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKA-CITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAMFFQRFWDIVGGD
        K+++  G    D E I      ++K+L+S+   N + ++  L   +   + +DQ   L  P S  +IE  +  +   K+PGPDG  A F+Q F +    D
Subjt:  KLKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKA-CITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAMFFQRFWDIVGGD

Query:  ISKVCLDILNNDGVIGQLNCTW----ISLIPK-VPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAIN
        +  +   + +   V G L  ++    I+LIPK    P K+E+FRPISL  +  KI++K LANR+++ +  +I P Q  FIPG     N+      IH IN
Subjt:  ISKVCLDILNNDGVIGQLNCTW----ISLIPK-VPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAIN

Query:  NKRVGKDGYIAVKLDMSKAFDRVEWIYICKIME
          ++    ++ + LD  KAFD+++  ++ K++E
Subjt:  NKRVGKDGYIAVKLDMSKAFDRVEWIYICKIME

P14381 Transposon TX1 uncharacterized 149 kDa protein2.6e-2432.72Show/hide
Query:  NGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAMFFQRFWDIVGGDISKVCL
        +G  ++D E I   A  +++ LFS    +P+       G+   ++E +   LE P +  ++ +AL+ +  +K+PG DG    FFQ FWD +G D  +V  
Subjt:  NGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAMFFQRFWDIVGGDISKVCL

Query:  DILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIA
        +      +        +SL+PK      ++++RP+SL    YKI++KA++ RLK VL +VI P Q+  +PGR I DNV +  + +H    +R G      
Subjt:  DILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIA

Query:  VKLDMSKAFDRVEWIYI
        + LD  KAFDRV+  Y+
Subjt:  VKLDMSKAFDRVEWIYI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.3e-1524.46Show/hide
Query:  GDFNETLSISEKKGGRPKS--QKQMDDFSSTLSLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQS
        GDF++  + S+       S   + +++F + L    LVDI  +G ++TW        +I+ +LDR +AN +  S   +          S H P I+  + 
Subjt:  GDFNETLSISEKKGGRPKS--QKQMDDFSSTLSLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQS

Query:  IRSSPSNKGKRRKSRFEESWLQFEECKNIVKDAWGKESHSEAGALISKIEESMRKLAAWNSKRLKGTLKGAIERKLQE-INVLASTSQRNLDNNIDK---
              N  KR K  F            +V      E     G+ +  + E + K A    K L     G I+ K +E ++ L S   + L N  D    
Subjt:  IRSSPSNKGKRRKSRFEESWLQFEECKNIVKDAWGKESHSEAGALISKIEESMRKLAAWNSKRLKGTLKGAIERKLQE-INVLASTSQRNLDNNIDK---

Query:  ----AEKELDQLLEEEEQYWRIRVREDWLNWGDRNTK-----------------LKDANGAWVDDEEGIGVVAVEYFKKLFSSAKP-------------N
            A K+ +      E ++R + R  WL  GD NT+                 L+  +   V++   +  + V Y+  L  S                +
Subjt:  ----AEKELDQLLEEEEQYWRIRVREDWLNWGDRNTK-----------------LKDANGAWVDDEEGIGVVAVEYFKKLFSSAKP-------------N

Query:  PNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAMFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKM
        P   N  LA   + +  D+           +I  A+  +  +KAPGPD   A FF   W +V         +      ++ + N T I+LIPKV   D++
Subjt:  PNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAMFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKM

Query:  ESFRPISLCCVVYKIIS
          FRP+S C VVYKII+
Subjt:  ESFRPISLCCVVYKIIS

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.5e-1142.03Show/hide
Query:  LANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYI
        +  RLK ++  +I P+QA+FIPGR+ TDN++   E++H++  K+ G  G++ +KLD+ KA+DR+ W Y+
Subjt:  LANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAAGGAAACAACTTTGGTTTAAGAAGCTCAGAGATAAAGTTAGAATAACCAGGGGAGGGTCGTGGAGCTATGATGATGCTATTCTCATTTTCGACGAGCCAAAAGG
GAGTTGTAGTGTGGAGTCTTTAGAATTCAAATTCGTTTCTTTCTGGGTTCACTTTCACAAGCTCCCTCGTGTGTGCTTTTGCAGGAAATACGCTGTGGCTTTAGGGAATT
CTATTGGCACGTTTGAAGATGCTGAATGGGACGCCAACGAGAAAATGGAAGGTCAATCCTTAAGGGTCAAGATCAGAATAAATGTTAAGGAACCTCTCAAAAGGGGAACT
AACGTGAAGATTGGCTCGATGGCGGAAAAAACATGGATTTCTATTACTTACGAGAAAATGCCGGACTTTTGCTATTTTTGTGGGAAGCTCGGCCATGTGGTCCAAGAATG
TGTGATGGTTGGAGCTAGCAACAGCAAGGAAAGGCCTTTTGGAATTGAGTTGAGAGAAACAAAAGGTAGCAAGGGAATTTACAAATCCTGGAAATATGATAACAGAGATT
TTTCCTTTTGGGGGAGAGGGCGTGGAAGAGGACGAAACGCGAGAGGGGCTCATCAATTCAGTAATAATCACGTCGGGGAAGAAGAACCCATTCCAGGAAATTCAGGCAAG
AATTACACTTCAAGAATCAATCGAAAAGAAGAAAAAGCGAGTGACCGGACATGGAGGAAAACGGCGGATCCCGTCGGAAACTCTGTAGATCCGGCTATGGAGAGGGAGGT
AGGGATGTCTAGAAATATAGACAGGTGCAGGCAGCTTGACACGACAGCTGTCAACCCAGATCAGATGATGAGAATGCCCCAGATTAAAGTTAATCAGAACGGGGCAAAAG
GACCGTTGGACCTGGAAAACGACAAAAATAATAACAAGGACATGGATTGGGAACGCAACAAACAAGCAAAAGGCTTGGGCCAAACTGAAAAAGAAGCCCAAGAATTAATC
CAGAGAAAGAGCTCCAATCTTGACTACATGAGTATTGGGAAGGAAAAGTCTGATTATGCTCACGTGGGACCACCGCAGAAAAATTCTGGAAACCAATCTTCTACCCCCTT
CAATCTTCTACCCCCTTCGGTTGACAACCCTCAAATTGTCCATTCTAAGGAAGGAAGGGCATCGAAGGAGACTAAAGATAACAACGCCAAAACCTCTTCGGGACCTCATG
GCAAGGATAAAAACGCTTGCAAAGGAGGGGGAAAGGGAGTCAAGAAGAAAGATACTGACGAGCAGAGACATAATGTTAAGTCATGGAAACGAATAGCCAGATCTCACAAG
GAAGAATCAGGGGTGTTAGATCAGAATTCTCAAAACCAGAGGAAAAGGGCCAAAGAGCAGGAAGAGGATTTGACTCCAGTAAAGGACAACAAGAAGCAGTGCACATATCC
TCTCGATTTAAGCGAGAGGGGATCGACGGAGGCTGTGAATCAGCCCCCCTGTGTGCCTAGCATTGGCCTAAGTGGGGGCCTAATCTTGCTTTGGAAAGACAAGCTAACAG
TTAGTATCAAATCTTTTTCTAAGGGCTGTATTGATTCGATCATTCAAGATGGCATTGAGCAATGGCGCTTTACAGCCTTGGATTTTGGGGGGGGGGACTTCAACGAGACC
CTCTCTATCTCAGAGAAAAAGGGCGGCAGGCCTAAAAGCCAGAAACAAATGGACGATTTTAGCTCCACTCTCAGTCTTTGTCACTTGGTTGATATTGATTTCAAAGGTGA
TAATTTCACTTGGAAAAGGAGTGATAAAAAAGGGGATTTGATCAAAGAAAGGTTAGACAGATTTGTGGCTAATATGGAACTGATCAGTAGAGTCAACAATCTGGAAGTTG
AACACCTCAATTACCACAACTCTTATCATAGGCCTATCATTGTGTCCTGGCAAAGCATAAGAAGCTCCCCTAGCAATAAGGGGAAGAGGAGAAAGTCGAGGTTCGAAGAA
AGTTGGCTTCAGTTTGAGGAATGCAAGAACATTGTAAAAGACGCTTGGGGCAAAGAATCTCATTCTGAAGCAGGGGCTTTGATATCCAAAATCGAAGAAAGCATGAGAAA
GTTAGCGGCCTGGAATAGTAAAAGACTTAAAGGCACTCTCAAAGGTGCGATTGAGAGGAAGCTTCAAGAGATAAATGTGTTGGCTTCCACAAGCCAAAGGAATCTAGACA
ACAATATTGACAAGGCAGAAAAGGAATTAGACCAGTTGCTTGAAGAAGAAGAGCAGTATTGGAGGATTAGAGTTAGAGAAGATTGGCTCAATTGGGGGGATAGAAACACT
AAGTTGAAAGATGCTAATGGAGCTTGGGTGGATGATGAAGAGGGAATTGGGGTGGTGGCGGTGGAGTACTTCAAAAAGCTTTTCTCCTCAGCCAAACCCAACCCTAATTT
GATTAACACAGCCTTGGCAGGTATCAAAGCCTGCATCACTGAGGACCAAAACCGGGACCTTGAGAGGCCGTTTTCGAGATGTGACATAGAAAGGGCGTTGAAAGACATTA
ACCCGTCCAAGGCTCCAGGTCCTGATGGTGCCCATGCCATGTTCTTCCAGCGGTTTTGGGATATTGTGGGTGGCGATATCTCTAAAGTGTGCTTAGACATTCTTAACAAT
GATGGTGTTATTGGGCAGCTAAATTGCACGTGGATTTCCTTAATCCCCAAGGTTCCCCACCCCGATAAAATGGAAAGTTTTCGACCTATCAGCCTATGCTGCGTGGTTTA
CAAAATTATATCCAAAGCCCTGGCAAATAGATTGAAAAAAGTTCTCGACAAAGTCATTTCTCCTTCTCAGGCGGCTTTCATTCCTGGAAGGCTAATTACTGATAATGTCC
TTATTGGCTTTGAGAGCATCCACGCGATAAACAACAAAAGAGTGGGGAAGGATGGCTACATTGCTGTGAAGCTGGATATGAGTAAAGCCTTCGACCGGGTGGAATGGATC
TATATCTGTAAAATCATGGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAAGGAAACAACTTTGGTTTAAGAAGCTCAGAGATAAAGTTAGAATAACCAGGGGAGGGTCGTGGAGCTATGATGATGCTATTCTCATTTTCGACGAGCCAAAAGG
GAGTTGTAGTGTGGAGTCTTTAGAATTCAAATTCGTTTCTTTCTGGGTTCACTTTCACAAGCTCCCTCGTGTGTGCTTTTGCAGGAAATACGCTGTGGCTTTAGGGAATT
CTATTGGCACGTTTGAAGATGCTGAATGGGACGCCAACGAGAAAATGGAAGGTCAATCCTTAAGGGTCAAGATCAGAATAAATGTTAAGGAACCTCTCAAAAGGGGAACT
AACGTGAAGATTGGCTCGATGGCGGAAAAAACATGGATTTCTATTACTTACGAGAAAATGCCGGACTTTTGCTATTTTTGTGGGAAGCTCGGCCATGTGGTCCAAGAATG
TGTGATGGTTGGAGCTAGCAACAGCAAGGAAAGGCCTTTTGGAATTGAGTTGAGAGAAACAAAAGGTAGCAAGGGAATTTACAAATCCTGGAAATATGATAACAGAGATT
TTTCCTTTTGGGGGAGAGGGCGTGGAAGAGGACGAAACGCGAGAGGGGCTCATCAATTCAGTAATAATCACGTCGGGGAAGAAGAACCCATTCCAGGAAATTCAGGCAAG
AATTACACTTCAAGAATCAATCGAAAAGAAGAAAAAGCGAGTGACCGGACATGGAGGAAAACGGCGGATCCCGTCGGAAACTCTGTAGATCCGGCTATGGAGAGGGAGGT
AGGGATGTCTAGAAATATAGACAGGTGCAGGCAGCTTGACACGACAGCTGTCAACCCAGATCAGATGATGAGAATGCCCCAGATTAAAGTTAATCAGAACGGGGCAAAAG
GACCGTTGGACCTGGAAAACGACAAAAATAATAACAAGGACATGGATTGGGAACGCAACAAACAAGCAAAAGGCTTGGGCCAAACTGAAAAAGAAGCCCAAGAATTAATC
CAGAGAAAGAGCTCCAATCTTGACTACATGAGTATTGGGAAGGAAAAGTCTGATTATGCTCACGTGGGACCACCGCAGAAAAATTCTGGAAACCAATCTTCTACCCCCTT
CAATCTTCTACCCCCTTCGGTTGACAACCCTCAAATTGTCCATTCTAAGGAAGGAAGGGCATCGAAGGAGACTAAAGATAACAACGCCAAAACCTCTTCGGGACCTCATG
GCAAGGATAAAAACGCTTGCAAAGGAGGGGGAAAGGGAGTCAAGAAGAAAGATACTGACGAGCAGAGACATAATGTTAAGTCATGGAAACGAATAGCCAGATCTCACAAG
GAAGAATCAGGGGTGTTAGATCAGAATTCTCAAAACCAGAGGAAAAGGGCCAAAGAGCAGGAAGAGGATTTGACTCCAGTAAAGGACAACAAGAAGCAGTGCACATATCC
TCTCGATTTAAGCGAGAGGGGATCGACGGAGGCTGTGAATCAGCCCCCCTGTGTGCCTAGCATTGGCCTAAGTGGGGGCCTAATCTTGCTTTGGAAAGACAAGCTAACAG
TTAGTATCAAATCTTTTTCTAAGGGCTGTATTGATTCGATCATTCAAGATGGCATTGAGCAATGGCGCTTTACAGCCTTGGATTTTGGGGGGGGGGACTTCAACGAGACC
CTCTCTATCTCAGAGAAAAAGGGCGGCAGGCCTAAAAGCCAGAAACAAATGGACGATTTTAGCTCCACTCTCAGTCTTTGTCACTTGGTTGATATTGATTTCAAAGGTGA
TAATTTCACTTGGAAAAGGAGTGATAAAAAAGGGGATTTGATCAAAGAAAGGTTAGACAGATTTGTGGCTAATATGGAACTGATCAGTAGAGTCAACAATCTGGAAGTTG
AACACCTCAATTACCACAACTCTTATCATAGGCCTATCATTGTGTCCTGGCAAAGCATAAGAAGCTCCCCTAGCAATAAGGGGAAGAGGAGAAAGTCGAGGTTCGAAGAA
AGTTGGCTTCAGTTTGAGGAATGCAAGAACATTGTAAAAGACGCTTGGGGCAAAGAATCTCATTCTGAAGCAGGGGCTTTGATATCCAAAATCGAAGAAAGCATGAGAAA
GTTAGCGGCCTGGAATAGTAAAAGACTTAAAGGCACTCTCAAAGGTGCGATTGAGAGGAAGCTTCAAGAGATAAATGTGTTGGCTTCCACAAGCCAAAGGAATCTAGACA
ACAATATTGACAAGGCAGAAAAGGAATTAGACCAGTTGCTTGAAGAAGAAGAGCAGTATTGGAGGATTAGAGTTAGAGAAGATTGGCTCAATTGGGGGGATAGAAACACT
AAGTTGAAAGATGCTAATGGAGCTTGGGTGGATGATGAAGAGGGAATTGGGGTGGTGGCGGTGGAGTACTTCAAAAAGCTTTTCTCCTCAGCCAAACCCAACCCTAATTT
GATTAACACAGCCTTGGCAGGTATCAAAGCCTGCATCACTGAGGACCAAAACCGGGACCTTGAGAGGCCGTTTTCGAGATGTGACATAGAAAGGGCGTTGAAAGACATTA
ACCCGTCCAAGGCTCCAGGTCCTGATGGTGCCCATGCCATGTTCTTCCAGCGGTTTTGGGATATTGTGGGTGGCGATATCTCTAAAGTGTGCTTAGACATTCTTAACAAT
GATGGTGTTATTGGGCAGCTAAATTGCACGTGGATTTCCTTAATCCCCAAGGTTCCCCACCCCGATAAAATGGAAAGTTTTCGACCTATCAGCCTATGCTGCGTGGTTTA
CAAAATTATATCCAAAGCCCTGGCAAATAGATTGAAAAAAGTTCTCGACAAAGTCATTTCTCCTTCTCAGGCGGCTTTCATTCCTGGAAGGCTAATTACTGATAATGTCC
TTATTGGCTTTGAGAGCATCCACGCGATAAACAACAAAAGAGTGGGGAAGGATGGCTACATTGCTGTGAAGCTGGATATGAGTAAAGCCTTCGACCGGGTGGAATGGATC
TATATCTGTAAAATCATGGAGTAG
Protein sequenceShow/hide protein sequence
MQRKQLWFKKLRDKVRITRGGSWSYDDAILIFDEPKGSCSVESLEFKFVSFWVHFHKLPRVCFCRKYAVALGNSIGTFEDAEWDANEKMEGQSLRVKIRINVKEPLKRGT
NVKIGSMAEKTWISITYEKMPDFCYFCGKLGHVVQECVMVGASNSKERPFGIELRETKGSKGIYKSWKYDNRDFSFWGRGRGRGRNARGAHQFSNNHVGEEEPIPGNSGK
NYTSRINRKEEKASDRTWRKTADPVGNSVDPAMEREVGMSRNIDRCRQLDTTAVNPDQMMRMPQIKVNQNGAKGPLDLENDKNNNKDMDWERNKQAKGLGQTEKEAQELI
QRKSSNLDYMSIGKEKSDYAHVGPPQKNSGNQSSTPFNLLPPSVDNPQIVHSKEGRASKETKDNNAKTSSGPHGKDKNACKGGGKGVKKKDTDEQRHNVKSWKRIARSHK
EESGVLDQNSQNQRKRAKEQEEDLTPVKDNKKQCTYPLDLSERGSTEAVNQPPCVPSIGLSGGLILLWKDKLTVSIKSFSKGCIDSIIQDGIEQWRFTALDFGGGDFNET
LSISEKKGGRPKSQKQMDDFSSTLSLCHLVDIDFKGDNFTWKRSDKKGDLIKERLDRFVANMELISRVNNLEVEHLNYHNSYHRPIIVSWQSIRSSPSNKGKRRKSRFEE
SWLQFEECKNIVKDAWGKESHSEAGALISKIEESMRKLAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDNNIDKAEKELDQLLEEEEQYWRIRVREDWLNWGDRNT
KLKDANGAWVDDEEGIGVVAVEYFKKLFSSAKPNPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAMFFQRFWDIVGGDISKVCLDILNN
DGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWI
YICKIME