; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025369 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025369
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:11940708..11943185
RNA-Seq ExpressionLag0025369
SyntenyLag0025369
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]1.1e-14637.55Show/hide
Query:  ETKCDGVKGEQVKKRLSFDHGWCVPSVGSSGGLMLLWNSDIEIYWFLW-----------ESGGC-------------QKERFLELVRAVSGEGRYPLASG
        ETK    + E  ++ L F++ + V   G  GGL LLW  D+ +    +           E+G               QK+    L+R ++G    P    
Subjt:  ETKCDGVKGEQVKKRLSFDHGWCVPSVGSSGGLMLLWNSDIEIYWFLW-----------ESGGC-------------QKERFLELVRAVSGEGRYPLASG

Query:  GGDFNEILSVDEKKGGGLRNQNQMKGFQDVINICRLMDAGPIVANIVISSAKHSSRNKGKILHFEEGWLKFKYAKKIVQDSWSGWEGRRSKQINHKLKVS
         GDFNEI +++EK GG  RN +++  F+  +  CRL+D G        S    + RN+ KIL                               N K + S
Subjt:  GGDFNEILSVDEKKGGGLRNQNQMKGFQDVINICRLMDAGPIVANIVISSAKHSSRNKGKILHFEEGWLKFKYAKKIVQDSWSGWEGRRSKQINHKLKVS

Query:  IKNLHQWSKERFKGNIKSAIRKKEEEIQ-KLEAESHMMDEQD----LDRAEMELNSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGI
        +  L  WSK+ F G      +K+ E++Q KL++  H     D    L + E +++++L++EE +W+ RSR  WLK GDKNT++FH KAS RRK+NRI GI
Subjt:  IKNLHQWSKERFKGNIKSAIRKKEEEIQ-KLEAESHMMDEQD----LDRAEMELNSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGI

Query:  FDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQQRRTLEQPYSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVR
         D  G W E+ +++  +  E+F  +F ++ P+ +Q+    +    +++++    L+ P+ ++E+ EA   + P+KAPG DG+ A+FFQ +W  V E  + 
Subjt:  FDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQQRRTLEQPYSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVR

Query:  ECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAKAIANRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEG
         CL ILN++ ++ P N T I LIPK  +PK + EFRPISLCNV Y+I+AK++AN LK +LD I+SP QSAF+  RLI+DN+++G+E ++ I   K  K G
Subjt:  ECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAKAIANRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEG

Query:  HVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGVPHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQI
         +A+ LD+SKAYDRVEW F+R    K+GF   WI+  M C+ + +FSVLINGVP  + +P RG+RQG PLSPY+FL+CAE FS +L +  ++Q + G + 
Subjt:  HVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGVPHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQI

Query:  NNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDSC-----------------------------------------------KIK
        N+    +SHL FADDSLVF  +++E+C  +K +F  Y  AS Q  N +                                                 +IK
Subjt:  NNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDSC-----------------------------------------------KIK

Query:  DKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYSMSCFKLPKSLCKDINKMCASFWWGATGDKKKLHWARWKELCKSKDVGG
         +I N L  W  KLFS+GG+E+LIKAV QA+P Y+MS FKLP S+C+DI K  A FWWG++ D++ +HWA+W++LC++K  GG
Subjt:  DKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYSMSCFKLPKSLCKDINKMCASFWWGATGDKKKLHWARWKELCKSKDVGG

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]7.8e-15839.1Show/hide
Query:  EQVKKRLSFDHGWCVPSVGSSGGLMLLW----NSDIEIYW--------------FLWESGGC-------QKERFLELVRAVSGEGRYPLASGGGDFNEIL
        E++K R+ F +G  VP  G SGG+ LLW    N +++ Y               + W   G        ++     L+  ++ + + P     GDFNEIL
Subjt:  EQVKKRLSFDHGWCVPSVGSSGGLMLLW----NSDIEIYW--------------FLWESGGC-------QKERFLELVRAVSGEGRYPLASGGGDFNEIL

Query:  SVDEKKGGGLRNQNQMKGFQDVINICRLMD-------------------------------------AGPIVANIVISSAKHSS----------RNKGKI
        S++EK GG  R+Q+QM GF+D++N C   D                                      G  V ++V S+  H +          R + K 
Subjt:  SVDEKKGGGLRNQNQMKGFQDVINICRLMD-------------------------------------AGPIVANIVISSAKHSS----------RNKGKI

Query:  LHFEEGWLKFKYAKKIVQDSWS-GWEGRRSKQINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAESHMMDEQ---DLDRAEMELNSLLEEEE
         HFE  W K +  K I++ SW  G +    + I+  L++    L +WS   + G I   I+ K   +  L      +DE    +++R   E+N+LL++EE
Subjt:  LHFEEGWLKFKYAKKIVQDSWS-GWEGRRSKQINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAESHMMDEQ---DLDRAEMELNSLLEEEE

Query:  YYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQQRRTLEQPYSKQ
         YW  R++  WLK GD+NT++FH +AS+RRK+N I GI+D  G W + EE IA  A  YF +I+ SS PS  QI +V E +  +++++   +L + ++K+
Subjt:  YYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQQRRTLEQPYSKQ

Query:  EVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAKAIANRLKKVLDT
        EV  A K +HP+KAPG DG+ A FFQ YW IVG +     L +LN+   I   NKT I+LIPK   PKRM +FRPISLCNV YK+++K +ANRLK +L  
Subjt:  EVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAKAIANRLKKVLDT

Query:  IISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGVPHEEFKPDR
        IIS  QSAF   RLI+DNVLV FE +H ++++  GKEG +AI LDMSKA+DRVEW F+  V  +MGF  RW   +M+C+ SV++S+LINGV H    P R
Subjt:  IISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGVPHEEFKPDR

Query:  GIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDSCKI----------
        G+RQGDPLSP +FL+CAEG S L+N+   ++ + G  IN  CP ++HL FADDS++FC ++ EEC  ++ +   YE AS Q IN D   I          
Subjt:  GIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDSCKI----------

Query:  -------------------------------------KDKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYSMSCFKLPKSLCKDINKMCASFWWGATG
                                             K+K+ + L GW  KL S GGKEILIKAV QAIPTY+MSCF LP+ LC D+ +M  +FWWG   
Subjt:  -------------------------------------KDKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYSMSCFKLPKSLCKDINKMCASFWWGATG

Query:  DKKKLHWARWKELCKSKDVGG
         + K+ W  WK +C SK  GG
Subjt:  DKKKLHWARWKELCKSKDVGG

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]8.1e-15545.38Show/hide
Query:  FEEGWLKFKYAKKIVQDSW-SGWEGRRSKQINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKL-EAESHMMDEQDLDRAEMELNSLLEEEEYYWR
        FE  W + +  K I+Q  W S  E    + I  +L+   +NL +W+K  F GNI   I++K+E +  L  ++ +     +++    E+N LL+ EE  W+
Subjt:  FEEGWLKFKYAKKIVQDSW-SGWEGRRSKQINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKL-EAESHMMDEQDLDRAEMELNSLLEEEEYYWR

Query:  SRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQQRRTLEQPYSKQEVEE
         RSR  WL  GD+NT++FHTKAS RR+RN I GI D NGNW++  E IA +A  YF+ I+ SS+P+R  I++V + +   ++++   +L Q ++++E+E 
Subjt:  SRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQQRRTLEQPYSKQEVEE

Query:  ATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAKAIANRLKKVLDTIISP
        A   +HP+KAPG DG+ A FFQ YW+IVG D V   L +LN+   +V  NKT ITL+PK+K P +M +FRPISLCNV YK+++K +ANRLK +L  IIS 
Subjt:  ATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAKAIANRLKKVLDTIISP

Query:  TQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGVPHEEFKPDRGIRQ
         QSAF+ GRLI+DNVLV FE +H + ++K+GKEG  AI LDMSKAYDRVEW F++ V  KMGF E+WIK +M C+ SV++S+L+NG  +    P RG+RQ
Subjt:  TQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGVPHEEFKPDRGIRQ

Query:  GDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDS-----------------
        GDP+SPYIFL+CA+GFS LLN       + G  I   CP ++HL FADDSL+FC ++ +ECQT+ D+ + YE AS Q IN+D                  
Subjt:  GDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDS-----------------

Query:  ------------------------------CKIKDKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYSMSCFKLPKSLCKDINKMCASFWWGATGDKKK
                                       ++K+++   L GW EKL S GG+EILIKAV QAIPTY+MSCF++PK+LC++I  M   FWWG  G + K
Subjt:  ------------------------------CKIKDKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYSMSCFKLPKSLCKDINKMCASFWWGATGDKKK

Query:  LHWARWKELCKSKDVGG
        + W  WK+LCK+K  GG
Subjt:  LHWARWKELCKSKDVGG

XP_030924668.1 uncharacterized protein LOC115951644 [Quercus lobata]6.9e-14637.31Show/hide
Query:  METKCDGVKGEQVKKRLSFDHGWCVPSVGSSGGLMLLWNSDIEI--------------------YW----FLWESGGCQKERFLELVRAVSGEGRYPLAS
        METK +    +++ +++ + + + VP   S GGL L W +D  +                     W    F  +     +E    L+R +S     P   
Subjt:  METKCDGVKGEQVKKRLSFDHGWCVPSVGSSGGLMLLWNSDIEI--------------------YW----FLWESGGCQKERFLELVRAVSGEGRYPLAS

Query:  GGGDFNEILSVDEKKGGGLRNQNQMKGFQDVINICRLMDAGPIVAN--------IVISSAKHSSR--NKGKILHFEEGWLKFKYAKKIVQDSWSGWEGRR
          GDFNEIL  DEK+G   R + QM+GF+D ++  RL D G    +        I++S     +R   KG+   FE  WL+ +  +++V DSW    G+ 
Subjt:  GGGDFNEILSVDEKKGGGLRNQNQMKGFQDVINICRLMDAGPIVAN--------IVISSAKHSSR--NKGKILHFEEGWLKFKYAKKIVQDSWSGWEGRR

Query:  SKQ-----INHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAES--HMMDEQDLDRAEMELNSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTK
          Q      N K+     NL  W+K+ F G+++++++KK E++ K E E+  +  +   +     E+  L  +EE  W+ RSR  WLK GD+NT++FH +
Subjt:  SKQ-----INHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAES--HMMDEQDLDRAEMELNSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTK

Query:  ASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQQRRTLEQPYSKQEVEEATKNLHPSKAPGIDGVHASFF
        A+QR +RN I G+ D  G W E+E+ +  +   YF+ IF SS PS    + +   ++    +  R  +E  +   EV+EA  ++ P  APG DG+   F+
Subjt:  ASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQQRRTLEQPYSKQEVEEATKNLHPSKAPGIDGVHASFF

Query:  QAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAKAIANRLKKVLDTIISPTQSAFVPGRLISDNVLVGFEC
        +++W IVGED     L+ LN        N T ITLIPK+K PK++ +FRPISLCNV YK++AK +ANRLKK L   +  +QSAF+ GRLISDN+L+ FE 
Subjt:  QAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAKAIANRLKKVLDTIISPTQSAFVPGRLISDNVLVGFEC

Query:  IHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGVPHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLN
        +H +  + KGK G +A+ LDMSKAYDRVEW F+  +   +G  ER  + I+ C++SV++S+L+NG P    KP RG+RQGDPLSPY+FL+CA G  GLL 
Subjt:  IHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGVPHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLN

Query:  REVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDSCKI-----------------------------------
        +     ++KG  I+ + P +SHL FADDS++FC ++  ECQ + D+   YE  + Q IN +   I                                   
Subjt:  REVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDSCKI-----------------------------------

Query:  ------------KDKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYSMSCFKLPKSLCKDINKMCASFWWGATGDKKKLHWARWKELCKSKDVGG
                    K+++   +QGW EKL S  G+E+LIKAVIQAIPTY+MSCFKLPK L K++  +   FWWG     KK+HW  W+ LC++K+VGG
Subjt:  ------------KDKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYSMSCFKLPKSLCKDINKMCASFWWGATGDKKKLHWARWKELCKSKDVGG

XP_030969743.1 uncharacterized protein LOC115990020 [Quercus lobata]1.6e-14741.4Show/hide
Query:  GDFNEILSVDEKKGGGLRNQNQMKGFQDVINICRLMD---AGPI-------------------VAN--------------------------IVISSAKH
        GDFNE+L V +K GG  R+  QM+ F+D ++ C  +D   +GP                    VAN                          + + S   
Subjt:  GDFNEILSVDEKKGGGLRNQNQMKGFQDVINICRLMD---AGPI-------------------VAN--------------------------IVISSAKH

Query:  SSRNKGKILHFEEGWLKFKYAKKIVQDSWSG-WEGRRSKQINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAES-HMMDEQDLDRAEMELNS
          R + K   FE  W+     K  V ++W+G   G        K+K   K L +WSKE F GN+K  I+  +E++   E ES    D+  +D  + EL+ 
Subjt:  SSRNKGKILHFEEGWLKFKYAKKIVQDSWSG-WEGRRSKQINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAES-HMMDEQDLDRAEMELNS

Query:  LLEEEEYYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQQRRTLE
        LLE+EE  W  RSR  WL+ GD+NT +FH  A+ R+++N I+G+ D NG W+ EE+  + + T++++ +FKSS P    I +V + V+  +++     L 
Subjt:  LLEEEEYYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQQRRTLE

Query:  QPYSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAKAIANRL
        +PYS  EVE A K++ P KAPG DG+   F+Q YW  V  D  +  L  LN+ + +   N T ITLIPKVK P+++ EFRPISLCNV YKIV+KAIANRL
Subjt:  QPYSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAKAIANRL

Query:  KKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGVPHE
        K +L++IIS TQSAF+  RLI+DNVL+ FE +H + N   GK G +A+ LDMSKAYDRVEW F+  V  K+GF E W+  IM+C+ +VT+S+L+NG P  
Subjt:  KKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGVPHE

Query:  EFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDS-------
           P RG+RQGDPLSPY+FL CAEG + +  +      ++GF I    P L+HL FADD L+FC SS EEC+ IK++   YE AS Q +N D        
Subjt:  EFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDS-------

Query:  ----------------------------------------CKIKDKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYSMSCFKLPKSLCKDINKMCASF
                                                 +IK++I   +QGW EKL S  GKEI+IKAV+Q+IPTYSMS FKLP  LCKDI  M   F
Subjt:  ----------------------------------------CKIKDKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYSMSCFKLPKSLCKDINKMCASF

Query:  WWGATGDKKKLHWARWKELCKSKDVGG
        WWG  G+ +KLHW  W  LC SK VGG
Subjt:  WWGATGDKKKLHWARWKELCKSKDVGG

TrEMBL top hitse value%identityAlignment
A0A2N9G8I6 Reverse transcriptase domain-containing protein1.8e-15237.82Show/hide
Query:  METKCDGVKGEQVKKRLSFDHGWCVPSVGSSGGLMLLWNSDIEIY--------------------W----FLWESGGCQKERFLELVRAVSGEGRYPLAS
        METK D  + E ++ +L FD+ + VPS+G SGGL LLW +D E+                     W    F       ++     L++ +S     P   
Subjt:  METKCDGVKGEQVKKRLSFDHGWCVPSVGSSGGLMLLWNSDIEIY--------------------W----FLWESGGCQKERFLELVRAVSGEGRYPLAS

Query:  GGGDFNEILSVDEKKGGGLRNQNQMKGFQDVINICRLMDAG---------------------------------------------------PIVANIVI
          GDFNEIL+V+EK GG  R+  Q+  FQ+ +NIC  +D G                                                    +V +IV 
Subjt:  GGGDFNEILSVDEKKGGGLRNQNQMKGFQDVINICRLMDAG---------------------------------------------------PIVANIVI

Query:  SSAKHSSRNKGKILHFEEGWLKFKYAKKIVQDSWSG--WEGRRSKQINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAESH-MMDEQDLDRA
         ++   SR K  +  FEE W      +K++Q+SW      G    Q+  K+      L  WS+E F+ N    +  K E ++ L  ++H       +   
Subjt:  SSAKHSSRNKGKILHFEEGWLKFKYAKKIVQDSWSG--WEGRRSKQINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAESH-MMDEQDLDRA

Query:  EMELNSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQ
          E+N LL ++E +WR RSREVWL +GDKNT +FH KA QRR +N ++G+ DSNG W EEE  + ++  +YF+DIF +S  S  ++      +   ++  
Subjt:  EMELNSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQ

Query:  QRRTLEQPYSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAK
          R L   ++  E+++AT  +HPSKAPG DG+ + FFQ YW IVG+D V   L ++N+   +   N + + LIPK K P+ + ++RPISL NV YKI++K
Subjt:  QRRTLEQPYSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAK

Query:  AIANRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLI
        A+ANRLK VL  IIS +QSAFVPGR I+DN+ V FE +H +  R+KGK  HVA+ LDMSKAYDRVEW F+ +V  +MGF  RWI+ +M CV++ ++SVL+
Subjt:  AIANRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLI

Query:  NGVPHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDSC
        NG P    KP RGIRQGDPLSPY+FL+CAEG S LL R    Q + G  +    P +SHL FADDSL+FC ++  EC  + +V   YE AS Q +N +  
Subjt:  NGVPHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDSC

Query:  -----------------------------------------------KIKDKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYSMSCFKLPKSLCKDIN
                                                        +K++I   LQGW E+L S  G+ ILIK + QAIPTY+MSCFKLPK+ C DIN
Subjt:  -----------------------------------------------KIKDKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYSMSCFKLPKSLCKDIN

Query:  KMCASFWWGATGDKKKLHWARWKELCKSKDVGG
         + +++WWG   ++ K+HW  W  LC  K+ GG
Subjt:  KMCASFWWGATGDKKKLHWARWKELCKSKDVGG

A0A2N9GII4 Uncharacterized protein4.8e-15337.11Show/hide
Query:  METKCDGVKGEQVKKRLSFDHGWCVPSVGSSGGLMLLWNSDIEI---------------------YWFLWESGGCQKERFLE---LVRAVSGEGRYPLAS
        METK    K E ++ +L F + + VPS+G S GL LLW  ++ +                     +  +   G  +++R  E   L+  ++     P   
Subjt:  METKCDGVKGEQVKKRLSFDHGWCVPSVGSSGGLMLLWNSDIEI---------------------YWFLWESGGCQKERFLE---LVRAVSGEGRYPLAS

Query:  GGGDFNEILSVDEKKGGGLRNQNQMKGFQDVINICRLMDAG-------------------------------------PIVANIVISSAKH---------
          GDFNEIL  +EK+G  LR   +M  F++V+N C+ +D G                                       V ++ IS + H         
Subjt:  GGGDFNEILSVDEKKGGGLRNQNQMKGFQDVINICRLMDAG-------------------------------------PIVANIVISSAKH---------

Query:  ---SSRNKGKILHFEEGWLKFKYAKKIVQDSWSG--WEGRRSKQINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAESHMMDEQDLDRAEME
            +RNK ++  FEE W      + +++  W     EG    ++  K+K     L QWSK+ F G+ +  IR + E ++ L  +    +   +   + E
Subjt:  ---SSRNKGKILHFEEGWLKFKYAKKIVQDSWSG--WEGRRSKQINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAESHMMDEQDLDRAEME

Query:  LNSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQQRR
        +NSLL  +E +W+ RSR  WLK GD NT++FH  A+QR++ N+IEG+ +  G W  E   + S++ +YFKDIF SS P R  I +  E V   ++ +  R
Subjt:  LNSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQQRR

Query:  TLEQPYSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAKAIA
         L +P++ +EV +A   +HPSK+PG DG+   FFQ +W IVG + +   L +L+    +   N T I LIPKVK P+RM EFRPISLCNV +K+++K + 
Subjt:  TLEQPYSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAKAIA

Query:  NRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGV
        NRLK VL ++IS  Q+AFVPGRLI+DN+LV +E I+++ +++ G+ G +AI LDMSKAYDRVEW ++  +  KMGF  +WI  +M+CV+S ++S+L+NG 
Subjt:  NRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGV

Query:  PHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDSCKI-
        PH    P RGIRQGDPLSPY+FL+CAEGF+ LL +    + L G  +    P +SHL FADDSL+FC +  EEC  + D+   YE +S Q IN D   I 
Subjt:  PHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDSCKI-

Query:  ----------------------------------------------KDKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYSMSCFKLPKSLCKDINKMC
                                                      KD+I   +QGW+E+  S  G+E+LIKAV QAIPT++MSCF LPKS CKD++ + 
Subjt:  ----------------------------------------------KDKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYSMSCFKLPKSLCKDINKMC

Query:  ASFWWGATGDKKKLHWARWKELCKSKDVGG
        A+FWWG + +  K+HWA WK++C  K+ GG
Subjt:  ASFWWGATGDKKKLHWARWKELCKSKDVGG

A0A2N9IBI9 Reverse transcriptase domain-containing protein9.0e-15237.7Show/hide
Query:  METKCDGVKGEQVKKRLSFDHGWCVPSVGSSGGLMLLWNSDIEIY--------------------W----FLWESGGCQKERFLELVRAVSGEGRYPLAS
        METK D  + E ++ +L FD+ + VPS+G SGGL LLW +D E+                     W    F       ++     L++ +S     P   
Subjt:  METKCDGVKGEQVKKRLSFDHGWCVPSVGSSGGLMLLWNSDIEIY--------------------W----FLWESGGCQKERFLELVRAVSGEGRYPLAS

Query:  GGGDFNEILSVDEKKGGGLRNQNQMKGFQDVINICRLMDAG---------------------------------------------------PIVANIVI
          GDFNEIL+V+EK GG  R+  Q+  FQ+ +NIC  +D G                                                    +V +IV 
Subjt:  GGGDFNEILSVDEKKGGGLRNQNQMKGFQDVINICRLMDAG---------------------------------------------------PIVANIVI

Query:  SSAKHSSRNKGKILHFEEGWLKFKYAKKIVQDSWSG--WEGRRSKQINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAESH-MMDEQDLDRA
         ++   SR K  +  FEE W      +K++Q+SW      G    Q+  K+      L  WS+E F+ N    +  K E ++ L  ++H       +   
Subjt:  SSAKHSSRNKGKILHFEEGWLKFKYAKKIVQDSWSG--WEGRRSKQINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAESH-MMDEQDLDRA

Query:  EMELNSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQ
          E+N LL ++E +WR RSREVWL +GDKNT +FH KA QRR +N ++G+ DSNG W EEE  + ++  +YF+DIF +S  S  ++      +   ++  
Subjt:  EMELNSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQ

Query:  QRRTLEQPYSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAK
          R L   ++  E+++AT  +HPSKAPG DG+ + FFQ YW IVG+D V   L ++N+   +   N + + LIPK K P+ + ++RPISL NV YKI++K
Subjt:  QRRTLEQPYSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAK

Query:  AIANRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLI
         +ANRLK VL  IIS +QSAFVPGR I+DN+ V FE +H +  R+KGK  HVA+ LDMSKAYDRVEW F+  V  +MGF  RWI  +M CV++ ++SVL+
Subjt:  AIANRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLI

Query:  NGVPHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDSC
        NG P    KP RGIRQGDPLSPY+FL+CAEG S LL R    Q + G  +    P +SHL FADDSL+FC +++ EC  + +V   YE AS Q +N +  
Subjt:  NGVPHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDSC

Query:  -----------------------------------------------KIKDKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYSMSCFKLPKSLCKDIN
                                                        +K++I   LQGW E+L S  G+ ILIK + QAIPTY+MSCFKLPK+ C DIN
Subjt:  -----------------------------------------------KIKDKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYSMSCFKLPKSLCKDIN

Query:  KMCASFWWGATGDKKKLHWARWKELCKSKDVGG
         + +++WWG   ++ K+HW  W  LC  K+ GG
Subjt:  KMCASFWWGATGDKKKLHWARWKELCKSKDVGG

A0A2N9IPS8 Reverse transcriptase domain-containing protein1.1e-15236.9Show/hide
Query:  ETKCDGVKGEQVKKRLSFDHGWCVPSVGSSGGLMLLWNSDIEIYWFLWESGGCQKERFLELVRAVSGEGRYPLASGG-----------------------
        ET+ D +  E+++  + FD  +CVP  G+ GGL +LW + +++    +           E+V    G+G       G                       
Subjt:  ETKCDGVKGEQVKKRLSFDHGWCVPSVGSSGGLMLLWNSDIEIYWFLWESGGCQKERFLELVRAVSGEGRYPLASGG-----------------------

Query:  -----GDFNEILSVDEKKGGGLRNQNQMKGFQDVINICRLMDA-------------------------------------GPIVANIVISSAKHSS----
             GDFNEIL  +E+ G G R + Q++ F++ +  C L D                                      G +V+++ + ++ H      
Subjt:  -----GDFNEILSVDEKKGGGLRNQNQMKGFQDVINICRLMDA-------------------------------------GPIVANIVISSAKHSS----

Query:  -------RNKGKILHFEEGWLKFKYAKKIVQDSWSG--WEGRRSKQINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAESHMMDEQDLDRAE
               + K K+  FE  W+K +  ++++  +W     EG     +  K+K+   +L  WS+ERF G++ S+I++K E++Q L  E+       +   +
Subjt:  -------RNKGKILHFEEGWLKFKYAKKIVQDSWSG--WEGRRSKQINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAESHMMDEQDLDRAE

Query:  MELNSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQQ
         +LN LLE+EE +WR RSR  W+  GDKNT++FH + ++RR+ N I G+ D +G W+ E+  IA +A +YF+ IF SS PS + I  V + +   +++  
Subjt:  MELNSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQQ

Query:  RRTLEQPYSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAKA
           L+  ++K EV  A K ++P+KAPG DG+ A F+Q YWDIVG +  +  L IL++   +   N T I LIPKVK P+ + +FRPISLCNV YKIV+K 
Subjt:  RRTLEQPYSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAKA

Query:  IANRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLIN
        +ANRLKKVL  +IS  QSAFVPGRLI+DNVLV FE +H+++ ++KGK+G +A+ LDMSKAYDRVEW F+ ++   MGF + WI+ +M C+ SV++SVLIN
Subjt:  IANRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLIN

Query:  GVPHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDSC-
        G     F   RGIRQGD LSPY+FL+CAEG S LL +    + L G   +   P L+HL FADDSL+FC ++   C+ +  + + YE+AS Q +N     
Subjt:  GVPHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDSC-

Query:  ----------------------------------------------KIKDKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYSMSCFKLPKSLCKDINK
                                                      +IK ++   + GW EK  S  G+E+LIKAV Q+IPTYSMSCFKLP+SLC D+N 
Subjt:  ----------------------------------------------KIKDKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYSMSCFKLPKSLCKDINK

Query:  MCASFWWGATGDKKKLHWARWKELCKSKDVGG
        M ++FWWG     KK HW RW +LC SK  GG
Subjt:  MCASFWWGATGDKKKLHWARWKELCKSKDVGG

A0A7N2LIH6 Uncharacterized protein2.8e-15338.15Show/hide
Query:  METKCDGVKGEQVKKRLSFDHGWCVPSVGSSGGLMLLW------------NSDIEIYWFLWESGGCQK-----------ERFL--ELVRAVSGEGRYPLA
        +ETK    K +  + +L F  G  VPS G SGGL LLW            +S I++      SGG  +           +R+   +L+  ++ +   P  
Subjt:  METKCDGVKGEQVKKRLSFDHGWCVPSVGSSGGLMLLW------------NSDIEIYWFLWESGGCQK-----------ERFL--ELVRAVSGEGRYPLA

Query:  SGGGDFNEILSVDEKKGGGLRNQNQMKGFQDVINICRLMDAGPI-------------------------------------VANIVISSAKH--------
           GDFNEI+  DEK G   R+  QM  F++V++ C L+D G +                                     V ++ +S++ H        
Subjt:  SGGGDFNEILSVDEKKGGGLRNQNQMKGFQDVINICRLMDAGPI-------------------------------------VANIVISSAKH--------

Query:  ---SSRNKGKILHFEEGWLKFKYAKKIVQDSWSGWEGRRSKQINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAESHMMD-EQDLDRAEMEL
           + R   K   FEE W + +  K+IV+ +W  +    +  +  +L+   K L QW++  F GN+   I++K+  +Q+LE+ + + +  +++   + E+
Subjt:  ---SSRNKGKILHFEEGWLKFKYAKKIVQDSWSGWEGRRSKQINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAESHMMD-EQDLDRAEMEL

Query:  NSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVA--EYVRMEISDQQR
        N L   EE  W+ RSR  WL+ GDKN+++FH  ASQRR++NRI G+ D  G W E++E    +  +YFKDI+ S+ P+   ++  A  E V  E++D+  
Subjt:  NSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVA--EYVRMEISDQQR

Query:  RTLEQPYSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAKAI
          L++ +   EV +A + +HP+KAPG DG+   F+Q YWDIVG       LQ LN+       NKT I LIPK K P+++ EFRPISLCNV YKI++K +
Subjt:  RTLEQPYSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAKAI

Query:  ANRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLING
        ANRLKKVL  +I   QSAFVPGR+I+DNV+V FE +H+IN R+KGKEG +AI LDMSKAYDRVEW ++ ++  KMGFG+RWI  IM CV SV+FSVLING
Subjt:  ANRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLING

Query:  VPHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDSC--
         P   F P RG+RQGDP+SPY+FL+C EG S ++ ++     ++G       P +SHL FADDS++FC ++ +EC+ +  V + YE  S Q +N D    
Subjt:  VPHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDSC--

Query:  ---------------------------------------------KIKDKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYSMSCFKLPKSLCKDINKM
                                                     +IKD++   + GW  KL S  G+E+LIKAV QA PTY+M+ FKLP SLC ++N M
Subjt:  ---------------------------------------------KIKDKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYSMSCFKLPKSLCKDINKM

Query:  CASFWWGATGDKKKLHWARWKELCKSKDVGG
          SFWWG  G +KK+ W  WK LCK K  GG
Subjt:  CASFWWGATGDKKKLHWARWKELCKSKDVGG

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.4e-3224.72Show/hide
Query:  LHQWSKERFKGNIKSAIRKKEEEIQKLEAESHMMDEQDLDRAEMELNSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWR
        L+ + +++ +  I +   + +E  ++ +  S     Q++ +   EL  +  ++     + SR  + +  +K          ++R++N+I+ I +  G+  
Subjt:  LHQWSKERFKGNIKSAIRKKEEEIQKLEAESHMMDEQDLDRAEMELNSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWR

Query:  EEEEDIASMATEYFKDIFKSSMPSRDQIAKVAE-YVRMEISDQQRRTLEQPYSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILN
         +  +I +   EY+K ++ + + + +++    + Y    ++ ++  +L +P +  E+     +L   K+PG DG  A F+Q Y     E+ V   L++  
Subjt:  EEEEDIASMATEYFKDIFKSSMPSRDQIAKVAE-YVRMEISDQQRRTLEQPYSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILN

Query:  N--EADIVP--FNKTLITLIPKV-KEPKRMQEFRPISLCNVTYKIVAKAIANRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHV
        +  +  I+P  F +  I LIPK  ++  + + FRPISL N+  KI+ K +ANR+++ +  +I   Q  F+PG     N+      I  IN  K   + HV
Subjt:  N--EADIVP--FNKTLITLIPKV-KEPKRMQEFRPISLCNVTYKIVAKAIANRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHV

Query:  AINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGVPHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINN
         I++D  KA+D+++  F+     K+G    ++K I    +  T ++++NG   E F    G RQG PLSP +F +  E  +  + +E E   +KG Q+  
Subjt:  AINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGVPHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINN

Query:  FCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINL
            LS   FADD +V+  +     Q +  +   +   S   IN+
Subjt:  FCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINL

P08548 LINE-1 reverse transcriptase homolog1.9e-3426.01Show/hide
Query:  NHKLKVSIKNLHQWSKERFKGN---IKSAIRKKEEE--------IQKLEAESHMMDE----QDLDRAEMELNSLLEEEEYYWRSRSREVWLKSGDKNTEW
        N+    + +NL   +K   +G    +++ ++K E E        +++LE E H   +    +++ +   ELN +  +      ++S+  + +  +K  + 
Subjt:  NHKLKVSIKNLHQWSKERFKGN---IKSAIRKKEEE--------IQKLEAESHMMDE----QDLDRAEMELNSLLEEEEYYWRSRSREVWLKSGDKNTEW

Query:  FHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRM-EISDQQRRTLEQPYSKQEVEEATKNLHPSKAPGIDGV
              ++R ++ I  I + N     +  +I  +  EY+K ++     +  +I +  E   +  +S ++   L +P S  E+    +NL   K+PG DG 
Subjt:  FHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRM-EISDQQRRTLEQPYSKQEVEEATKNLHPSKAPGIDGV

Query:  HASFFQAYWDIVGEDTVRECLQILNN--EADIVP--FNKTLITLIPKV-KEPKRMQEFRPISLCNVTYKIVAKAIANRLKKVLDTIISPTQSAFVPGRLI
         + F+Q +     E+ V   L +  N  +  I+P  F +  ITLIPK  K+P R + +RPISL N+  KI+ K + NR+++ +  II   Q  F+PG   
Subjt:  HASFFQAYWDIVGEDTVRECLQILNN--EADIVP--FNKTLITLIPKV-KEPKRMQEFRPISLCNVTYKIVAKAIANRLKKVLDTIISPTQSAFVPGRLI

Query:  SDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGVPHEEFKPDRGIRQGDPLSPYIFLM
          N+      I  IN  K   + H+ +++D  KA+D ++  F+     K+G    ++K I       T ++++NGV  + F    G RQG PLSP +F +
Subjt:  SDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGVPHEEFKPDRGIRQGDPLSPYIFLM

Query:  CAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAIN
          E  +  +    E + +KG  I +    LS   FADD +V+  ++R+    + +V K Y   S   IN
Subjt:  CAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAIN

P11369 LINE-1 retrotransposable element ORF2 protein1.5e-3125.57Show/hide
Query:  SKERFKGNIKSAIRKKEEEIQKLEAESHMMD-EQDLDRAEMELNSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEE
        SK++ +    S++    + ++K EA S      Q++ +   E+N +         +++R  + +  +K  +         R +  I  I +  G+   + 
Subjt:  SKERFKGNIKSAIRKKEEEIQKLEAESHMMD-EQDLDRAEMELNSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEE

Query:  EDIASMATEYFKDIFKSSMPSRDQIAKVAE-YVRMEISDQQRRTLEQPYSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEA
        E+I +    ++K ++ + + + D++ K  + Y   +++  Q   L  P S +E+E    +L   K+PG DG  A F+Q +     ++ +   L  L ++ 
Subjt:  EDIASMATEYFKDIFKSSMPSRDQIAKVAE-YVRMEISDQQRRTLEQPYSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEA

Query:  DIV-----PFNKTLITLIPK-VKEPKRMQEFRPISLCNVTYKIVAKAIANRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAI
        ++       F +  ITLIPK  K+P +++ FRPISL N+  KI+ K +ANR+++ +  II P Q  F+PG     N+      IH IN  K   + H+ I
Subjt:  DIV-----PFNKTLITLIPK-VKEPKRMQEFRPISLCNVTYKIVAKAIANRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAI

Query:  NLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGVPHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFC
        +LD  KA+D+++  F+  V  + G    ++  I         ++ +NG   E      G RQG PLSPY+F +  E  +  + ++ E   +KG QI    
Subjt:  NLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGVPHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFC

Query:  PPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAIN
          +S L  ADD +V+ S  +   + + ++  ++       IN
Subjt:  PPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAIN

P14381 Transposon TX1 uncharacterized 149 kDa protein2.9e-3826.98Show/hide
Query:  KGKILHFEEGWLKFKYAKKIVQDSWSGWEGRRSK-----QINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAESHMMDEQDLDRAEMELNSL
        K    HF    L+ +   K V+D+W GW   + +     Q     KV +K L Q   +   G   + I     E+  LE      ++Q L    +E    
Subjt:  KGKILHFEEGWLKFKYAKKIVQDSWSGWEGRRSK-----QINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAESHMMDEQDLDRAEMELNSL

Query:  LEEEEYYWRS----RSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQQRR
        L   E         RSR   L   D+ + +F+    ++  R +I  +F  +G   E+ E I   A  +++++F     S D   ++ + + + +S++++ 
Subjt:  LEEEEYYWRS----RSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQQRR

Query:  TLEQPYSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAKAIA
         LE P +  E+ +A + +  +K+PG+DG+   FFQ +WD +G D  R   +        +   + +++L+PK  + + ++ +RP+SL +  YKIVAKAI+
Subjt:  TLEQPYSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAKAIA

Query:  NRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGV
         RLK VL  +I P QS  VPGR I DNV +  + +H    R+ G      ++LD  KA+DRV+ +++        FG +++  +     S    V IN  
Subjt:  NRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGV

Query:  PHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAIN
                RG+RQG PLS  ++ +  E F  LL + +    LK   +         LS   D ++  +    + +  ++  + Y  ASS  IN
Subjt:  PHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAIN

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM1.0e-1428.82Show/hide
Query:  IPKVKEPKRMQEFRPISLCNVTYKIVAKAIANRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRN
        IPK    KR Q+FRPIS+ +V  + +   +A RL   ++    P Q  F+P    +DN  +    +   ++ K  +  ++A NLD+SKA+D +    + +
Subjt:  IPKVKEPKRMQEFRPISLCNVTYKIVAKAIANRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRN

Query:  VFTKMGFGERWIKNIMKCVESVTFSVLINGVPHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSS
             G  + ++  +    E    S+  +G   EEF P RG++QGDPLSP +F +  +     L  E+      G ++ N     +  +FADD LV  + 
Subjt:  VFTKMGFGERWIKNIMKCVESVTFSVLINGVPHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSS

Query:  SREECQTIKDVFKAYEFASSQAINLDSCK
        +R   Q + D  K  +F S   + L++ K
Subjt:  SREECQTIKDVFKAYEFASSQAINLDSCK

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.0e-1425.1Show/hide
Query:  GRRSKQINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAESHMMDEQDLDRAE----MELNSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTK
        G     +   LK + K     +++ F GNI+   ++  + ++ ++++        L R E     + N      E ++R +SR  WL+ GD NT +FH  
Subjt:  GRRSKQINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAESHMMDEQDLDRAE----MELNSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTK

Query:  ASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIF--KSSMPSRDQIAKVAEYVRMEISDQQRRTLEQPYSKQEVEEATKNLHPSKAPGIDGVHAS
            + +N I+ +   +    E    +  M   Y+  +    S + + D + ++ +      +D     L    S +E+  A   +  +KAPG D   A 
Subjt:  ASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIF--KSSMPSRDQIAKVAEYVRMEISDQQRRTLEQPYSKQEVEEATKNLHPSKAPGIDGVHAS

Query:  FFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIV
        FF   W +V + T+    +       +  FN T ITLIPKV    ++  FRP+S C V YKI+
Subjt:  FFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIV

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.0e-1436.36Show/hide
Query:  IANRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMK
        +  RLK ++  +I P Q++F+PGR+ +DN++   E +H++  RKKG +G + + LD+ KAYDR+ W+++ +     GF E W+  I +
Subjt:  IANRLKKVLDTIISPTQSAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMK

AT4G29090.1 Ribonuclease H-like superfamily protein2.9e-0946.3Show/hide
Query:  AIPTYSMSCFKLPKSLCKDINKMCASFWWGATGDKKKLHWARWKELCKSKDVGG
        A+PTY+M+CF LPK++CK I  + A FWW    + K +HW  W  L   K  GG
Subjt:  AIPTYSMSCFKLPKSLCKDINKMCASFWWGATGDKKKLHWARWKELCKSKDVGG

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.1e-1146.3Show/hide
Query:  AIPTYSMSCFKLPKSLCKDINKMCASFWWGATGDKKKLHWARWKELCKSKDVGG
        A+P Y+MSCF+L K LCK +      FWW +  +K+K+ W  W++LCKSK+  G
Subjt:  AIPTYSMSCFKLPKSLCKDINKMCASFWWGATGDKKKLHWARWKELCKSKDVGG

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.5e-1550Show/hide
Query:  LINGVPHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDS
        +ING P     P RG+RQGDPLSPY+F++C E  SGL  R  E   L G +++N  P ++HL FADD+
Subjt:  LINGVPHEEFKPDRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACAAAATGTGATGGTGTTAAAGGTGAGCAAGTTAAGAAAAGGCTTAGTTTTGATCATGGTTGGTGTGTGCCGAGCGTTGGGAGCAGTGGGGGATTAATGCTCCT
TTGGAATTCTGATATTGAGATTTACTGGTTTTTATGGGAATCCGGAGGTTGCCAAAAGGAAAGATTCTTGGAGCTTGTTAGAGCGGTTAGTGGAGAAGGTAGATATCCCC
TGGCTAGTGGGGGGGGGGACTTCAATGAAATCCTCTCGGTGGATGAAAAAAAAGGTGGTGGTTTGAGGAACCAAAACCAAATGAAAGGGTTCCAAGATGTGATCAATATC
TGTAGGCTTATGGATGCCGGGCCCATAGTCGCCAACATTGTCATTTCTTCGGCTAAGCATTCCAGTAGAAACAAAGGTAAAATTCTGCACTTCGAGGAAGGATGGCTGAA
ATTTAAGTATGCAAAGAAAATTGTGCAAGATAGCTGGAGTGGCTGGGAAGGCAGAAGATCAAAACAGATCAACCATAAACTCAAAGTGAGTATTAAAAATCTGCATCAAT
GGAGTAAAGAAAGGTTTAAAGGCAATATAAAATCAGCTATTAGGAAAAAAGAAGAGGAAATCCAAAAGCTTGAGGCCGAGAGTCATATGATGGATGAACAGGACCTTGAT
AGAGCTGAGATGGAATTGAATTCGCTCCTTGAAGAAGAGGAGTACTACTGGAGAAGCAGATCAAGAGAAGTGTGGCTTAAGAGCGGGGACAAAAACACTGAATGGTTCCA
CACTAAAGCCTCTCAAAGGAGAAAGCGGAATAGGATTGAGGGCATCTTTGATTCTAACGGCAACTGGAGGGAGGAGGAAGAGGATATTGCTAGCATGGCAACCGAATATT
TCAAAGATATTTTTAAGTCTTCCATGCCGAGTAGGGACCAGATCGCAAAGGTTGCAGAATATGTTAGAATGGAAATTTCAGATCAGCAAAGGAGAACTCTTGAGCAGCCA
TACAGTAAGCAAGAGGTGGAGGAGGCAACGAAAAATCTCCATCCTAGCAAGGCCCCAGGGATAGATGGAGTTCATGCTTCTTTTTTCCAAGCCTATTGGGATATTGTGGG
AGAAGATACAGTCCGGGAATGCCTCCAAATTCTCAACAATGAAGCCGACATTGTTCCCTTTAATAAAACTCTGATTACCCTCATCCCAAAAGTCAAAGAGCCAAAGAGAA
TGCAAGAATTCAGACCAATCAGCTTATGCAACGTTACTTATAAGATAGTTGCCAAAGCCATTGCTAACAGGTTAAAAAAGGTTCTTGATACAATCATCTCGCCAACTCAA
TCGGCTTTTGTTCCAGGGAGACTCATTTCTGATAATGTTCTAGTCGGCTTTGAATGCATTCACGCGATCAATAATAGGAAGAAAGGAAAAGAGGGGCATGTTGCCATCAA
CCTCGATATGAGCAAAGCGTACGACAGAGTTGAATGGGAGTTTGTTAGAAATGTATTCACAAAAATGGGGTTCGGTGAGAGGTGGATCAAGAACATCATGAAGTGCGTGG
AGTCCGTGACTTTCTCGGTCCTTATTAATGGAGTGCCGCATGAAGAGTTCAAGCCGGATCGTGGTATTCGTCAAGGGGATCCTTTATCCCCTTACATTTTCTTAATGTGT
GCTGAAGGATTCTCTGGTCTTCTAAACAGGGAAGTAGAATCTCAAAACTTAAAAGGCTTTCAGATTAACAATTTTTGTCCACCCTTATCCCACCTTTCTTTCGCTGATGA
TAGCCTTGTTTTTTGCAGCTCTTCTAGGGAGGAATGCCAAACAATCAAGGACGTGTTCAAGGCCTATGAGTTTGCATCGAGTCAAGCCATAAATCTAGATTCATGCAAGA
TAAAGGACAAGATAAGGAATATTCTCCAGGGATGGAGTGAGAAGCTTTTTTCTGCAGGCGGCAAGGAAATCCTTATTAAAGCTGTGATTCAGGCGATCCCCACCTATTCC
ATGAGTTGTTTTAAGCTCCCAAAAAGCTTATGTAAGGATATTAACAAAATGTGTGCCAGCTTTTGGTGGGGAGCGACAGGGGATAAGAAGAAGCTTCATTGGGCCAGATG
GAAGGAGCTTTGTAAAAGCAAGGATGTTGGGGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAACAAAATGTGATGGTGTTAAAGGTGAGCAAGTTAAGAAAAGGCTTAGTTTTGATCATGGTTGGTGTGTGCCGAGCGTTGGGAGCAGTGGGGGATTAATGCTCCT
TTGGAATTCTGATATTGAGATTTACTGGTTTTTATGGGAATCCGGAGGTTGCCAAAAGGAAAGATTCTTGGAGCTTGTTAGAGCGGTTAGTGGAGAAGGTAGATATCCCC
TGGCTAGTGGGGGGGGGGACTTCAATGAAATCCTCTCGGTGGATGAAAAAAAAGGTGGTGGTTTGAGGAACCAAAACCAAATGAAAGGGTTCCAAGATGTGATCAATATC
TGTAGGCTTATGGATGCCGGGCCCATAGTCGCCAACATTGTCATTTCTTCGGCTAAGCATTCCAGTAGAAACAAAGGTAAAATTCTGCACTTCGAGGAAGGATGGCTGAA
ATTTAAGTATGCAAAGAAAATTGTGCAAGATAGCTGGAGTGGCTGGGAAGGCAGAAGATCAAAACAGATCAACCATAAACTCAAAGTGAGTATTAAAAATCTGCATCAAT
GGAGTAAAGAAAGGTTTAAAGGCAATATAAAATCAGCTATTAGGAAAAAAGAAGAGGAAATCCAAAAGCTTGAGGCCGAGAGTCATATGATGGATGAACAGGACCTTGAT
AGAGCTGAGATGGAATTGAATTCGCTCCTTGAAGAAGAGGAGTACTACTGGAGAAGCAGATCAAGAGAAGTGTGGCTTAAGAGCGGGGACAAAAACACTGAATGGTTCCA
CACTAAAGCCTCTCAAAGGAGAAAGCGGAATAGGATTGAGGGCATCTTTGATTCTAACGGCAACTGGAGGGAGGAGGAAGAGGATATTGCTAGCATGGCAACCGAATATT
TCAAAGATATTTTTAAGTCTTCCATGCCGAGTAGGGACCAGATCGCAAAGGTTGCAGAATATGTTAGAATGGAAATTTCAGATCAGCAAAGGAGAACTCTTGAGCAGCCA
TACAGTAAGCAAGAGGTGGAGGAGGCAACGAAAAATCTCCATCCTAGCAAGGCCCCAGGGATAGATGGAGTTCATGCTTCTTTTTTCCAAGCCTATTGGGATATTGTGGG
AGAAGATACAGTCCGGGAATGCCTCCAAATTCTCAACAATGAAGCCGACATTGTTCCCTTTAATAAAACTCTGATTACCCTCATCCCAAAAGTCAAAGAGCCAAAGAGAA
TGCAAGAATTCAGACCAATCAGCTTATGCAACGTTACTTATAAGATAGTTGCCAAAGCCATTGCTAACAGGTTAAAAAAGGTTCTTGATACAATCATCTCGCCAACTCAA
TCGGCTTTTGTTCCAGGGAGACTCATTTCTGATAATGTTCTAGTCGGCTTTGAATGCATTCACGCGATCAATAATAGGAAGAAAGGAAAAGAGGGGCATGTTGCCATCAA
CCTCGATATGAGCAAAGCGTACGACAGAGTTGAATGGGAGTTTGTTAGAAATGTATTCACAAAAATGGGGTTCGGTGAGAGGTGGATCAAGAACATCATGAAGTGCGTGG
AGTCCGTGACTTTCTCGGTCCTTATTAATGGAGTGCCGCATGAAGAGTTCAAGCCGGATCGTGGTATTCGTCAAGGGGATCCTTTATCCCCTTACATTTTCTTAATGTGT
GCTGAAGGATTCTCTGGTCTTCTAAACAGGGAAGTAGAATCTCAAAACTTAAAAGGCTTTCAGATTAACAATTTTTGTCCACCCTTATCCCACCTTTCTTTCGCTGATGA
TAGCCTTGTTTTTTGCAGCTCTTCTAGGGAGGAATGCCAAACAATCAAGGACGTGTTCAAGGCCTATGAGTTTGCATCGAGTCAAGCCATAAATCTAGATTCATGCAAGA
TAAAGGACAAGATAAGGAATATTCTCCAGGGATGGAGTGAGAAGCTTTTTTCTGCAGGCGGCAAGGAAATCCTTATTAAAGCTGTGATTCAGGCGATCCCCACCTATTCC
ATGAGTTGTTTTAAGCTCCCAAAAAGCTTATGTAAGGATATTAACAAAATGTGTGCCAGCTTTTGGTGGGGAGCGACAGGGGATAAGAAGAAGCTTCATTGGGCCAGATG
GAAGGAGCTTTGTAAAAGCAAGGATGTTGGGGGCTAG
Protein sequenceShow/hide protein sequence
METKCDGVKGEQVKKRLSFDHGWCVPSVGSSGGLMLLWNSDIEIYWFLWESGGCQKERFLELVRAVSGEGRYPLASGGGDFNEILSVDEKKGGGLRNQNQMKGFQDVINI
CRLMDAGPIVANIVISSAKHSSRNKGKILHFEEGWLKFKYAKKIVQDSWSGWEGRRSKQINHKLKVSIKNLHQWSKERFKGNIKSAIRKKEEEIQKLEAESHMMDEQDLD
RAEMELNSLLEEEEYYWRSRSREVWLKSGDKNTEWFHTKASQRRKRNRIEGIFDSNGNWREEEEDIASMATEYFKDIFKSSMPSRDQIAKVAEYVRMEISDQQRRTLEQP
YSKQEVEEATKNLHPSKAPGIDGVHASFFQAYWDIVGEDTVRECLQILNNEADIVPFNKTLITLIPKVKEPKRMQEFRPISLCNVTYKIVAKAIANRLKKVLDTIISPTQ
SAFVPGRLISDNVLVGFECIHAINNRKKGKEGHVAINLDMSKAYDRVEWEFVRNVFTKMGFGERWIKNIMKCVESVTFSVLINGVPHEEFKPDRGIRQGDPLSPYIFLMC
AEGFSGLLNREVESQNLKGFQINNFCPPLSHLSFADDSLVFCSSSREECQTIKDVFKAYEFASSQAINLDSCKIKDKIRNILQGWSEKLFSAGGKEILIKAVIQAIPTYS
MSCFKLPKSLCKDINKMCASFWWGATGDKKKLHWARWKELCKSKDVGG