; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038403 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038403
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:16784491..16786464
RNA-Seq ExpressionLag0038403
SyntenyLag0038403
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PNY14301.1 ribonuclease H [Trifolium pratense]5.2e-10935.61Show/hide
Query:  LDRFLINPTMLIRCSVFRVFHLSMIASDHRLLLAEWKEDPPDTGYKILRRPRRFEEAWTKYDDCREIVMSVWVGFERRSRRSI--VEKMGECMTKLDQWS
        LDR L N  ++ R S  +V HL    SDH +LL +  E P   G +  +R  RFEE+WTK   C + + + W        + +  ++ +G C  + +   
Subjt:  LDRFLINPTMLIRCSVFRVFHLSMIASDHRLLLAEWKEDPPDTGYKILRRPRRFEEAWTKYDDCREIVMSVWVGFERRSRRSI--VEKMGECMTKLDQWS

Query:  RRMYNGSIRGAISTKEKEIQRLWNQGVQGNESEMLRKEKDLEA----LLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSRRKI---------------
             G I+  I   EK +Q   +Q +   E   + + K+LE     LL+  E  W+QR+   WLK GD NT++ H KAS R K+               
Subjt:  RRMYNGSIRGAISTKEKEIQRLWNQGVQGNESEMLRKEKDLEA----LLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSRRKI---------------

Query:  ------------FKDILAAT------------PKSLTEEHNWKLLETFTREEIYGVVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELLNGEGN
                    F  + A++               L+ EH       FT +++   +  MHP K PG D + A F+QKYW ++G DI +  L +LN  G 
Subjt:  ------------FKDILAAT------------PKSLTEEHNWKLLETFTREEIYGVVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELLNGEGN

Query:  LNCINKTYILLIPK-----------------------------------------------------------------------GKKGEVALKLDMNKA
         + INKT+++LIPK                                                                       GKKG +ALKLDM+KA
Subjt:  LNCINKTYILLIPK-----------------------------------------------------------------------GKKGEVALKLDMNKA

Query:  YDRVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCPSLTHLL
        YDR+EW ++  ++  MG+S +++ELIMRC+ SVS+Q+L+NG P   F P RGLRQGDPLSPYLF++CA+ LSGLL+ + V K L G+++ +  P L+HL 
Subjt:  YDRVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCPSLTHLL

Query:  YVDDSLMFFKASEKNCNSIKIILETYENASGQIKNLDKSNFMTSPNTNEDLARKIEVVLQVKHTDSMGQYLGLPSQNARSKKEIFNHIKDRVWKALQGWK
        + DDSL+F +A+    ++I  IL+TY+NASGQ+ NLDKS    S N        I  ++ VK  ++  +YLG P    RSKK IF+ + DR+WK ++GWK
Subjt:  YVDDSLMFFKASEKNCNSIKIILETYENASGQIKNLDKSNFMTSPNTNEDLARKIEVVLQVKHTDSMGQYLGLPSQNARSKKEIFNHIKDRVWKALQGWK

Query:  GRLFSAAGREILIKYVAQAIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSRNKIHWRS
         R  S AG+E LIK VAQAIPNY +SC+K  +  C ++++M A+FWWG+ D + KIHW S
Subjt:  GRLFSAAGREILIKYVAQAIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSRNKIHWRS

XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]3.2e-11143.85Show/hide
Query:  LDRFLINPTMLIRCSVFRVFHLSMIASDHRLLLAEWKEDPPDTGYKILRRPRRFEEAWTKYDDCREIVMSVWVGFERRSRRSIVEKMGECMTKLDQWSRR
        LDRFLIN +ML +C   +V HL +++SDHR +LA W  + P       +R  RFEE+W + D CR+I+   W         +   K+  C+++L++W++ 
Subjt:  LDRFLINPTMLIRCSVFRVFHLSMIASDHRLLLAEWKEDPPDTGYKILRRPRRFEEAWTKYDDCREIVMSVWVGFERRSRRSIVEKMGECMTKLDQWSRR

Query:  MYNGSIRGAISTKEKEIQRLWNQGVQGNESEMLRKEKDLEALLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSRRKIFKDILAATPKSLTEEHNWKLL
          N S++GAI+ KEKE++RL                + L+                                                    +  N  L 
Subjt:  MYNGSIRGAISTKEKEIQRLWNQGVQGNESEMLRKEKDLEALLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSRRKIFKDILAATPKSLTEEHNWKLL

Query:  ETFTREEIYGVVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELLNGEGNLNCINKTYILLIP----------------------KGKKGEVALK
        + FTREEI   +K MHP+K PG D +QA FFQK+W V+   I K     L    + + I+ T    +P                       GK GEVA+K
Subjt:  ETFTREEIYGVVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELLNGEGNLNCINKTYILLIP----------------------KGKKGEVALK

Query:  LDMNKAYDRVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCP
        LDM+KAYDRVEW Y+  IM KMGF++RW++LIM CVESV F VL+NG P  EF+PNRGLRQGDPLSPYLF++CAEGLS L+N  E K ++T L+IN+ CP
Subjt:  LDMNKAYDRVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCP

Query:  SLTHLLYVDDSLMFFKASEKNCNSIKIILETYENA-SGQIKNLDKSNFMTSPNTNEDLARKIEVVLQVKHTDSMGQYLGLPSQNARSKKEIFNHIKDRVW
         ++HL Y DD L+FFKAS  NC SIK ILE+YE A SGQ  NLDKS F+ S NT E +   I   L V HT+S+GQYLGLPSQ  R+K+++FN+IKDRVW
Subjt:  SLTHLLYVDDSLMFFKASEKNCNSIKIILETYENA-SGQIKNLDKSNFMTSPNTNEDLARKIEVVLQVKHTDSMGQYLGLPSQNARSKKEIFNHIKDRVW

Query:  KALQGWKGRLFSAAGREILI
        KALQGWKG+LFS  GRE+L+
Subjt:  KALQGWKGRLFSAAGREILI

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]5.0e-11237.42Show/hide
Query:  LDRFLINPTMLIRCSVFRVFHLSMIASDHRLLLAEWKEDPPDTGYKILRRPRRFEEAWTKYDDCREIVMSVW-VGFERRSRRSIVEKMGECMTKLDQWSR
        LDR L  P  +      +V HL    SDH  LL     D         RR  +FE  WT+ +DC++I+  VW    E  S R I  ++  C   L +W++
Subjt:  LDRFLINPTMLIRCSVFRVFHLSMIASDHRLLLAEWKEDPPDTGYKILRRPRRFEEAWTKYDDCREIVMSVW-VGFERRSRRSIVEKMGECMTKLDQWSR

Query:  RMYNGSIRGAISTKEKEIQRLWN---QGVQGNESEMLRKEKDLEALLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSRRK------------------
         ++ G+I   I  K++ +  L +    G  G E  MLRKE  +  LL+ +EI W+QR+   WL  GD NTK+ H KAS RR+                  
Subjt:  RMYNGSIRGAISTKEKEIQRLWN---QGVQGNESEMLRKEKDLEALLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSRRK------------------

Query:  ---------------------IFKDILAATPKSLTEEHNWKLLETFTREEIYGVVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELLNGEGNLN
                                ++L A P ++TEE N  L++ FTREEI   +  MHPTK PG D + A FFQKYW+++G DI    L++LN   ++ 
Subjt:  ---------------------IFKDILAATPKSLTEEHNWKLLETFTREEIYGVVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELLNGEGNLN

Query:  CINKTYILLIPK-----------------------------------------------------------------------GKKGEVALKLDMNKAYD
         INKT I L+PK                                                                       GK+G  A+KLDM+KAYD
Subjt:  CINKTYILLIPK-----------------------------------------------------------------------GKKGEVALKLDMNKAYD

Query:  RVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCPSLTHLLYV
        RVEW +I ++M KMGF ++WI+L+M C+ SVS+ +L+NG      +P RGLRQGDP+SPY+FL+CA+G S LLN    K  ++G+ I + CP +THL + 
Subjt:  RVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCPSLTHLLYV

Query:  DDSLMFFKASEKNCNSIKIILETYENASGQIKNLDKSNFMTSPNTNEDLARKIEVVLQVKHTDSM--GQYLGLPSQNARSKKEIFNHIKDRVWKALQGWK
        DDSL+F KA+ + C ++  IL+ YE+ASGQ  N+DKS+   S NT ++  ++ EV+  + H       +YLGLPS   +SK EIF  +K+RV + L GWK
Subjt:  DDSLMFFKASEKNCNSIKIILETYENASGQIKNLDKSNFMTSPNTNEDLARKIEVVLQVKHTDSM--GQYLGLPSQNARSKKEIFNHIKDRVWKALQGWK

Query:  GRLFSAAGREILIKYVAQAIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSRNKIHWRS
         +L S  GREILIK VAQAIP Y MSCF+   +LC E+ +M  RFWWG     +KI W S
Subjt:  GRLFSAAGREILIKYVAQAIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSRNKIHWRS

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]4.7e-11036.21Show/hide
Query:  VFHLSMIASDHRLLLAEWKEDPPDTGYKILRRPRRFEEAWTKYDDCREIVMSVW-VGFERRSRRSIVEKMGECMTKLDQWSRRMYNGSIRGAISTKEKEI
        V HL+   SDH +L       P   G    +R   FE  WTK +DC +++ + W  G    +   IV  +  C + L  W++ +  G+I   I  K + +
Subjt:  VFHLSMIASDHRLLLAEWKEDPPDTGYKILRRPRRFEEAWTKYDDCREIVMSVW-VGFERRSRRSIVEKMGECMTKLDQWSRRMYNGSIRGAISTKEKEI

Query:  QRLWNQGVQGNESEMLRK-EKDLEALLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSRRKI-------------------------------------
          +     QGN    + +  K+L  LL+ +EI W+QR+   W + GD NTK+ H +AS RRK                                      
Subjt:  QRLWNQGVQGNESEMLRK-EKDLEALLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSRRKI-------------------------------------

Query:  --FKDILAATPKSLTEEHNWKLLETFTREEIYGVVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELLNGEGNLNCINKTYILLIPK--------
            +++ A P+ +T+E N +L +TFT EE+   +K +HPTK PG D + A FF  YWD++G  I    L +LN    +  INKT I LIPK        
Subjt:  --FKDILAATPKSLTEEHNWKLLETFTREEIYGVVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELLNGEGNLNCINKTYILLIPK--------

Query:  ---------------------------------------------------------------GKKGEVALKLDMNKAYDRVEWIYIWKIMAKMGFSDRW
                                                                       GK+  +++KLDM+KA+DRVEW +I  +M K+GF ++W
Subjt:  ---------------------------------------------------------------GKKGEVALKLDMNKAYDRVEWIYIWKIMAKMGFSDRW

Query:  IELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCPSLTHLLYVDDSLMFFKASEKNCNSIKII
        I LIM CV SVS+ VL+NG      +P+RG+RQGDPLSP LFL+CAEGLS L++ +   + + G+ I + CP +THL + DDSL+F KA E+ C+++  I
Subjt:  IELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCPSLTHLLYVDDSLMFFKASEKNCNSIKII

Query:  LETYENASGQIKNLDKSNFMTSPNTNEDLARKIEVVLQVKHTDSMGQYLGLPSQNARSKKEIFNHIKDRVWKALQGWKGRLFSAAGREILIKYVAQAIPN
        L  YE ASGQ  N DKS+   SPNT+++L   I  +L         +YLGLPS   +SK ++F  +KDRV K L GWKG+L S  GREILIK VAQA+P 
Subjt:  LETYENASGQIKNLDKSNFMTSPNTNEDLARKIEVVLQVKHTDSMGQYLGLPSQNARSKKEIFNHIKDRVWKALQGWKGRLFSAAGREILIKYVAQAIPN

Query:  YAMSCFKFLVSLCNELNSMCARFWWGAEDSRNKIHWRS
        Y MSCF+   +LC +L S+   FWWG +D  NKI W S
Subjt:  YAMSCFKFLVSLCNELNSMCARFWWGAEDSRNKIHWRS

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]3.0e-10936.83Show/hide
Query:  LDRFLINPTMLIRCSVFRVFHLSMIASDHRLLLAEWKEDPPDTGYKILRRPRRFEEAWTKYDDCREIVMSVWVGFE--RRSRRSIVEKMGECMTKLDQWS
        LDR + N     R  + RV HLS  ASDH  LL   +       +    R  +FEE+W   D+C  ++   W   +  R    ++ EK+  C  +L  W 
Subjt:  LDRFLINPTMLIRCSVFRVFHLSMIASDHRLLLAEWKEDPPDTGYKILRRPRRFEEAWTKYDDCREIVMSVWVGFE--RRSRRSIVEKMGECMTKLDQWS

Query:  RRMYNGSIRGAISTKEKEIQRL-WNQGVQGNESEMLRKEKDLEALLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSRRK-------------------
          + +    GAI   +K++ RL   +  + +++E L   K ++ LL+  EIYW QR+  +WL+ GD NTK+ H KAS RR+                   
Subjt:  RRMYNGSIRGAISTKEKEIQRL-WNQGVQGNESEMLRKEKDLEALLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSRRK-------------------

Query:  --------------------IFKDILAATPKSLTEEHNWKLLETFTREEIYGVVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELLNGEGNLNC
                              ++ L A    +TE+    L   FT EE+   +  M PTK PG D + A F+QK+W ++G  +    L+ LN    L  
Subjt:  --------------------IFKDILAATPKSLTEEHNWKLLETFTREEIYGVVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELLNGEGNLNC

Query:  INKTYILLIP-----------------------------------------------------------------------KGKKGEVALKLDMNKAYDR
        IN T I+LIP                                                                       KGKKG+VALKLD++KAYDR
Subjt:  INKTYILLIP-----------------------------------------------------------------------KGKKGEVALKLDMNKAYDR

Query:  VEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCPSLTHLLYVD
        VEW ++  IM KMGF   WIE +M CV + SF +L+NG P     P+RG+RQGDP+SPYLFL+CAEGL+ LLN +E+   +TG+ I +  P +T+L++ D
Subjt:  VEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCPSLTHLLYVD

Query:  DSLMFFKASEKNCNSIKIILETYENASGQIKNLDKSNFMTSPNTNEDLARKIEVVLQVKHTDSMGQYLGLPSQNARSKKEIFNHIKDRVWKALQGWKGRL
        DSL+F +A+     +I  IL+ YE ASGQ  NL+KS+   S NT+E    +I  +L VK  D   +YLGLP+   R+K   F+ +KDRVWK LQGWKG L
Subjt:  DSLMFFKASEKNCNSIKIILETYENASGQIKNLDKSNFMTSPNTNEDLARKIEVVLQVKHTDSMGQYLGLPSQNARSKKEIFNHIKDRVWKALQGWKGRL

Query:  FSAAGREILIKYVAQAIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSRNKIHWRS
         S AG+EILIK VAQAIP Y MS F+  + LC+EL ++CARFWWG   +  KIHW+S
Subjt:  FSAAGREILIKYVAQAIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSRNKIHWRS

TrEMBL top hitse value%identityAlignment
A0A2N9F9E4 Reverse transcriptase domain-containing protein7.8e-11138.18Show/hide
Query:  LDRFLINPTMLIRCSVFRVFHLSMIASDHRLLLAEWKEDPPDTGYKILRRPRRFEEAWTKYDDCREIVMSVWVGFERRSRR-SIVEKMGECMTKLDQWSR
        LDR +     L R  +  V H+ ++ SDH+ +   W +       +  +RP RFEE W     C E++   W   +  +R  ++ +K+ EC  +L +WSR
Subjt:  LDRFLINPTMLIRCSVFRVFHLSMIASDHRLLLAEWKEDPPDTGYKILRRPRRFEEAWTKYDDCREIVMSVWVGFERRSRR-SIVEKMGECMTKLDQWSR

Query:  RMYNGSIRGAISTKEKEIQRLWNQGVQGNESEMLRK-EKDLEALLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSRRK--------------------
          + G+I+  I   +++I +  N  +Q    + +    ++L  LLE +E  W+QR+   WL  GD NTK+ H KAS RR+                    
Subjt:  RMYNGSIRGAISTKEKEIQRLWNQGVQGNESEMLRK-EKDLEALLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSRRK--------------------

Query:  -------------------IFKDILAATPKSLTEEHNWKLLETFTREEIYGVVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELLNGEGNLNCI
                              + +A  PK +T + N  L+  F  EE+   +K M P+K PG D +   F+QKYW V+G D+    L  LN    L  I
Subjt:  -------------------IFKDILAATPKSLTEEHNWKLLETFTREEIYGVVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELLNGEGNLNCI

Query:  NKTYILLIPK--------------------GKKGEVALKLDMNKAYDRVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQ
        N T+I LIPK                    G++G +ALKLDM+KAYDRVEW Y+  +M KMGF  +WI L++ C+ SVS+ VL+NG P     P+RGLRQ
Subjt:  NKTYILLIPK--------------------GKKGEVALKLDMNKAYDRVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQ

Query:  GDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCPSLTHLLYVDDSLMFFKASEKNCNSIKIILETYENASGQIKNLDKSNFMTSPNTNEDLARKI
        GDPLSPYLFL+CAEGL  L+  +E   D+ G+ + +  P +THL + DDSL+F KA+   C+ I  ILE YE ASGQ  N DK+    S +        I
Subjt:  GDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCPSLTHLLYVDDSLMFFKASEKNCNSIKIILETYENASGQIKNLDKSNFMTSPNTNEDLARKI

Query:  EVVLQVKHTDSMGQYLGLPSQNARSKKEIFNHIKDRVWKALQGWKGRLFSAAGREILIKYVAQAIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSRNK
        +  L V       +YLGLPS   R+K E F  IK+RVW  L+GWK +L S AGRE+LIK VAQAIP Y+MSCF+  + LC +L +M  RFWW     + K
Subjt:  EVVLQVKHTDSMGQYLGLPSQNARSKKEIFNHIKDRVWKALQGWKGRLFSAAGREILIKYVAQAIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSRNK

Query:  IHWRS
        I+W S
Subjt:  IHWRS

A0A2N9I2P8 Reverse transcriptase domain-containing protein6.6e-11035.26Show/hide
Query:  LDRFLINPTMLIRCSVFRVFHLSMIASDHRLLLAEWKEDPPDTGYKILRRPR--RFEEAWTKYDDCREIVMSVWVGFERRSRR--SIVEKMGECMTKLDQ
        LDR L + T +       V+HL +  SDH  LL     D P +G  +++R +  RFE  WTK + CR ++   W    R   R   + EK+ +C   L  
Subjt:  LDRFLINPTMLIRCSVFRVFHLSMIASDHRLLLAEWKEDPPDTGYKILRRPR--RFEEAWTKYDDCREIVMSVWVGFERRSRR--SIVEKMGECMTKLDQ

Query:  WSRRMYNGSIRGAISTKEKEIQRLWNQGVQGNESEMLRKEKDLEALLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSRRK------------------
        WS+  + GS+  +I  K +++Q   N  + G  S ++  + +L  LLE +EI+W+QR+   W+  GD NTK+ H   + RR+                  
Subjt:  WSRRMYNGSIRGAISTKEKEIQRLWNQGVQGNESEMLRKEKDLEALLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSRRK------------------

Query:  ---------IFKDILAATPKS--------------LTEEHNWKLLETFTREEIYGVVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELLNGEGN
                  F++I  ++                 +T + N  LL  FT EE++  ++ M+PTK PG D + A F+Q YW+V+G ++ +  L +++    
Subjt:  ---------IFKDILAATPKS--------------LTEEHNWKLLETFTREEIYGVVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELLNGEGN

Query:  LNCINKTYILLIPK-----------------------------------------------------------------------GKKGEVALKLDMNKA
        L+ IN T+I L+PK                                                                       G+KG++ALKLDM+KA
Subjt:  LNCINKTYILLIPK-----------------------------------------------------------------------GKKGEVALKLDMNKA

Query:  YDRVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCPSLTHLL
        YDRVEW ++  IM ++GF++ WI LIM C++SVS+ VL+NG     F+ +RG+RQGD LSPYLFL+CAEGLS LL  +E +  +TG+  ++  P LTHL 
Subjt:  YDRVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCPSLTHLL

Query:  YVDDSLMFFKASEKNCNSIKIILETYENASGQIKNLDKSNFMTSPNTNEDLARKIEVVLQVKHTDSMGQYLGLPSQNARSKKEIFNHIKDRVWKALQGWK
        + DDSL+F +A+  +C ++  IL+ YE ASGQ  N  K++   + NT   + ++I+ + QV    S  +YLGLPS   RSK   F  +K RVW+ + GWK
Subjt:  YVDDSLMFFKASEKNCNSIKIILETYENASGQIKNLDKSNFMTSPNTNEDLARKIEVVLQVKHTDSMGQYLGLPSQNARSKKEIFNHIKDRVWKALQGWK

Query:  GRLFSAAGREILIKYVAQAIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSRNKIHW
         +  S+AGREIL+K VAQ+IP Y MSCFK   SLCN+LNSM + FWWG  D   K HW
Subjt:  GRLFSAAGREILIKYVAQAIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSRNKIHW

A0A6J1DUG8 uncharacterized protein LOC1110241351.6e-11143.85Show/hide
Query:  LDRFLINPTMLIRCSVFRVFHLSMIASDHRLLLAEWKEDPPDTGYKILRRPRRFEEAWTKYDDCREIVMSVWVGFERRSRRSIVEKMGECMTKLDQWSRR
        LDRFLIN +ML +C   +V HL +++SDHR +LA W  + P       +R  RFEE+W + D CR+I+   W         +   K+  C+++L++W++ 
Subjt:  LDRFLINPTMLIRCSVFRVFHLSMIASDHRLLLAEWKEDPPDTGYKILRRPRRFEEAWTKYDDCREIVMSVWVGFERRSRRSIVEKMGECMTKLDQWSRR

Query:  MYNGSIRGAISTKEKEIQRLWNQGVQGNESEMLRKEKDLEALLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSRRKIFKDILAATPKSLTEEHNWKLL
          N S++GAI+ KEKE++RL                + L+                                                    +  N  L 
Subjt:  MYNGSIRGAISTKEKEIQRLWNQGVQGNESEMLRKEKDLEALLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSRRKIFKDILAATPKSLTEEHNWKLL

Query:  ETFTREEIYGVVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELLNGEGNLNCINKTYILLIP----------------------KGKKGEVALK
        + FTREEI   +K MHP+K PG D +QA FFQK+W V+   I K     L    + + I+ T    +P                       GK GEVA+K
Subjt:  ETFTREEIYGVVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELLNGEGNLNCINKTYILLIP----------------------KGKKGEVALK

Query:  LDMNKAYDRVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCP
        LDM+KAYDRVEW Y+  IM KMGF++RW++LIM CVESV F VL+NG P  EF+PNRGLRQGDPLSPYLF++CAEGLS L+N  E K ++T L+IN+ CP
Subjt:  LDMNKAYDRVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCP

Query:  SLTHLLYVDDSLMFFKASEKNCNSIKIILETYENA-SGQIKNLDKSNFMTSPNTNEDLARKIEVVLQVKHTDSMGQYLGLPSQNARSKKEIFNHIKDRVW
         ++HL Y DD L+FFKAS  NC SIK ILE+YE A SGQ  NLDKS F+ S NT E +   I   L V HT+S+GQYLGLPSQ  R+K+++FN+IKDRVW
Subjt:  SLTHLLYVDDSLMFFKASEKNCNSIKIILETYENA-SGQIKNLDKSNFMTSPNTNEDLARKIEVVLQVKHTDSMGQYLGLPSQNARSKKEIFNHIKDRVW

Query:  KALQGWKGRLFSAAGREILI
        KALQGWKG+LFS  GRE+L+
Subjt:  KALQGWKGRLFSAAGREILI

A0A7N2L6Z9 Reverse transcriptase domain-containing protein1.9e-11238.67Show/hide
Query:  LDRFLINPTMLIRCSVFRVFHLSMIASDHRLLLAEWKEDPPDTGYKILRRPR----RFEEAWTKYDDCREIVMSVWVG-FERRSRRSIVEKMGECMTKLD
        LDR L     +      +V HL    SDH  L          +  +I++RPR     FE  WTK +DCR I+ SVW    +  +   +   +  C ++L 
Subjt:  LDRFLINPTMLIRCSVFRVFHLSMIASDHRLLLAEWKEDPPDTGYKILRRPR----RFEEAWTKYDDCREIVMSVWVG-FERRSRRSIVEKMGECMTKLD

Query:  QWSRRMYNGSIRGAISTKEKEIQRLWNQ---GVQGNESEMLRKEKDLEALLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSRRK--------------
         W+       I   I  K K + +L  Q   G  G E  ++R+E  L  LL+D+EI+W QR+   WLK GD NTK+ H +AS RRK              
Subjt:  QWSRRMYNGSIRGAISTKEKEIQRLWNQ---GVQGNESEMLRKEKDLEALLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSRRK--------------

Query:  -------------IFKDI------------LAATPKSLTEEHNWKLLETFTREEIYGVVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELLNGE
                      F+DI             AA P  +TEE N +L   FTREEI   +K +HPTK PG D + A FFQKYWD++G ++    L +LN  
Subjt:  -------------IFKDI------------LAATPKSLTEEHNWKLLETFTREEIYGVVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELLNGE

Query:  GNLNCINKTYILLIPK-----------------------------------------------------------------------GKKGEVALKLDMN
         +L+ INKT I+LIPK                                                                       GK   +A KLDM+
Subjt:  GNLNCINKTYILLIPK-----------------------------------------------------------------------GKKGEVALKLDMN

Query:  KAYDRVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCPSLTH
        KA+DRVEW +I ++M KMGF++ WI LIMRC+ SVS+ V++NG       P RGLRQGDPLSPYLFL+CAEGLS LL+ +   + L G+ + + CP +TH
Subjt:  KAYDRVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCPSLTH

Query:  LLYVDDSLMFFKASEKNCNSIKIILETYENASGQIKNLDKSNFMTSPNTNEDLARKIEVVLQVKHTDSMGQYLGLPSQNARSKKEIFNHIKDRVWKALQG
        L + DDSL+F KA+ + C  +K ILE YE ASGQ  N DKS+   SPNT  +L   I  +L         +YLGLPS   RSKK +F  IK+RV   L G
Subjt:  LLYVDDSLMFFKASEKNCNSIKIILETYENASGQIKNLDKSNFMTSPNTNEDLARKIEVVLQVKHTDSMGQYLGLPSQNARSKKEIFNHIKDRVWKALQG

Query:  WKGRLFSAAGREILIKYVAQAIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSRNKIHWRS
        WKG+L S+ G+EILIK VAQAIP Y MSCF    SLC+EL  M   FWWG ++  +K+ W S
Subjt:  WKGRLFSAAGREILIKYVAQAIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSRNKIHWRS

A0A803PYN5 Uncharacterized protein4.6e-11138.59Show/hide
Query:  HLSMIASDHRLLLAEWKEDPPDTGYKILRRPRRFEEAWTKYDDCREIVMSVWVGFERRSRRSIVEKMGECMTKLDQWSRRMYNGSIRGAISTKEKEIQRL
        HL   +S+HR +         +   +  +   RFE+ W    D   I+ + W           +  +  C   L QW  R + G++R  IS  ++E+ RL
Subjt:  HLSMIASDHRLLLAEWKEDPPDTGYKILRRPRRFEEAWTKYDDCREIVMSVWVGFERRSRRSIVEKMGECMTKLDQWSRRMYNGSIRGAISTKEKEIQRL

Query:  WNQGVQGNESEMLRK--EKDLEALLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSR--RKIFK------DILAATPKSLTEEHNWKLLETFTREEIYG
         N  V+ + + +  K  E  L+ LLE +E YW Q +  DWL+ GD NTK+ H  ASSR  + + K      DIL A P ++T E N  L   FT  E+Y 
Subjt:  WNQGVQGNESEMLRK--EKDLEALLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSR--RKIFK------DILAATPKSLTEEHNWKLLETFTREEIYG

Query:  VVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELLNGEGNLNCINKTYILLIPK-----------------------------------------
         +++M P K PG D + A F+Q YW+++G  +    L +LN   ++  INK+ I LIPK                                         
Subjt:  VVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELLNGEGNLNCINKTYILLIPK-----------------------------------------

Query:  ---------GKKGEVALKLDMNKAYDRVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLN
                 G+ G  ALKLDM+KA+DRVEW Y+  IM KMGF   W  LIM+C+ + SF   LNG      +P+RGLRQGDPLSPYLFLIC+EGLS LL 
Subjt:  ---------GKKGEVALKLDMNKAYDRVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLN

Query:  TSEVKKDLTGLRINKFCPSLTHLLYVDDSLMFFKASEKNCNSIKIILETYENASGQIKNLDKSNFMTSPNTNEDLARKIEVVLQVKHTDSMGQYLGLPSQ
          E    L GL + +  PS++HLL+ DDSL+F + +E++  SIK  L+TY  ASGQ+ N DKS    SPNT +D        L +  T+   +YLGLPS 
Subjt:  TSEVKKDLTGLRINKFCPSLTHLLYVDDSLMFFKASEKNCNSIKIILETYENASGQIKNLDKSNFMTSPNTNEDLARKIEVVLQVKHTDSMGQYLGLPSQ

Query:  NARSKKEIFNHIKDRVWKALQGWKGRLFSAAGREILIKYVAQAIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSRNKIHWR
        + R K+E+F++IK++VWK L  W  ++FS  G+E+L+K V Q+IP YAMSCFK     C++L SM A FWWG+  +  KIHW+
Subjt:  NARSKKEIFNHIKDRVWKALQGWKGRLFSAAGREILIKYVAQAIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSRNKIHWR

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657505.0e-0631.76Show/hide
Query:  LPSQNARSKKEIFNHIKDRVWKALQGWKGRLFSAAGREILIKYVAQAIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSRNKIH
        +P    R  K+ F  I +RV   + GW+ +  S AGR  L K V  ++P ++MS      S+ N L+ +   F WG+   + K H
Subjt:  LPSQNARSKKEIFNHIKDRVWKALQGWKGRLFSAAGREILIKYVAQAIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSRNKIH

P11369 LINE-1 retrotransposable element ORF2 protein8.0e-1222.22Show/hide
Query:  KGEVALKLDMNKAYDRVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGL
        K  + + LD  KA+D+++  ++ K++ + G    ++ +I          + +NG          G RQG PLSPYLF I  E L+  +     +K++ G+
Subjt:  KGEVALKLDMNKAYDRVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGL

Query:  RINKFCPSLTHLLYVDDSLMFFKASEKNCNSIKIILETYENASGQIKNLDKSNFMTSPNTNEDLARKIEVVLQVKHTDSMGQYLG--LPSQNARSKKEIF
        +I K    +   L  DD +++    + +   +  ++ ++    G   N +KS        N+   ++I          +  +YLG  L  +      + F
Subjt:  RINKFCPSLTHLLYVDDSLMFFKASEKNCNSIKIILETYENASGQIKNLDKSNFMTSPNTNEDLARKIEVVLQVKHTDSMGQYLG--LPSQNARSKKEIF

Query:  NHIKDRVWKALQGWKGRLFSAAGREILIK--YVAQAIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSR
          +K  + + L+ WK    S  GR  ++K   + +AI  +     K      NEL     +F W  +  R
Subjt:  NHIKDRVWKALQGWKGRLFSAAGREILIK--YVAQAIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSR

P14381 Transposon TX1 uncharacterized 149 kDa protein5.5e-0532.95Show/hide
Query:  LTEEHNWKLLETFTREEIYGVVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELL-NGEGNLNCINKTYILLIPKGKKGEVAL
        ++E    +L    T +E+   ++ M   K PGLD L   FFQ +WD +G D ++   E    GE  L+C  +  + L+P  KKG++ L
Subjt:  LTEEHNWKLLETFTREEIYGVVKNMHPTKGPGLDELQANFFQKYWDVIGQDIYKFCLELL-NGEGNLNCINKTYILLIPKGKKGEVAL

P92555 Uncharacterized mitochondrial protein AtMg012501.0e-1450Show/hide
Query:  LLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCPSLTHLLYVDDS
        ++NGAP+   +P+RGLRQGDPLSPYLF++C E LSGL   ++ +  L G+R++   P + HLL+ DD+
Subjt:  LLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCPSLTHLLYVDDS

Q05118 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)3.8e-0630.08Show/hide
Query:  KGKKGEVALKLDMNKAYDRVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDL
        KGK   V + LD+ KA+D V    I + M   G  D   + IM  +      +++ G    +     G++QGDPLSP LF I  + L   LN  +     
Subjt:  KGKKGEVALKLDMNKAYDRVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDL

Query:  TGLRINKFCPSLTHLLYVDDSLM
         G  +   C  +  L + DD L+
Subjt:  TGLRINKFCPSLTHLLYVDDSLM

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases8.8e-0634.78Show/hide
Query:  KGKKGEVALKLDMNKAYDRVEWIYIWKIMAKMGFSDRWIELIMRC---VESVSFQV-LLNGAPRAEFSPNR-GLRQGDPLSPYL--FLICAE
        KG KG + LKLD+ KAYDR+ W Y+   +   GF + W+  I R       V+ +V   + + R   S +R G R  D  +P+    + CAE
Subjt:  KGKKGEVALKLDMNKAYDRVEWIYIWKIMAKMGFSDRWIELIMRC---VESVSFQV-LLNGAPRAEFSPNR-GLRQGDPLSPYL--FLICAE

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.7e-0545Show/hide
Query:  AIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSRNKIHW
        A+P YAMSCF+    LC +L S    FWW + +++ KI W
Subjt:  AIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSRNKIHW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)7.2e-1650Show/hide
Query:  LLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCPSLTHLLYVDDS
        ++NGAP+   +P+RGLRQGDPLSPYLF++C E LSGL   ++ +  L G+R++   P + HLL+ DD+
Subjt:  LLNGAPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCPSLTHLLYVDDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAGATCGATTCCTTATTAATCCGACTATGCTGATTAGATGCAGTGTATTTCGAGTGTTTCACTTATCCATGATAGCATCAGATCATAGGCTACTTCTGGCAGAGTG
GAAAGAAGATCCACCGGACACTGGGTACAAGATTCTTAGACGTCCTAGGCGTTTTGAGGAGGCCTGGACAAAGTATGATGATTGCAGGGAGATTGTTATGAGCGTTTGGG
TTGGTTTTGAGAGGAGGAGCAGGCGGTCGATAGTTGAAAAAATGGGGGAGTGTATGACTAAGTTAGACCAGTGGAGTAGAAGAATGTATAATGGGTCGATAAGGGGGGCT
ATATCCACAAAGGAAAAAGAAATTCAGAGGCTTTGGAATCAAGGGGTCCAGGGTAATGAATCTGAGATGTTGAGAAAAGAGAAAGATCTGGAGGCGCTTCTTGAGGATGA
TGAAATATACTGGAAACAGAGGGCAATGGAGGATTGGCTTAAATGGGGTGACATGAATACTAAATGGTCACATATGAAGGCTAGCAGTAGAAGGAAAATTTTTAAGGATA
TTTTGGCAGCAACTCCTAAAAGCCTGACAGAAGAGCATAATTGGAAACTCTTGGAAACATTCACAAGGGAGGAGATCTATGGGGTGGTTAAAAACATGCACCCAACAAAG
GGCCCTGGTCTAGATGAGTTGCAAGCCAATTTCTTTCAGAAGTATTGGGATGTGATTGGACAGGATATCTATAAGTTTTGCTTAGAGCTTCTGAATGGAGAGGGGAATCT
GAATTGTATTAACAAAACCTATATTCTGTTGATTCCAAAGGGGAAGAAAGGGGAAGTGGCGCTCAAACTGGATATGAACAAAGCTTATGATCGGGTGGAATGGATCTATA
TTTGGAAGATTATGGCTAAAATGGGGTTCAGTGACCGTTGGATAGAGCTCATTATGCGTTGTGTGGAATCGGTCAGTTTTCAGGTTCTTTTGAATGGAGCCCCTAGAGCT
GAATTCTCGCCAAATAGAGGGTTGAGGCAAGGCGATCCGTTATCTCCGTACTTATTCCTAATATGTGCAGAAGGTCTATCTGGGCTCCTTAACACCTCAGAGGTTAAAAA
AGATCTGACAGGTTTGCGTATTAATAAGTTTTGTCCTAGTTTAACTCATTTATTATATGTTGATGATAGTCTCATGTTCTTTAAGGCTTCTGAAAAAAATTGCAATTCTA
TTAAAATTATCCTTGAGACTTACGAAAATGCTTCTGGCCAAATTAAAAATCTTGATAAATCTAACTTCATGACTAGCCCGAATACTAATGAGGATCTAGCTAGAAAGATC
GAGGTTGTTTTGCAGGTGAAACATACAGATAGTATGGGACAATACTTAGGGCTTCCATCACAAAATGCACGAAGCAAAAAGGAGATTTTCAACCACATCAAAGACCGGGT
TTGGAAAGCGCTTCAAGGATGGAAAGGGAGATTATTCTCAGCTGCTGGAAGGGAGATCCTTATTAAATATGTGGCTCAGGCCATCCCTAATTATGCGATGAGTTGTTTTA
AATTTCTGGTGTCTTTGTGTAACGAGTTAAACTCTATGTGTGCTAGGTTCTGGTGGGGCGCAGAGGACTCAAGAAATAAGATACACTGGAGGAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTAGATCGATTCCTTATTAATCCGACTATGCTGATTAGATGCAGTGTATTTCGAGTGTTTCACTTATCCATGATAGCATCAGATCATAGGCTACTTCTGGCAGAGTG
GAAAGAAGATCCACCGGACACTGGGTACAAGATTCTTAGACGTCCTAGGCGTTTTGAGGAGGCCTGGACAAAGTATGATGATTGCAGGGAGATTGTTATGAGCGTTTGGG
TTGGTTTTGAGAGGAGGAGCAGGCGGTCGATAGTTGAAAAAATGGGGGAGTGTATGACTAAGTTAGACCAGTGGAGTAGAAGAATGTATAATGGGTCGATAAGGGGGGCT
ATATCCACAAAGGAAAAAGAAATTCAGAGGCTTTGGAATCAAGGGGTCCAGGGTAATGAATCTGAGATGTTGAGAAAAGAGAAAGATCTGGAGGCGCTTCTTGAGGATGA
TGAAATATACTGGAAACAGAGGGCAATGGAGGATTGGCTTAAATGGGGTGACATGAATACTAAATGGTCACATATGAAGGCTAGCAGTAGAAGGAAAATTTTTAAGGATA
TTTTGGCAGCAACTCCTAAAAGCCTGACAGAAGAGCATAATTGGAAACTCTTGGAAACATTCACAAGGGAGGAGATCTATGGGGTGGTTAAAAACATGCACCCAACAAAG
GGCCCTGGTCTAGATGAGTTGCAAGCCAATTTCTTTCAGAAGTATTGGGATGTGATTGGACAGGATATCTATAAGTTTTGCTTAGAGCTTCTGAATGGAGAGGGGAATCT
GAATTGTATTAACAAAACCTATATTCTGTTGATTCCAAAGGGGAAGAAAGGGGAAGTGGCGCTCAAACTGGATATGAACAAAGCTTATGATCGGGTGGAATGGATCTATA
TTTGGAAGATTATGGCTAAAATGGGGTTCAGTGACCGTTGGATAGAGCTCATTATGCGTTGTGTGGAATCGGTCAGTTTTCAGGTTCTTTTGAATGGAGCCCCTAGAGCT
GAATTCTCGCCAAATAGAGGGTTGAGGCAAGGCGATCCGTTATCTCCGTACTTATTCCTAATATGTGCAGAAGGTCTATCTGGGCTCCTTAACACCTCAGAGGTTAAAAA
AGATCTGACAGGTTTGCGTATTAATAAGTTTTGTCCTAGTTTAACTCATTTATTATATGTTGATGATAGTCTCATGTTCTTTAAGGCTTCTGAAAAAAATTGCAATTCTA
TTAAAATTATCCTTGAGACTTACGAAAATGCTTCTGGCCAAATTAAAAATCTTGATAAATCTAACTTCATGACTAGCCCGAATACTAATGAGGATCTAGCTAGAAAGATC
GAGGTTGTTTTGCAGGTGAAACATACAGATAGTATGGGACAATACTTAGGGCTTCCATCACAAAATGCACGAAGCAAAAAGGAGATTTTCAACCACATCAAAGACCGGGT
TTGGAAAGCGCTTCAAGGATGGAAAGGGAGATTATTCTCAGCTGCTGGAAGGGAGATCCTTATTAAATATGTGGCTCAGGCCATCCCTAATTATGCGATGAGTTGTTTTA
AATTTCTGGTGTCTTTGTGTAACGAGTTAAACTCTATGTGTGCTAGGTTCTGGTGGGGCGCAGAGGACTCAAGAAATAAGATACACTGGAGGAGTTAG
Protein sequenceShow/hide protein sequence
MLDRFLINPTMLIRCSVFRVFHLSMIASDHRLLLAEWKEDPPDTGYKILRRPRRFEEAWTKYDDCREIVMSVWVGFERRSRRSIVEKMGECMTKLDQWSRRMYNGSIRGA
ISTKEKEIQRLWNQGVQGNESEMLRKEKDLEALLEDDEIYWKQRAMEDWLKWGDMNTKWSHMKASSRRKIFKDILAATPKSLTEEHNWKLLETFTREEIYGVVKNMHPTK
GPGLDELQANFFQKYWDVIGQDIYKFCLELLNGEGNLNCINKTYILLIPKGKKGEVALKLDMNKAYDRVEWIYIWKIMAKMGFSDRWIELIMRCVESVSFQVLLNGAPRA
EFSPNRGLRQGDPLSPYLFLICAEGLSGLLNTSEVKKDLTGLRINKFCPSLTHLLYVDDSLMFFKASEKNCNSIKIILETYENASGQIKNLDKSNFMTSPNTNEDLARKI
EVVLQVKHTDSMGQYLGLPSQNARSKKEIFNHIKDRVWKALQGWKGRLFSAAGREILIKYVAQAIPNYAMSCFKFLVSLCNELNSMCARFWWGAEDSRNKIHWRS