; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016446 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016446
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr12:37863853..37869031
RNA-Seq ExpressionLag0016446
SyntenyLag0016446
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017250619.1 PREDICTED: uncharacterized protein LOC108221234 [Daucus carota subsp. sativus]6.2e-20138.97Show/hide
Query:  MNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK
        MN++L   +T  E+  A+ +  P K+PGPDG  A+F+Q++W+IVG  +    L  LN E  +E  N T + LIPK ++P+ V D+RPISLCNV YK I K
Subjt:  MNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK

Query:  VIANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILL
        V+ NRLK +L  II+  QSAF+PGR I+DN ++++E LH L+  R GK  + A+KLDMSKAYDRVEWI++ +++ KLGF  +W++ +MKC+T+ ++S  +
Subjt:  VIANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILL

Query:  NGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVDARNNS-MAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKS
        NG+ +G + P+RG+RQGDPLSPYLFL+C+EG SALL  A N S + G+ IAR  P ISHLFFADDSL+FLKA        ++I   Y + SGQ +NF KS
Subjt:  NGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVDARNNS-MAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKS

Query:  TVFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIM
         +FFS N   D +      L+M  +E++ +YLGLP    + K   F+ I ++VW  L  W+   FSQGGKE+L+K++IQA+PTY M CF+IP+G   +I 
Subjt:  TVFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIM

Query:  SLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLLK
         L AR+WWGS   KR +HW+ W ++  PK  GGL FR  + +NQA+LAKQAWR+L  P   LS+VL  KYF   S L        S  W+  VWG  LL 
Subjt:  SLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLLK

Query:  QGIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQFLIKEDVEMIKGLPIS-LTAPDRWIWHYNNTGEYSVKSGYK
        +G+R+ +GNGQ+   FKDPWL RP +F  L     S   +KV E+      W+   ++Q  +  D+++I  +P+S     D W WHYN+ G Y+VKSGYK
Subjt:  QGIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQFLIKEDVEMIKGLPIS-LTAPDRWIWHYNNTGEYSVKSGYK

Query:  LSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNIWMDQWDM
        L        S S+  +  +WWK  W  +IP ++ IF W+ +H  +P+   L    + +H  CP+C    ++  HA+F C  AQE+W +      + + + 
Subjt:  LSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNIWMDQWDM

Query:  LEIKD--RWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPLGGSLVQSEDEVIQILAEGEEVIMHTDATFIDA--
        +  KD   +++   + D +   + +  W IW +RN +++ +   TP     W+  Y  E           +   D    I A   E +      F+DA  
Subjt:  LEIKD--RWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPLGGSLVQSEDEVIQILAEGEEVIMHTDATFIDA--

Query:  ---TTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVNFH
           T + G+G  +       +A     + G  S L AEA+A++ GLQ A+        VL+D  +L++++N + +  + +   + D   L+  F  V   
Subjt:  ---TTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVNFH

Query:  FVNRRYNRFAHNMASEGYSAPPCLWLGAFPAWME
         V+R  N  AHN+A +       L L    +W+E
Subjt:  FVNRRYNRFAHNMASEGYSAPPCLWLGAFPAWME

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]3.8e-19840.77Show/hide
Query:  MNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK
        +N+ LLAPYT+EEI  A+R   PTKA GPDGFPALFYQ YW +VG K +  CL  LN+   I+ WN T I LIPK +QPR +SD+RPISLCNVSYKII+K
Subjt:  MNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK

Query:  VIANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILL
         I NRLK V+  +I + QSAF+P R+ISDN+I+ HE LH +   + G +G AALKLD+SKA+DRVEW YL  IM K+GF++ WI+ +++CI+T  FSI L
Subjt:  VIANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILL

Query:  NGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVDARNNS--MAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAK
        NG   G  +P+RGIRQGDPLSPYLFLLC+EGLSA L++  NNS  + G+        I+HL FADDSL+FL++   E    R +L  Y +ASGQC+NF+K
Subjt:  NGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVDARNNS--MAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAK

Query:  STVFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKI
        S + FS N+  + +QYL  IL++ +    G+YLGLPS F R                                                           
Subjt:  STVFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKI

Query:  MSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLL
                   + E R +HW +W  +C PKE GGLNFRDL  FNQA++AK  WR L +P+L +SKVL  KYF   S+L  S ++ SS+FWKGF+WG  LL
Subjt:  MSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLL

Query:  KQGIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQFLIKEDVEMIKGLPI-SLTAPDRWIWHYNNTGEYSVKSGY
         +G+R  +GNG TI  F DPWLPRP TFK L + N    +  VA F      WD   +      ED ++I  +PI S    D W+WHY+  G YSV+SGY
Subjt:  KQGIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQFLIKEDVEMIKGLPI-SLTAPDRWIWHYNNTGEYSVKSGY

Query:  KLSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNI-WMDQW
        KL M      + ++       W  +WKL +P ++KIF+W+S H  IP+  NL    +     C +C +  E+  HA F C RA++IWR   P +  +   
Subjt:  KLSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNI-WMDQW

Query:  DMLEIKDRWLSFAAQ---PDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPLGGS--LVQSEDEVIQILAEGEEVI--MHTDA
        D +   + W S   Q    D  LA+I    W IWNDRNS+++ + +     +CEW+  +L  + +      S     +   V+Q       V   ++TDA
Subjt:  DMLEIKDRWLSFAAQ---PDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPLGGS--LVQSEDEVIQILAEGEEVI--MHTDA

Query:  TFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVN
            A+T    G ++R     L AA    +    SPL AE   IL GL+ A       L V SD L  I+ I  ++  +      + +I+AL   F  ++
Subjt:  TFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVN

Query:  FHFVNRRYNRFAHNMASEGYSAPPC--LWLGAFPAWM
        F   +R+ NR AH +A  G ++P     WL  FP W+
Subjt:  FHFVNRRYNRFAHNMASEGYSAPPC--LWLGAFPAWM

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]3.9e-19539.25Show/hide
Query:  MNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK
        MN+ L+  +T+EE+  A++  HP KAPGPDG  A+F+QKYW IVG+ +    L +LN  + I + N TNI LIPK   P+ ++D+RPISLCNV YK+I+K
Subjt:  MNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK

Query:  VIANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILL
        ++ANRLK +L  II E QSAF   R I+DN++++ E +H+L  K  GK G+ A+KLDMSKA+DRVEW ++ ++ME++GF +RW  LVM+CIT+ S+SIL+
Subjt:  VIANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILL

Query:  NGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVD-ARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKS
        NG A G I P+RG+RQGDPLSP LFLLC+EGLSAL+   ARN  + G+SI R CPK++HLFFADDS++F KA  EE    RSIL  YE+ASGQ +N  KS
Subjt:  NGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVD-ARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKS

Query:  TVFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIM
        ++FFS N   +T+  + +IL    +     YLGLPS   R K++ F  + ++V   L GWK +  S GGKE+LIK++ QAIPTY M CF +P+G+   + 
Subjt:  TVFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIM

Query:  SLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLLK
         +   FWWG + ++  + W  W+ +C  K  GGL FR+L +FN AMLAKQAWR+L NP+  + +VL  +YFP   +L+    +S S+ W+     +++++
Subjt:  SLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLLK

Query:  QGIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLTNIK---VAEFFNPSVQWDEAK-LQQFLIKEDVEMIKGLPISLTAP-DRWIWHYNNTGEYSVK
        +G R  +GNG+ I +++D WLP P T+KV+S   P + N +   V+   +P  +W + + L+   +  +VE I  +P+S   P D+ IW  N  GE+SVK
Subjt:  QGIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLTNIK---VAEFFNPSVQWDEAK-LQQFLIKEDVEMIKGLPISLTAP-DRWIWHYNNTGEYSVK

Query:  SGYKL--SMLRPMGPSMSARGLDSR-WWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNI
        S Y +  S++ P      + G   R  WKK+W L +P ++KIF W++  + +P+  N+ +  +    TCP+C    E  +HAL  C  A  +W  +    
Subjt:  SGYKL--SMLRPMGPSMSARGLDSR-WWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNI

Query:  WMDQWDMLEIKDRWLSFA-AQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPLGGSLVQSEDEVIQILAEGEEVI-MHTDA
           Q       D  L    ++   +L    V +WAIW +RN I++N    +P        + L ++ K   L   ++      I+  A    +  ++ D 
Subjt:  WMDQWDMLEIKDRWLSFA-AQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPLGGSLVQSEDEVIQILAEGEEVI-MHTDA

Query:  TFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVN
           D      IG+++R   G++ AA +  + G  +    EA+A+  G+ LA  L++ R+ +  D L +I+++N +  G + +   L  I ++  +FE   
Subjt:  TFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVN

Query:  FHFVNRRYNRFAHNMAS-EGYSAPPCLWLG
        F  VNR +N  AH +A         C+W G
Subjt:  FHFVNRRYNRFAHNMAS-EGYSAPPCLWLG

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]7.4e-20240.77Show/hide
Query:  MNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK
        MN  L+  +TREEI TA+   HPTKAPGPDG  A+F+QKYW+IVG+ +V   L +LNS +S+ + N TNI L+PK + P  +SD+RPISLCNV YK+I+K
Subjt:  MNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK

Query:  VIANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILL
        V+ANRLK +L +II E QSAF+ GR I+DN++++ E +H+L+ K++GK G+AA+KLDMSKAYDRVEW +++Q+MEK+GFH++WI+LVM CIT+ S+SIL+
Subjt:  VIANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILL

Query:  NGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVD-ARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKS
        NG A+G I PTRG+RQGDP+SPY+FLLC++G S+LL D AR   ++GVSI R CPKI+HLFFADDSL+F KA ++E      IL  YE ASGQ +N  KS
Subjt:  NGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVD-ARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKS

Query:  TVFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIM
        +VFFS N P + +  +  +L          YLGLPS   + K   F  + +RV   L GWK +  S GG+E+LIK++ QAIPTY M CF+IPK +  +I 
Subjt:  TVFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIM

Query:  SLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLLK
        ++  RFWWG + ++  + W  W++LCK K+ GG+ FR+L +FN AMLAKQ WR+++NP+  ++++   +Y+P   V       S S+ W+    G+++++
Subjt:  SLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLLK

Query:  QGIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFN-PSVQWDEAKLQQFLIKEDVEMIKGLPISLTAP-DRWIWHYNNTGEYSVKSGY
        +G R  +GNG+ I +++D WLP P T+KV+S   P     +V+   +    +W +  ++   +  +   I  +P+S   P D+ IW  N  GE+SVKS Y
Subjt:  QGIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFN-PSVQWDEAKLQQFLIKEDVEMIKGLPISLTAP-DRWIWHYNNTGEYSVKSGY

Query:  KLS--MLRPMGPSMSARGLDSR--WWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPN---
         ++  ++  +    S+ G DSR   W+K+W L IP +V+IF WK   N++P+ +NL R  V +   CP C  E E+  H   +C  A+ +WR +  N   
Subjt:  KLS--MLRPMGPSMSARGLDSR--WWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPN---

Query:  IWMDQWDMLEIKDRWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPLGGSLVQSEDEVIQILAEGEEVIMHTDAT
        +     D+++I  + L F    D  L    V AWAIW +RN I++      P     +   Y+ E+   +     ++   D        G   I + D  
Subjt:  IWMDQWDMLEIKDRWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPLGGSLVQSEDEVIQILAEGEEVIMHTDAT

Query:  FIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVNF
          +      +G+++R   G + AA    + G +S    EA+A+  GL LAK  K+ ++ + SD L ++  +N   +    +      I +LL++F     
Subjt:  FIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVNF

Query:  HFVNRRYNRFAHNMA
        + V R YN+ AH +A
Subjt:  HFVNRRYNRFAHNMA

XP_030479133.1 uncharacterized protein LOC115696372 [Cannabis sativa]1.5e-19438.86Show/hide
Query:  NQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKV
        N  LL  +TR ++  A+++    K+PG DG  A+FYQ YW IVGD +    L +LN   S   +N T + LIPK ++P+ + D+RPISLCNV YKII+K+
Subjt:  NQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKV

Query:  IANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILLN
        +A RLK VL  +I E QSAF+  R I+DN++++ E +H LK +++G  G+AALK DMSKA+DRVEW ++  +M K+GF+ RWI L+M C+ T  FS  +N
Subjt:  IANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILLN

Query:  GEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALL-VDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKST
        GE  G + P RG+RQGDPLSPYLFL+CSEGLS LL  + +   + G++++R  P ISHLFFADDSL+F +A     G  +  L  Y +ASGQ +N  KS 
Subjt:  GEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALL-VDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKST

Query:  VFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMS
        + FS N P   +     IL M + E   +YLGLP+   R K++ F  I +++W ++  W  + FS GGKEVL+K+++Q+IPTYAM CFR+P  +  +I +
Subjt:  VFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMS

Query:  LCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLLKQ
        + A+FWWGS ++ + +HWK+W  LCK K  GG+ FR  V FNQA+LAKQAWR+  +P   LS+VL G YF +   +       SS  W+G VWG +LL +
Subjt:  LCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLLKQ

Query:  GIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQFLIKEDVEMIKGLPIS-LTAPDRWIWHYNNTGEYSVKSGYKL
        G+R  +G G  I    D W+P    FK          +  VA++   + +W+   LQ      DV+ I  +P+S L   DRWIWHY ++G+YSV SGY L
Subjt:  GIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQFLIKEDVEMIKGLPIS-LTAPDRWIWHYNNTGEYSVKSGYKL

Query:  SMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNIWMDQWDML
        +         S       WWK  WKL +P++VKIF WK   +SIP   +L    +    TC +C    E+  HALF C  A+E+W+    +I     D L
Subjt:  SMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNIWMDQWDML

Query:  EIKDRWLSFAA-QPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPL---GGSLVQSEDEVIQILAEGEEVI-MHTDATFIDA
        +  D  +  ++    S   +I    W IW+DRN+ ++ + + TP+        Y+ +Y           S   S+  V       E    ++ DA    +
Subjt:  EIKDRWLSFAA-QPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPL---GGSLVQSEDEVIQILAEGEEVI-MHTDATFIDA

Query:  TTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVNFHFVN
         +K GIG+++R   G +KAA      G       EA A+  GL  AK  ++    V +DCL L+ ++NGD+   S     + D++  L+ F       + 
Subjt:  TTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVNFHFVN

Query:  RRYNRFAHNMASEGYSAP-PCLWLGAFPA
        R  N+ AH +A         C+WL   P+
Subjt:  RRYNRFAHNMASEGYSAP-PCLWLGAFPA

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248741.8e-19840.77Show/hide
Query:  MNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK
        +N+ LLAPYT+EEI  A+R   PTKA GPDGFPALFYQ YW +VG K +  CL  LN+   I+ WN T I LIPK +QPR +SD+RPISLCNVSYKII+K
Subjt:  MNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK

Query:  VIANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILL
         I NRLK V+  +I + QSAF+P R+ISDN+I+ HE LH +   + G +G AALKLD+SKA+DRVEW YL  IM K+GF++ WI+ +++CI+T  FSI L
Subjt:  VIANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILL

Query:  NGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVDARNNS--MAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAK
        NG   G  +P+RGIRQGDPLSPYLFLLC+EGLSA L++  NNS  + G+        I+HL FADDSL+FL++   E    R +L  Y +ASGQC+NF+K
Subjt:  NGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVDARNNS--MAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAK

Query:  STVFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKI
        S + FS N+  + +QYL  IL++ +    G+YLGLPS F R                                                           
Subjt:  STVFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKI

Query:  MSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLL
                   + E R +HW +W  +C PKE GGLNFRDL  FNQA++AK  WR L +P+L +SKVL  KYF   S+L  S ++ SS+FWKGF+WG  LL
Subjt:  MSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLL

Query:  KQGIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQFLIKEDVEMIKGLPI-SLTAPDRWIWHYNNTGEYSVKSGY
         +G+R  +GNG TI  F DPWLPRP TFK L + N    +  VA F      WD   +      ED ++I  +PI S    D W+WHY+  G YSV+SGY
Subjt:  KQGIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQFLIKEDVEMIKGLPI-SLTAPDRWIWHYNNTGEYSVKSGY

Query:  KLSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNI-WMDQW
        KL M      + ++       W  +WKL +P ++KIF+W+S H  IP+  NL    +     C +C +  E+  HA F C RA++IWR   P +  +   
Subjt:  KLSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNI-WMDQW

Query:  DMLEIKDRWLSFAAQ---PDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPLGGS--LVQSEDEVIQILAEGEEVI--MHTDA
        D +   + W S   Q    D  LA+I    W IWNDRNS+++ + +     +CEW+  +L  + +      S     +   V+Q       V   ++TDA
Subjt:  DMLEIKDRWLSFAAQ---PDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPLGGS--LVQSEDEVIQILAEGEEVI--MHTDA

Query:  TFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVN
            A+T    G ++R     L AA    +    SPL AE   IL GL+ A       L V SD L  I+ I  ++  +      + +I+AL   F  ++
Subjt:  TFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVN

Query:  FHFVNRRYNRFAHNMASEGYSAPPC--LWLGAFPAWM
        F   +R+ NR AH +A  G ++P     WL  FP W+
Subjt:  FHFVNRRYNRFAHNMASEGYSAPPC--LWLGAFPAWM

A0A7N2L6Z9 Reverse transcriptase domain-containing protein1.1e-20040.51Show/hide
Query:  MNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK
        MN  L   +TREEI+TA++  HPTK+PGPDG  A+F+QKYWDIVG  +    L +LN  +S++  N TNIVLIPK   P+ ++D+RPISLCNV YK+I+K
Subjt:  MNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK

Query:  VIANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILL
         +ANRLK  L  II E QSAF   R I+DN+++++E +H+LK K+ GK  + A KLDMSKA+DRVEW ++ ++M K+GF++ WI L+M+CI++ S+S+++
Subjt:  VIANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILL

Query:  NGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVD-ARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKS
        NGE FG I PTRG+RQGDPLSPYLFLLC+EGLSALL D ARN  + G+S+ R CP+I+HLFFADDSL+F KA  EE    + IL  YE ASGQ VN  KS
Subjt:  NGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVD-ARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKS

Query:  TVFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIM
        ++FFS N   + K+ + +IL          YLGLPS   R K   F  I +RV   L GWK +  S GGKE+LIK++ QAIPTY M CF +PK +  ++ 
Subjt:  TVFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIM

Query:  SLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLLK
         +   FWWG + ++  V W  W ++CKPK +GGL FR+L +FN A+LAKQAWR+LTNP    +++L  KYFP   VL+ S  ++ S+ W+     +++LK
Subjt:  SLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLLK

Query:  QGIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQWDEA-KLQQFLIKEDVEMIKGLPISLTAP-DRWIWHYNNTGEYSVKSGY
        +G R  +GNG+ I ++ D WLP P T+KV++    +     V+   +P  +W +   ++   +  D E I  +P+S   P DR IW  N  GE+SVKS Y
Subjt:  QGIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQWDEA-KLQQFLIKEDVEMIKGLPISLTAP-DRWIWHYNNTGEYSVKSGY

Query:  KLSM-LRPMGPSMSARGLDS--RWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNIWMD
         +++ L      +     D     WK +WKL +P +VKIF W++  N +P+M N+    +  ++ CPVC EE+E  +H L  C  A  +W ++Q      
Subjt:  KLSM-LRPMGPSMSARGLDS--RWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNIWMD

Query:  QWDMLEIKDRWLSFAA-QPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPLGGSLVQSEDEVIQILAEGEEVIMHTDATFID
          +  +IK   L F A  P   L      +WAIW++RN  +++    +P+   E  Q  + +Y     L    +QS          G   +    A  ID
Subjt:  QWDMLEIKDRWLSFAA-QPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPLGGSLVQSEDEVIQILAEGEEVIMHTDATFID

Query:  ATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSIN-GDLKGQSSISTTLWDIEALL---AAFEIVN
             G+G+V+R + G++ AA    +         E  AI  GL LA+ + + ++ + SD L+ I +IN  + +G++        +E +L   A F   +
Subjt:  ATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSIN-GDLKGQSSISTTLWDIEALL---AAFEIVN

Query:  FHFVNRRYNRFAHNMASEGYS-APPCLWLGAFPAWMERMTLME
        F ++ R YN  AH +A    S     +W G  P ++  + + +
Subjt:  FHFVNRRYNRFAHNMASEGYS-APPCLWLGAFPAWMERMTLME

A0A803PV25 Uncharacterized protein5.5e-19539.3Show/hide
Query:  MNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK
        MN  LLA +  EE+I A++  +PTKAPG DG PALFYQK+W  +   +V   L +LN+   ++  N T + LIPK  +P+ + ++RPISLCNV YKI++K
Subjt:  MNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK

Query:  VIANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILL
         +ANR++  L  ++ + QSAF+ GR I DN I+ +E+LH ++K R       ALKLDM+KAYDRVEW +L  +M KLG+   W+  +M C+T+  FS ++
Subjt:  VIANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILL

Query:  NGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVDA-RNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKS
        NGE  G + P RG+RQGDPLSP+LFLLC+E  S L+  A +   + GV   RQ   +SHLFFADDSLVFL A  +E   FR +L  Y  ASGQ VNF KS
Subjt:  NGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVDA-RNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKS

Query:  TVFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIM
         + F  ++    + +L+  + + + ++ G YLGLPS   R K + F+FI ++VW  L+GWK  FFS  GKEVLIK+I+QAIPTY M CFR+PK  +  I 
Subjt:  TVFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIM

Query:  SLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLLK
        S+ ARFWWGS  +   +HW +W  LCK KE GGL FRDL  FNQA+LAKQ WR +  P+   SKVL   Y+P + VL       +SF W+  VWG K+++
Subjt:  SLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLLK

Query:  QGIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQFLIKEDVEMIKGLPIS-LTAPDRWIWHYNNTGEYSVKSGYK
         G R  +GNG ++ +  DPWLPRP TFK+  +  P   N+ V +    + +WDE  ++      D E+I  +  S     D+ +WHY+  GEYSV+SGY+
Subjt:  QGIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQFLIKEDVEMIKGLPIS-LTAPDRWIWHYNNTGEYSVKSGYK

Query:  LSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHE-ELETTDHALFRCTRAQEIWRI--FQPNI-WMD
        ++    +    S     +RWW+++WKL+IP +VK FVWK  H+ IP+   L    V +   C  C     E   HAL+ C    ++W+I  F   I    
Subjt:  LSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHE-ELETTDHALFRCTRAQEIWRI--FQPNI-WMD

Query:  QWDMLEIKDRWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPLGGSLVQSEDEVIQILAEGEEVIMHTDATFIDA
        + D+L    R  S  A+ D       V  W +W  RNS+ +    P      EW   +L E+  +N      V  +    +  A     +  +    +DA
Subjt:  QWDMLEIKDRWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPLGGSLVQSEDEVIQILAEGEEVIMHTDATFIDA

Query:  TTKCGIGI-----VLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVN
          K G G+     V+R   G +K A    +    SPL AE  AI  G++     K+    V +DCL  +  +  D  G   +   +  I  LL    +  
Subjt:  TTKCGIGI-----VLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVN

Query:  FHFVNRRYNRFAHNMASEG-YSAPPCLWLGAFPAWMERMTLMES
          FV R  N+FAH +A+E   +    +W+G  P    +  L++S
Subjt:  FHFVNRRYNRFAHNMASEG-YSAPPCLWLGAFPAWMERMTLMES

A0A803Q8J4 Uncharacterized protein1.5e-19538.88Show/hide
Query:  NQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKV
        N  L   +T  E+  A++  +   +PG DG  ALFYQ  WDIVGD +    L +LN   S E  N T I LIPK ++P+ + D+RPISLCNV YK+I+K 
Subjt:  NQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKV

Query:  IANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILLN
        +  R K +L  +I E QSAF+P R I+DN++++ E +H LK K +G+ G++ALKLDMSKA+DRVEW ++  +M K+GF  RW+ L+M C+ T   S ++N
Subjt:  IANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILLN

Query:  GEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALL-VDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKST
        G   G ++P RG+RQGDPLSPYLFL+CSEGLS LL  +     + G++I+R  P ISHL FADDSL+F +A     G  + +L  Y KASGQ +N  KS 
Subjt:  GEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALL-VDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKST

Query:  VFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMS
        + FS N     K    +IL M + +   SYLGLP+   R K   F  I +R+W +L  W  + FS GGKEVL+K++IQ+IPTYAM CF++P     +I S
Subjt:  VFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMS

Query:  LCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLLKQ
        + + +WWG+  +K+ +HWK+W+ LC  K  GGL FR+ + FNQA+LAKQAWR+  N +  L +VL G+YFPR   L  +   +SS  W+G  WG +LLK+
Subjt:  LCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLLKQ

Query:  GIRKNLGNGQTIFMFKDPWLPRPFTF-KVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQFLIKEDVEMIKGLPISLTA-PDRWIWHYNNTGEYSVKSGYK
        GIRK +GNG +I    DPW+P    F  +    NP   N  VA++  P  +W+ +KL       DV  I  LP+S  A PD WIWH    GEY VKSGY 
Subjt:  GIRKNLGNGQTIFMFKDPWLPRPFTF-KVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQFLIKEDVEMIKGLPISLTA-PDRWIWHYNNTGEYSVKSGYK

Query:  LSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNIWMDQWDM
         +       + S    ++ WWK  W+L++P +VKIF WK+ HN++P    L +       +C +C    E+  HALF C  A+ +W++            
Subjt:  LSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNIWMDQWDM

Query:  LEIKDRWLSFAAQ-PDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPLGGSLVQSEDEVIQILAEGEE-----VIMHTDATFI
        + I+D     +     S L +I    W+IW+DRN++++ +    P       Q++L  Y  T  L      S      I           + ++ DA F 
Subjt:  LEIKDRWLSFAAQ-PDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPLGGSLVQSEDEVIQILAEGEE-----VIMHTDATFI

Query:  DATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVNFHF
        +A  K G G ++R   G +KAA  H I+GC  P   EA  +   L+ A+ L  +   V +D L L  ++      +SS    ++D++  L+    V  + 
Subjt:  DATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVNFHF

Query:  VNRRYNRFAHNMASEGYSAP-PCLWLGAFPAWMERMTLMESHFL
        V R  N+ AH +A +       C WL  FP+ +  + + +S  L
Subjt:  VNRRYNRFAHNMASEGYSAP-PCLWLGAFPAWMERMTLMESHFL

A0A803QGT2 Uncharacterized protein7.2e-19538.86Show/hide
Query:  NQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKV
        N  LL  +TR ++  A+++    K+PG DG  A+FYQ YW IVGD +    L +LN   S   +N T + LIPK ++P+ + D+RPISLCNV YKII+K+
Subjt:  NQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKV

Query:  IANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILLN
        +A RLK VL  +I E QSAF+  R I+DN++++ E +H LK +++G  G+AALK DMSKA+DRVEW ++  +M K+GF+ RWI L+M C+ T  FS  +N
Subjt:  IANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILLN

Query:  GEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALL-VDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKST
        GE  G + P RG+RQGDPLSPYLFL+CSEGLS LL  + +   + G++++R  P ISHLFFADDSL+F +A     G  +  L  Y +ASGQ +N  KS 
Subjt:  GEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALL-VDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKST

Query:  VFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMS
        + FS N P   +     IL M + E   +YLGLP+   R K++ F  I +++W ++  W  + FS GGKEVL+K+++Q+IPTYAM CFR+P  +  +I +
Subjt:  VFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMS

Query:  LCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLLKQ
        + A+FWWGS ++ + +HWK+W  LCK K  GG+ FR  V FNQA+LAKQAWR+  +P   LS+VL G YF +   +       SS  W+G VWG +LL +
Subjt:  LCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLLKQ

Query:  GIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQFLIKEDVEMIKGLPIS-LTAPDRWIWHYNNTGEYSVKSGYKL
        G+R  +G G  I    D W+P    FK          +  VA++   + +W+   LQ      DV+ I  +P+S L   DRWIWHY ++G+YSV SGY L
Subjt:  GIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQFLIKEDVEMIKGLPIS-LTAPDRWIWHYNNTGEYSVKSGYKL

Query:  SMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNIWMDQWDML
        +         S       WWK  WKL +P++VKIF WK   +SIP   +L    +    TC +C    E+  HALF C  A+E+W+    +I     D L
Subjt:  SMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNIWMDQWDML

Query:  EIKDRWLSFAA-QPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPL---GGSLVQSEDEVIQILAEGEEVI-MHTDATFIDA
        +  D  +  ++    S   +I    W IW+DRN+ ++ + + TP+        Y+ +Y           S   S+  V       E    ++ DA    +
Subjt:  EIKDRWLSFAA-QPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPL---GGSLVQSEDEVIQILAEGEEVI-MHTDATFIDA

Query:  TTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVNFHFVN
         +K GIG+++R   G +KAA      G       EA A+  GL  AK  ++    V +DCL L+ ++NGD+   S     + D++  L+ F       + 
Subjt:  TTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVNFHFVN

Query:  RRYNRFAHNMASEGYSAP-PCLWLGAFPA
        R  N+ AH +A         C+WL   P+
Subjt:  RRYNRFAHNMASEGYSAP-PCLWLGAFPA

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.6e-3725.93Show/hide
Query:  QMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPK-GRQPRLVSDYRPISLCNVSYKIITKV
        + L  P T  EI+  + +    K+PGPDGF A FYQ+Y + +   ++    +I    +    +   +I+LIPK GR      ++RPISL N+  KI+ K+
Subjt:  QMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPK-GRQPRLVSDYRPISLCNVSYKIITKV

Query:  IANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETL-HFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILL
        +ANR++  + ++I   Q  FIPG     N+  S   + H  + K K  V    + +D  KA+D+++  ++ + + KLG    +++++       + +I+L
Subjt:  IANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETL-HFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILL

Query:  NGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKST
        NG+         G RQG PLSP LF +  E L+  +   +   + G+ + ++  K+S   FADD +V+L+           ++ ++ K SG  +N  KS 
Subjt:  NGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKST

Query:  VFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGK--TRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGC--FRIPKGILT
         F   N      Q +   L   ++     YLG+  T         +++ +L  +      WK+   S  G+  ++K  I     Y       ++P    T
Subjt:  VFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGK--TRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGC--FRIPKGILT

Query:  KIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAW
        ++     +F W    +KR+   K    L +  + GG+   D   + +A + K AW
Subjt:  KIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAW

P08548 LINE-1 reverse transcriptase homolog4.3e-3525.71Show/hide
Query:  QMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPK-GRQPRLVSDYRPISLCNVSYKIITKV
        +ML  P +  EI + ++N    K+PGPDGF + FYQ + + +   ++     I    +    +   NI LIPK G+ P    +YRPISL N+  KI+ K+
Subjt:  QMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPK-GRQPRLVSDYRPISLCNVSYKIITKV

Query:  IANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETL-HFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILL
        + NR++  + +II   Q  FIPG     N+  S   + H  K K K    +  L +D  KA+D ++  ++ + ++K+G    +++L+    +  + +I+L
Subjt:  IANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETL-HFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILL

Query:  NGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKST
        NG          G RQG PLSP LF +  E L+  + + +  ++ G+ I  +  K+S   FADD +V+L+   +       ++ +Y   SG  +N  KS 
Subjt:  NGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKST

Query:  VFFSGNIPYDTKQYLSHILSMNMSESLGSYLG--LPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSII--QAIPTYAMGCFRIPKGILT
         F   N     K     I    + + +  YLG  L          +++ +   +   +  WK+   S  G+  ++K  I  +AI  +     + P     
Subjt:  VFFSGNIPYDTKQYLSHILSMNMSESLGSYLG--LPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSII--QAIPTYAMGCFRIPKGILT

Query:  KIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAW
         +  +   F W  +  + +        L    + GG+   DL  + ++++ K AW
Subjt:  KIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAW

P0C2F6 Putative ribonuclease H protein At1g657509.8e-4024.45Show/hide
Query:  FQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQA
        F  IL+RV + + GW+ +  S  G+  L K+++ ++P ++M    +P+ IL ++  L   F WGS AEK+  H  +W ++C PK+ GGL  R   S N+A
Subjt:  FQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQA

Query:  MLAKQAWRVLTNPHLTLSKVLCGKYF------PRISVLHVSYSTSSSFFWKGFVWGMK-LLKQGIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLT
        +++K  WR+L   +   + VL  KY        R  +   S+S++    W+    G++ ++  G+    G+GQ I  + D W+      ++ +   P+  
Subjt:  MLAKQAWRVLTNPHLTLSKVLCGKYF------PRISVLHVSYSTSSSFFWKGFVWGMK-LLKQGIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLT

Query:  NIKVA-EFFNPSVQWDEAKLQQFLIKEDVEMIKGLPISLT--APDRWIWHYNNTGEYSVKSGYK-LSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKI
        +  VA + + P   WD AK+  +        ++ + + L   A DR  W ++  G++SV+S Y+ L++     P+M++      ++  +WK+R+P RVK 
Subjt:  NIKVA-EFFNPSVQWDEAKLQQFLIKEDVEMIKGLPISLT--APDRWIWHYNNTGEYSVKSGYK-LSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKI

Query:  FVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQP--------NIWMDQWDMLEIKDRWLSFAAQPDSILASICVGAWA
        F+W   + ++ +     R H+     C VC   +E+  H L  C     IW    P        +  + +W    + DR         +I A I    W 
Subjt:  FVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQP--------NIWMDQWDMLEIKDRWLSFAAQPDSILASICVGAWA

Query:  IWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKT---NPLGGSLVQSEDEVIQILAEGEE-VIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISG
         W   N    N        R ++++++  E ++    N L G      + +I  ++     V ++TD            G VLR   G        +I  
Subjt:  IWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKT---NPLGGSLVQSEDEVIQILAEGEE-VIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISG

Query:  CHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVNFHFVNRRYNRFAHNMASEGYS
        C +P  AE   +  GL  A   KV R+ +  D   ++  +   +     +S  +      L    +V    V R  NR A  +A+  +S
Subjt:  CHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVNFHFVNRRYNRFAHNMASEGYS

P11369 LINE-1 retrotransposable element ORF2 protein4.3e-3525.76Show/hide
Query:  LLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIE-----DWNHTNIVLIPK-GRQPRLVSDYRPISLCNVSYKII
        L +P + +EI   + +    K+PGPDGF A FYQ +      + +   L  L  ++ +E      +    I LIPK  + P  + ++RPISL N+  KI+
Subjt:  LLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIE-----DWNHTNIVLIPK-GRQPRLVSDYRPISLCNVSYKII

Query:  TKVIANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSI
         K++ANR++  +  II   Q  FIPG     N+  S   +H++ K +     +  + LD  KA+D+++  ++ +++E+ G    ++ ++    +    +I
Subjt:  TKVIANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSI

Query:  LLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAK
         +NGE    I    G RQG PLSPYLF +  E L+  +   +   + G+ I ++  KIS L  ADD +V++           +++  + +  G  +N  K
Subjt:  LLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAK

Query:  STVFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRD--FQFILDRVWTVLQGWKSQFFSQGGKEVLIKSII--QAIPTYAMGCFRIPKGI
        S  F         K+ +      ++  +   YLG+  T       D  F+ +   +   L+ WK    S  G+  ++K  I  +AI  +     +IP   
Subjt:  STVFFSGNIPYDTKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRD--FQFILDRVWTVLQGWKSQFFSQGGKEVLIKSII--QAIPTYAMGCFRIPKGI

Query:  LTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEV-GGLNFRDLVSFNQAMLAKQAW
          ++     +F W ++  + +      + L K K   GG+   DL  + +A++ K AW
Subjt:  LTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEV-GGLNFRDLVSFNQAMLAKQAW

P14381 Transposon TX1 uncharacterized 149 kDa protein3.1e-3327.43Show/hide
Query:  QMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVG---DKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIIT
        + L  P T +E+  A+R     K+PG DG    F+Q +WD +G    +++ E  A    E+ +       + L+PK    RL+ ++RP+SL +  YKI+ 
Subjt:  QMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVG---DKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIIT

Query:  KVIANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSIL
        K I+ RLK VL E+I   QS  +PGR+I DN+ L  + LHF    R+  +  A L LD  KA+DRV+  YL   ++   F  +++  +     +A   + 
Subjt:  KVIANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSIL

Query:  LNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKS
        +N      +   RG+RQG PLS  L+ L  E    LL       + G+ +     ++    +ADD ++ +     +    +     Y  AS   +N++KS
Subjt:  LNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKS

Query:  TVFFSGNIPYDTKQYLSHILS--MNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWK--SQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGIL
        +    G++  D        +S    + + LG YL   S      +++F  + + V T L  WK  ++  S  G+ ++I  ++ +   Y + C    +  +
Subjt:  TVFFSGNIPYDTKQYLSHILS--MNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWK--SQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGIL

Query:  TKIMSLCARFWW
         KI      F W
Subjt:  TKIMSLCARFWW

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein5.0e-3927.15Show/hide
Query:  KYFPRISVLHVSYSTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTIFM----FKDPWLPRPF----TFKVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQF
        +YF  +S+L        S+ W   + G+ LLK+G R  +G+GQ I +      D   PRP     T+K ++  N  L   K + +F     WD++K+ QF
Subjt:  KYFPRISVLHVSYSTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTIFM----FKDPWLPRPF----TFKVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQF

Query:  LIKEDVEMIKGLPISLT-APDRWIWHYNNTGEYSVKSGYKLSMLRP------MGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRH
        + + D   I  + ++ +  PD+ IW+YN TGEY+V+SGY L    P      + P   +  L +R    +W L I  ++K F+W++   ++ +   L   
Subjt:  LIKEDVEMIKGLPISLT-APDRWIWHYNNTGEYSVKSGYKLSMLRP------MGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRH

Query:  HVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNIWMDQW---DMLEIKDRWLSFAAQPDSILASI-----CVGAWAIWNDRNSILYNR----PIP
         + +  +CP CH E E+ +HALF C  A   WR+   ++  +Q    D  E     L+F    D+ ++           W IW  RN++++N+    P  
Subjt:  HVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNIWMDQW---DMLEIKDRWLSFAAQPDSILASI-----CVGAWAIWNDRNSILYNR----PIP

Query:  TPMCRCEWIQDYL--TEYWKTNPLGGSLVQSEDEVIQILAEGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQ
        T +       D+L  T+  K  P     + +E+++         V  + DA F     +   G ++R   G   +     ++   +PL AE  A+L  LQ
Subjt:  TPMCRCEWIQDYL--TEYWKTNPLGGSLVQSEDEVIQILAEGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQ

Query:  LAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVNFHFVNRRYNRFAHNMASEG------YSAPPCLWLGAFPAWMERMTLMES
                ++ +  DC  LI  ING +   SS++  L DI      F  + F F+ R+ N+ AH +A  G      YS       G+ P W++R    +S
Subjt:  LAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVNFHFVNRRYNRFAHNMASEG------YSAPPCLWLGAFPAWMERMTLMES

Query:  H
        +
Subjt:  H

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.1e-1340.96Show/hide
Query:  IANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWI
        +  RLK ++T +I   Q++FIPGR  +DN++   E +H +++K KG  G+  LKLD+ KAYDR+ W YL   +   GF + W+
Subjt:  IANRLKGVLTEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWI

AT4G29090.1 Ribonuclease H-like superfamily protein1.3e-5828.9Show/hide
Query:  AIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHV
        A+PTY M CF +PK +  +I+S+ A FWW ++ E + +HWK W+ L   K  GG+ F+D+ +FN A+L KQ WR+L+ P   ++KV   +YF +   L+ 
Subjt:  AIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHV

Query:  SYSTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTIFMFKDPWL-PRPFTFKVLSQVNP-----SLTNI-KVAEFFNPS-VQWDEAKLQQFLIKEDVEMIKG
           +  SF WK      ++L+QG R  +GNG+ I +++  WL  +P +  +  Q  P     S+++I KV++  + S  +W +  ++    + + ++I  
Subjt:  SYSTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTIFMFKDPWL-PRPFTFKVLSQVNP-----SLTNI-KVAEFFNPS-VQWDEAKLQQFLIKEDVEMIKG

Query:  L-PISLTAPDRWIWHYNNTGEYSVKSGY----KLSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHE
        L P      D + W Y ++G+Y+VKSGY    ++   R     +S   L+   ++K+WK +   +++ F+WK   NS+P    L   H+     C  C  
Subjt:  L-PISLTAPDRWIWHYNNTGEYSVKSGY----KLSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTCPVCHE

Query:  ELETTDHALFRCTRAQEIWRIFQPNIWM-DQW-DMLEIKDRWLSFA--AQPDSILASICVG--AWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKT
          ET +H LF+CT A+  W I    I +  +W D + +   W+       P    AS  V    W +W +RN +++              +D L E W+ 
Subjt:  ELETTDHALFRCTRAQEIWRIFQPNIWM-DQW-DMLEIKDRWLSFA--AQPDSILASICVG--AWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKT

Query:  NPLGGSLVQSEDEVIQILA------EGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSD
             S   ++ +V +           + V  +TDAT+     +CGIG VLR + G +K     ++    S L AE  A+   +      +   +   SD
Subjt:  NPLGGSLVQSEDEVIQILA------EGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSD

Query:  CLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVNFHFVNRRYNRFAHNMASEGYS
           LI+ +N D +   S+  T+ D++ LL+ F  V F F+ R  N  A  +A E  S
Subjt:  CLNLIKSINGDLKGQSSISTTLWDIEALLAAFEIVNFHFVNRRYNRFAHNMASEGYS

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.9e-3242.66Show/hide
Query:  AIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKE-VGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLH
        A+P YAM CFR+ K +  K+ S    FWW S   KR + W  W++LCK KE  GGL FRDL  FNQA+LAKQ++R++  PH  LS++L  +YFP  S++ 
Subjt:  AIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKE-VGGLNFRDLVSFNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLH

Query:  VSYSTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTIFMFKDPWL
         S  T  S+ W+  + G +LL +G+ + +G+G    ++ D W+
Subjt:  VSYSTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTIFMFKDPWL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.9e-1538.94Show/hide
Query:  LLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVDARNNS-MAGVSIARQCPKISHLFFADD--SLVFLKAKAEEFGFFRSILLDYEKASGQCVN
        ++NG   G + P+RG+RQGDPLSPYLF+LC+E LS L   A+    + G+ ++   P+I+HL FADD  S  ++   A+ +  F    L      G  VN
Subjt:  LLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLSALLVDARNNS-MAGVSIARQCPKISHLFFADD--SLVFLKAKAEEFGFFRSILLDYEKASGQCVN

Query:  FAKSTVFFSGNIP
           S ++F G++P
Subjt:  FAKSTVFFSGNIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCAGATGTTGTTGGCTCCTTATACCAGAGAGGAAATTATCACTGCAATGCGAAATTTTCATCCAACTAAGGCCCCGGGACCGGATGGGTTTCCAGCATTATTCTA
CCAGAAATATTGGGACATTGTAGGAGACAAAATGGTATTTGAGTGCTTGGCTATTCTCAATTCGGAGGTTTCGATAGAGGACTGGAATCATACCAATATAGTGCTTATTC
CAAAAGGACGGCAGCCCAGGTTAGTATCAGATTATCGCCCAATTAGTCTATGCAATGTCTCTTATAAAATAATTACTAAGGTCATCGCTAATAGACTCAAGGGTGTGTTA
ACTGAGATAATCGATGAATGTCAATCCGCGTTCATTCCTGGTAGATCAATATCTGATAACATGATTTTAAGTCATGAGACGCTTCATTTTCTTAAAAAAAAAAGGAAAGG
AAAAGTTGGTTATGCTGCACTAAAACTAGATATGAGCAAAGCCTATGATAGGGTGGAGTGGATATATTTGAGACAAATCATGGAGAAGTTGGGGTTTCATGATCGCTGGA
TCCGGTTAGTTATGAAATGTATTACAACCGCCTCTTTTTCTATCTTGTTAAATGGGGAAGCTTTTGGTTTCATTAGACCAACTCGTGGAATTCGTCAAGGCGATCCTTTA
TCACCTTATCTGTTTTTACTATGTTCAGAAGGCCTGTCGGCTTTGTTGGTGGATGCGAGGAATAATTCAATGGCGGGGGTGTCCATAGCGCGCCAGTGTCCAAAAATTTC
GCATTTATTCTTTGCGGATGATAGTTTGGTTTTTTTGAAAGCAAAGGCGGAGGAATTTGGTTTTTTCAGATCTATTCTGTTAGATTATGAAAAGGCTTCAGGACAGTGTG
TTAATTTTGCAAAATCAACGGTGTTCTTCTCGGGGAATATTCCATATGACACCAAACAGTATCTTAGTCATATTTTGTCTATGAATATGTCTGAATCTTTGGGTTCCTAC
CTTGGATTGCCATCAACTTTCCATAGAGGAAAAACTCGTGATTTCCAGTTTATTTTGGATAGGGTCTGGACTGTGCTTCAAGGATGGAAGAGCCAATTTTTTTCACAAGG
AGGGAAAGAGGTGCTGATAAAATCTATTATTCAAGCGATTCCAACCTATGCAATGGGGTGCTTTCGGATCCCAAAAGGTATTCTGACAAAGATTATGTCTTTATGCGCTA
GATTCTGGTGGGGTTCTCAGGCAGAAAAGCGTAGTGTGCATTGGAAGAGATGGGAAGAGCTTTGCAAACCAAAAGAGGTGGGGGGGTTGAATTTCCGTGATTTGGTTAGT
TTCAACCAGGCAATGCTTGCTAAACAAGCCTGGAGGGTGCTTACTAATCCTCACCTCACACTATCCAAAGTGTTATGCGGAAAGTACTTTCCTCGCATTTCAGTGTTACA
TGTCTCATATTCAACCTCGTCTTCCTTTTTTTGGAAAGGTTTTGTGTGGGGCATGAAACTATTAAAACAAGGAATTCGGAAAAATCTAGGAAACGGTCAAACTATTTTTA
TGTTCAAAGACCCTTGGCTTCCTCGGCCTTTTACTTTTAAGGTGTTGTCACAGGTTAATCCAAGTTTGACGAATATCAAGGTAGCGGAGTTTTTTAATCCAAGTGTGCAA
TGGGATGAGGCGAAGCTCCAACAATTTCTAATAAAGGAGGATGTAGAGATGATTAAAGGTTTACCTATTAGTCTAACTGCTCCAGACAGGTGGATTTGGCACTACAATAA
TACAGGAGAATATTCCGTTAAGAGTGGCTATAAATTAAGCATGTTGAGACCGATGGGACCATCAATGTCGGCTCGTGGATTGGATAGTAGGTGGTGGAAAAAGGTTTGGA
AGTTGCGAATTCCCAATAGAGTAAAAATTTTTGTGTGGAAATCGTTCCATAACTCTATCCCGTCTATGGTTAATTTAAAAAGACATCATGTTCCAGTACATATGACTTGC
CCTGTATGTCATGAGGAGTTAGAAACTACGGATCATGCTCTTTTCCGTTGCACTAGAGCTCAAGAGATTTGGAGGATTTTTCAACCGAATATTTGGATGGATCAATGGGA
CATGCTAGAGATCAAGGATCGGTGGTTGAGTTTTGCTGCCCAACCGGATTCTATTTTAGCAAGTATTTGTGTTGGTGCTTGGGCGATTTGGAATGATAGAAATAGTATAC
TCTATAACCGTCCAATTCCAACGCCTATGTGTCGTTGTGAATGGATTCAAGACTATCTGACGGAATACTGGAAGACAAACCCTTTGGGTGGATCTTTGGTTCAGTCAGAG
GATGAGGTGATACAGATTCTTGCTGAAGGGGAGGAAGTCATCATGCATACTGATGCTACCTTTATTGATGCTACGACTAAATGTGGAATTGGCATTGTATTACGTACTAA
GGGAGGCATTTTGAAGGCAGCGCAACATCACTCTATCTCTGGGTGTCATTCCCCATTGGGGGCTGAAGCAGTTGCCATTCTGACGGGGCTTCAATTAGCAAAGGGCTTGA
AGGTGAGACGTCTAACAGTTTTGTCAGATTGTTTGAATCTCATAAAGTCTATCAACGGTGATCTTAAAGGACAGTCAAGCATTTCTACGACCCTTTGGGATATTGAAGCA
CTTCTGGCTGCTTTTGAGATTGTTAATTTCCATTTTGTTAATCGTCGTTATAATAGGTTTGCTCATAATATGGCCAGTGAAGGCTACTCGGCACCTCCATGCTTATGGTT
AGGTGCTTTTCCTGCATGGATGGAACGGATGACACTCATGGAAAGCCATTTCCTCCAAAGTGCTCTATTCTGTCCTCTTTACGGCTCCAAAAGGTCCCAAATCGATCCCC
CTCAAAACCTAGCCGAAGTGAAGCTTATATTGCAGATTTTTCGTCTTAGCGTCGAGACGCTGTCAAGACAGCGCCGCGACGCTGCCTCTTATACGCGCGTTCTGAATAAG
GAAAAACGGAAGCGTCGCGACGCTTCCTATCATAGCGTCTCGATGCTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATCAGATGTTGTTGGCTCCTTATACCAGAGAGGAAATTATCACTGCAATGCGAAATTTTCATCCAACTAAGGCCCCGGGACCGGATGGGTTTCCAGCATTATTCTA
CCAGAAATATTGGGACATTGTAGGAGACAAAATGGTATTTGAGTGCTTGGCTATTCTCAATTCGGAGGTTTCGATAGAGGACTGGAATCATACCAATATAGTGCTTATTC
CAAAAGGACGGCAGCCCAGGTTAGTATCAGATTATCGCCCAATTAGTCTATGCAATGTCTCTTATAAAATAATTACTAAGGTCATCGCTAATAGACTCAAGGGTGTGTTA
ACTGAGATAATCGATGAATGTCAATCCGCGTTCATTCCTGGTAGATCAATATCTGATAACATGATTTTAAGTCATGAGACGCTTCATTTTCTTAAAAAAAAAAGGAAAGG
AAAAGTTGGTTATGCTGCACTAAAACTAGATATGAGCAAAGCCTATGATAGGGTGGAGTGGATATATTTGAGACAAATCATGGAGAAGTTGGGGTTTCATGATCGCTGGA
TCCGGTTAGTTATGAAATGTATTACAACCGCCTCTTTTTCTATCTTGTTAAATGGGGAAGCTTTTGGTTTCATTAGACCAACTCGTGGAATTCGTCAAGGCGATCCTTTA
TCACCTTATCTGTTTTTACTATGTTCAGAAGGCCTGTCGGCTTTGTTGGTGGATGCGAGGAATAATTCAATGGCGGGGGTGTCCATAGCGCGCCAGTGTCCAAAAATTTC
GCATTTATTCTTTGCGGATGATAGTTTGGTTTTTTTGAAAGCAAAGGCGGAGGAATTTGGTTTTTTCAGATCTATTCTGTTAGATTATGAAAAGGCTTCAGGACAGTGTG
TTAATTTTGCAAAATCAACGGTGTTCTTCTCGGGGAATATTCCATATGACACCAAACAGTATCTTAGTCATATTTTGTCTATGAATATGTCTGAATCTTTGGGTTCCTAC
CTTGGATTGCCATCAACTTTCCATAGAGGAAAAACTCGTGATTTCCAGTTTATTTTGGATAGGGTCTGGACTGTGCTTCAAGGATGGAAGAGCCAATTTTTTTCACAAGG
AGGGAAAGAGGTGCTGATAAAATCTATTATTCAAGCGATTCCAACCTATGCAATGGGGTGCTTTCGGATCCCAAAAGGTATTCTGACAAAGATTATGTCTTTATGCGCTA
GATTCTGGTGGGGTTCTCAGGCAGAAAAGCGTAGTGTGCATTGGAAGAGATGGGAAGAGCTTTGCAAACCAAAAGAGGTGGGGGGGTTGAATTTCCGTGATTTGGTTAGT
TTCAACCAGGCAATGCTTGCTAAACAAGCCTGGAGGGTGCTTACTAATCCTCACCTCACACTATCCAAAGTGTTATGCGGAAAGTACTTTCCTCGCATTTCAGTGTTACA
TGTCTCATATTCAACCTCGTCTTCCTTTTTTTGGAAAGGTTTTGTGTGGGGCATGAAACTATTAAAACAAGGAATTCGGAAAAATCTAGGAAACGGTCAAACTATTTTTA
TGTTCAAAGACCCTTGGCTTCCTCGGCCTTTTACTTTTAAGGTGTTGTCACAGGTTAATCCAAGTTTGACGAATATCAAGGTAGCGGAGTTTTTTAATCCAAGTGTGCAA
TGGGATGAGGCGAAGCTCCAACAATTTCTAATAAAGGAGGATGTAGAGATGATTAAAGGTTTACCTATTAGTCTAACTGCTCCAGACAGGTGGATTTGGCACTACAATAA
TACAGGAGAATATTCCGTTAAGAGTGGCTATAAATTAAGCATGTTGAGACCGATGGGACCATCAATGTCGGCTCGTGGATTGGATAGTAGGTGGTGGAAAAAGGTTTGGA
AGTTGCGAATTCCCAATAGAGTAAAAATTTTTGTGTGGAAATCGTTCCATAACTCTATCCCGTCTATGGTTAATTTAAAAAGACATCATGTTCCAGTACATATGACTTGC
CCTGTATGTCATGAGGAGTTAGAAACTACGGATCATGCTCTTTTCCGTTGCACTAGAGCTCAAGAGATTTGGAGGATTTTTCAACCGAATATTTGGATGGATCAATGGGA
CATGCTAGAGATCAAGGATCGGTGGTTGAGTTTTGCTGCCCAACCGGATTCTATTTTAGCAAGTATTTGTGTTGGTGCTTGGGCGATTTGGAATGATAGAAATAGTATAC
TCTATAACCGTCCAATTCCAACGCCTATGTGTCGTTGTGAATGGATTCAAGACTATCTGACGGAATACTGGAAGACAAACCCTTTGGGTGGATCTTTGGTTCAGTCAGAG
GATGAGGTGATACAGATTCTTGCTGAAGGGGAGGAAGTCATCATGCATACTGATGCTACCTTTATTGATGCTACGACTAAATGTGGAATTGGCATTGTATTACGTACTAA
GGGAGGCATTTTGAAGGCAGCGCAACATCACTCTATCTCTGGGTGTCATTCCCCATTGGGGGCTGAAGCAGTTGCCATTCTGACGGGGCTTCAATTAGCAAAGGGCTTGA
AGGTGAGACGTCTAACAGTTTTGTCAGATTGTTTGAATCTCATAAAGTCTATCAACGGTGATCTTAAAGGACAGTCAAGCATTTCTACGACCCTTTGGGATATTGAAGCA
CTTCTGGCTGCTTTTGAGATTGTTAATTTCCATTTTGTTAATCGTCGTTATAATAGGTTTGCTCATAATATGGCCAGTGAAGGCTACTCGGCACCTCCATGCTTATGGTT
AGGTGCTTTTCCTGCATGGATGGAACGGATGACACTCATGGAAAGCCATTTCCTCCAAAGTGCTCTATTCTGTCCTCTTTACGGCTCCAAAAGGTCCCAAATCGATCCCC
CTCAAAACCTAGCCGAAGTGAAGCTTATATTGCAGATTTTTCGTCTTAGCGTCGAGACGCTGTCAAGACAGCGCCGCGACGCTGCCTCTTATACGCGCGTTCTGAATAAG
GAAAAACGGAAGCGTCGCGACGCTTCCTATCATAGCGTCTCGATGCTGTGA
Protein sequenceShow/hide protein sequence
MNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKMVFECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVL
TEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDRWIRLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPL
SPYLFLLCSEGLSALLVDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILLDYEKASGQCVNFAKSTVFFSGNIPYDTKQYLSHILSMNMSESLGSY
LGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVS
FNQAMLAKQAWRVLTNPHLTLSKVLCGKYFPRISVLHVSYSTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTIFMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQ
WDEAKLQQFLIKEDVEMIKGLPISLTAPDRWIWHYNNTGEYSVKSGYKLSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMVNLKRHHVPVHMTC
PVCHEELETTDHALFRCTRAQEIWRIFQPNIWMDQWDMLEIKDRWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKTNPLGGSLVQSE
DEVIQILAEGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGDLKGQSSISTTLWDIEA
LLAAFEIVNFHFVNRRYNRFAHNMASEGYSAPPCLWLGAFPAWMERMTLMESHFLQSALFCPLYGSKRSQIDPPQNLAEVKLILQIFRLSVETLSRQRRDAASYTRVLNK
EKRKRRDASYHSVSML