; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027791 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027791
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:5071890..5075478
RNA-Seq ExpressionLag0027791
SyntenyLag0027791
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]1.1e-21637.45Show/hide
Query:  THLDFSRSDHRPLLLSMSVNARVVRSAGQKIRRFEESWLQSSDFGEAVSAGWRSEAPVGSLVELASVATR----------SNVQQ-----------VMGR
        +HL    SDH P+L+    +     +       FEE W +  DF + +   W+    V S+    S+  +          SNV+            + GR
Subjt:  THLDFSRSDHRPLLLSMSVNARVVRSAGQKIRRFEESWLQSSDFGEAVSAGWRSEAPVGSLVELASVATR----------SNVQQ-----------VMGR

Query:  LGSSESRADLQRAEGELESLLVEEEVYWRQRSRDSWLKWGDRNTRAGRVLGLIEGYFENIFSTSSPSDGEIDQVTSRVHPLVSDVMNNSLMRPFHREEVL
        L +++      + E  +  LL ++E+ WRQRSR     W    T   R+  L   YF+ +F  SS    +++++ + V P+++  MN+ L++ F REE+ 
Subjt:  LGSSESRADLQRAEGELESLLVEEEVYWRQRSRDSWLKWGDRNTRAGRVLGLIEGYFENIFSTSSPSDGEIDQVTSRVHPLVSDVMNNSLMRPFHREEVL

Query:  RALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMIVLIPKKKSPKCITEYRPISLCNVSYKIVSKVLVNRMKGILNGLIS
          L Q+ P KAPG DG+   F+Q  W +VG  V   CL ILN + S    N T+I LIPK K P  ++E+RPISLC   YK+++K + NR+K +L  +I+
Subjt:  RALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMIVLIPKKKSPKCITEYRPISLCNVSYKIVSKVLVNRMKGILNGLIS

Query:  HNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVRYSFNVNGVRCGDVIPSRGLR
         NQSAF+P R ++DN +  +E +H +K  K GR    +LKLDM+KAYDRVEWVFL  MML++GF+  WV  V+ CIS+  +S    G   G ++P RGLR
Subjt:  HNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVRYSFNVNGVRCGDVIPSRGLR

Query:  QGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRAKEGEARTVHHILQCYERASGQTINFDKSIISFSPSTAMGVRHQ
        QG PLSPYLFL+C EG S +L  AE    + G+++ARG P ++HL FADDS+LF +A     R +  + Q YE  SGQ IN+ KS  S SP+        
Subjt:  QGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRAKEGEARTVHHILQCYERASGQTINFDKSIISFSPSTAMGVRHQ

Query:  VSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIPCYTMNCFRLPKKLVQDISRAMSGFWWNGETEDR
        +  +  V V  CH++YLGLP+   + R      +K+++W+ I GWK K  S  G+E+L+K+++QAIP Y+M+CFR+PK L ++++  M+ FWW    + R
Subjt:  VSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIPCYTMNCFRLPKKLVQDISRAMSGFWWNGETEDR

Query:  RIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVSV
         IHWV W+ LCK K  GG+GFRDLE FNQALLAKQCWRI++ P S ++R+ + RY PS  FL A VG+ PS+IWRSL WG+ELL KG+RWR+GNG  + V
Subjt:  RIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVSV

Query:  YGANWIPDDVCFKVRSPVMLGLDTRVADLLTASGQWNEELLRHNFSPREVRSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLG-----------QVA
        Y   W+P    FK+ SP  L L T V DL T+SGQWN  LL+  F  +EV + L IPL  ++  D +IWH+E+ G+YSVKSGYRL             V 
Subjt:  YGANWIPDDVCFKVRSPVMLGLDTRVADLLTASGQWNEELLRHNFSPREVRSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLG-----------QVA

Query:  VLARSHLCLRVRRCPVGGR-----GVGKWGSRTKGIDVLN-------VCVHCGWHGESCMHLFWQCKFFRDALMGSEWEVLLQGVQANSMLNLLRDVKDK
        V   S    ++    +  +         W     G  + N       +C +C    ES +H  W C+  ++    S W  + +  + NS   L   ++  
Subjt:  VLARSHLCLRVRRCPVGGR-----GVGKWGSRTKGIDVLN-------VCVHCGWHGESCMHLFWQCKFFRDALMGSEWEVLLQGVQANSMLNLLRDVKDK

Query:  VDWAKFEEFVVVLWAVWCCRNQQKFRGRF-PSTGLVDWAVNYLNVFRGASRI--YCEGMRGVPRSSVK-WLPPEEGWYKVNVDAAF-CGDLFRAGVGVVV
            +   F  + W +W  RN   F G+   +T L+         F  A+ +     G +  P++ +  W PP  G YK+NVD A   GD  R GVGVVV
Subjt:  VDWAKFEEFVVVLWAVWCCRNQQKFRGRF-PSTGLVDWAVNYLNVFRGASRI--YCEGMRGVPRSSVK-WLPPEEGWYKVNVDAAF-CGDLFRAGVGVVV

Query:  RDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMGLAPTILETDSMRVFTLL---QEPAMVDLSEFGVLVSEARMEVPAYIQLGFSFTRREGNRV
        R+  G  M +   R          E +A ++G+R  ++MG    +LE D+      +   +E   +D    G+L+ E    +  +  +   +T R GN+V
Subjt:  RDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMGLAPTILETDSMRVFTLL---QEPAMVDLSEFGVLVSEARMEVPAYIQLGFSFTRREGNRV

Query:  AHELGHFA
        AH L  FA
Subjt:  AHELGHFA

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]3.1e-21437.59Show/hide
Query:  HLDFSRSDHRPLLLSMSVNARVVRSAGQKIR-RFEESWLQSSDFGEAVSAGWRS----EAPVGSLVELASVATR----------------SNVQQVMGRL
        HL  S SDH  LL++   +A VV+    + R +FE  W +  D  + +   W S     +P G    L   A                     ++ +  L
Subjt:  HLDFSRSDHRPLLLSMSVNARVVRSAGQKIR-RFEESWLQSSDFGEAVSAGWRS----EAPVGSLVELASVATR----------------SNVQQVMGRL

Query:  GSSESR----ADLQRAEGELESLLVEEEVYWRQRSRDSWLKWGDRNT-----------RAGRVLGLIE-----------------GYFENIFSTSSPSDG
          S+       ++     E+  LL  EE+ W+QRSR  WL  GDRNT           R   + G+++                  YF+ I+S+S P+  
Subjt:  GSSESR----ADLQRAEGELESLLVEEEVYWRQRSRDSWLKWGDRNT-----------RAGRVLGLIE-----------------GYFENIFSTSSPSDG

Query:  EIDQVTSRVHPLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMIVLIPKKKSPKCITE
         I +V   +   V++ MN+SL++ F REE+  AL Q+HP KAPGPDG+S  F+Q  W++VG+D+V   L +LN+  S   +N+T I L+PK K+P  +++
Subjt:  EIDQVTSRVHPLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMIVLIPKKKSPKCITE

Query:  YRPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFAVEWV
        +RPISLCNV YK++SKVL NR+K IL  +IS NQSAF+ GR + DN ++ +E +H L+ KK G+ G+A++KLDMSKAYDRVEW F++++M +MGF  +W+
Subjt:  YRPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFAVEWV

Query:  ELVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRAKEGEARTVHHIL
        +LV+ CI+SV YS  VNG   G + P+RGLRQGDP+SPY+FLLCA+G S +L+D      I+G+ I RGCP I+HLFFADDSLLF +A   E +T+  IL
Subjt:  ELVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRAKEGEARTVHHIL

Query:  QCYERASGQTINFDKSIISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIPCY
        Q YE ASGQ IN DKS + FS +T    R +V ++        H++YLGLPS + ++++     +KERV R++ GWK K  SVGGRE+L+K++ QAIP Y
Subjt:  QCYERASGQTINFDKSIISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIPCY

Query:  TMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAGVGSR
        TM+CF++PK L ++I   M  FWW    ++ +I WVSWK LCK K  GGMGFR+L+ FN A+LAKQ WR++  P S ++++ K RY+P GD   A +G+ 
Subjt:  TMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAGVGSR

Query:  PSYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDDVCFKVRSPVMLGLD-TRVADLLTAS-GQWNEELLRHNFSPREVRSILTIPLRHISSEDKV
        PSY WRS+  G E++ +G RWR+GNGE++ ++   W+P  + +KV SP     D  RV+ L+     +W ++++R  F P E R+IL+IPL H   ED++
Subjt:  PSYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDDVCFKVRSPVMLGLD-TRVADLLTAS-GQWNEELLRHNFSPREVRSILTIPLRHISSEDKV

Query:  IWHFEKCGVYSVKSGYRLGQVAVL------------ARSHLCLRVRRCPVGGR-GVGKW-----------GSRTKGIDVLNVCVHCGWHGESCMHLFWQC
        IW   + G +SVKS Y +  V V+            +RS L  ++    +  +  +  W               KG+++ +VC  CG   ES +H+F +C
Subjt:  IWHFEKCGVYSVKSGYRLGQVAVL------------ARSHLCLRVRRCPVGGR-GVGKW-----------GSRTKGIDVLNVCVHCGWHGESCMHLFWQC

Query:  KFFRDALMGSEWEVLLQG----VQAN-SMLNLLRDVKDKVDWAKFEEFVVVLWAVWCCRNQQKFRGRFPSTGLV-DWAVNYLNVFRGASRIYCEGMRGVP
        +  +       W   L      V  N  ++++   + D    +  E F VV WA+WC RN+  F         +  +A+ Y+  F+ AS  YC+ +   P
Subjt:  KFFRDALMGSEWEVLLQG----VQAN-SMLNLLRDVKDKVDWAKFEEFVVVLWAVWCCRNQQKFRGRFPSTGLV-DWAVNYLNVFRGASRIYCEGMRGVP

Query:  RSSVKWLPPEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMGLAPTILETDSMRVFTLLQEPAMVDL
        +S  KW+ P  G +K+NVD A       + VGV++RD  G V  +         S +  E LA+  G+ L  E  L   I+E+D++ V + +        
Subjt:  RSSVKWLPPEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMGLAPTILETDSMRVFTLLQEPAMVDL

Query:  SEFGVLVSEARMEVPAYIQLGFSFTRREGNRVAHELGHFA-LKE
           G +       + ++     +  +RE N+ AHEL  +A LKE
Subjt:  SEFGVLVSEARMEVPAYIQLGFSFTRREGNRVAHELGHFA-LKE

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]2.6e-20836.96Show/hide
Query:  HLDFSRSDHRPLLLSMSVNARVVRSAGQKIRRFEESWLQSSDFGEAVSAGWRS-------EAPVGSLVELASVATRSNVQQVMG----------RLGSSE
        HL  S SDH    +  + ++      G++   FE  W +  D  + + A W S       E  V +L   AS     N Q V+G          R  +S 
Subjt:  HLDFSRSDHRPLLLSMSVNARVVRSAGQKIRRFEESWLQSSDFGEAVSAGWRS-------EAPVGSLVELASVATRSNVQQVMG----------RLGSSE

Query:  SRADLQRAEG--------ELESLLVEEEVYWRQRSRDSWLKWGDRNTR----------------------------AGRVLGLIEGYFENIFSTSSPSDG
        +  D Q   G        EL  LL  EE+ WRQRS+  W + GDRNT+                               +      YFENI+++SSP+  
Subjt:  SRADLQRAEG--------ELESLLVEEEVYWRQRSRDSWLKWGDRNTR----------------------------AGRVLGLIEGYFENIFSTSSPSDG

Query:  EIDQVTSRVHPLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMIVLIPKKKSPKCITE
         I++V + +   V+D MN+ L + F  EEVL+AL+Q+HP KAPGPDG+S  F+ + W +VG  + N  L++LN+      +N+T I LIPK   P  +TE
Subjt:  EIDQVTSRVHPLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMIVLIPKKKSPKCITE

Query:  YRPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFAVEWV
        +RPISLCN +YKI+SKVL NR K IL  +IS NQSAF P R + DN ++ +E +H L  K  G+  + S+KLDMSKA+DRVEW F++ +M ++GF  +W+
Subjt:  YRPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFAVEWV

Query:  ELVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRAKEGEARTVHHIL
         L++ C+SSV YS  +NG   G++ PSRG+RQGDPLSP LFLLCAEGLS ++H+A   + I G+ I RGCP I+HLFFADDSLLF +AKE E   +  IL
Subjt:  ELVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRAKEGEARTVHHIL

Query:  QCYERASGQTINFDKSIISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIPCY
          YE ASGQ IN DKS + FSP+T+  +R  +  +      + H +YLGLPS + +++      +K+RV +++ GWKGK  S+GGRE+L+K++ QA+P Y
Subjt:  QCYERASGQTINFDKSIISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIPCY

Query:  TMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAGVGSR
        TM+CF+LPK L QD+   M  FWW  + ++ +I WVSW+ +C+ K  GGMGFR+++ FN A+LAKQ WRI+  P S ++RV K +YFP  D L +  GS 
Subjt:  TMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAGVGSR

Query:  PSYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDDVCFKVRSP-VMLGLDTRVADLLTA-SGQWNEELLRHNFSPREVRSILTIPLRHISSEDKV
        PSY WRS+    +++ KG RWR+GNG ++ ++   W+P     KV SP V  G    V+ L+   + +W  +++   F P E  +IL IPL     ED +
Subjt:  PSYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDDVCFKVRSP-VMLGLDTRVADLLTA-SGQWNEELLRHNFSPREVRSILTIPLRHISSEDKV

Query:  IWHFEKCGVYSVKSGYRLGQVAV-----------LARSHLCLRVRRCPVGGR-GVGKWGSRTKGIDVLNVCVHCGWHGES-CMHLFWQCKFFRDALMGSE
        IW   K G ++VKS Y +    V            + S L  R+ +  V  +  +  W +   G+  +    H G H  S C       +    AL+  E
Subjt:  IWHFEKCGVYSVKSGYRLGQVAV-----------LARSHLCLRVRRCPVGGR-GVGKWGSRTKGIDVLNVCVHCGWHGES-CMHLFWQCKFFRDALMGSE

Query:  WEVLLQGVQANS---------MLNLLRDVKDKVDWAKFEEFVVVLWAVWCCRNQQKFR-GRFPSTGLVDWAVNYLNVFRGASRIYCEGM-RGVPRSSVKW
           L      NS         M+ +  D+  K      E F  V W++W  RNQ        P     + A   L  F+ A    C  + + +P S  +W
Subjt:  WEVLLQGVQANS---------MLNLLRDVKDKVDWAKFEEFVVVLWAVWCCRNQQKFR-GRFPSTGLVDWAVNYLNVFRGASRIYCEGM-RGVPRSSVKW

Query:  LPPEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSP-DLAEGLAVVDGMRLVVEMGLAPTILETDSMRVFTLLQEPAMVDLSEFGV
          P  G++K+N DAA   D   + +GVV+RD  G V+L+A+ +      P +++E LA+ +G+ L  EM ++  I E+DS+ +   + E   +   EFG 
Subjt:  LPPEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSP-DLAEGLAVVDGMRLVVEMGLAPTILETDSMRVFTLLQEPAMVDLSEFGV

Query:  LVSEARMEVPAYIQLGFSFTRREGNRVAHELGHFALKEG-------GCSAVIELVKIE
        ++        ++    F   +REGN+VAHEL   A   G        C +++E + +E
Subjt:  LVSEARMEVPAYIQLGFSFTRREGNRVAHELGHFALKEG-------GCSAVIELVKIE

XP_024172304.2 uncharacterized protein LOC112178381 [Rosa chinensis]3.9e-20937.57Show/hide
Query:  LTHLDFSRSDHRPLLLSMSVNARVVRSAGQKIR-RFEESWLQSSDFGEAVSAGWRSEA------PVGSLVELASVA-----------TRSNVQQVMGRLG
        +THL  S+SDH P+L  + V + + R   +K R RFEE WL  ++    V  GW S A       +   +E    A            ++ ++++  +L 
Subjt:  LTHLDFSRSDHRPLLLSMSVNARVVRSAGQKIR-RFEESWLQSSDFGEAVSAGWRSEA------PVGSLVELASVA-----------TRSNVQQVMGRLG

Query:  -------SSESRADLQRAEGELESLLVEEEVYWRQRSRDSWLKWGDRNTR-----------AGRVLGLIEG-----------------YFENIFSTSSPS
               S+    +    E +L  LL  E  YW+QRSR  WL  GD NTR              + GL                    YF  +FSTSSP 
Subjt:  -------SSESRADLQRAEGELESLLVEEEVYWRQRSRDSWLKWGDRNTR-----------AGRVLGLIEG-----------------YFENIFSTSSPS

Query:  DGEIDQVTSRVHPLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMIVLIPKKKSPKCI
        + E+   T+    +V+  MN+ L+R F  EE+L+AL Q+HP KAPGPDG S  FYQ  WSVVG DV+      +N++     +N T + LIPK K  + +
Subjt:  DGEIDQVTSRVHPLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMIVLIPKKKSPKCI

Query:  TEYRPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFAVE
         + RPISLCNV YK+ SKVL NR+K +L  +I+  QSAF+PGR + DN++L +E  H LK +  G  G+ +LKLDMSKAYDRVEW F+E +M  MGF   
Subjt:  TEYRPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFAVE

Query:  WVELVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRAKEGEARTVHH
        W++ ++ C+++V YSF +NG   G +IP+RGLRQGD +SPYLFLLCAEGLSRML   E    + G+ IA G P I+HLFFADDS +F +A+  E   V  
Subjt:  WVELVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRAKEGEARTVHH

Query:  ILQCYERASGQTINFDKSIISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIP
        IL+ YE ASGQ +NF KS ISFS +  +G + +++++F V+    H +YLGLP+ +  ++     FI E+   +++ WK K  SV G+EV++KS+VQ++P
Subjt:  ILQCYERASGQTINFDKSIISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIP

Query:  CYTMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAGVG
         Y M+CF LPK L Q++ R M+ FWW    + R+IHW++W  +C PK  GG+GFR++E FNQALLAKQ WRI+++P S L + LK +YFP+ DF+ A V 
Subjt:  CYTMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAGVG

Query:  SRPSYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDDVCFKVRSPVMLGL-DTRVADLLTA-SGQWNEELLRHNFSPREVRSILTIPLRHISSED
           SY WRSL+ G+ LL KG+R+++G+G ++SV+   WIP    F+  S VM GL D  VADL+   S  W  + L   F   EV  I  IPL   + ED
Subjt:  SRPSYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDDVCFKVRSPVMLGL-DTRVADLLTA-SGQWNEELLRHNFSPREVRSILTIPLRHISSED

Query:  KVIWHFEKCGVYSVKSGYRLGQVAVLARSHLCLRVRRCPVGGRGVGK--WGSRTK-------------------------GIDVLNVCVHCGWHGESCMH
        ++IWHF+K G+YSVKSGY + +      SH+     +   G + + +  W +R +                          +D   +C  C    E+ +H
Subjt:  KVIWHFEKCGVYSVKSGYRLGQVAVLARSHLCLRVRRCPVGGRGVGK--WGSRTK-------------------------GIDVLNVCVHCGWHGESCMH

Query:  LFWQCKFFRDALMGSEWEVLLQGVQANSMLNLLRDVKDKVDWAKFEEFVVVLWAVWCCRNQQKFR-GRFPSTGLVDWAVNYLNVFRGASRIYCE-GMRGV
        +F +C       + S   +  +    NS+   + D+ D ++ ++ + F ++LWA+W  RN+  +  G F     V W+++ L+ ++   R + E      
Subjt:  LFWQCKFFRDALMGSEWEVLLQGVQANSMLNLLRDVKDKVDWAKFEEFVVVLWAVWCCRNQQKFR-GRFPSTGLVDWAVNYLNVFRGASRIYCE-GMRGV

Query:  PR-SSVKWLPPEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMGLAPTILETDSMRVFTLLQEPAMV
        PR ++ KW+ P  G  K+NVD A+  +    G+GVVVRDE+G    + + +  ++ S    E  A   G+ + +  G     LETD   + T L +  M 
Subjt:  PR-SSVKWLPPEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMGLAPTILETDSMRVFTLLQEPAMV

Query:  DLSEFGVLVSEARMEVPAYIQLGFSFTRREGNRVAHELGHFA
        D SE   ++ + +  +  +  +      RE N VA+ L HFA
Subjt:  DLSEFGVLVSEARMEVPAYIQLGFSFTRREGNRVAHELGHFA

XP_030509050.1 uncharacterized protein LOC115723712 [Cannabis sativa]3.0e-20936.15Show/hide
Query:  LTHLDFSRSDHRPLLLSMSVNARVVRSAGQKIR-RFEESWLQSSDFGEAVSAGWRSEA------PVGSLVELAS----------------VATRSNVQQV
        L HLD+  SDHR L L++S+    +     K R RFE+ WLQ S+  + +   W +         + ++   AS                + T     +V
Subjt:  LTHLDFSRSDHRPLLLSMSVNARVVRSAGQKIR-RFEESWLQSSDFGEAVSAGWRSEA------PVGSLVELAS----------------VATRSNVQQV

Query:  MGRLGSS--ESRADLQRAEGELESLLVEEEVYWRQRSRDSWLKWGDRNTR----------------------------AGRVLGLIEGYFENIFSTSSPS
        +    S+  E    LQ +E  L+ LL +EE YW QRSR SWLK GD NT+                            +  +L  +E YF +IF++    
Subjt:  MGRLGSS--ESRADLQRAEGELESLLVEEEVYWRQRSRDSWLKWGDRNTR----------------------------AGRVLGLIEGYFENIFSTSSPS

Query:  DGEIDQVTSRVHPLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMIVLIPKKKSPKCI
           I  V   +  L+S    + L  PF   EV  AL+ +  + +PG DG+S  FY   W +VGS V +  L +LN+   P+  N T+I LIPK K P  I
Subjt:  DGEIDQVTSRVHPLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMIVLIPKKKSPKCI

Query:  TEYRPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFAVE
        T+YRPISLCNV YK+VSK +V R++  +  +IS  QSAF+  R + DN ++ +E +H+LK +K GR G+A++KLDMSKA+DRVEW FLE++ML++GF   
Subjt:  TEYRPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFAVE

Query:  WVELVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRAKEGEARTVHH
         VEL+LRC+ SV YSF +NG   G + P RG+RQGDPLSPYLFL+C+EG SR+L   E   A+ GL+++R  PPI+HL FADDS+LF RA    AR +H 
Subjt:  WVELVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRAKEGEARTVHH

Query:  ILQCYERASGQTINFDKSIISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIP
         L  Y RASGQ +N +KS++SFSP+T    +H    L  + +  CH+QYLGLPSF  R++    + I +++W+ +  W+ + FS+GG+EVLLK++VQAIP
Subjt:  ILQCYERASGQTINFDKSIISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIP

Query:  CYTMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAGVG
         Y M+CF+LP KL   I + MS FWW        IHW +WK LCK K  GGMGFR+   FNQALLAKQ WRI++ P+S ++RVLK RYF +G FL A  G
Subjt:  CYTMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAGVG

Query:  SRPSYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDDVCFKVRSPVMLGLDTRVADLLTASGQWNEELLRHNFSPREVRSILTIPLRHISSEDKV
        ++PS  W+S++WG+ELL KG+RWRIG+G  V+     WIP    FK    +    D +VAD +T S QW+   L+  F+P +V  IL+IPL     ED +
Subjt:  SRPSYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDDVCFKVRSPVMLGLDTRVADLLTASGQWNEELLRHNFSPREVRSILTIPLRHISSEDKV

Query:  IWHFEKCGVYSVKSGYRLGQVAVLARSHLCLRVR---------RCPVGGRGVGKWGSRTKGIDVLNV-----------CVHCGWHGESCMHLFWQCKFFR
        +WH+   G Y+VKSGY+L      ++ +               + P   R +  W +  + +    +           C  C  H E+  H F+ C   +
Subjt:  IWHFEKCGVYSVKSGYRLGQVAVLARSHLCLRVR---------RCPVGGRGVGKWGSRTKGIDVLNV-----------CVHCGWHGESCMHLFWQCKFFR

Query:  DALMGSEWEVLLQGVQANSMLNLLRDVKDKVDWAKFEEFVVVLWAVWCCRNQQKFRGRFPSTGL--VDWAVNYLNVFR----------------------
              +  +        S  + L  V  ++   K E F+  LW +W  RN +    + P + L    +A +YL  FR                      
Subjt:  DALMGSEWEVLLQGVQANSMLNLLRDVKDKVDWAKFEEFVVVLWAVWCCRNQQKFRGRFPSTGL--VDWAVNYLNVFR----------------------

Query:  -----------------GASRIYCEGMRGV-PRSSVKWLPPEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGM
                          A + +     GV P +  KWL P  G  K+N DAA        G+  ++R+  G ++ +           +  E LA+  G+
Subjt:  -----------------GASRIYCEGMRGV-PRSSVKWLPPEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGM

Query:  RLVVEMGLAPTILETDSMRVFTLLQEPAMVDLSEFGVLVSEARMEVPAYIQLGFSFTRREGNRVAHELGHFAL
        +L++ + L    +E+DS+ V   L+      LS F V++++    +  + +   S   R  N  AH L  FAL
Subjt:  RLVVEMGLAPTILETDSMRVFTLLQEPAMVDLSEFGVLVSEARMEVPAYIQLGFSFTRREGNRVAHELGHFAL

TrEMBL top hitse value%identityAlignment
A0A2N9GB96 Uncharacterized protein5.4e-21236.72Show/hide
Query:  HLDFSRSDHRPLLLSMSVNARVVRSAGQKIRRFEESWLQSSDFGEAVSAGWRSEAPVGSLVELASVATR--------SNVQQVMGRLGSSESRADLQRAE
        HLD + SDH+P+ L+ +     ++   +K+ RFE+ W   S   E ++  W  +     +V++  + TR        S V          E + +L++AE
Subjt:  HLDFSRSDHRPLLLSMSVNARVVRSAGQKIRRFEESWLQSSDFGEAVSAGWRSEAPVGSLVELASVATR--------SNVQQVMGRLGSSESRADLQRAE

Query:  ----------------GELESLLVEEEVYWRQRSRDSWLKWGDRNT-----------RAGRVLGLIEG-----------------YFENIFSTSSPSDGE
                         EL +LL +EE  W+QRSR  WLK GD+NT           R   ++ L +G                 Y+E++F  ++P + E
Subjt:  ----------------GELESLLVEEEVYWRQRSRDSWLKWGDRNT-----------RAGRVLGLIEG-----------------YFENIFSTSSPSDGE

Query:  IDQVTSRVHPLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMIVLIPKKKSPKCITEY
        +DQV + ++P V+  MN  L RP+   EV  AL+Q+ P KAPGPDG+  AFYQ  W VVG +VV   LS +N+   P  +N T + LIPK K+P+ +TEY
Subjt:  IDQVTSRVHPLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMIVLIPKKKSPKCITEY

Query:  RPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVE
        RPISLCNV YK++SKVL NR+K +L  +I+  QSAF+PGR + DN ++ +E +H +  ++ GR G  +LKLDMSKAYDRVEW FL ++ML+MGF  +WV 
Subjt:  RPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVE

Query:  LVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRAKEGEARTVHHILQ
        L++ CI++V YS  +NG   G + PSRGLRQGDP+SPYLFLLCAEGL+ +L+ A     I G+ + R  P ++HLFFADDSLLF RA + E   +  +L 
Subjt:  LVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRAKEGEARTVHHILQ

Query:  CYERASGQTINFDKSIISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIPCYT
         YE+ASGQ +N  K+ + FS +T+   +  +  +  V     +++YLGLPS + + +M   + IK+RVW +++GWK K  S  GRE+L+K+++QAIP YT
Subjt:  CYERASGQTINFDKSIISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIPCYT

Query:  MNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAGVGSRP
        MNCF+LP KL +DI   M  FWW  + ++R++HW+SW  LC+PK  GG+GFR+L+ FN ALLAKQ WR +    S L +V   ++FP+G+ L A + +R 
Subjt:  MNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAGVGSRP

Query:  SYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDDVCFKVRSPV-MLGLDTRVADLLTASG-QWNEELLRHNFSPREVRSILTIPLRHISSEDKVI
        S+ WRS++  + L+  G  WR+G+G+K+ +  ANW+ D+   +V SP+ M    ++VA L+  S  +W+ E +R +F P +  +IL IP+   S  DK+I
Subjt:  SYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDDVCFKVRSPV-MLGLDTRVADLLTASG-QWNEELLRHNFSPREVRSILTIPLRHISSEDKVI

Query:  WHFEKCGVYSVKSGY---------------RLGQVAVLARS--HLCLRVRRCPVGGRGV-----GKWGSRTKGIDVLNVCVHCGWHGESCMHLFWQCKFF
        WH  + G YSV+SGY               R G+   L +    +C   +      R        K G   + I     C +CG   E C+H  W+C   
Subjt:  WHFEKCGVYSVKSGY---------------RLGQVAVLARS--HLCLRVRRCPVGGRGV-----GKWGSRTKGIDVLNVCVHCGWHGESCMHLFWQCKFF

Query:  RDALMGSEWEVLLQGVQANSMLNLLRDVKDKVDWAKFEEFVVVLWAVWCCRNQQKFRGRFPSTGLVDWAVNYLNVFRGASRIYC-EGMRGVPRSSVKWLP
          +         ++  +  S  +L+R V         E+F  + W +W  RNQ +     PS            + +  +RI+  E  +  P   V W P
Subjt:  RDALMGSEWEVLLQGVQANSMLNLLRDVKDKVDWAKFEEFVVVLWAVWCCRNQQKFRGRFPSTGLVDWAVNYLNVFRGASRIYC-EGMRGVPRSSVKWLP

Query:  PEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMGLAPTILETDSMRVFTLLQEPAMVDLSEFGVLVS
        P    YKVN D A   +    G+GVV+RD+ G V+ + + R     S ++ E  A    ++  +E+G+   I E DS  +   +  P  +  + +G+++ 
Subjt:  PEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMGLAPTILETDSMRVFTLLQEPAMVDLSEFGVLVS

Query:  EARMEVPAYIQLGFSFTRREGNRVAHELGHFAL
        +A+  +  + +  F+ TRR GN VAH L   AL
Subjt:  EARMEVPAYIQLGFSFTRREGNRVAHELGHFAL

A0A2N9H0J9 Uncharacterized protein5.4e-21236.72Show/hide
Query:  HLDFSRSDHRPLLLSMSVNARVVRSAGQKIRRFEESWLQSSDFGEAVSAGWRSEAPVGSLVELASVATR--------SNVQQVMGRLGSSESRADLQRAE
        HLD + SDH+P+ L+ +     ++   +K+ RFE+ W   S   E ++  W  +     +V++  + TR        S V          E + +L++AE
Subjt:  HLDFSRSDHRPLLLSMSVNARVVRSAGQKIRRFEESWLQSSDFGEAVSAGWRSEAPVGSLVELASVATR--------SNVQQVMGRLGSSESRADLQRAE

Query:  ----------------GELESLLVEEEVYWRQRSRDSWLKWGDRNT-----------RAGRVLGLIEG-----------------YFENIFSTSSPSDGE
                         EL +LL +EE  W+QRSR  WLK GD+NT           R   ++ L +G                 Y+E++F  ++P + E
Subjt:  ----------------GELESLLVEEEVYWRQRSRDSWLKWGDRNT-----------RAGRVLGLIEG-----------------YFENIFSTSSPSDGE

Query:  IDQVTSRVHPLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMIVLIPKKKSPKCITEY
        +DQV + ++P V+  MN  L RP+   EV  AL+Q+ P KAPGPDG+  AFYQ  W VVG +VV   LS +N+   P  +N T + LIPK K+P+ +TEY
Subjt:  IDQVTSRVHPLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMIVLIPKKKSPKCITEY

Query:  RPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVE
        RPISLCNV YK++SKVL NR+K +L  +I+  QSAF+PGR + DN ++ +E +H +  ++ GR G  +LKLDMSKAYDRVEW FL ++ML+MGF  +WV 
Subjt:  RPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVE

Query:  LVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRAKEGEARTVHHILQ
        L++ CI++V YS  +NG   G + PSRGLRQGDP+SPYLFLLCAEGL+ +L+ A     I G+ + R  P ++HLFFADDSLLF RA + E   +  +L 
Subjt:  LVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRAKEGEARTVHHILQ

Query:  CYERASGQTINFDKSIISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIPCYT
         YE+ASGQ +N  K+ + FS +T+   +  +  +  V     +++YLGLPS + + +M   + IK+RVW +++GWK K  S  GRE+L+K+++QAIP YT
Subjt:  CYERASGQTINFDKSIISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIPCYT

Query:  MNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAGVGSRP
        MNCF+LP KL +DI   M  FWW  + ++R++HW+SW  LC+PK  GG+GFR+L+ FN ALLAKQ WR +    S L +V   ++FP+G+ L A + +R 
Subjt:  MNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAGVGSRP

Query:  SYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDDVCFKVRSPV-MLGLDTRVADLLTASG-QWNEELLRHNFSPREVRSILTIPLRHISSEDKVI
        S+ WRS++  + L+  G  WR+G+G+K+ +  ANW+ D+   +V SP+ M    ++VA L+  S  +W+ E +R +F P +  +IL IP+   S  DK+I
Subjt:  SYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDDVCFKVRSPV-MLGLDTRVADLLTASG-QWNEELLRHNFSPREVRSILTIPLRHISSEDKVI

Query:  WHFEKCGVYSVKSGY---------------RLGQVAVLARS--HLCLRVRRCPVGGRGV-----GKWGSRTKGIDVLNVCVHCGWHGESCMHLFWQCKFF
        WH  + G YSV+SGY               R G+   L +    +C   +      R        K G   + I     C +CG   E C+H  W+C   
Subjt:  WHFEKCGVYSVKSGY---------------RLGQVAVLARS--HLCLRVRRCPVGGRGV-----GKWGSRTKGIDVLNVCVHCGWHGESCMHLFWQCKFF

Query:  RDALMGSEWEVLLQGVQANSMLNLLRDVKDKVDWAKFEEFVVVLWAVWCCRNQQKFRGRFPSTGLVDWAVNYLNVFRGASRIYC-EGMRGVPRSSVKWLP
          +         ++  +  S  +L+R V         E+F  + W +W  RNQ +     PS            + +  +RI+  E  +  P   V W P
Subjt:  RDALMGSEWEVLLQGVQANSMLNLLRDVKDKVDWAKFEEFVVVLWAVWCCRNQQKFRGRFPSTGLVDWAVNYLNVFRGASRIYC-EGMRGVPRSSVKWLP

Query:  PEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMGLAPTILETDSMRVFTLLQEPAMVDLSEFGVLVS
        P    YKVN D A   +    G+GVV+RD+ G V+ + + R     S ++ E  A    ++  +E+G+   I E DS  +   +  P  +  + +G+++ 
Subjt:  PEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMGLAPTILETDSMRVFTLLQEPAMVDLSEFGVLVS

Query:  EARMEVPAYIQLGFSFTRREGNRVAHELGHFAL
        +A+  +  + +  F+ TRR GN VAH L   AL
Subjt:  EARMEVPAYIQLGFSFTRREGNRVAHELGHFAL

A0A5E4FZN9 PREDICTED: retrotransposon5.5e-21737.45Show/hide
Query:  THLDFSRSDHRPLLLSMSVNARVVRSAGQKIRRFEESWLQSSDFGEAVSAGWRSEAPVGSLVELASVATR----------SNVQQ-----------VMGR
        +HL    SDH P+L+    +     +       FEE W +  DF + +   W+    V S+    S+  +          SNV+            + GR
Subjt:  THLDFSRSDHRPLLLSMSVNARVVRSAGQKIRRFEESWLQSSDFGEAVSAGWRSEAPVGSLVELASVATR----------SNVQQ-----------VMGR

Query:  LGSSESRADLQRAEGELESLLVEEEVYWRQRSRDSWLKWGDRNTRAGRVLGLIEGYFENIFSTSSPSDGEIDQVTSRVHPLVSDVMNNSLMRPFHREEVL
        L +++      + E  +  LL ++E+ WRQRSR     W    T   R+  L   YF+ +F  SS    +++++ + V P+++  MN+ L++ F REE+ 
Subjt:  LGSSESRADLQRAEGELESLLVEEEVYWRQRSRDSWLKWGDRNTRAGRVLGLIEGYFENIFSTSSPSDGEIDQVTSRVHPLVSDVMNNSLMRPFHREEVL

Query:  RALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMIVLIPKKKSPKCITEYRPISLCNVSYKIVSKVLVNRMKGILNGLIS
          L Q+ P KAPG DG+   F+Q  W +VG  V   CL ILN + S    N T+I LIPK K P  ++E+RPISLC   YK+++K + NR+K +L  +I+
Subjt:  RALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMIVLIPKKKSPKCITEYRPISLCNVSYKIVSKVLVNRMKGILNGLIS

Query:  HNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVRYSFNVNGVRCGDVIPSRGLR
         NQSAF+P R ++DN +  +E +H +K  K GR    +LKLDM+KAYDRVEWVFL  MML++GF+  WV  V+ CIS+  +S    G   G ++P RGLR
Subjt:  HNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVRYSFNVNGVRCGDVIPSRGLR

Query:  QGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRAKEGEARTVHHILQCYERASGQTINFDKSIISFSPSTAMGVRHQ
        QG PLSPYLFL+C EG S +L  AE    + G+++ARG P ++HL FADDS+LF +A     R +  + Q YE  SGQ IN+ KS  S SP+        
Subjt:  QGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRAKEGEARTVHHILQCYERASGQTINFDKSIISFSPSTAMGVRHQ

Query:  VSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIPCYTMNCFRLPKKLVQDISRAMSGFWWNGETEDR
        +  +  V V  CH++YLGLP+   + R      +K+++W+ I GWK K  S  G+E+L+K+++QAIP Y+M+CFR+PK L ++++  M+ FWW    + R
Subjt:  VSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIPCYTMNCFRLPKKLVQDISRAMSGFWWNGETEDR

Query:  RIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVSV
         IHWV W+ LCK K  GG+GFRDLE FNQALLAKQCWRI++ P S ++R+ + RY PS  FL A VG+ PS+IWRSL WG+ELL KG+RWR+GNG  + V
Subjt:  RIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVSV

Query:  YGANWIPDDVCFKVRSPVMLGLDTRVADLLTASGQWNEELLRHNFSPREVRSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLG-----------QVA
        Y   W+P    FK+ SP  L L T V DL T+SGQWN  LL+  F  +EV + L IPL  ++  D +IWH+E+ G+YSVKSGYRL             V 
Subjt:  YGANWIPDDVCFKVRSPVMLGLDTRVADLLTASGQWNEELLRHNFSPREVRSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLG-----------QVA

Query:  VLARSHLCLRVRRCPVGGR-----GVGKWGSRTKGIDVLN-------VCVHCGWHGESCMHLFWQCKFFRDALMGSEWEVLLQGVQANSMLNLLRDVKDK
        V   S    ++    +  +         W     G  + N       +C +C    ES +H  W C+  ++    S W  + +  + NS   L   ++  
Subjt:  VLARSHLCLRVRRCPVGGR-----GVGKWGSRTKGIDVLN-------VCVHCGWHGESCMHLFWQCKFFRDALMGSEWEVLLQGVQANSMLNLLRDVKDK

Query:  VDWAKFEEFVVVLWAVWCCRNQQKFRGRF-PSTGLVDWAVNYLNVFRGASRI--YCEGMRGVPRSSVK-WLPPEEGWYKVNVDAAF-CGDLFRAGVGVVV
            +   F  + W +W  RN   F G+   +T L+         F  A+ +     G +  P++ +  W PP  G YK+NVD A   GD  R GVGVVV
Subjt:  VDWAKFEEFVVVLWAVWCCRNQQKFRGRF-PSTGLVDWAVNYLNVFRGASRI--YCEGMRGVPRSSVK-WLPPEEGWYKVNVDAAF-CGDLFRAGVGVVV

Query:  RDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMGLAPTILETDSMRVFTLL---QEPAMVDLSEFGVLVSEARMEVPAYIQLGFSFTRREGNRV
        R+  G  M +   R          E +A ++G+R  ++MG    +LE D+      +   +E   +D    G+L+ E    +  +  +   +T R GN+V
Subjt:  RDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMGLAPTILETDSMRVFTLL---QEPAMVDLSEFGVLVSEARMEVPAYIQLGFSFTRREGNRV

Query:  AHELGHFA
        AH L  FA
Subjt:  AHELGHFA

A0A803PV25 Uncharacterized protein1.3e-21337.41Show/hide
Query:  ELTHLDFSRSDHRPLLLSMSV--NARVVRSAGQKIR-RFEESWLQSSDFGEAVSAGWRSE---APVGSL---VELASVATRSNVQQVMGRLGSSESR---
        +++ LD+  SDHR L++ + V  +      A +K R  FEE+W Q  +  E V   W  E     VGS    +     A ++  ++   +L S   +   
Subjt:  ELTHLDFSRSDHRPLLLSMSV--NARVVRSAGQKIR-RFEESWLQSSDFGEAVSAGWRSE---APVGSL---VELASVATRSNVQQVMGRLGSSESR---

Query:  --------------ADLQRAEGELESLLVEEEVYWRQRSRDSWLKWGDRNTR-------AGR---------------------VLGLIEGYFENIFSTSS
                        +Q+ E +L  LL ++E YWRQRSR  WL+WGDRNT+       A R                     V  ++E Y+E +F++S 
Subjt:  --------------ADLQRAEGELESLLVEEEVYWRQRSRDSWLKWGDRNTR-------AGR---------------------VLGLIEGYFENIFSTSS

Query:  PSDGEIDQVTSRVHPLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMIVLIPKKKSPK
         ++  +++V S V P VS VMNN L+  F  EEV+RA+++++P KAPG DGL   FYQ  WS +  DVV   L++LNN A    LN+T++ LIPK   P+
Subjt:  PSDGEIDQVTSRVHPLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMIVLIPKKKSPK

Query:  CITEYRPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFA
         I E+RPISLCNV YKIVSK L NRM+  L  ++S +QSAF+ GR + DNAI+GYE +H ++  +       +LKLDM+KAYDRVEW FLE MM+++G++
Subjt:  CITEYRPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFA

Query:  VEWVELVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRAKEGEARTV
          WV  ++ C++SV++SF +NG   G V+P RGLRQGDPLSP+LFLLCAE  S ++  AE    + G+   R    +SHLFFADDSL+F  A E E R  
Subjt:  VEWVELVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRAKEGEARTV

Query:  HHILQCYERASGQTINFDKSIISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQA
          +L+ Y  ASGQ +NF KS + F  S +  VR  ++    V+V   + +YLGLPSF+ R +     FI  +VW +++GWKG FFS  G+EVL+K+IVQA
Subjt:  HHILQCYERASGQTINFDKSIISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQA

Query:  IPCYTMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAG
        IP YTM+CFRLPKK +  I    + FWW    +D +IHW  W  LCK K  GG+GFRDL  FNQALLAKQ WR ++YP S  S+VLK  Y+P+   L A 
Subjt:  IPCYTMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAG

Query:  VGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDDVCFKVRSPVMLGLDTRVADLLTASGQWNEELLRHNFSPREVRSILTIPLRHISSED
         G+  S++WRSL+WG++++  G RWRIGNG  V V    W+P  V FK+     L  +  V DL   +G+W+EE +R  F+P +   IL +       ED
Subjt:  VGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDDVCFKVRSPVMLGLDTRVADLLTASGQWNEELLRHNFSPREVRSILTIPLRHISSED

Query:  KVIWHFEKCGVYSVKSGYRLGQVAVLARSHLCLRVRRCPVGGRGVGKWGSRT-----------------------------KGIDVLNVCVHCGWHG-ES
        K++WH+ K G YSV+SGYR+            L VR          +W  +                              + + +   C  C     E+
Subjt:  KVIWHFEKCGVYSVKSGYRLGQVAVLARSHLCLRVRRCPVGGRGVGKWGSRT-----------------------------KGIDVLNVCVHCGWHG-ES

Query:  CMHLFWQCKFFRDALMGSEWEVLLQGVQANSMLNLLRDVKDKVDWAKFEEFVVVLWAVWCCRNQQKFRGRFP-STGLVDWAVNYLNVFRGASRIYCEGMR
          H  W C+   D    S +   ++      +L  L  +   +    FE F+V+ W +W  RN     G  P +  +V+W   +L  FR ++        
Subjt:  CMHLFWQCKFFRDALMGSEWEVLLQGVQANSMLNLLRDVKDKVDWAKFEEFVVVLWAVWCCRNQQKFRGRFP-STGLVDWAVNYLNVFRGASRIYCEGMR

Query:  GVPRSSVKWLPPEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMGLAPTILETDSMRVFTLLQEPAM
        G  R++ +W+ P  G Y +NVDA        A V  V+RD  GRV ++A    +   SP  AE  A+ DG++  ++  L    +ETD ++   L+ +   
Subjt:  GVPRSSVKWLPPEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMGLAPTILETDSMRVFTLLQEPAM

Query:  VDLSEFGVLVSEARMEVPAYIQLGFSFTRREGNRVAHELGHFAL
            +   LV++ R  +      G SF  RE N+ AH L + AL
Subjt:  VDLSEFGVLVSEARMEVPAYIQLGFSFTRREGNRVAHELGHFAL

A0A803QC75 Uncharacterized protein1.4e-21235.8Show/hide
Query:  HLDFSRSDHRPLLLSMSVNARVVRSAGQKIR-RFEESWLQSSDFGEAVSAGWRS---EAPVGSLVE-LASVAT-------------RSNVQQVMGRL---
        HLD+  SDHR L +S S  ++    A  K R RFE+ WL   +  E +   W S     P+ ++++ L++ AT             + N+  +  ++   
Subjt:  HLDFSRSDHRPLLLSMSVNARVVRSAGQKIR-RFEESWLQSSDFGEAVSAGWRS---EAPVGSLVE-LASVAT-------------RSNVQQVMGRL---

Query:  -----GSSESRADLQRAEGELESLLVEEEVYWRQRSRDSWLKWGDRNTR----------------------------AGRVLGLIEGYFENIFSTSSPSD
              SS+   +L  AE  LE LL +EEVYW+QRSR  WL  GDRNT+                             G +  +++ ++ ++FS++S  +
Subjt:  -----GSSESRADLQRAEGELESLLVEEEVYWRQRSRDSWLKWGDRNTR----------------------------AGRVLGLIEGYFENIFSTSSPSD

Query:  GEIDQVTSRVHPLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMIVLIPKKKSPKCIT
          ++   S +   VS  +N SL++PF  +EV  AL  + P+K+PG DG+S  FYQ +W++VG  V    LSILN+ A P  LN+T+I LIPK K P+ I 
Subjt:  GEIDQVTSRVHPLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMIVLIPKKKSPKCIT

Query:  EYRPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFAVEW
        EYRPISLCNV  K+V+KVLV+R K  L  +IS  QSAF+P R + DN ++ +E +HA+K K  GR G AS KLDMSKA+DRVEW F+E +M +MGFA  W
Subjt:  EYRPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFAVEW

Query:  VELVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRAKEGEARTVHHI
        + L++ C+++  +SF +NG   G++ PSRGLRQG PLSPYLFL+C+EG SR+L   + +  + G ++ R  PPI+HLFFADDSLLF +A E     +  +
Subjt:  VELVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRAKEGEARTVHHI

Query:  LQCYERASGQTINFDKSIISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIPC
        L  Y +ASGQ +N DKS++SFSP+T +  +    Q   + +  CH++YLGLPS+  R++    + IKER+W+ +  W  K FS GG+E+LLK++VQ+IP 
Subjt:  LQCYERASGQTINFDKSIISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIPC

Query:  YTMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAGVGS
        Y M+CFRLP    + +   M+ FWW       RIHW SW  LCK K  GGMGFR    FNQALLAKQ WRI + P S L R+LK RYFP+ +FL A +G 
Subjt:  YTMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAGVGS

Query:  RPSYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDD---VCFKVRSPVMLGLDTRVADLLTASGQWNEELLRHNFSPREVRSILTIPLRHISSED
         PS  W+ + W RELL KG+RW++G+G  +      WIP     + F    P     +  V++L+T   QW+  LL+  FS  +V  IL++PL   SS D
Subjt:  RPSYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDD---VCFKVRSPVMLGLDTRVADLLTASGQWNEELLRHNFSPREVRSILTIPLRHISSED

Query:  KVIWHFEKCGVYSVKSGYRL-GQVAVLARSHLCLR---------VRRCPVGGRGVGKWGSRTKGIDVL-----------NVCVHCGWHGESCMHLFWQCK
         +IWH    G+Y+VKSGY L   + ++ +S    R           + P   + +  W +    + V            + C  C    ES  H  + CK
Subjt:  KVIWHFEKCGVYSVKSGYRL-GQVAVLARSHLCLR---------VRRCPVGGRGVGKWGSRTKGIDVL-----------NVCVHCGWHGESCMHLFWQCK

Query:  F----FRDALMGSEWEVLLQGVQANSMLNLLRDVKDKVDWAKFEEFVVVLWAVWCCRNQ--QKFRGRFPSTGLVDWAVNYLNVFRGASR-----------
        +    +R + +  +W +     + + +L+ L  +  K   A+ E+    LW +W  RN+     + R P+  L  +AV YL  +R A +           
Subjt:  F----FRDALMGSEWEVLLQGVQANSMLNLLRDVKDKVDWAKFEEFVVVLWAVWCCRNQ--QKFRGRFPSTGLVDWAVNYLNVFRGASR-----------

Query:  ----------IYCEGMRGVPRSSV-----KWLPPEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMG
                   + +     P+  V     +W PPE   YK+NVDAA        G+G ++RD  G V+ + + +     S    E +A+   +  V++  
Subjt:  ----------IYCEGMRGVPRSSV-----KWLPPEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMG

Query:  LAPTILETDSMRVFTLLQEPAMVDLSEFGVLVSEARMEVPAYIQLGFSFTRREGNRVAHELGHFAL-KEGGCS-------AVIELVKIE--IGKRIRDPS
        L  +++E+D++ V   L+ P    +S F  L+ +    +     +  +  +RE N  AH L  FAL  +  CS        +  ++++E  + K  + PS
Subjt:  LAPTILETDSMRVFTLLQEPAMVDLSEFGVLVSEARMEVPAYIQLGFSFTRREGNRVAHELGHFAL-KEGGCS-------AVIELVKIE--IGKRIRDPS

Query:  E
        E
Subjt:  E

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657505.3e-3924.92Show/hide
Query:  LPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIPCYTMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGG
        +P    R   +T   I ERV  ++ GW+ K  S  GR  L K+++ ++P ++M+   LP+ ++  + +    F W    E ++ H V W  +C PK  GG
Subjt:  LPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIPCYTMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGG

Query:  MGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRY-----------FPSGDFLAAGVGSRPSYIWRSLIWG-RELLGKGIRWRIGNGEKVSVYGANWI
        +G R  ++ N+AL++K  WR++Q   S  + VL+ +Y            P G +         S  WRS+  G R+++  G+ W  G+G+++  +   W+
Subjt:  MGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRY-----------FPSGDFLAAGVGSRPSYIWRSLIWG-RELLGKGIRWRIGNGEKVSVYGANWI

Query:  PDDVCFKV-RSPVMLGLDTRVA-DLLTASGQWNEELL---RHNFSPREVRSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQVAVLARSHL-----
              ++         DT VA DL      W+   +     N +  E+R+++   +    + D++ W F + G +SV+S Y +  V  + R ++     
Subjt:  PDDVCFKV-RSPVMLGLDTRVA-DLLTASGQWNEELL---RHNFSPREVRSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQVAVLARSHL-----

Query:  CLRVRRCP---------VGGRGVGKWGSR-TKGIDVLNVCVHCGWHGESCMHLFWQCKFFRDALMGSEWEVLLQGVQANSMLNLL------RDVKDKVDW
        CL   R P         VG + V     R  + +   NVC  C    ES +H+   C       +    +   QG  + S+   L      R   + + W
Subjt:  CLRVRRCP---------VGGRGVGKWGSR-TKGIDVLNVCVHCGWHGESCMHLFWQCKFFRDALMGSEWEVLLQGVQANSMLNLL------RDVKDKVDW

Query:  AKFEEFVVVLWAVW---C---------CRNQQKFRGRFPSTGLVDWAVNYL-----NVFRGASRIYCEGMRGVPRSSVKWLPPEEGWYKVNVDAAFCGDL
        +    F V++W  W   C         CR++ KF        + +WAV        NV  G ++   E M G       W+ P  GW KVN D A  G+ 
Subjt:  AKFEEFVVVLWAVW---C---------CRNQQKFRGRFPSTGLVDWAVNYL-----NVFRGASRIYCEGMRGVPRSSVKWLPPEEGWYKVNVDAAFCGDL

Query:  FRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMGLAPTILETDSMRVFTLLQEPAMVDLSEFGVLVSEARMEVPAYIQLGFSFTR
          A  G V+RD  G      ++      +P  AE   V  G+    E  +    LE DS  +   L +  + D      LV      +     +      
Subjt:  FRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMGLAPTILETDSMRVFTLLQEPAMVDLSEFGVLVSEARMEVPAYIQLGFSFTR

Query:  REGNRVAHELGHFA
        RE NR+A  L ++A
Subjt:  REGNRVAHELGHFA

P11369 LINE-1 retrotransposable element ORF2 protein4.8e-4025.74Show/hide
Query:  GDRNTRAGRVLGLIEGYFENIFSTSSPSDGEIDQVTSRVH-PLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCL
        GD  T    +   I  +++ ++ST   +  E+D+   R   P ++    + L  P   +E+   +  +   K+PGPDG S  FYQ     +   +     
Subjt:  GDRNTRAGRVLGLIEGYFENIFSTSSPSDGEIDQVTSRVH-PLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCL

Query:  SILNNKASPAPLNETMIVLIPK-KKSPKCITEYRPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWA
         I      P    E  I LIPK +K P  I  +RPISL N+  KI++K+L NR++  +  +I  +Q  FIPG     N       IH +   K+      
Subjt:  SILNNKASPAPLNETMIVLIPK-KKSPKCITEYRPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWA

Query:  SLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIAR
         + LD  KA+D+++  F+ +++ R G    ++ ++    S    +  VNG +   +    G RQG PLSPYLF +  E L+R +      + I G++I +
Subjt:  SLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIAR

Query:  GCPPISHLFFADDSLLFFRAKEGEARTVHHILQCYERASGQTINFDKSIISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNF--IK
            IS L  ADD +++    +   R + +++  +    G  IN +KS ++F  +       ++ +     +   + +YLG+        +   NF  +K
Subjt:  GCPPISHLFFADDSLLFFRAKEGEARTVHHILQCYERASGQTINFDKSIISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNF--IK

Query:  ERVWRQIQGWKGKFFSVGGREVLLKSIVQAIPCYTMNC--FRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPK-CLGGMGFRDLETFNQALL
        + +   ++ WK    S  GR  ++K  +     Y  N    ++P +   ++  A+  F WN +    RI     K+L K K   GG+   DL+ + +A++
Subjt:  ERVWRQIQGWKGKFFSVGGREVLLKSIVQAIPCYTMNC--FRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPK-CLGGMGFRDLETFNQALL

Query:  AKQCW
         K  W
Subjt:  AKQCW

P14381 Transposon TX1 uncharacterized 149 kDa protein2.6e-3826.32Show/hide
Query:  YFENIFSTSSPSDGEIDQVTSRVHPLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMI
        +++N+FS    S    +++   + P+VS+     L  P   +E+ +ALR +  NK+PG DGL+  F+Q  W  +G D              P      ++
Subjt:  YFENIFSTSSPSDGEIDQVTSRVHPLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLNETMI

Query:  VLIPKKKSPKCITEYRPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFL
         L+PKK   + I  +RP+SL +  YKIV+K +  R+K +L  +I  +QS  +PGR + DN  L  + +H   A++ G    A L LD  KA+DRV+  +L
Subjt:  VLIPKKKSPKCITEYRPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFL

Query:  ERMMLRMGFAVEWVELVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFF
           +    F  ++V  +    +S      +N      +   RG+RQG PLS  L+ L  E    +L      + +TGL +      +    +ADD +L  
Subjt:  ERMMLRMGFAVEWVELVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFF

Query:  RAKEGEARTVHHILQCYERASGQTINFDKSI--------ISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKG
        +    +        + Y  AS   IN+ KS         + F P     +  +   +  + V    ++Y    +F+          ++E V  ++  WKG
Subjt:  RAKEGEARTVHHILQCYERASGQTINFDKSI--------ISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKG

Query:  --KFFSVGGREVLLKSIVQAIPCYTMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMG
          K  S+ GR +++  +V +   Y + C    ++ +  I R +  F W G+      HWVS      P   GG G
Subjt:  --KFFSVGGREVLLKSIVQAIPCYTMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMG

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM1.8e-1527.74Show/hide
Query:  YFENIFSTSSPS--DGEIDQVTSRVHPLVSDVMNNSLMRPFH--REEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLN
        Y+  + +  SPS   GE+ Q            M++SL R +    E+ LRA R +  + +PGPDG++    +   S +   ++N  L   N    P  + 
Subjt:  YFENIFSTSSPS--DGEIDQVTSRVHPLVSDVMNNSLMRPFH--REEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCCLSILNNKASPAPLN

Query:  ETMIVLIPKKKSPKCITEYRPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNA-ILGYECIHALKAKKVGRTGWASLKLDMSKAYDRV
            V IPK  + K   ++RPIS+ +V  + ++ +L  R+   +N      Q  F+P     DNA I+     H+ K  +          LD+SKA+D +
Subjt:  ETMIVLIPKKKSPKCITEYRPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNA-ILGYECIHALKAKKVGRTGWASLKLDMSKAYDRV

Query:  EWVFLERMMLRMGFAVEWVELVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADD
            +   +   G    +V+ V         S N +G    + +P+RG++QGDPLSP LF L  + L R L    GA+    +  A          FADD
Subjt:  EWVFLERMMLRMGFAVEWVELVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADD

Query:  SLLFFRAKEG
         +LF   + G
Subjt:  SLLFFRAKEG

P93295 Uncharacterized mitochondrial protein AtMg003104.5e-3850.68Show/hide
Query:  AIPCYTMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPK-CLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLA
        A+P Y M+CFRL K L + ++ AM+ FWW+     R+I WV+W+ LCK K   GG+GFRDL  FNQALLAKQ +RI+  P + LSR+L+ RYFP    + 
Subjt:  AIPCYTMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPK-CLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLA

Query:  AGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDD
          VG+RPSY WRS+I GRELL +G+   IG+G    V+   WI D+
Subjt:  AGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDD

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein3.2e-2322.56Show/hide
Query:  LKGRYFPSGDFLAAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDDVCFKVRSPVMLGLDTRVADLLTASGQ---WNEELLRHNFSP
        +K RYF     L A V  + SY W SL+ G  LL KG R  IG+G+ + + G + I D    +  +      +  + +L    G    W++  +      
Subjt:  LKGRYFPSGDFLAAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDDVCFKVRSPVMLGLDTRVADLLTASGQ---WNEELLRHNFSP

Query:  REVRSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRL------GQVAVLARSH----LCLRVRRCPVGGR-GVGKWGS-----------RTKGIDVLNV
         +   I  I L      DK+IW++   G Y+V+SGY L        +  +   H    L  R+   P+  +     W +            T+G+ +   
Subjt:  REVRSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRL------GQVAVLARSH----LCLRVRRCPVGGR-GVGKWGS-----------RTKGIDVLNV

Query:  CVHCGWHGESCMHLFWQCKFFRDALMGSEWEVLLQGVQANSMLNLLRDVKDKVDWAKFEEF-----VVVLWAVWCCRNQ---QKFRGRFPSTGLVDWAV-
        C  C    ES  H  + C F   A   S+  ++   + +N     + ++ + V      +F     V ++W +W  RN     KFR     T L   A  
Subjt:  CVHCGWHGESCMHLFWQCKFFRDALMGSEWEVLLQGVQANSMLNLLRDVKDKVDWAKFEEF-----VVVLWAVWCCRNQ---QKFRGRFPSTGLVDWAV-

Query:  -NYLNVFRGASRIYCEGMRGVPRSSVKWLPPEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMGLAP
         ++LN  +   +      R +  + ++W  P   + K N DA F      A  G ++R+  G  +   +++  H  +P  AE  A++  ++     G   
Subjt:  -NYLNVFRGASRIYCEGMRGVPRSSVKWLPPEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMGLAP

Query:  TILETDSMRVFTLLQEPAMVDLSEFGVLVSEARMEVPAYIQLGFSFTRREGNRVAHELGHF
          +E D   +  L+    +   S     + +       +  + F F RR+GN++AH L  +
Subjt:  TILETDSMRVFTLLQEPAMVDLSEFGVLVSEARMEVPAYIQLGFSFTRREGNRVAHELGHF

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.7e-1439.36Show/hide
Query:  LVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVR
        +V R+K ++  LI   Q++FIPGR   DN +   E +H+++ KK G  GW  LKLD+ KAYDR+ W +LE  ++  GF   W+  + R     R
Subjt:  LVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAYDRVEWVFLERMMLRMGFAVEWVELVLRCISSVR

AT4G29090.1 Ribonuclease H-like superfamily protein2.4e-5528.65Show/hide
Query:  AIPCYTMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAA
        A+P YTM CF LPK + + I   ++ FWW  + E + +HW +W  L   K  GG+GF+D+E FN ALL KQ WR++  P S +++V K RYF   D L A
Subjt:  AIPCYTMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAA

Query:  GVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWI---PDDVCFKV-RSP----VMLGLDTRVADLLTASG-QWNEELLRHNFSPREVRSILT
         +GSRPS++W+S+   +E+L +G R  +GNGE + ++   W+   P     ++ R P      +    +V+DL+  SG +W ++++   F   E + I  
Subjt:  GVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWI---PDDVCFKV-RSP----VMLGLDTRVADLLTASG-QWNEELLRHNFSPREVRSILT

Query:  IPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQVAVLARSHLCLRVRRCPVGGRGVGKWGSRT------------------------KGIDVLNVCVHCGW
        +        D   W +   G Y+VKSGY +    +  RS     V    +       W S+T                        + +   + C+ C  
Subjt:  IPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQVAVLARSHLCLRVRRCPVGGRGVGKWGSRT------------------------KGIDVLNVCVHCGW

Query:  HGESCMHLFWQCKFFRDALMGSEWEVLLQGVQANSMLNLLRDV----KDKVDWAKFEEFVV-VLWAVWCCRNQQKFRGR-FPSTGLVDWAVNYLNVFRGA
          E+  HL ++C F R     S   + L G  A+S+   L  V         W K  + V  +LW +W  RN+  FRGR F +  ++  A + L  +R  
Subjt:  HGESCMHLFWQCKFFRDALMGSEWEVLLQGVQANSMLNLLRDV----KDKVDWAKFEEFVV-VLWAVWCCRNQQKFRGR-FPSTGLVDWAVNYLNVFRGA

Query:  SRI-YCEGMRGVPRSSV-KWLPPEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMGLAPTILETDSM
        +    C     V RSS  +W PP   W K N DA +  D  R G+G V+R+E G V    A     + S   AE  A+   +  +        I E+DS 
Subjt:  SRI-YCEGMRGVPRSSV-KWLPPEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRLVVEMGLAPTILETDSM

Query:  RVFTLLQEPAMVDLSEFGVLVSEARMEVPAYIQLGFSFTRREGNRVAHELGHFAL
         +  +L    +         + + +  +  + ++ F F  REGN +A  +   +L
Subjt:  RVFTLLQEPAMVDLSEFGVLVSEARMEVPAYIQLGFSFTRREGNRVAHELGHFAL

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.2e-3950.68Show/hide
Query:  AIPCYTMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPK-CLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLA
        A+P Y M+CFRL K L + ++ AM+ FWW+     R+I WV+W+ LCK K   GG+GFRDL  FNQALLAKQ +RI+  P + LSR+L+ RYFP    + 
Subjt:  AIPCYTMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPK-CLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLA

Query:  AGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDD
          VG+RPSY WRS+I GRELL +G+   IG+G    V+   WI D+
Subjt:  AGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVSVYGANWIPDD

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)4.2e-1553.62Show/hide
Query:  FNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDS
        F +NG   G V PSRGLRQGDPLSPYLF+LC E LS +   A+    + G+R++   P I+HL FADD+
Subjt:  FNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGAGCTGACCCATCTGGATTTTAGCCGGTCGGACCATAGGCCTCTTCTGCTTTCGATGTCAGTGAATGCTCGGGTGGTTCGTAGCGCAGGGCAGAAGATTCGGAG
GTTCGAGGAATCTTGGCTCCAGTCCTCTGATTTTGGGGAGGCAGTGTCAGCAGGTTGGAGGTCTGAGGCTCCTGTTGGATCGCTTGTGGAGCTGGCTTCAGTAGCTACCA
GGTCGAATGTCCAACAGGTGATGGGTAGGCTGGGTTCTTCTGAGTCTCGAGCAGACTTACAAAGGGCAGAAGGAGAACTAGAGTCTCTTCTTGTAGAGGAGGAGGTGTAC
TGGAGGCAGAGATCCCGAGATAGTTGGCTAAAATGGGGAGATAGAAATACCCGAGCCGGTAGGGTGTTGGGTCTGATTGAGGGGTATTTTGAGAACATCTTCTCGACATC
CTCCCCGTCTGATGGGGAGATTGATCAGGTGACATCTCGAGTTCATCCTTTGGTCTCGGATGTGATGAATAACAGTTTGATGCGACCTTTCCATCGCGAAGAGGTACTTC
GAGCTTTGAGGCAGATCCATCCTAATAAAGCTCCGGGACCAGATGGCCTGTCGGGAGCCTTTTACCAGCATTCCTGGTCGGTAGTGGGGTCTGATGTGGTTAACTGTTGT
TTGAGCATCCTGAACAACAAAGCCTCTCCAGCACCTTTGAATGAAACGATGATTGTGTTGATCCCCAAGAAGAAGAGCCCCAAGTGTATCACGGAGTACAGGCCAATCTC
TCTGTGCAATGTTTCGTACAAAATCGTGTCCAAGGTTTTGGTGAACCGCATGAAGGGTATACTCAATGGGCTGATTTCCCACAATCAGAGTGCATTTATCCCAGGGCGCT
GCGTGGTGGACAATGCTATTCTGGGGTATGAATGCATTCACGCTCTGAAAGCGAAGAAGGTAGGGAGAACAGGGTGGGCCTCGCTTAAATTGGACATGAGCAAAGCCTAC
GACAGGGTGGAGTGGGTCTTCCTGGAACGGATGATGCTTAGAATGGGGTTTGCGGTGGAGTGGGTGGAGCTTGTGCTACGGTGCATATCATCGGTCCGGTACTCTTTTAA
TGTGAATGGTGTGAGATGTGGGGATGTCATCCCTAGCAGGGGTCTTCGTCAGGGAGATCCTCTTTCCCCGTATCTGTTCTTACTGTGTGCTGAAGGGCTCTCCCGTATGT
TGCATGATGCGGAAGGGGCGAGGGCTATCACAGGGTTGAGAATAGCGCGTGGGTGTCCTCCTATCTCCCATTTGTTTTTTGCTGATGACAGCCTTCTTTTCTTTCGGGCT
AAGGAGGGTGAAGCTCGGACTGTCCATCACATCCTCCAGTGCTATGAGCGAGCATCCGGGCAAACAATAAATTTTGACAAATCGATTATCTCCTTTAGTCCGAGCACTGC
AATGGGGGTCCGACACCAGGTGAGTCAGCTTTTTCAGGTTCAAGTAACTGCCTGCCATCAGCAGTATTTAGGTTTGCCGTCTTTCATGCCTCGGAACAGAATGAACACCC
TGAATTTTATTAAGGAGCGGGTTTGGCGTCAGATTCAGGGTTGGAAGGGGAAGTTTTTTTCTGTGGGAGGTAGGGAGGTGTTACTGAAGTCAATAGTGCAGGCTATCCCG
TGTTATACTATGAATTGTTTTCGCTTGCCTAAAAAGCTGGTTCAGGACATTAGTAGGGCAATGTCGGGATTCTGGTGGAATGGAGAGACAGAAGATAGAAGGATCCACTG
GGTGAGTTGGAAGACGCTGTGCAAGCCAAAATGTTTGGGTGGAATGGGTTTCAGGGATTTGGAAACTTTCAACCAAGCCCTCTTGGCCAAACAGTGTTGGAGGATTGTTC
AGTATCCTGCCTCGTTTCTCTCTCGGGTGTTGAAGGGGCGGTATTTTCCTAGTGGGGACTTCTTGGCTGCAGGGGTGGGCTCCCGTCCCTCGTATATTTGGAGGAGCCTG
ATTTGGGGGAGGGAGCTTTTGGGGAAGGGTATTCGCTGGAGGATTGGGAATGGGGAGAAAGTCAGTGTATATGGAGCTAACTGGATCCCAGATGATGTCTGTTTTAAGGT
TCGGTCCCCAGTTATGTTGGGTCTTGATACTCGAGTTGCTGACCTCCTCACTGCCTCTGGCCAGTGGAACGAGGAACTTCTTCGACATAACTTTAGCCCTCGCGAGGTAC
GCTCTATCCTTACTATTCCATTGCGACACATATCGTCCGAAGATAAAGTCATATGGCATTTTGAGAAGTGTGGGGTCTATTCAGTGAAAAGTGGGTACCGGCTTGGCCAG
GTGGCTGTGCTGGCCCGGTCCCATCTTTGTCTTCGAGTGAGGCGTTGTCCAGTTGGTGGAAGGGGTGTTGGAAAATGGGGATCCCGAACAAAGGGCATTGATGTTCTGAA
TGTGTGTGTGCATTGTGGTTGGCATGGGGAATCATGCATGCATTTGTTTTGGCAGTGCAAGTTTTTTCGGGATGCATTGATGGGGTCCGAGTGGGAGGTTTTGCTGCAGG
GTGTCCAGGCGAATTCTATGCTTAATCTGTTAAGGGATGTGAAGGATAAGGTTGACTGGGCTAAGTTCGAAGAGTTTGTGGTGGTGCTGTGGGCTGTGTGGTGTTGTCGA
AACCAACAGAAGTTTAGAGGGCGGTTTCCTTCAACAGGCCTCGTGGATTGGGCGGTGAACTATCTTAATGTGTTTCGGGGGGCCTCTCGGATCTATTGTGAGGGGATGAG
AGGTGTGCCCCGGAGTTCAGTAAAATGGCTCCCGCCGGAGGAGGGATGGTATAAGGTTAATGTGGATGCTGCTTTTTGTGGGGATCTGTTTCGGGCAGGTGTTGGGGTTG
TTGTTCGGGATGAGGTTGGGCGCGTCATGCTATCAGCCGCTATTAGGCATGATCATGTGGGGAGTCCAGACTTGGCAGAGGGTCTGGCGGTGGTGGACGGGATGAGGCTT
GTGGTGGAAATGGGTTTAGCTCCGACAATTCTTGAGACTGACTCTATGCGGGTTTTTACGCTACTTCAGGAGCCTGCGATGGTGGATCTGTCGGAGTTTGGTGTGCTAGT
TTCTGAGGCTCGAATGGAGGTGCCTGCGTATATTCAACTCGGGTTCAGCTTTACAAGAAGGGAAGGAAATCGTGTCGCCCATGAGTTAGGCCACTTTGCTTTAAAAGAAG
GGGGTTGTAGCGCGGTGATTGAATTAGTGAAGATCGAGATTGGAAAGCGGATCAGAGATCCATCTGAACTGGAAGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATGAGCTGACCCATCTGGATTTTAGCCGGTCGGACCATAGGCCTCTTCTGCTTTCGATGTCAGTGAATGCTCGGGTGGTTCGTAGCGCAGGGCAGAAGATTCGGAG
GTTCGAGGAATCTTGGCTCCAGTCCTCTGATTTTGGGGAGGCAGTGTCAGCAGGTTGGAGGTCTGAGGCTCCTGTTGGATCGCTTGTGGAGCTGGCTTCAGTAGCTACCA
GGTCGAATGTCCAACAGGTGATGGGTAGGCTGGGTTCTTCTGAGTCTCGAGCAGACTTACAAAGGGCAGAAGGAGAACTAGAGTCTCTTCTTGTAGAGGAGGAGGTGTAC
TGGAGGCAGAGATCCCGAGATAGTTGGCTAAAATGGGGAGATAGAAATACCCGAGCCGGTAGGGTGTTGGGTCTGATTGAGGGGTATTTTGAGAACATCTTCTCGACATC
CTCCCCGTCTGATGGGGAGATTGATCAGGTGACATCTCGAGTTCATCCTTTGGTCTCGGATGTGATGAATAACAGTTTGATGCGACCTTTCCATCGCGAAGAGGTACTTC
GAGCTTTGAGGCAGATCCATCCTAATAAAGCTCCGGGACCAGATGGCCTGTCGGGAGCCTTTTACCAGCATTCCTGGTCGGTAGTGGGGTCTGATGTGGTTAACTGTTGT
TTGAGCATCCTGAACAACAAAGCCTCTCCAGCACCTTTGAATGAAACGATGATTGTGTTGATCCCCAAGAAGAAGAGCCCCAAGTGTATCACGGAGTACAGGCCAATCTC
TCTGTGCAATGTTTCGTACAAAATCGTGTCCAAGGTTTTGGTGAACCGCATGAAGGGTATACTCAATGGGCTGATTTCCCACAATCAGAGTGCATTTATCCCAGGGCGCT
GCGTGGTGGACAATGCTATTCTGGGGTATGAATGCATTCACGCTCTGAAAGCGAAGAAGGTAGGGAGAACAGGGTGGGCCTCGCTTAAATTGGACATGAGCAAAGCCTAC
GACAGGGTGGAGTGGGTCTTCCTGGAACGGATGATGCTTAGAATGGGGTTTGCGGTGGAGTGGGTGGAGCTTGTGCTACGGTGCATATCATCGGTCCGGTACTCTTTTAA
TGTGAATGGTGTGAGATGTGGGGATGTCATCCCTAGCAGGGGTCTTCGTCAGGGAGATCCTCTTTCCCCGTATCTGTTCTTACTGTGTGCTGAAGGGCTCTCCCGTATGT
TGCATGATGCGGAAGGGGCGAGGGCTATCACAGGGTTGAGAATAGCGCGTGGGTGTCCTCCTATCTCCCATTTGTTTTTTGCTGATGACAGCCTTCTTTTCTTTCGGGCT
AAGGAGGGTGAAGCTCGGACTGTCCATCACATCCTCCAGTGCTATGAGCGAGCATCCGGGCAAACAATAAATTTTGACAAATCGATTATCTCCTTTAGTCCGAGCACTGC
AATGGGGGTCCGACACCAGGTGAGTCAGCTTTTTCAGGTTCAAGTAACTGCCTGCCATCAGCAGTATTTAGGTTTGCCGTCTTTCATGCCTCGGAACAGAATGAACACCC
TGAATTTTATTAAGGAGCGGGTTTGGCGTCAGATTCAGGGTTGGAAGGGGAAGTTTTTTTCTGTGGGAGGTAGGGAGGTGTTACTGAAGTCAATAGTGCAGGCTATCCCG
TGTTATACTATGAATTGTTTTCGCTTGCCTAAAAAGCTGGTTCAGGACATTAGTAGGGCAATGTCGGGATTCTGGTGGAATGGAGAGACAGAAGATAGAAGGATCCACTG
GGTGAGTTGGAAGACGCTGTGCAAGCCAAAATGTTTGGGTGGAATGGGTTTCAGGGATTTGGAAACTTTCAACCAAGCCCTCTTGGCCAAACAGTGTTGGAGGATTGTTC
AGTATCCTGCCTCGTTTCTCTCTCGGGTGTTGAAGGGGCGGTATTTTCCTAGTGGGGACTTCTTGGCTGCAGGGGTGGGCTCCCGTCCCTCGTATATTTGGAGGAGCCTG
ATTTGGGGGAGGGAGCTTTTGGGGAAGGGTATTCGCTGGAGGATTGGGAATGGGGAGAAAGTCAGTGTATATGGAGCTAACTGGATCCCAGATGATGTCTGTTTTAAGGT
TCGGTCCCCAGTTATGTTGGGTCTTGATACTCGAGTTGCTGACCTCCTCACTGCCTCTGGCCAGTGGAACGAGGAACTTCTTCGACATAACTTTAGCCCTCGCGAGGTAC
GCTCTATCCTTACTATTCCATTGCGACACATATCGTCCGAAGATAAAGTCATATGGCATTTTGAGAAGTGTGGGGTCTATTCAGTGAAAAGTGGGTACCGGCTTGGCCAG
GTGGCTGTGCTGGCCCGGTCCCATCTTTGTCTTCGAGTGAGGCGTTGTCCAGTTGGTGGAAGGGGTGTTGGAAAATGGGGATCCCGAACAAAGGGCATTGATGTTCTGAA
TGTGTGTGTGCATTGTGGTTGGCATGGGGAATCATGCATGCATTTGTTTTGGCAGTGCAAGTTTTTTCGGGATGCATTGATGGGGTCCGAGTGGGAGGTTTTGCTGCAGG
GTGTCCAGGCGAATTCTATGCTTAATCTGTTAAGGGATGTGAAGGATAAGGTTGACTGGGCTAAGTTCGAAGAGTTTGTGGTGGTGCTGTGGGCTGTGTGGTGTTGTCGA
AACCAACAGAAGTTTAGAGGGCGGTTTCCTTCAACAGGCCTCGTGGATTGGGCGGTGAACTATCTTAATGTGTTTCGGGGGGCCTCTCGGATCTATTGTGAGGGGATGAG
AGGTGTGCCCCGGAGTTCAGTAAAATGGCTCCCGCCGGAGGAGGGATGGTATAAGGTTAATGTGGATGCTGCTTTTTGTGGGGATCTGTTTCGGGCAGGTGTTGGGGTTG
TTGTTCGGGATGAGGTTGGGCGCGTCATGCTATCAGCCGCTATTAGGCATGATCATGTGGGGAGTCCAGACTTGGCAGAGGGTCTGGCGGTGGTGGACGGGATGAGGCTT
GTGGTGGAAATGGGTTTAGCTCCGACAATTCTTGAGACTGACTCTATGCGGGTTTTTACGCTACTTCAGGAGCCTGCGATGGTGGATCTGTCGGAGTTTGGTGTGCTAGT
TTCTGAGGCTCGAATGGAGGTGCCTGCGTATATTCAACTCGGGTTCAGCTTTACAAGAAGGGAAGGAAATCGTGTCGCCCATGAGTTAGGCCACTTTGCTTTAAAAGAAG
GGGGTTGTAGCGCGGTGATTGAATTAGTGAAGATCGAGATTGGAAAGCGGATCAGAGATCCATCTGAACTGGAAGGGTGA
Protein sequenceShow/hide protein sequence
MHELTHLDFSRSDHRPLLLSMSVNARVVRSAGQKIRRFEESWLQSSDFGEAVSAGWRSEAPVGSLVELASVATRSNVQQVMGRLGSSESRADLQRAEGELESLLVEEEVY
WRQRSRDSWLKWGDRNTRAGRVLGLIEGYFENIFSTSSPSDGEIDQVTSRVHPLVSDVMNNSLMRPFHREEVLRALRQIHPNKAPGPDGLSGAFYQHSWSVVGSDVVNCC
LSILNNKASPAPLNETMIVLIPKKKSPKCITEYRPISLCNVSYKIVSKVLVNRMKGILNGLISHNQSAFIPGRCVVDNAILGYECIHALKAKKVGRTGWASLKLDMSKAY
DRVEWVFLERMMLRMGFAVEWVELVLRCISSVRYSFNVNGVRCGDVIPSRGLRQGDPLSPYLFLLCAEGLSRMLHDAEGARAITGLRIARGCPPISHLFFADDSLLFFRA
KEGEARTVHHILQCYERASGQTINFDKSIISFSPSTAMGVRHQVSQLFQVQVTACHQQYLGLPSFMPRNRMNTLNFIKERVWRQIQGWKGKFFSVGGREVLLKSIVQAIP
CYTMNCFRLPKKLVQDISRAMSGFWWNGETEDRRIHWVSWKTLCKPKCLGGMGFRDLETFNQALLAKQCWRIVQYPASFLSRVLKGRYFPSGDFLAAGVGSRPSYIWRSL
IWGRELLGKGIRWRIGNGEKVSVYGANWIPDDVCFKVRSPVMLGLDTRVADLLTASGQWNEELLRHNFSPREVRSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQ
VAVLARSHLCLRVRRCPVGGRGVGKWGSRTKGIDVLNVCVHCGWHGESCMHLFWQCKFFRDALMGSEWEVLLQGVQANSMLNLLRDVKDKVDWAKFEEFVVVLWAVWCCR
NQQKFRGRFPSTGLVDWAVNYLNVFRGASRIYCEGMRGVPRSSVKWLPPEEGWYKVNVDAAFCGDLFRAGVGVVVRDEVGRVMLSAAIRHDHVGSPDLAEGLAVVDGMRL
VVEMGLAPTILETDSMRVFTLLQEPAMVDLSEFGVLVSEARMEVPAYIQLGFSFTRREGNRVAHELGHFALKEGGCSAVIELVKIEIGKRIRDPSELEG