; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005683 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005683
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold8:19913584..19919423
RNA-Seq ExpressionSpg005683
SyntenySpg005683
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR040256 - Uncharacterized protein At4g02000-like
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU41525.1 hypothetical protein TSUD_140560 [Trifolium subterraneum]4.4e-4323.95Show/hide
Query:  EERASVFKLKEEEIDRSEKKLENALVCKILSQKQINPEMFKSKMSHIWNQEQTI-ISMVGFNLYLCKFKNGKIKSLIADNGPWFFDKALLLFKEPKGGNY
        E+      ++ EEI   E+  +  LV K+ ++   N   FK  ++  W  + +I +  +  NL+L +F   K    +  NGPW FD+ LL+     G   
Subjt:  EERASVFKLKEEEIDRSEKKLENALVCKILSQKQINPEMFKSKMSHIWNQEQTI-ISMVGFNLYLCKFKNGKIKSLIADNGPWFFDKALLLFKEPKGGNY

Query:  GDDIEFRYVSFWIHFHKLPFACFSREVAAEIGSILGQVCQIDLEEEVDQCRGGTLRVKIQIDATKPLKRGIFLSSEDSTEDRWIPITYEKLPDFCYGCGL
          D++   V+FW+  + LPF   S  +A ++G+I+G   ++D ++      G  LR+K  +D  KPLKRG  +  +D  ++ W+   YE+LP+FC+ CG 
Subjt:  GDDIEFRYVSFWIHFHKLPFACFSREVAAEIGSILGQVCQIDLEEEVDQCRGGTLRVKIQIDATKPLKRGIFLSSEDSTEDRWIPITYEKLPDFCYGCGL

Query:  LGHTLKECEG------SNHDGSPVEELPYGAWLREPVLLKAREGGWRGGHHHDEEAYGAGGNERRQGGTEEATIRDQPSASGPPTMNKPPANIPTGAGPE
        +GH +KECE       +N+     +   YG WLR   L +  E   +              + + +GG  E     +      P +      +PT A   
Subjt:  LGHTLKECEG------SNHDGSPVEELPYGAWLREPVLLKAREGGWRGGHHHDEEAYGAGGNERRQGGTEEATIRDQPSASGPPTMNKPPANIPTGAGPE

Query:  TD----------------------------------TMANNSDMERVEKVTELKKVPVKEGEIKVSLKD---------------NFISKGKVNSPNMDED
         D                                      +S   + +   +L K   K   + V++ +                F+ + ++  P +D  
Subjt:  TD----------------------------------TMANNSDMERVEKVTELKKVPVKEGEIKVSLKD---------------NFISKGKVNSPNMDED

Query:  SGNNG----------PIGKEKESYCSIMEVDLEDNGSK------VEATSTEVKSTQDNDV-GVFLDPKGKGVL---------------------------
            G           +G+E+    ++   D  D   K      +     +V++ +  D+ G++ + K  G++                           
Subjt:  SGNNG----------PIGKEKESYCSIMEVDLEDNGSK------VEATSTEVKSTQDNDV-GVFLDPKGKGVL---------------------------

Query:  -------------------CDNSSNFSKGKAKAKVSHLTRIASDHRSLLAEWSIEPLNPSFVIPRRPR--RFEEGWCKYGECREIVATVWNSQGCHDISD
                             +S  F    +   V HL R  SDH  L+               RR R  RFEE W     C  ++   W SQ C   SD
Subjt:  -------------------CDNSSNFSKGKAKAKVSHLTRIASDHRSLLAEWSIEPLNPSFVIPRRPR--RFEEGWCKYGECREIVATVWNSQGCHDISD

Query:  FNTKIMDCLMSLNHWSRRKYGGSIRGAIAKTERDIQHLSKKDD-QTSRTALREKEKTLESLLEDDEIYWKQRSREEWLLWGDRNTKWFHMRANTRRKRNR
           ++ D    L   S     GSI   I + E+ IQ+    D+ +TS    +  E TLE LL+++E  W+QRSR  WL  GD+NTK+FH +A+ RRK N 
Subjt:  FNTKIMDCLMSLNHWSRRKYGGSIRGAIAKTERDIQHLSKKDD-QTSRTALREKEKTLESLLEDDEIYWKQRSREEWLLWGDRNTKWFHMRANTRRKRNR

Query:  IRGIMNDLGIWTEEDNGMEFIVNNYFAKLFQSSEP
        I+ + ++ G+W   +  +E ++ +YF +LF S+ P
Subjt:  IRGIMNDLGIWTEEDNGMEFIVNNYFAKLFQSSEP

GAU41525.1 hypothetical protein TSUD_140560 [Trifolium subterraneum]1.3e-1334.43Show/hide
Query:  EDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPV
        ++AL I++TP   N+  D+I+W+ +  G +SV+ AY L C           +    +++WK  WK  +P K+K    R+  +ILPT  NL  +G+ +   
Subjt:  EDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPV

Query:  CFLCREKEETTSHLFWHCKMTK
        C LC    E   HLF HC M K
Subjt:  CFLCREKEETTSHLFWHCKMTK

GAU41525.1 hypothetical protein TSUD_140560 [Trifolium subterraneum]2.2e-4227.04Show/hide
Query:  TEEERASVFKLKEEEIDRSEKKLENALVCKILSQKQINPEMFKSKMSHIWN-QEQTIISMVGFNLYLCKFKNGKIKSLIADNGPWFFDKALLLFKEPKGG
        TEEE   + +  E E D    K   +L  K+ +    N   FK  +   W  + Q  I  +G NLYL +F   +    +  NGPW FD+ +L+ K   G 
Subjt:  TEEERASVFKLKEEEIDRSEKKLENALVCKILSQKQINPEMFKSKMSHIWN-QEQTIISMVGFNLYLCKFKNGKIKSLIADNGPWFFDKALLLFKEPKGG

Query:  NYGDDIEFRYVSFWIHFHKLPFACFSREVAAEIGSILGQVCQIDLEEEVDQCRGGTLRVKIQIDATKPLKRGIFLSSEDSTEDRWIPITYEKLPDFCYGC
            +I+   +SFW   + LP    S  +A ++G ++G+  +ID +E  +   G  L++K+ ID  KP+KRG  +  +   +D  +   YE+LP FC+ C
Subjt:  NYGDDIEFRYVSFWIHFHKLPFACFSREVAAEIGSILGQVCQIDLEEEVDQCRGGTLRVKIQIDATKPLKRGIFLSSEDSTEDRWIPITYEKLPDFCYGC

Query:  GLLGHTLKECEGSN-HDGSPVE-----ELPYGAWLREPVLLKA-----REGG--------WRGGHHHDEEAYG--AGGNERRQGGTEEATIRDQPSASGP
        G +GH ++ECE +   DG   E     ELP+G WLR   L +A     +E G        + G      ++ G   G  E  Q  T+   I  Q   +  
Subjt:  GLLGHTLKECEGSN-HDGSPVE-----ELPYGAWLREPVLLKA-----REGG--------WRGGHHHDEEAYG--AGGNERRQGGTEEATIRDQPSASGP

Query:  PTMNKPPANIP-------------TGAGPETDTMANNSDMERVEKVTELKKVPVKEGEIK---VSLKDNFISKGKVNSPNMDEDSGN-NGPIGKEKESYC
         T +     I              + A P+          E  +    LK      G+ +   ++L  N  +   ++S +++  +G  +  +  E  S  
Subjt:  PTMNKPPANIP-------------TGAGPETDTMANNSDMERVEKVTELKKVPVKEGEIK---VSLKDNFISKGKVNSPNMDEDSGN-NGPIGKEKESYC

Query:  SIMEVDLEDNGSKVEATSTEVK-STQDN-----DVGVFLDPKGKG-------------VLCDNSSNFSKGKAKAKVSHLTRIASDHRSLLAEWSIEPLNP
         I     E N  K  A   E+  ST +N     D+   L    K               LC  SS+F+      KV+HL R  SDH ++     +   + 
Subjt:  SIMEVDLEDNGSKVEATSTEVK-STQDN-----DVGVFLDPKGKG-------------VLCDNSSNFSKGKAKAKVSHLTRIASDHRSLLAEWSIEPLNP

Query:  SFVIPRRPR--RFEEGWCKYGECREIVATVW-NSQGCHDISDFNTKIMDCLMSLNHWSRRKYGGSIRGAIAKTERDIQHLSKKDDQTSR-TALREKEKTL
        +    ++P   RFEE W K   C  ++   W N++G     D  TK++  L SL+   +      ++  I KTE  ++     D         RE E+  
Subjt:  SFVIPRRPR--RFEEGWCKYGECREIVATVW-NSQGCHDISDFNTKIMDCLMSLNHWSRRKYGGSIRGAIAKTERDIQHLSKKDDQTSR-TALREKEKTL

Query:  ESLLEDDEIYWKQRSREEWLLWGDRNTKWFHMRANTRRKRNRIRGIMNDLGIWTEEDNGMEFIVNNYFAKLFQSSEPQMDSIAHILESIPTSISEAQNN
          LL+ +E+ W+QRSR  WL  GD+NTK+FH +A+ R+K N I+ + ++ GIW    + ++ ++ ++F+ LF SS P   +  H+ E++  ++S+ Q +
Subjt:  ESLLEDDEIYWKQRSREEWLLWGDRNTKWFHMRANTRRKRNRIRGIMNDLGIWTEEDNGMEFIVNNYFAKLFQSSEPQMDSIAHILESIPTSISEAQNN

PWA36168.1 hypothetical protein CTI12_AA602590 [Artemisia annua]4.4e-4325.04Show/hide
Query:  AKVSHLTRIASDHRSLLAEWSIEPLNPSFVIPRRPRRFEEGWCKYGECREIVATVWN---SQGC-HDISDFNTKIMDCLMSLNHWSRRKYGGSIRGAIAK
        A   +L RIASDH  ++       L+P      R  RFE  W +      +V   W    + G  HD       + +C   L+ W++R + G ++ +I  
Subjt:  AKVSHLTRIASDHRSLLAEWSIEPLNPSFVIPRRPRRFEEGWCKYGECREIVATVWN---SQGC-HDISDFNTKIMDCLMSLNHWSRRKYGGSIRGAIAK

Query:  TERDIQHLSKKDDQTSRTALREKEKTLESLLEDDEIYWKQRSREEWLLWGDRNTKWFHMRANTRRKRNRIRGIMNDLGIWTEEDNGMEFIVNNYFAKLFQ
         +R +Q L  + D ++R   +   + ++ LL  +E+ WKQRSR EWL  GD+NT++FH RA+ R++RN I  +    G W EE N +  +V++YF+ LF 
Subjt:  TERDIQHLSKKDDQTSRTALREKEKTLESLLEDDEIYWKQRSREEWLLWGDRNTKWFHMRANTRRKRNRIRGIMNDLGIWTEEDNGMEFIVNNYFAKLFQ

Query:  SSEPQ-MDSIAHILE-----------SIPTSISEAQ----------NNDLEDALAILATPTK------SNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQ
        SS PQ  +S+   ++             P + SE +          N++L  +L      +K      S    D + W+ +  GRFS K AY L  + ++
Subjt:  SSEPQ-MDSIAHILE-----------SIPTSISEAQ----------NNDLEDALAILATPTK------SNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQ

Query:  RFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTKGLWAKYLPLDNLSCLFDRED--
            ++         W+  WK ++P K+K+  WR +N+ +PT+ NL +RG++    C  C +  E   H+ + C + K +W +     N  C +D +   
Subjt:  RFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTKGLWAKYLPLDNLSCLFDRED--

Query:  ------RRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSIRNLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGRGG
              + I E     W+               ++I W +W+ RN   H  Q   +E   +++ + + +  H+               Q E +  SG  G
Subjt:  ------RRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSIRNLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGRGG

Query:  VLLGDPTVRRKWSPISDGCW--------KLSYDASWRSDRECGSVGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDS
        V             I  G W        K++ DA+W+ +     +G+V R++   +L +G +    AS     EA ++   +  +    G   ++ E +S
Subjt:  VLLGDPTVRRKWSPISDGCW--------KLSYDASWRSDRECGSVGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDS

Query:  LQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQKAYEEDGPKSWSHSF
        L +V  +  + V   ++     E         V + + + R  N LAHS+A  A       S SH F
Subjt:  LQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQKAYEEDGPKSWSHSF

TXG53848.1 hypothetical protein EZV62_019104 [Acer yangbiense]3.2e-4126.24Show/hide
Query:  AEETLVKQLSGLKVTEEERASVFKLKEEEIDRSEKKLENALVCKILSQKQINPEMFKSKMSHIWNQ-EQTIISMVGFNLYLCKFKNGKIKSLIADNGPWF
        AE  +VK    L + +E+ A + ++ EE+    E +++  LV KILS K++N E F   +  +W+   +  I  VG N+++  F N + ++ I   GPW+
Subjt:  AEETLVKQLSGLKVTEEERASVFKLKEEEIDRSEKKLENALVCKILSQKQINPEMFKSKMSHIWNQ-EQTIISMVGFNLYLCKFKNGKIKSLIADNGPWF

Query:  FDKALLLFKEPKGGNYGDDIEFRYVSFWIHFHKLPFACFSREVAAEIGSILGQVCQIDLEEEVDQCRGGTLRVKIQIDATKPLKRGIFLSSEDSTEDRWI
        FDK+L++ ++ +G      + F  V  WI  H +P  C +R  A  +   +G+V  ID+  E   C G  ++VK+QID +KPLK             RW+
Subjt:  FDKALLLFKEPKGGNYGDDIEFRYVSFWIHFHKLPFACFSREVAAEIGSILGQVCQIDLEEEVDQCRGGTLRVKIQIDATKPLKRGIFLSSEDSTEDRWI

Query:  PITYEKLPDFCYGCGLLGHTLKECEGSNHDGSPVEELPYGAWLREPVLLKAREGGWRGGHHHDEEAYGAGGNERRQGGTEEATIRDQPSASGPPTMNKPP
         +  +K  +   G                     +   +G+W+R   L + +    R  H  D       GN + Q   EE    ++   SG  +     
Subjt:  PITYEKLPDFCYGCGLLGHTLKECEGSNHDGSPVEELPYGAWLREPVLLKAREGGWRGGHHHDEEAYGAGGNERRQGGTEEATIRDQPSASGPPTMNKPP

Query:  ANIPTGAGPETDTMANNSDMERVEKVTELKKVPVKEGEIKVSLKDNFISKGKVNSPNMDEDSGNNGPIGKEKESYCSIMEVDLEDNGSKVEATSTE----
         ++    GP  +            K+   K V V         K+ F   G + S +++ +SG+                 DL  +G  +E   T+    
Subjt:  ANIPTGAGPETDTMANNSDMERVEKVTELKKVPVKEGEIKVSLKDNFISKGKVNSPNMDEDSGNNGPIGKEKESYCSIMEVDLEDNGSKVEATSTE----

Query:  VKSTQDNDVGVFLDPKGKGVLCDNSSNFSKGKAKAKVSHLTRIASDHRSLLAEWSIEPLNPSFVIPRRPRRFEEGWCKYGECREIVATVWNSQG-CHDIS
        V  +Q  ++ V   PK       N+ N+ K  A+AK + +  I         + + +     F       RFE  W K  +   ++   W  +G  +   
Subjt:  VKSTQDNDVGVFLDPKGKGVLCDNSSNFSKGKAKAKVSHLTRIASDHRSLLAEWSIEPLNPSFVIPRRPRRFEEGWCKYGECREIVATVWNSQG-CHDIS

Query:  DFNTKIMDCLMSLNHWSRRKYGGSIRGAIAKTERDIQHLSKKDDQTSRTA-LREKEKTLESLLEDDEIYWKQRSREEWLLWGDRNTKWFHMRANTRRKRN
        DF  K+  C   L  WS+ ++  ++   I    R+I++L K  ++    A ++E EKT+E LL+ +E++WKQRSR +WL  GDRN+K+FH RA+ R+K+ 
Subjt:  DFNTKIMDCLMSLNHWSRRKYGGSIRGAIAKTERDIQHLSKKDDQTSRTA-LREKEKTLESLLEDDEIYWKQRSREEWLLWGDRNTKWFHMRANTRRKRN

Query:  RIRGIMNDLGIWTEEDNGMEFIVNNYFAKLFQSSEPQMDSIAHILESIPTSISEAQNNDLEDA
            + N+ G   + + GM  ++ +YF  LFQSS P    I+   E I +  ++ Q  +L  A
Subjt:  RIRGIMNDLGIWTEEDNGMEFIVNNYFAKLFQSSEPQMDSIAHILESIPTSISEAQNNDLEDA

TXG58188.1 hypothetical protein EZV62_016017 [Acer yangbiense]6.4e-4227.33Show/hide
Query:  INPEMFKSKMSHIWN-QEQTIISMVGFNLYLCKFKNGKIKSLIADNGPWFFDKALLLFKEPKGGNYGDDIEFRYVSFWIHFHKLPFACFSREVAAEIGSI
        +N E FK  +  IWN   Q  + MV  N+++  F N + ++ I    PW F  +L+  ++P G      + F    FWI  H +P  C +R  A  +   
Subjt:  INPEMFKSKMSHIWN-QEQTIISMVGFNLYLCKFKNGKIKSLIADNGPWFFDKALLLFKEPKGGNYGDDIEFRYVSFWIHFHKLPFACFSREVAAEIGSI

Query:  LGQVCQIDLEEEVDQCRGGTLRVKIQIDATKPLKRGIFLSSEDSTEDRWIPITYEKLPDFCYGCGLLGHTLKECEGSNHDGSPVEELP--YGAWLREPVL
        +G V +I LE    +C G  +RVK+ ID +KPLKR + L    S E   + + YEKLP+FCY CG +G  + EC  +      +E  P  YG+WL+   L
Subjt:  LGQVCQIDLEEEVDQCRGGTLRVKIQIDATKPLKRGIFLSSEDSTEDRWIPITYEKLPDFCYGCGLLGHTLKECEGSNHDGSPVEELP--YGAWLREPVL

Query:  LKAREGGWRGGHHHDEEAYGAGGN-------ERRQGGTEEATIRDQPSASGPPTMNKPPANI----------PTGAGP-ETDTMA---------------
         K++    +       + YG+  +        R + G   A++R    AS         A I           T  GP  T+ M                
Subjt:  LKAREGGWRGGHHHDEEAYGAGGN-------ERRQGGTEEATIRDQPSASGPPTMNKPPANI----------PTGAGP-ETDTMA---------------

Query:  ---NNSDMERVEK---VTELKKVPVKEGEIKVSLKDNFISKGK-VNSPNMD-EDSGNNGPIGKEKESYCSIMEVDLEDN---------GSKVEATSTEVK
           NNSD   VE       L     +  E+K  ++D   S  K ++SP    + S    P+   K+      + +  D+          S ++      K
Subjt:  ---NNSDMERVEK---VTELKKVPVKEGEIKVSLKDNFISKGK-VNSPNMD-EDSGNNGPIGKEKESYCSIMEVDLEDN---------GSKVEATSTEVK

Query:  STQDNDVGVFLDPK--GKGVLCDNSSNFSKGKAKAKVSHL-------TRIASDHRSLLAEWSIEPLNPSFVIPRRPRR-FEE------GWCKYGECREIV
        S + +       PK    G+L  N+ +  K +   +  H        TR+ S+  +   +   E     F  P++ +  F +       W K  E   I+
Subjt:  STQDNDVGVFLDPK--GKGVLCDNSSNFSKGKAKAKVSHL-------TRIASDHRSLLAEWSIEPLNPSFVIPRRPRR-FEE------GWCKYGECREIV

Query:  ATVWNSQG-CHDISDFNTKIMDCLMSLNHWSRRKYGGSIRGAIAKTERDIQHLSKKDDQTS-RTALREKEKTLESLLEDDEIYWKQRSREEWLLWGDRNT
          +W   G    I D   K+  C  +L  WS+ ++G   R    KT R+I+HL            +R  E+ +E L + +EIYWKQRSR EWL  GDRN+
Subjt:  ATVWNSQG-CHDISDFNTKIMDCLMSLNHWSRRKYGGSIRGAIAKTERDIQHLSKKDDQTS-RTALREKEKTLESLLEDDEIYWKQRSREEWLLWGDRNT

Query:  KWFHMRANTRRKRNRIRGIMNDLGIWTEEDNGM
        K+FH +A  R+K+N I  + +  G     D GM
Subjt:  KWFHMRANTRRKRNRIRGIMNDLGIWTEEDNGM

TrEMBL top hitse value%identityAlignment
A0A2N9E949 CCHC-type domain-containing protein6.4e-4823.09Show/hide
Query:  VGFNLYLCKFKNGKIKSLIADNGPWFFDKALLLFKEPKGGNYGDDIEFRYVSFWIHFHKLPFACFSREVAAEIGSILGQVCQIDLEEEVDQCRGGTLRVK
        +G NL++  F +   + L+  NGPW FDK L+L K   G      ++    SFWI  H+LPF   + E A  +G+ LG + +ID+ E+     G  +RV+
Subjt:  VGFNLYLCKFKNGKIKSLIADNGPWFFDKALLLFKEPKGGNYGDDIEFRYVSFWIHFHKLPFACFSREVAAEIGSILGQVCQIDLEEEVDQCRGGTLRVK

Query:  IQIDATKPL--KRGIFLSSEDSTEDRWIPITYEKLPDFCYGCGLLGHTLKEC-----EGSNHDGSPVEELPYGAWLREPVLLKAREGGWRGGHHHDEEAY
        ++ID + PL  ++ + L  E+S    W+ + YEKLP FCY CG+LGH+ +EC        + DG+  E   YG+WLR      A  G  + G    E A+
Subjt:  IQIDATKPL--KRGIFLSSEDSTEDRWIPITYEKLPDFCYGCGLLGHTLKEC-----EGSNHDGSPVEELPYGAWLREPVLLKAREGGWRGGHHHDEEAY

Query:  GAGG-----NERRQGGTEEATIRDQPSAS------GPPTMNKPPANIPTGAGPE----TDTMANNSDMERVEKVTELKKVPVKEGEI-KVSLKDNFISKG
                 N+   GG + A+       S      G P   +    +     P      +T     +ME+  ++T     P + G +    L  +   + 
Subjt:  GAGG-----NERRQGGTEEATIRDQPSAS------GPPTMNKPPANIPTGAGPE----TDTMANNSDMERVEKVTELKKVPVKEGEI-KVSLKDNFISKG

Query:  K---VNSPNMDEDSGNNGPIGK------EKESYCSIMEV-DLEDNGSKVEATSTEVKSTQDNDVGVFLDPKGKGVLCDNSSNFSKGKAKAKVSHLTRIAS
        +   V   + +E  GN+  +GK      +  S+   +E   L D G + +  + + +     ++   LD   + V     +N     ++A V HL   AS
Subjt:  K---VNSPNMDEDSGNNGPIGK------EKESYCSIMEV-DLEDNGSKVEATSTEVKSTQDNDVGVFLDPKGKGVLCDNSSNFSKGKAKAKVSHLTRIAS

Query:  DHRSLLAEWSIEPLNPSFVIPRRPR--RFEEGWCKYGECREIVATVWNS--QGCHDISDFNTKIMDCLMSLNHWSRRKYGGS-IRGAIAKTERDI--QHL
        DH  +L       L  S ++PRR R  RFE+ WC+   C E V   W+S  +G H + +    I  C   L +W R  Y  S +R      E  +  +  
Subjt:  DHRSLLAEWSIEPLNPSFVIPRRPR--RFEEGWCKYGECREIVATVWNS--QGCHDISDFNTKIMDCLMSLNHWSRRKYGGS-IRGAIAKTERDI--QHL

Query:  SKKDDQTSRTALREKEKTLESLLEDDEIYWKQRSREEWLLWGDRNT----------------------------KWFHMRANTRRK--------------
          ++DQ     LRE+   L  LLE +EI W+QRSR +WL  G+RNT                                +RA   +K              
Subjt:  SKKDDQTSRTALREKEKTLESLLEDDEIYWKQRSREEWLLWGDRNT----------------------------KWFHMRANTRRK--------------

Query:  --------------------------------------------------------------------------------RNRIRGIMNDLGIWT-----
                                                                                           +RG+  DLGIW      
Subjt:  --------------------------------------------------------------------------------RNRIRGIMNDLGIWT-----

Query:  ----------EEDNGMEFIV--NNYF-------AKLFQSSEPQMDSIA-----------------------------------------------HILES
                  + +N +  +V    YF       A + Q S     SIA                                                + E 
Subjt:  ----------EEDNGMEFIV--NNYF-------AKLFQSSEPQMDSIA-----------------------------------------------HILES

Query:  IPTSISEAQNNDL------EDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIYN
        I   I +   N L      ED   IL  P    + ED  LWN    G+F+V  AYRL  + N   Q  S++      +W+  WK++LP  IK+  WR  +
Subjt:  IPTSISEAQNNDL------EDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIYN

Query:  DILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTKGLWAKYLPLDNLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSIRN
        + LPTL NL  R +     C  C+ + ET  H  W C      W++     N    F    + + E  +     G         ++  + I ++IW  RN
Subjt:  DILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTKGLWAKYLPLDNLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSIRN

Query:  LISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGRGGVLLGDPTVRR---KWSPISDGCWKLSYDASWRSDRECGSVGWVLR
         ++        E I     Q    +  E     + Y                +  +L   P++ R   KW       +KL++D +         +G ++R
Subjt:  LISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGRGGVLLGDPTVRR---KWSPISDGCWKLSYDASWRSDRECGSVGWVLR

Query:  DWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSL
        +     +A+      + +D    E  + ++GLQ    + G  RL++E DS   +  +   D + + L   IKEA+++    +   ++ I R  N +AH L
Subjt:  DWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSL

Query:  AQKA
        A+ A
Subjt:  AQKA

A0A2N9F5Y8 Reverse transcriptase domain-containing protein1.3e-4325.32Show/hide
Query:  EIDRSEKKLENALVCKILSQKQINPEMFKSKMSHIWNQEQTIISM-VGFNLYLCKFKNGKIKSLIADNGPWFFDKALLLFKEPKGGNYGDDIEFRYVSFW
        ++   +++ +  L  K  +++ IN E        +W  ++   +  +G N+ L +F++      +  + PW +DK+L+ F++ +  +  + I     +FW
Subjt:  EIDRSEKKLENALVCKILSQKQINPEMFKSKMSHIWNQEQTIISM-VGFNLYLCKFKNGKIKSLIADNGPWFFDKALLLFKEPKGGNYGDDIEFRYVSFW

Query:  IHFHKLPFACFSREVAAEIGSILGQVCQIDLEEEVDQCRGG--TLRVKIQIDATKPLKRGIFLSSEDSTEDRWIPITYEKLPDFCYGCGLLGHTLKECEG
        +  H LP     +++A  +G  +G V +     E D+  GG   +RV++Q++ +KPL RG  L   +  E RWI   YE+LP FCY CGLL H  K+C+ 
Subjt:  IHFHKLPFACFSREVAAEIGSILGQVCQIDLEEEVDQCRGG--TLRVKIQIDATKPLKRGIFLSSEDSTEDRWIPITYEKLPDFCYGCGLLGHTLKECEG

Query:  --SNHDGSPVEELPYGAWLREPVLLKAREGGWRGGHHHDEEAYGAGGNERRQGGTEEATIRDQPSASGPPTMNKPPANIPTGAGPETDTMANNSDME---
          +N +    E+  YG WLR  +     E  +R      +      G  R +    E+  + Q  A  PPT   PP +  +G     D        E   
Subjt:  --SNHDGSPVEELPYGAWLREPVLLKAREGGWRGGHHHDEEAYGAGGNERRQGGTEEATIRDQPSASGPPTMNKPPANIPTGAGPETDTMANNSDME---

Query:  ---RVEKVTELKKVPVKEGEIKVSLKDNFISKG--------KVNSPNMDEDSGNNGPIGKEKESY-------CSIMEVDLEDNGS---KVEATSTEVKST
                  L+ + ++ G       D +   G         +  P    ++ +  PI  + E Y        S   +    NG+   K +A +    S 
Subjt:  ---RVEKVTELKKVPVKEGEIKVSLKDNFISKG--------KVNSPNMDEDSGNNGPIGKEKESY-------CSIMEVDLEDNGS---KVEATSTEVKST

Query:  Q------------------DNDVGVFLDPKGKGVLCDNSSNFSKGK-----------AKAKVSHLTRIASDHRSLLAEWSIEPLNPSFVIPRRPRRFEEG
        Q                  + D G++    G       S  ++  K              +V H+    SDH+ L   W      P+    R+P  F+E 
Subjt:  Q------------------DNDVGVFLDPKGKGVLCDNSSNFSKGK-----------AKAKVSHLTRIASDHRSLLAEWSIEPLNPSFVIPRRPRRFEEG

Query:  WCKYGECREIVATVWNSQGCHDISDFN--TKIMDCLMSLNHWSRRKYGGSIRGAIAKTERDIQHLS-KKDDQTSRTALREKEKTLESLLEDDEIYWKQRS
        W +   C E +   WN       + F   TKI  C  SL  WSRR + GSIR  + + ++ ++             A +     +  LL+ +E  W+QRS
Subjt:  WCKYGECREIVATVWNSQGCHDISDFN--TKIMDCLMSLNHWSRRKYGGSIRGAIAKTERDIQHLS-KKDDQTSRTALREKEKTLESLLEDDEIYWKQRS

Query:  REEWLLWGDRNTKWFHMRANTRRKRNRIRGIMNDLGIWTEEDNGMEFIVNNYFAKLFQSSEPQMDSIAHILESIPTSISEAQNNDL-------EDALAIL
        R  WL  GDRNTK+FH RA+ RR+RN I+ + +  GIW E ++ +  +  +YF  LF +S P+  +I   +ES P  ++++ N+ L       E  LAI 
Subjt:  REEWLLWGDRNTKWFHMRANTRRKRNRIRGIMNDLGIWTEEDNGMEFIVNNYFAKLFQSSEPQMDSIAHILESIPTSISEAQNNDL-------EDALAIL

Query:  ATPTKSNMGED
             + +G D
Subjt:  ATPTKSNMGED

A0A2Z6NZV1 Uncharacterized protein2.1e-4323.95Show/hide
Query:  EERASVFKLKEEEIDRSEKKLENALVCKILSQKQINPEMFKSKMSHIWNQEQTI-ISMVGFNLYLCKFKNGKIKSLIADNGPWFFDKALLLFKEPKGGNY
        E+      ++ EEI   E+  +  LV K+ ++   N   FK  ++  W  + +I +  +  NL+L +F   K    +  NGPW FD+ LL+     G   
Subjt:  EERASVFKLKEEEIDRSEKKLENALVCKILSQKQINPEMFKSKMSHIWNQEQTI-ISMVGFNLYLCKFKNGKIKSLIADNGPWFFDKALLLFKEPKGGNY

Query:  GDDIEFRYVSFWIHFHKLPFACFSREVAAEIGSILGQVCQIDLEEEVDQCRGGTLRVKIQIDATKPLKRGIFLSSEDSTEDRWIPITYEKLPDFCYGCGL
          D++   V+FW+  + LPF   S  +A ++G+I+G   ++D ++      G  LR+K  +D  KPLKRG  +  +D  ++ W+   YE+LP+FC+ CG 
Subjt:  GDDIEFRYVSFWIHFHKLPFACFSREVAAEIGSILGQVCQIDLEEEVDQCRGGTLRVKIQIDATKPLKRGIFLSSEDSTEDRWIPITYEKLPDFCYGCGL

Query:  LGHTLKECEG------SNHDGSPVEELPYGAWLREPVLLKAREGGWRGGHHHDEEAYGAGGNERRQGGTEEATIRDQPSASGPPTMNKPPANIPTGAGPE
        +GH +KECE       +N+     +   YG WLR   L +  E   +              + + +GG  E     +      P +      +PT A   
Subjt:  LGHTLKECEG------SNHDGSPVEELPYGAWLREPVLLKAREGGWRGGHHHDEEAYGAGGNERRQGGTEEATIRDQPSASGPPTMNKPPANIPTGAGPE

Query:  TD----------------------------------TMANNSDMERVEKVTELKKVPVKEGEIKVSLKD---------------NFISKGKVNSPNMDED
         D                                      +S   + +   +L K   K   + V++ +                F+ + ++  P +D  
Subjt:  TD----------------------------------TMANNSDMERVEKVTELKKVPVKEGEIKVSLKD---------------NFISKGKVNSPNMDED

Query:  SGNNG----------PIGKEKESYCSIMEVDLEDNGSK------VEATSTEVKSTQDNDV-GVFLDPKGKGVL---------------------------
            G           +G+E+    ++   D  D   K      +     +V++ +  D+ G++ + K  G++                           
Subjt:  SGNNG----------PIGKEKESYCSIMEVDLEDNGSK------VEATSTEVKSTQDNDV-GVFLDPKGKGVL---------------------------

Query:  -------------------CDNSSNFSKGKAKAKVSHLTRIASDHRSLLAEWSIEPLNPSFVIPRRPR--RFEEGWCKYGECREIVATVWNSQGCHDISD
                             +S  F    +   V HL R  SDH  L+               RR R  RFEE W     C  ++   W SQ C   SD
Subjt:  -------------------CDNSSNFSKGKAKAKVSHLTRIASDHRSLLAEWSIEPLNPSFVIPRRPR--RFEEGWCKYGECREIVATVWNSQGCHDISD

Query:  FNTKIMDCLMSLNHWSRRKYGGSIRGAIAKTERDIQHLSKKDD-QTSRTALREKEKTLESLLEDDEIYWKQRSREEWLLWGDRNTKWFHMRANTRRKRNR
           ++ D    L   S     GSI   I + E+ IQ+    D+ +TS    +  E TLE LL+++E  W+QRSR  WL  GD+NTK+FH +A+ RRK N 
Subjt:  FNTKIMDCLMSLNHWSRRKYGGSIRGAIAKTERDIQHLSKKDD-QTSRTALREKEKTLESLLEDDEIYWKQRSREEWLLWGDRNTKWFHMRANTRRKRNR

Query:  IRGIMNDLGIWTEEDNGMEFIVNNYFAKLFQSSEP
        I+ + ++ G+W   +  +E ++ +YF +LF S+ P
Subjt:  IRGIMNDLGIWTEEDNGMEFIVNNYFAKLFQSSEP

A0A2Z6NZV1 Uncharacterized protein6.1e-1434.43Show/hide
Query:  EDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPV
        ++AL I++TP   N+  D+I+W+ +  G +SV+ AY L C           +    +++WK  WK  +P K+K    R+  +ILPT  NL  +G+ +   
Subjt:  EDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPV

Query:  CFLCREKEETTSHLFWHCKMTK
        C LC    E   HLF HC M K
Subjt:  CFLCREKEETTSHLFWHCKMTK

A0A803P5M6 Uncharacterized protein4.3e-5220.81Show/hide
Query:  KQLSGLKVTEEERASVFKLKEEEIDRSEKKLENALVCKILSQKQINPEMFKSKMSHIWNQEQTIISMVGFNLYLCKFKNGKIKSLIADNGPWFFDKALLL
        K  + + +TE+E  SVF+  + E       ++  L  KIL++K++     +++M+  W+    +      ++++  F     K  + D  P+ F    ++
Subjt:  KQLSGLKVTEEERASVFKLKEEEIDRSEKKLENALVCKILSQKQINPEMFKSKMSHIWNQEQTIISMVGFNLYLCKFKNGKIKSLIADNGPWFFDKALLL

Query:  FKEPKGGNYGDDIEFRYVSFWIHFHKLPFACFSREVAAEIGSILGQVCQIDLEEEVDQCRGGTLRVKIQIDATKPLKRGIFLSSEDSTEDRWIPITYEKL
           P+ G      +  +  FW+  ++LPF   +R +A  +G+I+G+   +  E+ +++  G  LRV++ +D +KPLKRG  +S     +  W+   YE+L
Subjt:  FKEPKGGNYGDDIEFRYVSFWIHFHKLPFACFSREVAAEIGSILGQVCQIDLEEEVDQCRGGTLRVKIQIDATKPLKRGIFLSSEDSTEDRWIPITYEKL

Query:  PDFCYGCGLLGHTLKEC-----EGSNHDGSPVEELPY---------------------GAW-----LREPVLLKAREGGWRGGHHHDEEAYGAGGNERRQ
        P++C  CG++GH   +C     +  N +   +E  P+                      AW     L +  L  A     + G  H    +    +    
Subjt:  PDFCYGCGLLGHTLKEC-----EGSNHDGSPVEELPY---------------------GAW-----LREPVLLKAREGGWRGGHHHDEEAYGAGGNERRQ

Query:  GGTEEATIRDQPSASG--PPTMNKPPA-------------NIPTG--AGPETDTMANNSDMERV-------------------EKVTELK-KVPVKEGEI
               + D  +A    P    KPP+             N+  G  A    D+  N SD+  +                     +T+++ K P+     
Subjt:  GGTEEATIRDQPSASG--PPTMNKPPA-------------NIPTG--AGPETDTMANNSDMERV-------------------EKVTELK-KVPVKEGEI

Query:  K--------VSLKDNFISKGKVN-SPNMDEDSGN------------NGPIGKEKESYCSIMEVDLEDNGSKVEATSTEVKSTQDNDVGVFLDPKGKGV--
                  ++    +  G  N +PN+     N             G I  +  S  S+ EVD       V     +    +DN V    DP    +  
Subjt:  K--------VSLKDNFISKGKVN-SPNMDEDSGN------------NGPIGKEKESYCSIMEVDLEDNGSKVEATSTEVKSTQDNDVGVFLDPKGKGV--

Query:  -LCDNSSNFSKGKAKAK-----------------------VSHLTRIASDHRSLLAEWSIEPLNPSFVIPRRPR--RFEEGWCKYGECREIVATVWNSQG
            +   + KG+                            +HL   +SDHR++    +IE +  +   P R    RFE+ W K  +   I+   W+   
Subjt:  -LCDNSSNFSKGKAKAK-----------------------VSHLTRIASDHRSLLAEWSIEPLNPSFVIPRRPR--RFEEGWCKYGECREIVATVWNSQG

Query:  CHDISDFNTKIMDCLMSLNHWSRRKYGGSIRGAIAKTERDIQHLSKKDDQTSRT--ALREKEKTLESLLEDDEIYWKQRSREEWLLWGDRNTKWFHMRAN
           +  F + +  C  +L  W  RK+ G+++  I   ++ +  L+   D++  T   L++ E  L+ LLE +E YW QRSR +WL  GD+NT +FH  A 
Subjt:  CHDISDFNTKIMDCLMSLNHWSRRKYGGSIRGAIAKTERDIQHLSKKDDQTSRT--ALREKEKTLESLLEDDEIYWKQRSREEWLLWGDRNTKWFHMRAN

Query:  TRRKRNRIRGIMNDLGIWTEEDNGMEFIVNNYFAKLFQSSEPQMDSIAHILESIPTSI--------------SEAQNND---------------------
        +R+ +N I+ ++N  G+       M  ++ +Y+  LF S     DS+  IL ++P SI              +E  NND                     
Subjt:  TRRKRNRIRGIMNDLGIWTEEDNGMEFIVNNYFAKLFQSSEPQMDSIAHILESIPTSI--------------SEAQNND---------------------

Query:  -------------------------------------------------------------------------------LE-------------------
                                                                                       LE                   
Subjt:  -------------------------------------------------------------------------------LE-------------------

Query:  ------------------------------------DALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWK
                                            D   IL+ P     G D ++W+    G +SVK  + L   +  +  +S++N   Q   WK FW 
Subjt:  ------------------------------------DALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWK

Query:  LKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTKGLWAKYLPLDNLSCLFDREDRRISETLDGLWQRGGNTSTNILH
        LKLPPKI+I  W+++ +ILPT   L  R +     C LC    E+  H  + CK  K +W        LS  F  +  +     +G +    +T      
Subjt:  LKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTKGLWAKYLPLDNLSCLFDREDRRISETLDGLWQRGGNTSTNILH

Query:  IKCSLIICWRIWSIRNLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGRGGVLLGDPTVRRKWSPISDGCWKLSYDASWR
         +  L + W IW+ RN + H  Q  +   I      Q ++  +E     +          +   +PS        D  V+R   P+ +G +KL+ DA+  
Subjt:  IKCSLIICWRIWSIRNLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGRGGVLLGDPTVRRKWSPISDGCWKLSYDASWR

Query:  SDRECGSVGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSIT
         +++   +G +LRD   T+LAA  K +  +     +EA ++   +  + S +      +E D+ +V N +N  + D +  +  I + + + S      +T
Subjt:  SDRECGSVGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSIT

Query:  HIPRAHNYLAHSLAQKAYEEDGPKSWSHSFP
        H+ R  N  AH LA+ A   D    W    P
Subjt:  HIPRAHNYLAHSLAQKAYEEDGPKSWSHSFP

A0A803QPR1 Uncharacterized protein2.1e-4621.26Show/hide
Query:  EERASVFKLKEEEIDRSEKKLENALVCKILSQKQINPEMFKSKMSHIWNQEQTI-ISMVGFNLYLCKFKNGKIKSLIADNGPWFFDKALLLFKEPKGGNY
        EE      L +++ D SE      LV + L+++ I+ +  + +M+ +W   + + +  +  N +L +F +    + + D  PW FD+  L+F+  K G  
Subjt:  EERASVFKLKEEEIDRSEKKLENALVCKILSQKQINPEMFKSKMSHIWNQEQTI-ISMVGFNLYLCKFKNGKIKSLIADNGPWFFDKALLLFKEPKGGNY

Query:  GDDIEFRYVSFWIHFHKLPFACFSREVAAEIGSILGQVCQIDLEEEVDQCRGGTLRVKIQIDATKPLKRGIFLSSEDSTEDRWIPITYEKLPDFCYGCGL
           I    +  WI  H +     S      + + +G   + D        R   LRV+  I+  KPLK+ + L  ++  +   +   YE LP FC+ CG+
Subjt:  GDDIEFRYVSFWIHFHKLPFACFSREVAAEIGSILGQVCQIDLEEEVDQCRGGTLRVKIQIDATKPLKRGIFLSSEDSTEDRWIPITYEKLPDFCYGCGL

Query:  LGHT--------------LKECEGSNHDGSP--VEELPYGAWLREPVLLKAREGGWRGGHHHDEEAYGA--GGNERRQG--------GTEEATIRDQPSA
        LGH+              +K+    +   +P   +      WLR      +  G   GG    +        G E  QG        G  +  IR   S 
Subjt:  LGHT--------------LKECEGSNHDGSP--VEELPYGAWLREPVLLKAREGGWRGGHHHDEEAYGA--GGNERRQG--------GTEEATIRDQPSA

Query:  SGPPTMNKPPANIPTGAGPETDTMANNSDMERVEKVTELKKVPVKEGEIKVSLKD----NFISKGKVNSPN-----MDEDSGNNGPIGKEKESYCSIMEV
                   +I  G    ++ + +N+ +  VE       V    GE  V   D    + I  G++++ N     +D+D+  +  +     S+  +  V
Subjt:  SGPPTMNKPPANIPTGAGPETDTMANNSDMERVEKVTELKKVPVKEGEIKVSLKD----NFISKGKVNSPN-----MDEDSGNNGPIGKEKESYCSIMEV

Query:  DLEDNGSKVEATSTEVKSTQDN---------DVGVFLDPKGK---------------------------------------------GVLCDNSSNFSKG
          E N ++ E T T +K+   N         D+   L  + K                                              V  D +   +  
Subjt:  DLEDNGSKVEATSTEVKSTQDN---------DVGVFLDPKGK---------------------------------------------GVLCDNSSNFSKG

Query:  KA---KAKVSHLTRIASDHRSLLAEWSIEPL-NPSFVIPRRPRRFEEGWCKYGECREIVATVWNSQGCHDISDFNTKIMDCLMSLNHWSRRKYGGSIRGA
        +    +A +++L    SDH  +     +EP+   SF  PRR  RFE  W     C ++V   W++Q     S    KI  C+  L  W  ++  G  +  
Subjt:  KA---KAKVSHLTRIASDHRSLLAEWSIEPL-NPSFVIPRRPRRFEEGWCKYGECREIVATVWNSQGCHDISDFNTKIMDCLMSLNHWSRRKYGGSIRGA

Query:  IAKTERDIQHLSKKDDQTSRTALREKEKTLESLLEDDEIYWKQRSREEWLLWGDRNTKWFHMRANTRRKRNRIRGIMNDLGIWTEEDNGMEFIVNNYFAK
        I + +  +  L  K D  S     E    L  +LE  E +WKQR+++ WL  GD+N+K+FH  A++R++ N I  + +D G W + +N +  ++N+Y+  
Subjt:  IAKTERDIQHLSKKDDQTSRTALREKEKTLESLLEDDEIYWKQRSREEWLLWGDRNTKWFHMRANTRRKRNRIRGIMNDLGIWTEEDNGMEFIVNNYFAK

Query:  LFQSSEPQMDSIAHILESIPTSISEAQNNDLEDAL-------AILATPTKSNMGEDEI--------------------------------LWNLDSKGRF
        LF ++         +++S+   +    N +L   +       A+       + G D +                                 W+ ++   F
Subjt:  LFQSSEPQMDSIAHILESIPTSISEAQNNDLEDAL-------AILATPTKSNMGEDEI--------------------------------LWNLDSKGRF

Query:  SVKGAYRLGCQMNQRFQASSANYKDQ--EAMWKDFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTKGLWAKY
        SVK AY +        Q    N+ D   +  W+  W+LKLP K++   WR  N+ L T+  L  + +DV P+C +C  + ET  H    C+  +  W   
Subjt:  SVKGAYRLGCQMNQRFQASSANYKDQ--EAMWKDFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTKGLWAKY

Query:  LPLDNLSCLFDREDRRISETLDGLWQRGGNTS--TNILHIKCSL-IICWRIWSIRNLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQ
                  DR     S T    +    + +   N   + C + ++CW +W  RN +  N +                     +  D   Y  ++L+  
Subjt:  LPLDNLSCLFDREDRRISETLDGLWQRGGNTS--TNILHIKCSL-IICWRIWSIRNLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQ

Query:  TERLAPSGRGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVE
              S     +L    V  +W+P      K++ DA+  +D     +G V R+    L+    K      +++  EA+ I E L  +       ++ +E
Subjt:  TERLAPSGRGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVE

Query:  NDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQKA
         DSL VV  I+      +     I++ + M +      ++ + R+ N +AH  A+ A
Subjt:  NDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQKA

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657507.9e-1121.69Show/hide
Query:  DEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWH
        D + W     G+FSV+ AY +                +  + +   WK+++P ++K   W + N  + T    + R +    VC +C+   E+  H+   
Subjt:  DEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWH

Query:  CKMTKGLWAKYLPLDNLSCLFDREDRRISETL-DGLWQRGGNTSTNILHIKCSLIICWRIWSIR--NLISHNNQRLNQETIRDILQQQISASIHELIGDE
        C    G+W + +P       F +    + E L D L  R G    +I       +I W  W  R  N+   N +                        D 
Subjt:  CKMTKGLWAKYLPLDNLSCLFDREDRRISETL-DGLWQRGGNTSTNILHIKCSLIICWRIWSIR--NLISHNNQRLNQETIRDILQQQISASIHELIGDE

Query:  EPYQMQWLEGQTERLAPSGRGGVLLG--DPTVRRK--WSPISDGCWKLSYDASWRSDRECGSVGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVEGL
          +  +W       +  +  G VL+G   P V R   W     G  K++ D + R +    S G VLRD +      GF            E   +  GL
Subjt:  EPYQMQWLEGQTERLAPSGRGGVLLG--DPTVRRK--WSPISDGCWKLSYDASWRSDRECGSVGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVEGL

Query:  -----QAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQKAY
             + +P      R+ +E DS  +V  +     D   L+F ++          +  I H+ R  N LA  LA  A+
Subjt:  -----QAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQKAY

Arabidopsis top hitse value%identityAlignment
AT1G60720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.3e-0528.38Show/hide
Query:  KDFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTKGLW----AKYLPLDNLSCLFDREDRRISETLDGLWQRG
        K  W     PK     W    D LPT   L + G      C LC  + E+  HL + C+    +W    ++  P   L C +       +E L   W R 
Subjt:  KDFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTKGLW----AKYLPLDNLSCLFDREDRRISETLDGLWQRG

Query:  GNTSTNILHIKCSL-IICWRIWSIRNLISHNNQRLNQETIRDILQQQI
         ++S   L  K S   I + IW  RN + HNN R+    I  I+ ++I
Subjt:  GNTSTNILHIKCSL-IICWRIWSIRNLISHNNQRLNQETIRDILQQQI

AT3G25270.1 Ribonuclease H-like superfamily protein6.4e-1625Show/hide
Query:  WKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTKGLW-AKYLPLDNLSCLFDREDRRISETLDGLWQRGGNTSTN
        WKLK  PKIK   W++ +  L T  NL  R +   P C  C +++ET+ HLF+ C   + +W A  +P   L       + ++   L        N    
Subjt:  WKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTKGLW-AKYLPLDNLSCLFDREDRRISETLDGLWQRGGNTSTN

Query:  ILHIKCSLIICWRIWSIRNLISHNNQRLN-QETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGRGGVLLGDPTV-RRKWSPISDGCWKLSY
        + ++  ++ I WR+W  RN +    + ++ Q T+     Q+    + E   ++    +Q L  Q      S R       PT+ R KW        K +Y
Subjt:  ILHIKCSLIICWRIWSIRNLISHNNQRLN-QETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGRGGVLLGDPTV-RRKWSPISDGCWKLSY

Query:  DASWRSDRECGSVGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKK
        D ++         GW++RD +   + +G    ++ SD    E  +++  +Q   S  G  +++ E DS QV  L+N E ++    N +I+E +      +
Subjt:  DASWRSDRECGSVGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKK

Query:  VDSITHIPRAHNYLAHSLAQ
              +PR +N  A  LA+
Subjt:  VDSITHIPRAHNYLAHSLAQ

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.3e-0738.6Show/hide
Query:  DFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHC
        D W LK+ PKIK+  W+  N+ LP  + L +R + + P C  CR+  ET +H+ ++C
Subjt:  DFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHC

AT3G42140.1 zinc ion binding;nucleic acid binding7.1e-0721.53Show/hide
Query:  IADNGPWFFDKALLLFKEPKGGNYGDDIEFRYVSFWIHFHKLPFACFSREVAAEIGSILGQVCQIDLEEEVDQCRGGTLRVKIQIDATKPLKRGIFLSSE
        I   GPW F+  + + +  +      D EF+ + FWI    +P    +  +   IG                                   + G+FL + 
Subjt:  IADNGPWFFDKALLLFKEPKGGNYGDDIEFRYVSFWIHFHKLPFACFSREVAAEIGSILGQVCQIDLEEEVDQCRGGTLRVKIQIDATKPLKRGIFLSSE

Query:  DSTEDRWIPITYEKLPDFCYGCGLLGHTLKECEGSNHDGSPVEE
           +   +   YEKL +FC  CG+L H   EC  S + G   ++
Subjt:  DSTEDRWIPITYEKLPDFCYGCGLLGHTLKECEGSNHDGSPVEE

AT4G29090.1 Ribonuclease H-like superfamily protein3.2e-1521.73Show/hide
Query:  DEILWNLDSKGRFSVKGAYRLGCQ-MNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFW
        D   W+  S G ++VK  Y +  Q +N+R      +      +++  WK +  PKI+   W+  ++ LP    L  R +     C  C   +ET +HL +
Subjt:  DEILWNLDSKGRFSVKGAYRLGCQ-MNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFW

Query:  HCKMTKGLWA-KYLPLDNLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLII---CWRIWSIRNLISHNNQRLNQETIRDILQQQISASIHELIG
         C   +  WA   +P+     L       I   L  ++  G     N    K S ++    WR+W  RN +    +  N + +           +     
Subjt:  HCKMTKGLWA-KYLPLDNLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLII---CWRIWSIRNLISHNNQRLNQETIRDILQQQISASIHELIG

Query:  DEEPYQMQWLEGQTERLAPSGRGGVLLGDPTVRR----KWSPISDGCWKLSYDASWRSDRECGSVGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVE
        D E +++     +TE  +   +       P V R    +W P      K + DA+W  D E   +GWVLR              N   ++ W+ A ++ +
Subjt:  DEEPYQMQWLEGQTERLAPSGRGGVLLGDPTVRR----KWSPISDGCWKLSYDASWRSDRECGSVGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVE

Query:  GLQAIPSVTGGVR-------------LLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQK--AYEEDGPKSWSH
            + +    +R             ++ E+DS  ++ ++N +++    L   I++ QR+ S         IPR  N LA  +A++  ++    PK +S 
Subjt:  GLQAIPSVTGGVR-------------LLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQK--AYEEDGPKSWSH

Query:  SFPDW
          P W
Subjt:  SFPDW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGATCAGAAGGCTGAGGAAACCTTGGTCAAACAACTCTCGGGACTTAAAGTTACTGAAGAAGAACGTGCTAGTGTCTTCAAACTCAAAGAGGAGGAAATTGACAG
ATCGGAGAAGAAGCTGGAGAATGCTCTGGTCTGCAAAATATTATCACAAAAACAAATTAATCCTGAGATGTTCAAATCCAAGATGTCGCACATATGGAATCAAGAGCAGA
CGATTATCAGCATGGTGGGATTCAATTTGTATTTGTGCAAATTCAAGAATGGGAAGATAAAAAGCTTAATTGCAGACAATGGTCCTTGGTTCTTTGACAAGGCTTTATTA
TTATTCAAGGAACCAAAAGGAGGCAACTACGGGGATGATATCGAGTTCAGGTATGTATCCTTCTGGATCCATTTCCATAAACTCCCATTTGCTTGTTTTTCCAGGGAAGT
GGCAGCGGAAATAGGGAGCATACTTGGACAGGTGTGTCAGATTGATCTAGAGGAGGAAGTTGATCAATGTCGAGGTGGCACTTTGAGGGTGAAAATTCAAATTGATGCTA
CCAAGCCTTTGAAGAGGGGAATTTTCTTATCATCGGAGGATTCTACAGAGGACCGGTGGATTCCAATTACATATGAAAAGTTACCGGACTTTTGTTATGGGTGCGGCTTA
TTGGGACATACATTAAAAGAATGTGAGGGTTCAAACCATGACGGCTCTCCGGTTGAGGAACTGCCGTATGGGGCGTGGCTTCGGGAGCCGGTTTTGTTGAAAGCCCGAGA
GGGTGGATGGAGGGGAGGGCATCATCATGACGAAGAGGCTTATGGGGCTGGTGGGAATGAAAGGCGACAGGGAGGAACTGAGGAAGCCACGATCCGGGATCAACCATCGG
CAAGTGGACCTCCGACGATGAACAAACCGCCGGCAAACATTCCGACAGGGGCAGGTCCAGAAACGGATACAATGGCTAATAACTCTGATATGGAAAGAGTGGAAAAGGTA
ACAGAATTGAAGAAGGTGCCAGTAAAGGAAGGTGAAATAAAAGTGTCCTTAAAGGATAATTTTATTTCAAAGGGAAAAGTAAACTCACCGAACATGGATGAAGATTCTGG
GAACAATGGTCCAATAGGAAAAGAAAAGGAAAGTTATTGCTCAATTATGGAAGTGGATTTGGAAGATAATGGGTCAAAAGTGGAGGCCACGTCCACAGAGGTGAAGTCAA
CACAGGACAATGATGTAGGTGTTTTTCTAGATCCTAAAGGTAAGGGGGTTTTGTGCGATAATTCTAGTAACTTTAGTAAGGGTAAGGCAAAGGCTAAAGTCTCTCACTTG
ACGCGCATTGCGTCTGACCATAGATCATTGCTTGCAGAGTGGTCCATTGAGCCTCTGAATCCAAGTTTTGTTATCCCTAGGAGGCCCAGAAGATTTGAAGAAGGTTGGTG
TAAGTATGGGGAATGTCGAGAGATTGTGGCGACAGTTTGGAATTCTCAAGGGTGTCATGATATTTCAGACTTTAATACTAAGATTATGGATTGTCTTATGAGTCTTAACC
ATTGGAGCCGTCGCAAATATGGCGGCTCAATTAGAGGAGCTATTGCAAAAACTGAAAGAGACATTCAACATCTCTCCAAAAAGGACGACCAAACTTCCAGGACTGCTTTG
AGGGAGAAGGAGAAAACTCTTGAAAGCTTGTTAGAAGACGACGAAATATATTGGAAGCAGCGGTCTCGTGAAGAATGGTTACTTTGGGGGGATAGAAATACCAAATGGTT
TCATATGAGAGCCAACACTAGAAGGAAAAGGAATCGGATTAGAGGCATCATGAATGACTTGGGAATATGGACTGAGGAAGATAATGGAATGGAGTTTATTGTGAACAACT
ATTTTGCAAAACTTTTCCAATCGTCGGAACCTCAGATGGACTCCATTGCACATATTCTAGAATCAATTCCCACTTCTATTTCTGAGGCGCAGAATAATGATCTTGAGGAT
GCCTTAGCCATTTTAGCCACACCTACTAAGTCTAATATGGGGGAGGATGAAATTCTATGGAACCTTGATTCAAAAGGGAGATTCTCGGTGAAGGGTGCTTATCGTTTGGG
GTGTCAAATGAATCAAAGATTTCAAGCTTCCTCTGCGAATTACAAGGATCAAGAGGCCATGTGGAAGGATTTTTGGAAGCTCAAATTACCCCCGAAGATCAAAATATGTG
GCTGGAGGATCTACAATGACATCTTACCCACATTATCCAACCTTAATAATAGAGGGATGGATGTGTGGCCAGTATGTTTCCTGTGTAGGGAAAAAGAAGAGACAACATCC
CACCTCTTTTGGCATTGCAAGATGACTAAGGGATTGTGGGCTAAATATTTACCTCTTGATAACTTGAGCTGTCTTTTTGACAGGGAGGATAGGCGGATATCAGAGACTCT
AGATGGGTTATGGCAGAGAGGCGGGAACACTTCGACAAACATTCTTCACATCAAATGCAGTCTTATTATATGTTGGAGAATATGGTCTATTCGTAATTTAATCAGTCACA
ACAATCAGAGACTCAATCAAGAGACCATCAGAGACATACTTCAGCAACAAATTAGTGCATCCATTCACGAGCTAATAGGAGATGAGGAGCCTTACCAGATGCAGTGGCTG
GAGGGACAAACTGAGCGCCTTGCACCGTCCGGCCGAGGAGGAGTCTTGCTGGGAGATCCAACGGTACGGAGGAAATGGTCCCCAATCTCCGATGGCTGCTGGAAGCTCAG
CTACGATGCCTCCTGGCGTTCAGATCGCGAGTGTGGAAGCGTCGGTTGGGTGCTTCGAGATTGGAGCAGAACATTGTTAGCAGCGGGTTTCAAATGTATTAATTCGGCGT
CGGACATCAGCTGGCTAGAAGCTCTATCGATCGTCGAAGGTTTGCAGGCGATCCCTTCGGTCACTGGTGGAGTGCGTCTTCTTGTGGAGAACGATTCCTTGCAAGTGGTG
AATCTGATAAATGGGGAAGACGTGGATGAAACTGAGTTGAACTTCTTCATTAAAGAAGCTCAACGCATGTGTTCTATTAAAAAAGTGGATTCCATAACTCACATTCCTCG
GGCCCATAATTATTTGGCTCATAGTCTTGCCCAGAAGGCTTATGAAGAAGATGGCCCAAAGAGTTGGTCTCATTCATTCCCAGATTGGCTTTTAGATGAAAATGAGAGAG
ATACCGGTTGTGTACATCACAAAAATAGGGGATCCTGTCCTATTTGTGATCATGTTTCGAACACTTTTGCTGCGCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGATCAGAAGGCTGAGGAAACCTTGGTCAAACAACTCTCGGGACTTAAAGTTACTGAAGAAGAACGTGCTAGTGTCTTCAAACTCAAAGAGGAGGAAATTGACAG
ATCGGAGAAGAAGCTGGAGAATGCTCTGGTCTGCAAAATATTATCACAAAAACAAATTAATCCTGAGATGTTCAAATCCAAGATGTCGCACATATGGAATCAAGAGCAGA
CGATTATCAGCATGGTGGGATTCAATTTGTATTTGTGCAAATTCAAGAATGGGAAGATAAAAAGCTTAATTGCAGACAATGGTCCTTGGTTCTTTGACAAGGCTTTATTA
TTATTCAAGGAACCAAAAGGAGGCAACTACGGGGATGATATCGAGTTCAGGTATGTATCCTTCTGGATCCATTTCCATAAACTCCCATTTGCTTGTTTTTCCAGGGAAGT
GGCAGCGGAAATAGGGAGCATACTTGGACAGGTGTGTCAGATTGATCTAGAGGAGGAAGTTGATCAATGTCGAGGTGGCACTTTGAGGGTGAAAATTCAAATTGATGCTA
CCAAGCCTTTGAAGAGGGGAATTTTCTTATCATCGGAGGATTCTACAGAGGACCGGTGGATTCCAATTACATATGAAAAGTTACCGGACTTTTGTTATGGGTGCGGCTTA
TTGGGACATACATTAAAAGAATGTGAGGGTTCAAACCATGACGGCTCTCCGGTTGAGGAACTGCCGTATGGGGCGTGGCTTCGGGAGCCGGTTTTGTTGAAAGCCCGAGA
GGGTGGATGGAGGGGAGGGCATCATCATGACGAAGAGGCTTATGGGGCTGGTGGGAATGAAAGGCGACAGGGAGGAACTGAGGAAGCCACGATCCGGGATCAACCATCGG
CAAGTGGACCTCCGACGATGAACAAACCGCCGGCAAACATTCCGACAGGGGCAGGTCCAGAAACGGATACAATGGCTAATAACTCTGATATGGAAAGAGTGGAAAAGGTA
ACAGAATTGAAGAAGGTGCCAGTAAAGGAAGGTGAAATAAAAGTGTCCTTAAAGGATAATTTTATTTCAAAGGGAAAAGTAAACTCACCGAACATGGATGAAGATTCTGG
GAACAATGGTCCAATAGGAAAAGAAAAGGAAAGTTATTGCTCAATTATGGAAGTGGATTTGGAAGATAATGGGTCAAAAGTGGAGGCCACGTCCACAGAGGTGAAGTCAA
CACAGGACAATGATGTAGGTGTTTTTCTAGATCCTAAAGGTAAGGGGGTTTTGTGCGATAATTCTAGTAACTTTAGTAAGGGTAAGGCAAAGGCTAAAGTCTCTCACTTG
ACGCGCATTGCGTCTGACCATAGATCATTGCTTGCAGAGTGGTCCATTGAGCCTCTGAATCCAAGTTTTGTTATCCCTAGGAGGCCCAGAAGATTTGAAGAAGGTTGGTG
TAAGTATGGGGAATGTCGAGAGATTGTGGCGACAGTTTGGAATTCTCAAGGGTGTCATGATATTTCAGACTTTAATACTAAGATTATGGATTGTCTTATGAGTCTTAACC
ATTGGAGCCGTCGCAAATATGGCGGCTCAATTAGAGGAGCTATTGCAAAAACTGAAAGAGACATTCAACATCTCTCCAAAAAGGACGACCAAACTTCCAGGACTGCTTTG
AGGGAGAAGGAGAAAACTCTTGAAAGCTTGTTAGAAGACGACGAAATATATTGGAAGCAGCGGTCTCGTGAAGAATGGTTACTTTGGGGGGATAGAAATACCAAATGGTT
TCATATGAGAGCCAACACTAGAAGGAAAAGGAATCGGATTAGAGGCATCATGAATGACTTGGGAATATGGACTGAGGAAGATAATGGAATGGAGTTTATTGTGAACAACT
ATTTTGCAAAACTTTTCCAATCGTCGGAACCTCAGATGGACTCCATTGCACATATTCTAGAATCAATTCCCACTTCTATTTCTGAGGCGCAGAATAATGATCTTGAGGAT
GCCTTAGCCATTTTAGCCACACCTACTAAGTCTAATATGGGGGAGGATGAAATTCTATGGAACCTTGATTCAAAAGGGAGATTCTCGGTGAAGGGTGCTTATCGTTTGGG
GTGTCAAATGAATCAAAGATTTCAAGCTTCCTCTGCGAATTACAAGGATCAAGAGGCCATGTGGAAGGATTTTTGGAAGCTCAAATTACCCCCGAAGATCAAAATATGTG
GCTGGAGGATCTACAATGACATCTTACCCACATTATCCAACCTTAATAATAGAGGGATGGATGTGTGGCCAGTATGTTTCCTGTGTAGGGAAAAAGAAGAGACAACATCC
CACCTCTTTTGGCATTGCAAGATGACTAAGGGATTGTGGGCTAAATATTTACCTCTTGATAACTTGAGCTGTCTTTTTGACAGGGAGGATAGGCGGATATCAGAGACTCT
AGATGGGTTATGGCAGAGAGGCGGGAACACTTCGACAAACATTCTTCACATCAAATGCAGTCTTATTATATGTTGGAGAATATGGTCTATTCGTAATTTAATCAGTCACA
ACAATCAGAGACTCAATCAAGAGACCATCAGAGACATACTTCAGCAACAAATTAGTGCATCCATTCACGAGCTAATAGGAGATGAGGAGCCTTACCAGATGCAGTGGCTG
GAGGGACAAACTGAGCGCCTTGCACCGTCCGGCCGAGGAGGAGTCTTGCTGGGAGATCCAACGGTACGGAGGAAATGGTCCCCAATCTCCGATGGCTGCTGGAAGCTCAG
CTACGATGCCTCCTGGCGTTCAGATCGCGAGTGTGGAAGCGTCGGTTGGGTGCTTCGAGATTGGAGCAGAACATTGTTAGCAGCGGGTTTCAAATGTATTAATTCGGCGT
CGGACATCAGCTGGCTAGAAGCTCTATCGATCGTCGAAGGTTTGCAGGCGATCCCTTCGGTCACTGGTGGAGTGCGTCTTCTTGTGGAGAACGATTCCTTGCAAGTGGTG
AATCTGATAAATGGGGAAGACGTGGATGAAACTGAGTTGAACTTCTTCATTAAAGAAGCTCAACGCATGTGTTCTATTAAAAAAGTGGATTCCATAACTCACATTCCTCG
GGCCCATAATTATTTGGCTCATAGTCTTGCCCAGAAGGCTTATGAAGAAGATGGCCCAAAGAGTTGGTCTCATTCATTCCCAGATTGGCTTTTAGATGAAAATGAGAGAG
ATACCGGTTGTGTACATCACAAAAATAGGGGATCCTGTCCTATTTGTGATCATGTTTCGAACACTTTTGCTGCGCCTTAA
Protein sequenceShow/hide protein sequence
MADQKAEETLVKQLSGLKVTEEERASVFKLKEEEIDRSEKKLENALVCKILSQKQINPEMFKSKMSHIWNQEQTIISMVGFNLYLCKFKNGKIKSLIADNGPWFFDKALL
LFKEPKGGNYGDDIEFRYVSFWIHFHKLPFACFSREVAAEIGSILGQVCQIDLEEEVDQCRGGTLRVKIQIDATKPLKRGIFLSSEDSTEDRWIPITYEKLPDFCYGCGL
LGHTLKECEGSNHDGSPVEELPYGAWLREPVLLKAREGGWRGGHHHDEEAYGAGGNERRQGGTEEATIRDQPSASGPPTMNKPPANIPTGAGPETDTMANNSDMERVEKV
TELKKVPVKEGEIKVSLKDNFISKGKVNSPNMDEDSGNNGPIGKEKESYCSIMEVDLEDNGSKVEATSTEVKSTQDNDVGVFLDPKGKGVLCDNSSNFSKGKAKAKVSHL
TRIASDHRSLLAEWSIEPLNPSFVIPRRPRRFEEGWCKYGECREIVATVWNSQGCHDISDFNTKIMDCLMSLNHWSRRKYGGSIRGAIAKTERDIQHLSKKDDQTSRTAL
REKEKTLESLLEDDEIYWKQRSREEWLLWGDRNTKWFHMRANTRRKRNRIRGIMNDLGIWTEEDNGMEFIVNNYFAKLFQSSEPQMDSIAHILESIPTSISEAQNNDLED
ALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTS
HLFWHCKMTKGLWAKYLPLDNLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSIRNLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWL
EGQTERLAPSGRGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVV
NLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQKAYEEDGPKSWSHSFPDWLLDENERDTGCVHHKNRGSCPICDHVSNTFAAP