; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019847 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019847
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPolynucleotidyl transferase, ribonuclease H-like superfamily protein
Genome locationChr04:26089246..26096773
RNA-Seq ExpressionHG10019847
SyntenyHG10019847
Gene Ontology termsGO:0006139 - nucleobase-containing compound metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008408 - 3'-5' exonuclease activity (molecular function)
InterPro domainsIPR002562 - 3'-5' exonuclease domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053895.1 Filamentous hemagglutinin [Cucumis melo var. makuwa]2.7e-23084.31Show/hide
Query:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN
        MGKSEEEQPLPVGVSS+ELSDRNVE+RCGGGGCS IR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPI SAYRDH+IVASFHAWKP PFL+N
Subjt:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN

Query:  HIFELEDNIFGEIPVPFVK---------------------------VAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRL
        HIFELEDNIFGEIP+P VK                           VAILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPVPFVK---------------------------VAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPEGITIIPPQSAFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGT
        N SLFGNTSLFEVLKFP GITIIPPQSAF LQ  QIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LS YENLYVSLSNERGSTM  PT+VQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPEGITIIPPQSAFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGT

Query:  N--SSKQRLKQLAQTITNSHSGNLGLNNTIFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYS-HHHHHHHHHHHHHHHRHHHHHHHHHHYN--QVAAY
        N  SSKQRLKQLA TITNSHSGNLGLNNT+FGKVKQVRL S LNHSLGGGG+A SPSPAPLP+S HHHHHHHHHHHHHHHRHHHHHHHHHH+N  Q AAY
Subjt:  N--SSKQRLKQLAQTITNSHSGNLGLNNTIFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYS-HHHHHHHHHHHHHHHRHHHHHHHHHHYN--QVAAY

Query:  SPSPGTEEHKHALKNGVSSAPEAGPSPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDSISASSPLSGVVL
        SPSPGTEEHKHA KNGVSSAPEAG SP+  PTS KR  E+TPPAF+YGYKRSS K+RKQ HLGPI SPSS P SPYLRVGLPAPVSDSISASSPLSGVVL
Subjt:  SPSPGTEEHKHALKNGVSSAPEAGPSPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDSISASSPLSGVVL

Query:  SSVQPPNTGSGHSENFERSAHSVLPPQFS
        S+VQPPNTGSGH+ENFERS+ SVLPPQFS
Subjt:  SSVQPPNTGSGHSENFERSAHSVLPPQFS

XP_004136773.3 uncharacterized protein LOC101213172 isoform X1 [Cucumis sativus]4.3e-22887.58Show/hide
Query:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN
        MGKSEEEQPLPVG SS+ELSDRNVE+RCGGGGCS IRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRP+ SAYRDH+IVASFHA KP PFL+ 
Subjt:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN

Query:  HIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPPQS
        HIFELEDNIFGEIP+P VKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLN SLFGNTSLFEVLKFP GITIIPPQS
Subjt:  HIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPPQS

Query:  AFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTN--SSKQRLKQLAQTITNSHSGNLGLN
        AF LQ  QIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LS YENLYVSLSNERGST+  PT+VQSSVLMAIGTN  SSKQRLKQLA TITNSHSGNLGLN
Subjt:  AFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTN--SSKQRLKQLAQTITNSHSGNLGLN

Query:  NTIFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYSHHHHHHHHHHHHHHHRHHHHHHHHHHYNQVAAYSPSPGTEEHKHALKNGVSSAPEAGPSPVGS
        NT+FGKVKQVRL S LNHSLGGGG+ARSPSPAPLP+SHHH HHHHHHHHHHH HHHHHHHHHH +  AAYSPSPGTEEHKHA KNGVSSAPEAG SP+  
Subjt:  NTIFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYSHHHHHHHHHHHHHHHRHHHHHHHHHHYNQVAAYSPSPGTEEHKHALKNGVSSAPEAGPSPVGS

Query:  PTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDSISASSPLSGVVLSSVQPPNTGSGHSENFERSAHSVLPPQFS
        PTS KR  E+TPPAF+YGYKRS  K+RK  +LGPI SPSS PSSPYLRVG PAPVSDSISASSPLSGVVLS+VQPPNTGSGH+ENFERS+ SVLPPQFS
Subjt:  PTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDSISASSPLSGVVLSSVQPPNTGSGHSENFERSAHSVLPPQFS

XP_008443610.1 PREDICTED: uncharacterized protein LOC103487165 [Cucumis melo]1.7e-23285.06Show/hide
Query:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN
        MGKSEEEQPLPVGVSS+ELSDRNVE+RCGGGGCS IR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPI SAYRDH+IVASFHAWKP PFL+N
Subjt:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN

Query:  HIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPPQS
        HIFELEDNIFGEIP+P VKVAILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLN SLFGNTSLFEVLKFP GITIIPPQS
Subjt:  HIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPPQS

Query:  AFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTN--SSKQRLKQLAQTITNSHSGNLGLN
        AF LQ  QIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LS YENLYVSLSNERGSTM  PT+VQSSVLMAIGTN  SSKQRLKQLA TITNSHSGNLGLN
Subjt:  AFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTN--SSKQRLKQLAQTITNSHSGNLGLN

Query:  NTIFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYSHHHHHHHHHHHHHHHRHHHHHHHHHH-----------------------YNQVAAYSPSPGTE
        NT+FGKVKQVRL S LNHSLGGGG+A SPSPAPLP+SHHHHHHHHHHHHHHH HHHHHHHHHH                       ++Q AAYSPSPGTE
Subjt:  NTIFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYSHHHHHHHHHHHHHHHRHHHHHHHHHH-----------------------YNQVAAYSPSPGTE

Query:  EHKHALKNGVSSAPEAGPSPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDSISASSPLSGVVLSSVQPPN
        EHKHA KNGVSSAPEAG SP+  PTS KR  E+TPPAF+YGYKRSS K+RKQ HLGPI SPSS P SPYLRVGLPAPVSDSISASSPLSGVVLS+VQPPN
Subjt:  EHKHALKNGVSSAPEAGPSPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDSISASSPLSGVVLSSVQPPN

Query:  TGSGHSENFERSAHSVLPPQFS
        TGSGH+ENFERS+ SVLPPQFS
Subjt:  TGSGHSENFERSAHSVLPPQFS

XP_022934949.1 uncharacterized protein LOC111441963 [Cucurbita moschata]3.8e-21682.84Show/hide
Query:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN
        MGKSEEEQPLPVGVSS+ELSD  V+SRCGGGGC  IRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYG+WPD+   S YRDHEIVA F A KP PFL+N
Subjt:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN

Query:  HIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPPQS
        HIFELEDNIFGEIPVPFVKVA+LSLQSLGG NVT I+F+VD DAKYSKIPPTSQSLIKETFETLVIN+PPLRLNASLFGNTSLFEVLKFP GITIIPPQS
Subjt:  HIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPPQS

Query:  AFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNT
        AF LQ  QIYFNFTLNYSIYQIQVNFDDLTSQLRSGL LS YENLYVSLSNERGSTM  PTIVQSSVLMAIGTNSS QRLKQLAQTITNSHSGNLGLNNT
Subjt:  AFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNT

Query:  IFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYS---------HHHHHHHHHHHHHHHRHHHHHHHHHHYNQVAAYSPSPGTEEHKHALKNGVSSAPEA
        +FGKVKQVRLSSVLNHSL  GG ARSPSPAPLP+S         HHHHHHH HHHHHHH HHHHHHHHHH++Q A YSPSPGTEEHK+A KNG+SSAPEA
Subjt:  IFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYS---------HHHHHHHHHHHHHHHRHHHHHHHHHHYNQVAAYSPSPGTEEHKHALKNGVSSAPEA

Query:  GPSPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDSISASSPLSGVVLSSVQPPNTGSGHSENFERSAHSV
        G SPV SP S+KR  E+TPP F+YGYK  S KVRK+SHLG I SPSSPPSSPYLRVGLPAPV+ SISASSPL GV LS+VQPP  G       +RSA SV
Subjt:  GPSPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDSISASSPLSGVVLSSVQPPNTGSGHSENFERSAHSV

Query:  LPPQFSF
        LPPQFSF
Subjt:  LPPQFSF

XP_022983747.1 uncharacterized protein LOC111482272 isoform X2 [Cucurbita maxima]7.1e-21581.55Show/hide
Query:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN
        MGKSEEEQPLPVGVSS+ELSD  V+SRCGGGGC  IRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYG+WPD+   S YRDHEIVA F A KP PFL+N
Subjt:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN

Query:  HIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPPQS
        HIFELEDNIFGEIPVPFVKVA+LSLQSLGG NVT I+F+VD DAKYSKIPPTSQSLIKETFETLVIN+PPLRLNASLFGNTSLFEVLKFP GITIIPPQS
Subjt:  HIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPPQS

Query:  AFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNT
        AF LQ  QIYFNFTLNYSIYQIQVNF+DLTSQLRSGL LS YENLYVSLSNERGSTM  PTIVQSSVLMAIGTNSS QRLKQLAQTITNSHSGNLGLNNT
Subjt:  AFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNT

Query:  IFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYS-----------------HHHHHHHHHHHHHHHRHHHHHHHHHHYNQVAAYSPSPGTEEHKHALKN
        +FGKVKQVRLSSVLNHSL  GG ARSPSPAPLP+S                 HHH HHHHHHHHHHH HHHHHHHH H++Q AAYSPSPGTEEHKHA KN
Subjt:  IFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYS-----------------HHHHHHHHHHHHHHHRHHHHHHHHHHYNQVAAYSPSPGTEEHKHALKN

Query:  GVSSAPEAGPSPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDSISASSPLSGVVLSSVQPPNTGSGHSEN
        G+SSAPEAG SPV SP S+KR  E+TPP F+YGYK  S KVRK+SHLG I SPSSPPSSPYLRVGLPAPV+ SISASSPL GV LS+VQPP  G      
Subjt:  GVSSAPEAGPSPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDSISASSPLSGVVLSSVQPPNTGSGHSEN

Query:  FERSAHSVLPPQFSF
         +RSA SVLPPQFSF
Subjt:  FERSAHSVLPPQFSF

TrEMBL top hitse value%identityAlignment
A0A0A0LHD1 Uncharacterized protein4.1e-22485.97Show/hide
Query:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN
        MGKSEEEQPLPVG SS+ELSDRNVE+RCGGGGCS IRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRP+ SAYRDH+IVASFHA KP PFL+ 
Subjt:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN

Query:  HIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPPQS
        HIFELEDNIFGEIP+P VKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLN SLFGNTSLFEVLKFP GITIIPPQS
Subjt:  HIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPPQS

Query:  AFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTN--SSKQRLKQLAQTITNSHSGNLGLN
        AF LQ  QIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LS YENLYVSLSNERGST+  PT+VQSSVLMAIGTN  SSKQRLKQLA TITNSHSGNLGLN
Subjt:  AFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTN--SSKQRLKQLAQTITNSHSGNLGLN

Query:  NTIFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYSHHHHHHHHHHHHHHHRHHHHHHHHHHYNQVAAYSPSPGTEEHKHALKNGVSSAPEAGPSPVGS
        NT+FGKVKQVRL S LNHSLGGGG+ARSPSPAPLP+SHHH HHHHHHHHHHH HH            AAYSPSPGTEEHKHA KNGVSSAPEAG SP+  
Subjt:  NTIFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYSHHHHHHHHHHHHHHHRHHHHHHHHHHYNQVAAYSPSPGTEEHKHALKNGVSSAPEAGPSPVGS

Query:  PTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDSISASSPLSGVVLSSVQPPNTGSGHSENFERSAHSVLPPQFS
        PTS KR  E+TPPAF+YGYKRS  K+RK  +LGPI SPSS PSSPYLRVG PAPVSDSISASSPLSGVVLS+VQPPNTGSGH+ENFERS+ SVLPPQFS
Subjt:  PTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDSISASSPLSGVVLSSVQPPNTGSGHSENFERSAHSVLPPQFS

A0A1S3B8E9 uncharacterized protein LOC1034871658.2e-23385.06Show/hide
Query:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN
        MGKSEEEQPLPVGVSS+ELSDRNVE+RCGGGGCS IR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPI SAYRDH+IVASFHAWKP PFL+N
Subjt:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN

Query:  HIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPPQS
        HIFELEDNIFGEIP+P VKVAILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLN SLFGNTSLFEVLKFP GITIIPPQS
Subjt:  HIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPPQS

Query:  AFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTN--SSKQRLKQLAQTITNSHSGNLGLN
        AF LQ  QIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LS YENLYVSLSNERGSTM  PT+VQSSVLMAIGTN  SSKQRLKQLA TITNSHSGNLGLN
Subjt:  AFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTN--SSKQRLKQLAQTITNSHSGNLGLN

Query:  NTIFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYSHHHHHHHHHHHHHHHRHHHHHHHHHH-----------------------YNQVAAYSPSPGTE
        NT+FGKVKQVRL S LNHSLGGGG+A SPSPAPLP+SHHHHHHHHHHHHHHH HHHHHHHHHH                       ++Q AAYSPSPGTE
Subjt:  NTIFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYSHHHHHHHHHHHHHHHRHHHHHHHHHH-----------------------YNQVAAYSPSPGTE

Query:  EHKHALKNGVSSAPEAGPSPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDSISASSPLSGVVLSSVQPPN
        EHKHA KNGVSSAPEAG SP+  PTS KR  E+TPPAF+YGYKRSS K+RKQ HLGPI SPSS P SPYLRVGLPAPVSDSISASSPLSGVVLS+VQPPN
Subjt:  EHKHALKNGVSSAPEAGPSPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDSISASSPLSGVVLSSVQPPN

Query:  TGSGHSENFERSAHSVLPPQFS
        TGSGH+ENFERS+ SVLPPQFS
Subjt:  TGSGHSENFERSAHSVLPPQFS

A0A5A7UJM2 Filamentous hemagglutinin1.3e-23084.31Show/hide
Query:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN
        MGKSEEEQPLPVGVSS+ELSDRNVE+RCGGGGCS IR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPI SAYRDH+IVASFHAWKP PFL+N
Subjt:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN

Query:  HIFELEDNIFGEIPVPFVK---------------------------VAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRL
        HIFELEDNIFGEIP+P VK                           VAILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPVPFVK---------------------------VAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPEGITIIPPQSAFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGT
        N SLFGNTSLFEVLKFP GITIIPPQSAF LQ  QIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LS YENLYVSLSNERGSTM  PT+VQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPEGITIIPPQSAFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGT

Query:  N--SSKQRLKQLAQTITNSHSGNLGLNNTIFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYS-HHHHHHHHHHHHHHHRHHHHHHHHHHYN--QVAAY
        N  SSKQRLKQLA TITNSHSGNLGLNNT+FGKVKQVRL S LNHSLGGGG+A SPSPAPLP+S HHHHHHHHHHHHHHHRHHHHHHHHHH+N  Q AAY
Subjt:  N--SSKQRLKQLAQTITNSHSGNLGLNNTIFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYS-HHHHHHHHHHHHHHHRHHHHHHHHHHYN--QVAAY

Query:  SPSPGTEEHKHALKNGVSSAPEAGPSPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDSISASSPLSGVVL
        SPSPGTEEHKHA KNGVSSAPEAG SP+  PTS KR  E+TPPAF+YGYKRSS K+RKQ HLGPI SPSS P SPYLRVGLPAPVSDSISASSPLSGVVL
Subjt:  SPSPGTEEHKHALKNGVSSAPEAGPSPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDSISASSPLSGVVL

Query:  SSVQPPNTGSGHSENFERSAHSVLPPQFS
        S+VQPPNTGSGH+ENFERS+ SVLPPQFS
Subjt:  SSVQPPNTGSGHSENFERSAHSVLPPQFS

A0A6J1F409 uncharacterized protein LOC1114419631.8e-21682.84Show/hide
Query:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN
        MGKSEEEQPLPVGVSS+ELSD  V+SRCGGGGC  IRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYG+WPD+   S YRDHEIVA F A KP PFL+N
Subjt:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN

Query:  HIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPPQS
        HIFELEDNIFGEIPVPFVKVA+LSLQSLGG NVT I+F+VD DAKYSKIPPTSQSLIKETFETLVIN+PPLRLNASLFGNTSLFEVLKFP GITIIPPQS
Subjt:  HIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPPQS

Query:  AFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNT
        AF LQ  QIYFNFTLNYSIYQIQVNFDDLTSQLRSGL LS YENLYVSLSNERGSTM  PTIVQSSVLMAIGTNSS QRLKQLAQTITNSHSGNLGLNNT
Subjt:  AFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNT

Query:  IFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYS---------HHHHHHHHHHHHHHHRHHHHHHHHHHYNQVAAYSPSPGTEEHKHALKNGVSSAPEA
        +FGKVKQVRLSSVLNHSL  GG ARSPSPAPLP+S         HHHHHHH HHHHHHH HHHHHHHHHH++Q A YSPSPGTEEHK+A KNG+SSAPEA
Subjt:  IFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYS---------HHHHHHHHHHHHHHHRHHHHHHHHHHYNQVAAYSPSPGTEEHKHALKNGVSSAPEA

Query:  GPSPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDSISASSPLSGVVLSSVQPPNTGSGHSENFERSAHSV
        G SPV SP S+KR  E+TPP F+YGYK  S KVRK+SHLG I SPSSPPSSPYLRVGLPAPV+ SISASSPL GV LS+VQPP  G       +RSA SV
Subjt:  GPSPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDSISASSPLSGVVLSSVQPPNTGSGHSENFERSAHSV

Query:  LPPQFSF
        LPPQFSF
Subjt:  LPPQFSF

A0A6J1J074 uncharacterized protein LOC111482272 isoform X23.5e-21581.55Show/hide
Query:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN
        MGKSEEEQPLPVGVSS+ELSD  V+SRCGGGGC  IRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYG+WPD+   S YRDHEIVA F A KP PFL+N
Subjt:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN

Query:  HIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPPQS
        HIFELEDNIFGEIPVPFVKVA+LSLQSLGG NVT I+F+VD DAKYSKIPPTSQSLIKETFETLVIN+PPLRLNASLFGNTSLFEVLKFP GITIIPPQS
Subjt:  HIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPPQS

Query:  AFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNT
        AF LQ  QIYFNFTLNYSIYQIQVNF+DLTSQLRSGL LS YENLYVSLSNERGSTM  PTIVQSSVLMAIGTNSS QRLKQLAQTITNSHSGNLGLNNT
Subjt:  AFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNT

Query:  IFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYS-----------------HHHHHHHHHHHHHHHRHHHHHHHHHHYNQVAAYSPSPGTEEHKHALKN
        +FGKVKQVRLSSVLNHSL  GG ARSPSPAPLP+S                 HHH HHHHHHHHHHH HHHHHHHH H++Q AAYSPSPGTEEHKHA KN
Subjt:  IFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYS-----------------HHHHHHHHHHHHHHHRHHHHHHHHHHYNQVAAYSPSPGTEEHKHALKN

Query:  GVSSAPEAGPSPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDSISASSPLSGVVLSSVQPPNTGSGHSEN
        G+SSAPEAG SPV SP S+KR  E+TPP F+YGYK  S KVRK+SHLG I SPSSPPSSPYLRVGLPAPV+ SISASSPL GV LS+VQPP  G      
Subjt:  GVSSAPEAGPSPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDSISASSPLSGVVLSSVQPPNTGSGHSEN

Query:  FERSAHSVLPPQFSF
         +RSA SVLPPQFSF
Subjt:  FERSAHSVLPPQFSF

SwissProt top hitse value%identityAlignment
Q84LH3 Werner Syndrome-like exonuclease6.9e-1938.56Show/hide
Query:  VGLDIEWRPNNRSYDNP--VATLQLCI-GRRCLILQLIHTPEIPKSLFEFLENESFTFVGVGIDEDAEKLNCDYGLKVGKRVDLRNLAESVTGRGDLKNA
        VGLDIEWRP+ R    P  VAT+Q+C+    C ++ + H+  IP+SL   +E+ +   VG+GID D+ KL  DYG+ +    DL +LA    G GD K  
Subjt:  VGLDIEWRPNNRSYDNP--VATLQLCI-GRRCLILQLIHTPEIPKSLFEFLENESFTFVGVGIDEDAEKLNCDYGLKVGKRVDLRNLAESVTGRGDLKNA

Query:  GLKKLGKEVLGKEIQKPKRVTMSRWDQEWLTLNQVKYACIDAFFSFEIGRHLQ
        GL  L + ++ KE+ KP R+ +  W+   L+  Q++YA  DA+ S+ + + L+
Subjt:  GLKKLGKEVLGKEIQKPKRVTMSRWDQEWLTLNQVKYACIDAFFSFEIGRHLQ

Q8VEG4 Exonuclease 3'-5' domain-containing protein 22.4e-1133.55Show/hide
Query:  IVGLDIEWRPNNRSYDNPVATLQLCI-GRRCLILQLIHT----PEIPKSLFEFLENESFTFVGVGIDEDAEKLNCDYGLKVGKRVDLRNLAESVTGRGDL
        ++G+D EW  N     +P++ LQ+      C +++L         +P++L + L + +   VGVG  EDA KL  DYGL V   +DLR LA         
Subjt:  IVGLDIEWRPNNRSYDNPVATLQLCI-GRRCLILQLIHT----PEIPKSLFEFLENESFTFVGVGIDEDAEKLNCDYGLKVGKRVDLRNLAESVTGRGDL

Query:  KNAGLKKLGKEVLGKEIQKPKRVTMSRWDQEWLTLNQVKYACIDAFFSFEIGRHL
            LK L + +L   + K   +  S WD E LT +QV YA  DA  S  +  HL
Subjt:  KNAGLKKLGKEVLGKEIQKPKRVTMSRWDQEWLTLNQVKYACIDAFFSFEIGRHL

Q9VGN7 Exonuclease 3'-5' domain-containing protein 23.0e-1438.41Show/hide
Query:  IVGLDIEWRPNNRSYDNPVATLQLCIGR-RCLILQLIHTPEIPKSLFEFLENESFTFVGVGIDEDAEKLNCDYGLKVGKRVDLRNLAESVTGRGDLKNAG
        ++G D EW     S   PVA LQL   R  C + +L H  +IP+ L E LE++S   VGV   EDA KL+ DYG+ V   +DLR L   + G    K  G
Subjt:  IVGLDIEWRPNNRSYDNPVATLQLCIGR-RCLILQLIHTPEIPKSLFEFLENESFTFVGVGIDEDAEKLNCDYGLKVGKRVDLRNLAESVTGRGDLKNAG

Query:  LKKLGKEVLGKEIQKPKRVTMSRWDQEWLTLNQVKYACIDAFFSFEIGRHL
        L KL K  L   + K  R+  S W+ + L   Q+ YA  DA  +  I + L
Subjt:  LKKLGKEVLGKEIQKPKRVTMSRWDQEWLTLNQVKYACIDAFFSFEIGRHL

Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)1.1e-3532.43Show/hide
Query:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGC-SGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLE
        M K  +E  L +   + +L +     R  G  C S   RL+ +RC+  L+LS A+ LSAIFWL P  S   +  +  G+   +  + ASF   KP   + 
Subjt:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGC-SGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLE

Query:  NHIFELEDNIFGEIPVP-FVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPP
         H  ++E +I   I +    KV +LSL   G  N T + FAV       +I   S SL++ +F  L      L+L  S FG  + F+VLKFP GIT+ P 
Subjt:  NHIFELEDNIFGEIPVP-FVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPP

Query:  QSAFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLN
        + A       + F+ T+  SI  +Q   D L       L L  YE+++  L+N++GST+ PP   Q  V   +      QRL    Q I  S + NLGL+
Subjt:  QSAFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLN

Query:  NTIFGKVKQVRLSSVLNHSLGGGGSARSPSPAP
          +FG+VK +  S+ L+  +       +P+P P
Subjt:  NTIFGKVKQVRLSSVLNHSLGGGGSARSPSPAP

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein1.8e-9447.4Show/hide
Query:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN
        MGK+E++  L V    A        +RC  G C  I   +  +C+F LLLS A+FLSA+F L PF    +  D  +   +R H IVASF   + A FL  
Subjt:  MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLEN

Query:  HIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPPQS
        +  +L+++IF E+    +KV IL+++     N+TK+VF +D D  Y +I P S S IKE FE+++IN+  L+L  SLFG T LFEVLKFP GIT+IPPQS
Subjt:  HIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPPQS

Query:  AFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNT
        AFPLQ  +I FNFTLNYSI+QIQ+NF+ L SQL++GL+L+ YENLYVSLSN  GST+ PPT V SSVL+ +GT++S  RLKQL  TIT S S NLGLNNT
Subjt:  AFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNT

Query:  IFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYSHHHHHHHHHHHHHHHRHHHHHHHHHHYNQVAAYSPSPGTEEHKHALKNGVSSAPEAGPSPVGSPT
        IFGKVKQVRLSS L +S     S +SPSP+P P+S HHHHHHHHHHHHHH HH+HHHHHHH       + SP               APE  P    +P 
Subjt:  IFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYSHHHHHHHHHHHHHHHRHHHHHHHHHHYNQVAAYSPSPGTEEHKHALKNGVSSAPEAGPSPVGSPT

Query:  SEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDS----ISASSPLSGVVLS-SVQPPNTGSGHSENFERSAHSVLPPQ
          +++  S PP    G +   ++ R Q    P  +PS+   +P+ ++  PAP+S +    +  S+PL  VV + + QPP T     E  E  A+ V  PQ
Subjt:  SEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPVSDS----ISASSPLSGVVLS-SVQPPNTGSGHSENFERSAHSVLPPQ

AT3G12410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.4e-2232.55Show/hide
Query:  DSHNFFDVTFDSDEPILTLLTTSPSMVDNWISETL---AIQTPPLIVGLDIEW---------RPNN--------RSY-DNPVATLQLCIGRRCLILQLIH
        ++H  + V F  DE I+T +T   S++  WI   L      + PL+VG+ ++W         RPNN        R Y DNP   LQLC+G RCLI+QL +
Subjt:  DSHNFFDVTFDSDEPILTLLTTSPSMVDNWISETL---AIQTPPLIVGLDIEW---------RPNN--------RSY-DNPVATLQLCIGRRCLILQLIH

Query:  TPEIPKSLFEFLENESFTFVGVGIDEDAEKL-NCDYGLKVGKRVDLRNLAESVTGRGDLKNAGLKKLGKEVLGKE-IQKPKRVTMSRWDQEWLTLNQVKY
          ++P +L  FL +   TFVGV   +DA KL  C + L++G+ +D+R       GR  ++ +  +++ +E +G + +     ++MS W    L L+Q+  
Subjt:  TPEIPKSLFEFLENESFTFVGVGIDEDAEKL-NCDYGLKVGKRVDLRNLAESVTGRGDLKNAGLKKLGKEVLGKE-IQKPKRVTMSRWDQEWLTLNQVKY

Query:  ACIDAFFSFEIG
        A +DA+   ++G
Subjt:  ACIDAFFSFEIG

AT3G56590.1 hydroxyproline-rich glycoprotein family protein3.0e-9446.31Show/hide
Query:  MGKSE-EEQPLPVGVSSAELSDRNVESRCGGGG------CSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWK
        MGK+  EEQ LP  VS    S RN     GGGG      C  I    ++RCV  L  SAAVFLSA+FWLPPFL + +  D  +   ++DH IVASF   K
Subjt:  MGKSE-EEQPLPVGVSSAELSDRNVESRCGGGG------CSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWK

Query:  PAPFLENHIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGI
        P  F+E+++ +LE++I  EI  P  KV +L+L+ LG  N T ++FA+D + + SKIP   +SLIK  FETLV  +   RL  SLFG    FEVLKFP GI
Subjt:  PAPFLENHIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGI

Query:  TIIPPQSAFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSG
        T+IPPQ  FPLQ  Q+ FNFTLN+SIYQIQ NF++L SQL+ G++L+SYENLY++LSN RGST+ PPTIV SSVL+  G++S   RLKQLAQTIT+SHS 
Subjt:  TIIPPQSAFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSG

Query:  NLGLNNTIFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYSHHHHHHHHHHHHHHHRHHHHHHHHHHYNQVAAYSPSPGTEEHKHALKNGVSSAPEAGP
        NLGLN+T+FGKVKQVRLSS+L HS     ++ +PSP+P P +H + HHH HHHHHHH                  +P P              S P  G 
Subjt:  NLGLNNTIFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYSHHHHHHHHHHHHHHHRHHHHHHHHHHYNQVAAYSPSPGTEEHKHALKNGVSSAPEAGP

Query:  SPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPV-SDSISASSPLSGVVLSSVQPPNTGSGHSENFERSAHSVL
        +P  +PT         PP   Y  +R         H  P  +P+   S P+     PAP    +I  SSPL  VV + + PP+  S  SE     + S  
Subjt:  SPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPV-SDSISASSPLSGVVLSSVQPPNTGSGHSENFERSAHSVL

Query:  P
        P
Subjt:  P

AT3G56590.2 hydroxyproline-rich glycoprotein family protein3.0e-9446.31Show/hide
Query:  MGKSE-EEQPLPVGVSSAELSDRNVESRCGGGG------CSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWK
        MGK+  EEQ LP  VS    S RN     GGGG      C  I    ++RCV  L  SAAVFLSA+FWLPPFL + +  D  +   ++DH IVASF   K
Subjt:  MGKSE-EEQPLPVGVSSAELSDRNVESRCGGGG------CSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWK

Query:  PAPFLENHIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGI
        P  F+E+++ +LE++I  EI  P  KV +L+L+ LG  N T ++FA+D + + SKIP   +SLIK  FETLV  +   RL  SLFG    FEVLKFP GI
Subjt:  PAPFLENHIFELEDNIFGEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGI

Query:  TIIPPQSAFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSG
        T+IPPQ  FPLQ  Q+ FNFTLN+SIYQIQ NF++L SQL+ G++L+SYENLY++LSN RGST+ PPTIV SSVL+  G++S   RLKQLAQTIT+SHS 
Subjt:  TIIPPQSAFPLQAPQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSG

Query:  NLGLNNTIFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYSHHHHHHHHHHHHHHHRHHHHHHHHHHYNQVAAYSPSPGTEEHKHALKNGVSSAPEAGP
        NLGLN+T+FGKVKQVRLSS+L HS     ++ +PSP+P P +H + HHH HHHHHHH                  +P P              S P  G 
Subjt:  NLGLNNTIFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPYSHHHHHHHHHHHHHHHRHHHHHHHHHHYNQVAAYSPSPGTEEHKHALKNGVSSAPEAGP

Query:  SPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPV-SDSISASSPLSGVVLSSVQPPNTGSGHSENFERSAHSVL
        +P  +PT         PP   Y  +R         H  P  +P+   S P+     PAP    +I  SSPL  VV + + PP+  S  SE     + S  
Subjt:  SPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPPSSPYLRVGLPAPV-SDSISASSPLSGVVLSSVQPPNTGSGHSENFERSAHSVL

Query:  P
        P
Subjt:  P


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAGAGTGAAGAAGAACAGCCGCTGCCGGTTGGAGTGAGCTCCGCTGAGCTTTCTGACCGGAATGTGGAGAGCAGATGCGGCGGCGGTGGGTGCTCTGGGATTCG
TAGACTGATTGCGGTGAGATGTGTCTTCTTCCTGTTACTGTCGGCGGCTGTGTTTCTTTCTGCTATTTTTTGGCTGCCACCATTCCTATCCTATGGAAACTGGCCGGATC
GGCCTATTGGTTCTGCGTATAGAGATCATGAAATAGTAGCAAGTTTTCATGCTTGGAAGCCAGCTCCTTTTCTGGAAAACCATATTTTTGAGCTTGAGGATAACATTTTT
GGAGAAATACCCGTACCTTTTGTCAAGGTGGCTATCCTCTCACTACAATCATTAGGTGGACCAAACGTAACAAAAATTGTTTTTGCGGTAGATTCTGATGCCAAGTATTC
AAAAATTCCCCCAACATCTCAAAGTTTGATCAAGGAAACCTTTGAAACATTGGTTATAAATGAACCTCCTCTGAGATTGAATGCATCATTATTTGGCAATACATCCTTAT
TCGAGGTGTTGAAATTTCCTGAAGGAATTACTATTATTCCTCCTCAGAGTGCATTTCCTCTGCAGGCGCCACAGATCTATTTCAATTTCACATTAAATTATTCTATTTAT
CAAATTCAAGTGAATTTTGATGATCTTACCAGCCAGCTGAGGTCAGGATTACATCTATCTTCTTATGAGAATTTATATGTTAGCCTATCGAATGAAAGAGGTTCAACAAT
GCATCCCCCCACTATTGTCCAGTCATCTGTTCTGATGGCAATTGGGACTAATTCATCGAAACAAAGGCTAAAACAGTTGGCTCAAACCATCACAAATTCTCATTCAGGAA
ACCTTGGCCTGAATAACACTATATTTGGTAAGGTCAAGCAGGTGCGTCTTTCATCAGTCCTAAACCACTCTCTTGGTGGTGGTGGAAGTGCACGGTCACCTTCACCTGCG
CCTTTGCCTTATTCTCACCACCACCACCACCACCACCATCACCATCACCACCACCATCACCGCCATCATCACCATCACCACCATCACCACCACTACAATCAGGTTGCTGC
ATATTCACCCAGCCCTGGAACAGAGGAGCACAAACATGCACTGAAGAATGGGGTATCATCTGCTCCCGAAGCTGGTCCATCCCCAGTGGGAAGTCCAACTTCAGAAAAAA
GAAAGGATGAATCAACTCCGCCTGCTTTTCAATATGGATATAAAAGGTCTTCAAGAAAAGTCAGAAAACAATCTCATTTAGGCCCTATTTCTTCTCCAAGCAGTCCTCCA
TCGTCACCATACTTACGAGTAGGCCTGCCAGCACCTGTTTCTGATTCTATTTCTGCTTCAAGTCCACTGTCAGGTGTAGTTCTATCTAGTGTACAGCCTCCTAATACAGG
CAGTGGACATTCAGAAAATTTTGAAAGAAGCGCCCATTCAGTCTTACCACCACAATTTTCTTTCGTTCAACGTCACCTGCTTAACTCTTTGAAGTTCTTCTTCTTCTTCT
CTGCCATGGCGATCACCATCGTTGACCATGAAATTCCCTCCGATTCCCACAATTTCTTCGACGTAACTTTCGACTCCGACGAGCCGATTCTCACTCTGCTCACCACTTCA
CCATCCATGGTAGACAATTGGATTTCGGAAACCCTCGCCATTCAAACTCCACCTCTCATCGTCGGCCTCGACATCGAATGGCGCCCTAACAATCGCTCCTACGACAACCC
CGTCGCCACCTTGCAACTCTGCATCGGCCGCCGATGCTTGATTTTGCAACTGATCCACACACCTGAGATCCCTAAATCTCTGTTCGAGTTTCTGGAAAACGAATCCTTCA
CATTTGTAGGAGTGGGAATCGACGAGGATGCTGAAAAGCTCAACTGTGACTACGGATTGAAAGTGGGGAAGAGAGTAGATCTGAGGAATTTGGCTGAGAGTGTAACGGGA
AGAGGAGATTTGAAGAATGCGGGATTGAAGAAACTGGGAAAAGAGGTGTTGGGGAAAGAGATTCAGAAGCCGAAGCGTGTGACGATGAGTAGATGGGATCAAGAATGGCT
GACTCTTAATCAGGTTAAGTATGCTTGTATTGATGCGTTCTTTTCGTTTGAGATTGGAAGGCATTTGCAATCTTCATCTAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAAGAGTGAAGAAGAACAGCCGCTGCCGGTTGGAGTGAGCTCCGCTGAGCTTTCTGACCGGAATGTGGAGAGCAGATGCGGCGGCGGTGGGTGCTCTGGGATTCG
TAGACTGATTGCGGTGAGATGTGTCTTCTTCCTGTTACTGTCGGCGGCTGTGTTTCTTTCTGCTATTTTTTGGCTGCCACCATTCCTATCCTATGGAAACTGGCCGGATC
GGCCTATTGGTTCTGCGTATAGAGATCATGAAATAGTAGCAAGTTTTCATGCTTGGAAGCCAGCTCCTTTTCTGGAAAACCATATTTTTGAGCTTGAGGATAACATTTTT
GGAGAAATACCCGTACCTTTTGTCAAGGTGGCTATCCTCTCACTACAATCATTAGGTGGACCAAACGTAACAAAAATTGTTTTTGCGGTAGATTCTGATGCCAAGTATTC
AAAAATTCCCCCAACATCTCAAAGTTTGATCAAGGAAACCTTTGAAACATTGGTTATAAATGAACCTCCTCTGAGATTGAATGCATCATTATTTGGCAATACATCCTTAT
TCGAGGTGTTGAAATTTCCTGAAGGAATTACTATTATTCCTCCTCAGAGTGCATTTCCTCTGCAGGCGCCACAGATCTATTTCAATTTCACATTAAATTATTCTATTTAT
CAAATTCAAGTGAATTTTGATGATCTTACCAGCCAGCTGAGGTCAGGATTACATCTATCTTCTTATGAGAATTTATATGTTAGCCTATCGAATGAAAGAGGTTCAACAAT
GCATCCCCCCACTATTGTCCAGTCATCTGTTCTGATGGCAATTGGGACTAATTCATCGAAACAAAGGCTAAAACAGTTGGCTCAAACCATCACAAATTCTCATTCAGGAA
ACCTTGGCCTGAATAACACTATATTTGGTAAGGTCAAGCAGGTGCGTCTTTCATCAGTCCTAAACCACTCTCTTGGTGGTGGTGGAAGTGCACGGTCACCTTCACCTGCG
CCTTTGCCTTATTCTCACCACCACCACCACCACCACCATCACCATCACCACCACCATCACCGCCATCATCACCATCACCACCATCACCACCACTACAATCAGGTTGCTGC
ATATTCACCCAGCCCTGGAACAGAGGAGCACAAACATGCACTGAAGAATGGGGTATCATCTGCTCCCGAAGCTGGTCCATCCCCAGTGGGAAGTCCAACTTCAGAAAAAA
GAAAGGATGAATCAACTCCGCCTGCTTTTCAATATGGATATAAAAGGTCTTCAAGAAAAGTCAGAAAACAATCTCATTTAGGCCCTATTTCTTCTCCAAGCAGTCCTCCA
TCGTCACCATACTTACGAGTAGGCCTGCCAGCACCTGTTTCTGATTCTATTTCTGCTTCAAGTCCACTGTCAGGTGTAGTTCTATCTAGTGTACAGCCTCCTAATACAGG
CAGTGGACATTCAGAAAATTTTGAAAGAAGCGCCCATTCAGTCTTACCACCACAATTTTCTTTCGTTCAACGTCACCTGCTTAACTCTTTGAAGTTCTTCTTCTTCTTCT
CTGCCATGGCGATCACCATCGTTGACCATGAAATTCCCTCCGATTCCCACAATTTCTTCGACGTAACTTTCGACTCCGACGAGCCGATTCTCACTCTGCTCACCACTTCA
CCATCCATGGTAGACAATTGGATTTCGGAAACCCTCGCCATTCAAACTCCACCTCTCATCGTCGGCCTCGACATCGAATGGCGCCCTAACAATCGCTCCTACGACAACCC
CGTCGCCACCTTGCAACTCTGCATCGGCCGCCGATGCTTGATTTTGCAACTGATCCACACACCTGAGATCCCTAAATCTCTGTTCGAGTTTCTGGAAAACGAATCCTTCA
CATTTGTAGGAGTGGGAATCGACGAGGATGCTGAAAAGCTCAACTGTGACTACGGATTGAAAGTGGGGAAGAGAGTAGATCTGAGGAATTTGGCTGAGAGTGTAACGGGA
AGAGGAGATTTGAAGAATGCGGGATTGAAGAAACTGGGAAAAGAGGTGTTGGGGAAAGAGATTCAGAAGCCGAAGCGTGTGACGATGAGTAGATGGGATCAAGAATGGCT
GACTCTTAATCAGGTTAAGTATGCTTGTATTGATGCGTTCTTTTCGTTTGAGATTGGAAGGCATTTGCAATCTTCATCTAATTGA
Protein sequenceShow/hide protein sequence
MGKSEEEQPLPVGVSSAELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIGSAYRDHEIVASFHAWKPAPFLENHIFELEDNIF
GEIPVPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNASLFGNTSLFEVLKFPEGITIIPPQSAFPLQAPQIYFNFTLNYSIY
QIQVNFDDLTSQLRSGLHLSSYENLYVSLSNERGSTMHPPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNTIFGKVKQVRLSSVLNHSLGGGGSARSPSPA
PLPYSHHHHHHHHHHHHHHHRHHHHHHHHHHYNQVAAYSPSPGTEEHKHALKNGVSSAPEAGPSPVGSPTSEKRKDESTPPAFQYGYKRSSRKVRKQSHLGPISSPSSPP
SSPYLRVGLPAPVSDSISASSPLSGVVLSSVQPPNTGSGHSENFERSAHSVLPPQFSFVQRHLLNSLKFFFFFSAMAITIVDHEIPSDSHNFFDVTFDSDEPILTLLTTS
PSMVDNWISETLAIQTPPLIVGLDIEWRPNNRSYDNPVATLQLCIGRRCLILQLIHTPEIPKSLFEFLENESFTFVGVGIDEDAEKLNCDYGLKVGKRVDLRNLAESVTG
RGDLKNAGLKKLGKEVLGKEIQKPKRVTMSRWDQEWLTLNQVKYACIDAFFSFEIGRHLQSSSN