; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC10G201540 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC10G201540
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionPolynucleotidyl transferase, ribonuclease H-like superfamily protein
Genome locationCicolChr10:28709142..28717612
RNA-Seq ExpressionCcUC10G201540
SyntenyCcUC10G201540
Gene Ontology termsGO:0006139 - nucleobase-containing compound metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008408 - 3'-5' exonuclease activity (molecular function)
InterPro domainsIPR002562 - 3'-5' exonuclease domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053895.1 Filamentous hemagglutinin [Cucumis melo var. makuwa]2.9e-24687.33Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMEN
        MGKSEEEQPLPVGVSSSELSDRNVE+RCGGGGCS IR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLS GN PDRPIDSAYRDH+IVASFHAWKPVPF++N
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMEN

Query:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL
        HIFELEDNIFGEIP+P VKVSSSVSCF WLD S+FW FD IL  NQV ILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIK+TFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGT
        N SLFGNTSLFEVLKFPGGITIIPPQSAF LQ AQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LSPYENLYVSLSN+RGSTM  PT+VQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGT

Query:  N--SLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHH-------HHHHHHHHHHHN--QDAAY
        N  S KQRLKQLA TIT+SHSGNLGLNNTVFGKVKQVRL S LNHSLGGGG+A SPSPAPLPHSHHHHHHHHHH       HHHHHHHHHHHN  Q AAY
Subjt:  N--SLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHH-------HHHHHHHHHHHN--QDAAY

Query:  SPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVGLPAPVSDSISASSPLSGVVL
        SPSPGTEEHKHA KNGVSSAPEAGSSP+E PT+ KRN+EATP AF+YGYKRSS K+RKQ HLGPIPS SS P  P+LRVGLPAPVSDSISASSPLSGVVL
Subjt:  SPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVGLPAPVSDSISASSPLSGVVL

Query:  SSVQSPNTGSGHAENFERSPPSVLPPQFS
        S+VQ PNTGSGHAENFERS PSVLPPQFS
Subjt:  SSVQSPNTGSGHAENFERSPPSVLPPQFS

XP_004136773.3 uncharacterized protein LOC101213172 isoform X1 [Cucumis sativus]9.7e-22683.02Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMEN
        MGKSEEEQPLPVG SSSELSDRNVE+RCGGGGCS IRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLS GN PDRP+DSAYRDH+IVASFHA KPVPF++ 
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMEN

Query:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL
        HIFELEDNIFGEIP+P VKV+                           ILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIK+TFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGT
        N SLFGNTSLFEVLKFPGGITIIPPQSAF LQ AQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LSPYENLYVSLSN+RGST+  PT+VQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGT

Query:  N--SLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHS----HHHHHHHHHHHHHHHHHHHHHNQDAAYSPSPG
        N  S KQRLKQLA TIT+SHSGNLGLNNTVFGKVKQVRL S LNHSLGGGG+ARSPSPAPLPHS    HHHHHHHHHHHHHHHHHHHHH++DAAYSPSPG
Subjt:  N--SLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHS----HHHHHHHHHHHHHHHHHHHHHNQDAAYSPSPG

Query:  TEEHKHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVGLPAPVSDSISASSPLSGVVLSSVQS
        TEEHKHA KNGVSSAPEAGSSP+E PT+ KRN+EATP AF+YGYKRS  K+RK  +LGPIPS SS P  P+LRVG PAPVSDSISASSPLSGVVLS+VQ 
Subjt:  TEEHKHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVGLPAPVSDSISASSPLSGVVLSSVQS

Query:  PNTGSGHAENFERSPPSVLPPQFS
        PNTGSGHAENFERS PSVLPPQFS
Subjt:  PNTGSGHAENFERSPPSVLPPQFS

XP_008443610.1 PREDICTED: uncharacterized protein LOC103487165 [Cucumis melo]3.5e-22880.33Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMEN
        MGKSEEEQPLPVGVSSSELSDRNVE+RCGGGGCS IR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLS GN PDRPIDSAYRDH+IVASFHAWKPVPF++N
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMEN

Query:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL
        HIFELEDNIFGEIP+P VKV+                           ILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIK+TFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGT
        N SLFGNTSLFEVLKFPGGITIIPPQSAF LQ AQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LSPYENLYVSLSN+RGSTM  PT+VQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGT

Query:  N--SLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHHHH----------------
        N  S KQRLKQLA TIT+SHSGNLGLNNTVFGKVKQVRL S LNHSLGGGG+A SPSPAPLPHSHHHHHHHHHHHHHHHHHHHH                
Subjt:  N--SLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHHHH----------------

Query:  -------------HNQDAAYSPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVG
                     H+Q AAYSPSPGTEEHKHA KNGVSSAPEAGSSP+E PT+ KRN+EATP AF+YGYKRSS K+RKQ HLGPIPS SS P  P+LRVG
Subjt:  -------------HNQDAAYSPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVG

Query:  LPAPVSDSISASSPLSGVVLSSVQSPNTGSGHAENFERSPPSVLPPQFS
        LPAPVSDSISASSPLSGVVLS+VQ PNTGSGHAENFERS PSVLPPQFS
Subjt:  LPAPVSDSISASSPLSGVVLSSVQSPNTGSGHAENFERSPPSVLPPQFS

XP_022983747.1 uncharacterized protein LOC111482272 isoform X2 [Cucurbita maxima]7.4e-21076.1Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMEN
        MGKSEEEQPLPVGVSSSELSD  V+SRCGGGGC  IRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLS G+ PD+  DS YRDHEIVA F A KPVPF++N
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMEN

Query:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL
        HIFELEDNIFGEIPVPFVKV+                           +LSLQSLGG NVT I+F+VD DAKYSKIPPTSQSLIK+TFETLVIN+PPLRL
Subjt:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGT
        NASLFGNTSLFEVLKFPGGITIIPPQSAF LQ AQIYFNFTLNYSIYQIQVNF+DLTSQLRSGL LS YENLYVSLSN+RGSTM  PTIVQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGT

Query:  NSLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHS-----------------------HHHHHHHHHHHHHHH
        NS  QRLKQLAQTIT+SHSGNLGLNNTVFGKVKQVRLSSVLNHSL  GG ARSPSPAPLPHS                       HHHHHHHHHHHHHHH
Subjt:  NSLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHS-----------------------HHHHHHHHHHHHHHH

Query:  HHHHHHNQDAAYSPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVGLPAPVSDS
        HHH HH+QDAAYSPSPGTEEHKHA KNG+SSAPEAGSSPVESP + KRN+EATP  F+YGYK  S KVRK+SHLG IPS SSPP  P+LRVGLPAPV+ S
Subjt:  HHHHHHNQDAAYSPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVGLPAPVSDS

Query:  ISASSPLSGVVLSSVQSPNTGSGHAENFERSPPSVLPPQFSWGI
        ISASSPL GV LS+VQ P  G       +RS PSVLPPQFS+ +
Subjt:  ISASSPLSGVVLSSVQSPNTGSGHAENFERSPPSVLPPQFSWGI

XP_038904490.1 uncharacterized protein LOC120090859 [Benincasa hispida]3.3e-21078.38Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMEN
        MGKSEEEQ LPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLS AVFLSAIFWLPPFLS GN PDRP+DSAYRDHEIVASFHAWKP P +EN
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMEN

Query:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL
        HIFELEDNIFGEIPVPFVKV+                           ILSLQSLGGPN TKIVFAVDSDAKYSKIPPTSQSLIK+TFETLVIN+PPLRL
Subjt:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGT
        NASLFGNTSLFEVLKFPGGITIIPPQSAF LQ AQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSN+RGSTMH PTIVQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGT

Query:  NSLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHNQDAAYSPSPGTEEHKH
        NS KQRLKQLAQTIT+SHS NLGLNNT+FGKVKQVRLSSVLNHSLGGGGSAR                                            E +H
Subjt:  NSLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHNQDAAYSPSPGTEEHKH

Query:  ALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVGLPAPVSDSISASSPLSGVVLSSVQSPNTGSG
         LKNGVSSAPEAGSSPVESPT+  RN+EATP AFQYGYKRSSRKVRKQ+HLGPIPS SS P  P+LRVGLPAPVSDSISASSPLSGVVLS+VQ PN+GS 
Subjt:  ALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVGLPAPVSDSISASSPLSGVVLSSVQSPNTGSG

Query:  HAENFERSPPSVLPPQFS
        HAENF  S PSVLPPQFS
Subjt:  HAENFERSPPSVLPPQFS

TrEMBL top hitse value%identityAlignment
A0A0A0LHD1 Uncharacterized protein3.4e-22482.88Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMEN
        MGKSEEEQPLPVG SSSELSDRNVE+RCGGGGCS IRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLS GN PDRP+DSAYRDH+IVASFHA KPVPF++ 
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMEN

Query:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL
        HIFELEDNIFGEIP+P VKV+                           ILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIK+TFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGT
        N SLFGNTSLFEVLKFPGGITIIPPQSAF LQ AQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LSPYENLYVSLSN+RGST+  PT+VQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGT

Query:  N--SLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHNQDAAYSPSPGTEEH
        N  S KQRLKQLA TIT+SHSGNLGLNNTVFGKVKQVRL S LNHSLGGGG+ARSPSPAPLPHSHHH HHHHHHHHHHHHHH    +DAAYSPSPGTEEH
Subjt:  N--SLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHNQDAAYSPSPGTEEH

Query:  KHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVGLPAPVSDSISASSPLSGVVLSSVQSPNTG
        KHA KNGVSSAPEAGSSP+E PT+ KRN+EATP AF+YGYKRS  K+RK  +LGPIPS SS P  P+LRVG PAPVSDSISASSPLSGVVLS+VQ PNTG
Subjt:  KHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVGLPAPVSDSISASSPLSGVVLSSVQSPNTG

Query:  SGHAENFERSPPSVLPPQFS
        SGHAENFERS PSVLPPQFS
Subjt:  SGHAENFERSPPSVLPPQFS

A0A1S3B8E9 uncharacterized protein LOC1034871651.7e-22880.33Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMEN
        MGKSEEEQPLPVGVSSSELSDRNVE+RCGGGGCS IR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLS GN PDRPIDSAYRDH+IVASFHAWKPVPF++N
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMEN

Query:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL
        HIFELEDNIFGEIP+P VKV+                           ILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIK+TFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGT
        N SLFGNTSLFEVLKFPGGITIIPPQSAF LQ AQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LSPYENLYVSLSN+RGSTM  PT+VQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGT

Query:  N--SLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHHHH----------------
        N  S KQRLKQLA TIT+SHSGNLGLNNTVFGKVKQVRL S LNHSLGGGG+A SPSPAPLPHSHHHHHHHHHHHHHHHHHHHH                
Subjt:  N--SLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHHHH----------------

Query:  -------------HNQDAAYSPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVG
                     H+Q AAYSPSPGTEEHKHA KNGVSSAPEAGSSP+E PT+ KRN+EATP AF+YGYKRSS K+RKQ HLGPIPS SS P  P+LRVG
Subjt:  -------------HNQDAAYSPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVG

Query:  LPAPVSDSISASSPLSGVVLSSVQSPNTGSGHAENFERSPPSVLPPQFS
        LPAPVSDSISASSPLSGVVLS+VQ PNTGSGHAENFERS PSVLPPQFS
Subjt:  LPAPVSDSISASSPLSGVVLSSVQSPNTGSGHAENFERSPPSVLPPQFS

A0A5A7UJM2 Filamentous hemagglutinin1.4e-24687.33Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMEN
        MGKSEEEQPLPVGVSSSELSDRNVE+RCGGGGCS IR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLS GN PDRPIDSAYRDH+IVASFHAWKPVPF++N
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMEN

Query:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL
        HIFELEDNIFGEIP+P VKVSSSVSCF WLD S+FW FD IL  NQV ILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIK+TFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGT
        N SLFGNTSLFEVLKFPGGITIIPPQSAF LQ AQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LSPYENLYVSLSN+RGSTM  PT+VQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGT

Query:  N--SLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHH-------HHHHHHHHHHHN--QDAAY
        N  S KQRLKQLA TIT+SHSGNLGLNNTVFGKVKQVRL S LNHSLGGGG+A SPSPAPLPHSHHHHHHHHHH       HHHHHHHHHHHN  Q AAY
Subjt:  N--SLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHH-------HHHHHHHHHHHN--QDAAY

Query:  SPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVGLPAPVSDSISASSPLSGVVL
        SPSPGTEEHKHA KNGVSSAPEAGSSP+E PT+ KRN+EATP AF+YGYKRSS K+RKQ HLGPIPS SS P  P+LRVGLPAPVSDSISASSPLSGVVL
Subjt:  SPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVGLPAPVSDSISASSPLSGVVL

Query:  SSVQSPNTGSGHAENFERSPPSVLPPQFS
        S+VQ PNTGSGHAENFERS PSVLPPQFS
Subjt:  SSVQSPNTGSGHAENFERSPPSVLPPQFS

A0A5D3DPD6 Filamentous hemagglutinin4.7e-21077.5Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMEN
        MGKSEEEQPLPVGVSSSELSDRNVE+RCGGGGCS IR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLS GN PDRPIDSAYRDH+IVASFHAWKPVPF++N
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMEN

Query:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL
        HIFELEDNIFGEIP+P VKV+                           ILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIK+TFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGT
        N SLFGNTSLFEVLKFPGGITIIPPQSAF LQ AQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LSPYENLYVSLSN+RGSTM  PT+VQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGT

Query:  N--SLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHNQDAAYSPSPGTEEH
        N  S KQRLKQLA TIT+SHSGNLGLNNTVFGKVKQVRL S LNHSLGGGG+A                                         PGTEEH
Subjt:  N--SLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHNQDAAYSPSPGTEEH

Query:  KHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVGLPAPVSDSISASSPLSGVVLSSVQSPNTG
        KHA KNGVSSAPEAGSSP+E PT+ KRN+EATP AF+YGYKRSS K+RKQ HLGPIPS SS P  P+LRVGLPAPVSDSISASSPLSGVVLS+VQ PNTG
Subjt:  KHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVGLPAPVSDSISASSPLSGVVLSSVQSPNTG

Query:  SGHAENFERSPPSVLPPQFS
        SGHAENFERS PSVLPPQFS
Subjt:  SGHAENFERSPPSVLPPQFS

A0A6J1J074 uncharacterized protein LOC111482272 isoform X23.6e-21076.1Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMEN
        MGKSEEEQPLPVGVSSSELSD  V+SRCGGGGC  IRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLS G+ PD+  DS YRDHEIVA F A KPVPF++N
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMEN

Query:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL
        HIFELEDNIFGEIPVPFVKV+                           +LSLQSLGG NVT I+F+VD DAKYSKIPPTSQSLIK+TFETLVIN+PPLRL
Subjt:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGT
        NASLFGNTSLFEVLKFPGGITIIPPQSAF LQ AQIYFNFTLNYSIYQIQVNF+DLTSQLRSGL LS YENLYVSLSN+RGSTM  PTIVQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGT

Query:  NSLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHS-----------------------HHHHHHHHHHHHHHH
        NS  QRLKQLAQTIT+SHSGNLGLNNTVFGKVKQVRLSSVLNHSL  GG ARSPSPAPLPHS                       HHHHHHHHHHHHHHH
Subjt:  NSLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHS-----------------------HHHHHHHHHHHHHHH

Query:  HHHHHHNQDAAYSPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVGLPAPVSDS
        HHH HH+QDAAYSPSPGTEEHKHA KNG+SSAPEAGSSPVESP + KRN+EATP  F+YGYK  S KVRK+SHLG IPS SSPP  P+LRVGLPAPV+ S
Subjt:  HHHHHHNQDAAYSPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVGLPAPVSDS

Query:  ISASSPLSGVVLSSVQSPNTGSGHAENFERSPPSVLPPQFSWGI
        ISASSPL GV LS+VQ P  G       +RS PSVLPPQFS+ +
Subjt:  ISASSPLSGVVLSSVQSPNTGSGHAENFERSPPSVLPPQFSWGI

SwissProt top hitse value%identityAlignment
Q84LH3 Werner Syndrome-like exonuclease2.5e-1939.22Show/hide
Query:  VGLDIEWRPDNRSYDNP--VATLQLCI-GRRCLILQLIHTPEIPKSLFEFLENESFTFVGVGIDEDAEKLNCDYGLKLGKRVDLRNLAESVTGRGDLKNA
        VGLDIEWRP  R    P  VAT+Q+C+    C ++ + H+  IP+SL   +E+ +   VG+GID D+ KL  DYG+ +    DL +LA    G GD K  
Subjt:  VGLDIEWRPDNRSYDNP--VATLQLCI-GRRCLILQLIHTPEIPKSLFEFLENESFTFVGVGIDEDAEKLNCDYGLKLGKRVDLRNLAESVTGRGDLKNA

Query:  GLKRLVKEVLGKEIEKPKRVTLSRWDQQWLTLNQVKYACIDAFFSFEIGRFLQ
        GL  L + ++ KE+ KP R+ L  W+   L+  Q++YA  DA+ S+ + + L+
Subjt:  GLKRLVKEVLGKEIEKPKRVTLSRWDQQWLTLNQVKYACIDAFFSFEIGRFLQ

Q8VEG4 Exonuclease 3'-5' domain-containing protein 21.8e-0931.76Show/hide
Query:  IVGLDIEWRPDNRSYDNPVATLQLCI-GRRCLILQLIHT----PEIPKSLFEFLENESFTFVGVGIDEDAEKLNCDYGLKLGKRVDLRNLAESVTGRGDL
        ++G+D EW  +     +P++ LQ+      C +++L         +P++L + L + +   VGVG  EDA KL  DYGL +   +DLR LA         
Subjt:  IVGLDIEWRPDNRSYDNPVATLQLCI-GRRCLILQLIHT----PEIPKSLFEFLENESFTFVGVGIDEDAEKLNCDYGLKLGKRVDLRNLAESVTGRGDL

Query:  KNAGLKRLVKEVLGKEIEKPKRVTLSRWDQQWLTLNQVKYACIDAFFS
            LK L + +L   ++K   +  S WD + LT +QV YA  DA  S
Subjt:  KNAGLKRLVKEVLGKEIEKPKRVTLSRWDQQWLTLNQVKYACIDAFFS

Q9VGN7 Exonuclease 3'-5' domain-containing protein 22.1e-1337.41Show/hide
Query:  IVGLDIEWRPDNRSYDNPVATLQLCIGR-RCLILQLIHTPEIPKSLFEFLENESFTFVGVGIDEDAEKLNCDYGLKLGKRVDLRNLAESVTGRGDLKNAG
        ++G D EW     S   PVA LQL   R  C + +L H  +IP+ L E LE++S   VGV   EDA KL+ DYG+ +   +DLR L   + G    K  G
Subjt:  IVGLDIEWRPDNRSYDNPVATLQLCIGR-RCLILQLIHTPEIPKSLFEFLENESFTFVGVGIDEDAEKLNCDYGLKLGKRVDLRNLAESVTGRGDLKNAG

Query:  LKRLVKEVLGKEIEKPKRVTLSRWDQQWLTLNQVKYACIDAFFSFEI
        L +L K  L   ++K  R+  S W+ + L   Q+ YA  DA  +  I
Subjt:  LKRLVKEVLGKEIEKPKRVTLSRWDQQWLTLNQVKYACIDAFFSFEI

Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)4.0e-3631.51Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGC-SGIRRLIAVRCVFFLLLSAAVFLSAIFWLPP------FLSDGNLPDRPIDSAYRDHEIVASFHAWK
        M K  +E  L +   + +L +     R  G  C S   RL+ +RC+  L+LS A+ LSAIFWL P      F +DG +          +  + ASF   K
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGC-SGIRRLIAVRCVFFLLLSAAVFLSAIFWLPP------FLSDGNLPDRPIDSAYRDHEIVASFHAWK

Query:  PVPFMENHIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVI
        PV  +  H  ++E +I   I                           + + ++V +LSL   G  N T + FAV       +I   S SL++ +F  L  
Subjt:  PVPFMENHIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVI

Query:  NEPPLRLNASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSS
            L+L  S FG  + F+VLKFPGGIT+ P + A     A + F+ T+  SI  +Q   D L       L L PYE+++  L+NK+GST+ PP   Q  
Subjt:  NEPPLRLNASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSS

Query:  VLMAIGTNSLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAP
        V   +    L QRL    Q I +S + NLGL+  VFG+VK +  S+ L+  +       +P+P P
Subjt:  VLMAIGTNSLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAP

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein1.7e-9045.33Show/hide
Query:  MGKSEEEQPLPV--GVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFM
        MGK+E++  L V  G ++ + + RN  +RC  G C  I   +  +C+F LLLS A+FLSA+F L PF  D    D  +D  +R H IVASF   +   F+
Subjt:  MGKSEEEQPLPV--GVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFM

Query:  ENHIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPL
          +  +L+++IF E+    +KV+                           IL+++     N+TK+VF +D D  Y +I P S S IK+ FE+++IN+  L
Subjt:  ENHIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPL

Query:  RLNASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAI
        +L  SLFG T LFEVLKFPGGIT+IPPQSAFPLQ  +I FNFTLNYSI+QIQ+NF+ L SQL++GL+L+PYENLYVSLSN  GST+ PPT V SSVL+ +
Subjt:  RLNASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAI

Query:  GTNSLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHNQDAAYSPSPGTEEH
        GT++   RLKQL  TIT S S NLGLNNT+FGKVKQVRLSS L +S     S +SPSP+P PHS HHHHHHHHHHHHHHHHH+HH              H
Subjt:  GTNSLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHNQDAAYSPSPGTEEH

Query:  KHALKNGVSSAPEAGSSPVESPTANKRNHEA--TPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVGLPAPVSDS----ISASSPLSGVVLSSV
         H        APE   SPV SP  ++    A   P     G +   ++ R Q    P P+ S+    P  ++  PAP+S +    +  S+PL  VV +  
Subjt:  KHALKNGVSSAPEAGSSPVESPTANKRNHEA--TPTAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLPPFLRVGLPAPVSDS----ISASSPLSGVVLSSV

Query:  QSP
          P
Subjt:  QSP

AT3G12410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.3e-2231.6Show/hide
Query:  DSHNLFDVTFDSEEPILTLLTTSPSMVDEWISETL---AIRTPPLIVGLDIEW---------RPDN--------RSY-DNPVATLQLCIGRRCLILQLIH
        ++H  + V F  +E I+T +T   S++  WI   L      + PL+VG+ ++W         RP+N        R Y DNP   LQLC+G RCLI+QL +
Subjt:  DSHNLFDVTFDSEEPILTLLTTSPSMVDEWISETL---AIRTPPLIVGLDIEW---------RPDN--------RSY-DNPVATLQLCIGRRCLILQLIH

Query:  TPEIPKSLFEFLENESFTFVGVGIDEDAEKL-NCDYGLKLGKRVDLRNLAESVTGRGDLKNAGLKRLVKEVLGKE-IEKPKRVTLSRWDQQWLTLNQVKY
          ++P +L  FL +   TFVGV   +DA KL  C + L++G+ +D+R       GR  ++ +  + +V+E +G + +     +++S W    L L+Q+  
Subjt:  TPEIPKSLFEFLENESFTFVGVGIDEDAEKL-NCDYGLKLGKRVDLRNLAESVTGRGDLKNAGLKRLVKEVLGKE-IEKPKRVTLSRWDQQWLTLNQVKY

Query:  ACIDAFFSFEIG
        A +DA+   ++G
Subjt:  ACIDAFFSFEIG

AT3G56590.1 hydroxyproline-rich glycoprotein family protein1.8e-9245.33Show/hide
Query:  MGKSE-EEQPLPVGVSSSELSDRNVESRCGGGG------CSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWK
        MGK+  EEQ LP  VS    S RN     GGGG      C  I    ++RCV  L  SAAVFLSA+FWLPPFL   +  D  +D  ++DH IVASF   K
Subjt:  MGKSE-EEQPLPVGVSSSELSDRNVESRCGGGG------CSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWK

Query:  PVPFMENHIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVI
        P+ FME+++ +LE++I  EI  P  K                           VV+L+L+ LG  N T ++FA+D + + SKIP   +SLIK  FETLV 
Subjt:  PVPFMENHIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVI

Query:  NEPPLRLNASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSS
         +   RL  SLFG    FEVLKFPGGIT+IPPQ  FPLQ AQ+ FNFTLN+SIYQIQ NF++L SQL+ G++L+ YENLY++LSN RGST+ PPTIV SS
Subjt:  NEPPLRLNASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSS

Query:  VLMAIGTNSLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHNQDAAYSPSP
        VL+  G++S   RLKQLAQTITSSHS NLGLN+TVFGKVKQVRLSS+L HS     ++ +PSP+P P +H + HHH HHHHHHH            +P P
Subjt:  VLMAIGTNSLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHNQDAAYSPSP

Query:  GTEEHKHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLG---PIPSRSSPPLPPFLRVGLPAPV-SDSISASSPLSGVVL
                      S P  G +P  +PT +       P    Y  +R         H     P P RS P  P       PAP    +I  SSPL  VV 
Subjt:  GTEEHKHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLG---PIPSRSSPPLPPFLRVGLPAPV-SDSISASSPLSGVVL

Query:  SSVQSPNTGSGHAENFERSPPSVLP
        + +  P+  S  +E      PS  P
Subjt:  SSVQSPNTGSGHAENFERSPPSVLP

AT3G56590.2 hydroxyproline-rich glycoprotein family protein1.0e-9245.11Show/hide
Query:  MGKSE-EEQPLPVGVSSSELSDRNVESRCGGGG------CSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWK
        MGK+  EEQ LP  VS    S RN     GGGG      C  I    ++RCV  L  SAAVFLSA+FWLPPFL   +  D  +D  ++DH IVASF   K
Subjt:  MGKSE-EEQPLPVGVSSSELSDRNVESRCGGGG------CSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWK

Query:  PVPFMENHIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVI
        P+ FME+++ +LE++I  EI  P  K                           VV+L+L+ LG  N T ++FA+D + + SKIP   +SLIK  FETLV 
Subjt:  PVPFMENHIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVI

Query:  NEPPLRLNASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSS
         +   RL  SLFG    FEVLKFPGGIT+IPPQ  FPLQ AQ+ FNFTLN+SIYQIQ NF++L SQL+ G++L+ YENLY++LSN RGST+ PPTIV SS
Subjt:  NEPPLRLNASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSS

Query:  VLMAIGTNSLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHNQDAAYSPSP
        VL+  G++S   RLKQLAQTITSSHS NLGLN+TVFGKVKQVRLSS+L HS     ++ +PSP+P P +H + HHH HHHHHHH            +P P
Subjt:  VLMAIGTNSLKQRLKQLAQTITSSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHNQDAAYSPSP

Query:  GTEEHKHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLG---PIPSRSSPPLPPFLRVGLPAPV-SDSISASSPLSGVVL
                      S P  G +P  +PT +       P    Y  +R         H     P P RS P  P       PAP    +I  SSPL  VV 
Subjt:  GTEEHKHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKRSSRKVRKQSHLG---PIPSRSSPPLPPFLRVGLPAPV-SDSISASSPLSGVVL

Query:  SSVQSPNTGSGHAENFERSPPSVLPPQFSWGI
        + +  P+  S  +E      PS  P   S  I
Subjt:  SSVQSPNTGSGHAENFERSPPSVLPPQFSWGI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAGAGTGAAGAAGAACAGCCGCTGCCGGTTGGAGTGAGCTCCTCTGAGCTTTCTGACCGGAATGTGGAGAGCAGATGCGGCGGCGGTGGGTGCTCTGGGATTCG
TAGACTGATTGCGGTGAGATGTGTCTTCTTCCTGTTATTATCGGCGGCTGTGTTTCTTTCTGCTATTTTTTGGCTGCCACCGTTCCTTTCCGATGGAAATTTGCCGGATC
GGCCTATTGATTCTGCTTATAGAGATCATGAAATAGTAGCAAGTTTTCATGCTTGGAAGCCAGTTCCTTTTATGGAAAACCATATTTTTGAGCTTGAGGATAACATTTTT
GGAGAAATACCCGTACCATTTGTCAAGGTATCCAGTTCAGTTTCTTGTTTCCTCTGGCTTGATGTGTCACATTTTTGGTCGTTTGATCAAATACTCTCTTTTAACCAGGT
GGTTATCCTCTCACTACAATCATTAGGTGGACCAAACGTAACAAAAATTGTTTTTGCGGTAGATTCTGATGCAAAGTATTCGAAAATTCCCCCAACATCTCAAAGTTTAA
TCAAGGATACCTTTGAAACATTGGTTATAAATGAACCTCCTCTGAGATTGAATGCATCATTATTTGGTAATACATCCTTATTCGAGGTGTTGAAATTTCCTGGAGGAATA
ACTATTATTCCTCCTCAGAGTGCATTTCCTCTGCAGGCGGCACAGATCTATTTCAATTTCACATTAAATTATTCTATTTATCAAATTCAAGTGAATTTTGATGATCTTAC
CAGCCAGTTGAGGTCAGGATTACATCTATCTCCTTATGAGAATTTGTATGTTAGCCTATCGAACAAAAGAGGTTCAACAATGCATCCCCCCACTATTGTCCAGTCATCTG
TTCTGATGGCAATTGGGACTAACTCATTGAAACAAAGGCTAAAACAGTTGGCTCAAACCATCACAAGTTCTCATTCAGGAAACCTTGGCCTGAACAACACTGTATTTGGT
AAGGTCAAGCAGGTGCGTCTTTCATCAGTCCTAAACCACTCTCTTGGTGGTGGTGGCAGTGCTCGGTCACCTTCACCTGCGCCTCTGCCTCATTCTCACCACCACCACCA
CCATCACCACCACCATCACCACCATCACCACCATCACCACCACCACCACAATCAGGATGCTGCATACTCACCTAGTCCTGGAACAGAGGAGCACAAACATGCACTGAAGA
ATGGGGTATCATCTGCTCCCGAAGCTGGTTCATCTCCAGTGGAAAGTCCAACTGCAAATAAAAGAAACCATGAAGCTACTCCGACTGCTTTTCAATATGGATATAAGAGG
TCTTCAAGAAAAGTCAGAAAACAATCTCATTTAGGCCCTATTCCTTCTCGAAGCAGTCCTCCATTGCCACCATTCTTACGAGTAGGCCTGCCAGCACCTGTTTCTGATTC
TATTTCTGCTTCAAGTCCACTGTCAGGTGTAGTTCTATCTAGTGTACAGTCTCCAAATACAGGCAGTGGACATGCAGAAAATTTTGAAAGAAGTCCCCCTTCAGTCTTAC
CACCTCAATTTTCTTGGGGAATTGTATATTTGGTTTTGGTTAGAATTACACAGAGGCACAAGGCTGCAACTGTTACAGTTTCTGATGGAAAATCAAGCAAGTTTGGAGAT
GCATTTTTCCAGGTCAAAGTCGCAGAGGTGGCAGGCCTTTGTTCCTTCTTCTTATGTGCCATGGCGATCACCATCGTTGACCATCAAGTTCCCTCCGATTCCCACAATTT
GTTCGACGTAACTTTCGATTCCGAGGAGCCAATTCTCACTCTTCTCACCACTTCACCATCCATGGTAGATGAGTGGATATCCGAAACCCTCGCCATTCGAACTCCACCTC
TCATCGTCGGCCTCGACATCGAATGGCGCCCTGATAATCGGTCCTACGACAACCCCGTCGCCACCTTGCAACTCTGCATCGGCCGCCGCTGCCTGATTCTGCAACTGATC
CACACACCTGAGATCCCTAAATCTCTGTTCGAGTTTCTGGAAAACGAATCCTTCACATTCGTAGGAGTGGGAATCGACGAGGATGCTGAAAAGCTCAACTGTGATTACGG
ATTGAAATTGGGGAAGAGAGTGGATCTGAGGAATTTGGCCGAGAGTGTAACGGGAAGAGGAGATTTGAAGAATGCGGGATTGAAGAGATTGGTGAAAGAGGTTTTGGGGA
AAGAGATTGAAAAGCCGAAGAGGGTGACGTTGAGTAGATGGGATCAACAGTGGCTTACTCTTAATCAGGTTAAGTATGCTTGTATTGATGCCTTTTTTTCGTTTGAGATT
GGAAGGTTTTTGCAATCTTCATCCAATTAA
mRNA sequenceShow/hide mRNA sequence
GAATTTGGTTCTGGGTTTGCGGTGGATTAGCTCCACGAGCTGTAATGGGTGACGATAATTCAGACCCAATTGAGGGAGGTGACAATGGAGGTTATTAACCCCACTTCACA
TGCATTGCTTCCATGGGAAAGAGTGAAGAAGAACAGCCGCTGCCGGTTGGAGTGAGCTCCTCTGAGCTTTCTGACCGGAATGTGGAGAGCAGATGCGGCGGCGGTGGGTG
CTCTGGGATTCGTAGACTGATTGCGGTGAGATGTGTCTTCTTCCTGTTATTATCGGCGGCTGTGTTTCTTTCTGCTATTTTTTGGCTGCCACCGTTCCTTTCCGATGGAA
ATTTGCCGGATCGGCCTATTGATTCTGCTTATAGAGATCATGAAATAGTAGCAAGTTTTCATGCTTGGAAGCCAGTTCCTTTTATGGAAAACCATATTTTTGAGCTTGAG
GATAACATTTTTGGAGAAATACCCGTACCATTTGTCAAGGTATCCAGTTCAGTTTCTTGTTTCCTCTGGCTTGATGTGTCACATTTTTGGTCGTTTGATCAAATACTCTC
TTTTAACCAGGTGGTTATCCTCTCACTACAATCATTAGGTGGACCAAACGTAACAAAAATTGTTTTTGCGGTAGATTCTGATGCAAAGTATTCGAAAATTCCCCCAACAT
CTCAAAGTTTAATCAAGGATACCTTTGAAACATTGGTTATAAATGAACCTCCTCTGAGATTGAATGCATCATTATTTGGTAATACATCCTTATTCGAGGTGTTGAAATTT
CCTGGAGGAATAACTATTATTCCTCCTCAGAGTGCATTTCCTCTGCAGGCGGCACAGATCTATTTCAATTTCACATTAAATTATTCTATTTATCAAATTCAAGTGAATTT
TGATGATCTTACCAGCCAGTTGAGGTCAGGATTACATCTATCTCCTTATGAGAATTTGTATGTTAGCCTATCGAACAAAAGAGGTTCAACAATGCATCCCCCCACTATTG
TCCAGTCATCTGTTCTGATGGCAATTGGGACTAACTCATTGAAACAAAGGCTAAAACAGTTGGCTCAAACCATCACAAGTTCTCATTCAGGAAACCTTGGCCTGAACAAC
ACTGTATTTGGTAAGGTCAAGCAGGTGCGTCTTTCATCAGTCCTAAACCACTCTCTTGGTGGTGGTGGCAGTGCTCGGTCACCTTCACCTGCGCCTCTGCCTCATTCTCA
CCACCACCACCACCATCACCACCACCATCACCACCATCACCACCATCACCACCACCACCACAATCAGGATGCTGCATACTCACCTAGTCCTGGAACAGAGGAGCACAAAC
ATGCACTGAAGAATGGGGTATCATCTGCTCCCGAAGCTGGTTCATCTCCAGTGGAAAGTCCAACTGCAAATAAAAGAAACCATGAAGCTACTCCGACTGCTTTTCAATAT
GGATATAAGAGGTCTTCAAGAAAAGTCAGAAAACAATCTCATTTAGGCCCTATTCCTTCTCGAAGCAGTCCTCCATTGCCACCATTCTTACGAGTAGGCCTGCCAGCACC
TGTTTCTGATTCTATTTCTGCTTCAAGTCCACTGTCAGGTGTAGTTCTATCTAGTGTACAGTCTCCAAATACAGGCAGTGGACATGCAGAAAATTTTGAAAGAAGTCCCC
CTTCAGTCTTACCACCTCAATTTTCTTGGGGAATTGTATATTTGGTTTTGGTTAGAATTACACAGAGGCACAAGGCTGCAACTGTTACAGTTTCTGATGGAAAATCAAGC
AAGTTTGGAGATGCATTTTTCCAGGTCAAAGTCGCAGAGGTGGCAGGCCTTTGTTCCTTCTTCTTATGTGCCATGGCGATCACCATCGTTGACCATCAAGTTCCCTCCGA
TTCCCACAATTTGTTCGACGTAACTTTCGATTCCGAGGAGCCAATTCTCACTCTTCTCACCACTTCACCATCCATGGTAGATGAGTGGATATCCGAAACCCTCGCCATTC
GAACTCCACCTCTCATCGTCGGCCTCGACATCGAATGGCGCCCTGATAATCGGTCCTACGACAACCCCGTCGCCACCTTGCAACTCTGCATCGGCCGCCGCTGCCTGATT
CTGCAACTGATCCACACACCTGAGATCCCTAAATCTCTGTTCGAGTTTCTGGAAAACGAATCCTTCACATTCGTAGGAGTGGGAATCGACGAGGATGCTGAAAAGCTCAA
CTGTGATTACGGATTGAAATTGGGGAAGAGAGTGGATCTGAGGAATTTGGCCGAGAGTGTAACGGGAAGAGGAGATTTGAAGAATGCGGGATTGAAGAGATTGGTGAAAG
AGGTTTTGGGGAAAGAGATTGAAAAGCCGAAGAGGGTGACGTTGAGTAGATGGGATCAACAGTGGCTTACTCTTAATCAGGTTAAGTATGCTTGTATTGATGCCTTTTTT
TCGTTTGAGATTGGAAGGTTTTTGCAATCTTCATCCAATTAA
Protein sequenceShow/hide protein sequence
MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNLPDRPIDSAYRDHEIVASFHAWKPVPFMENHIFELEDNIF
GEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRLNASLFGNTSLFEVLKFPGGI
TIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTMHPPTIVQSSVLMAIGTNSLKQRLKQLAQTITSSHSGNLGLNNTVFG
KVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHNQDAAYSPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNHEATPTAFQYGYKR
SSRKVRKQSHLGPIPSRSSPPLPPFLRVGLPAPVSDSISASSPLSGVVLSSVQSPNTGSGHAENFERSPPSVLPPQFSWGIVYLVLVRITQRHKAATVTVSDGKSSKFGD
AFFQVKVAEVAGLCSFFLCAMAITIVDHQVPSDSHNLFDVTFDSEEPILTLLTTSPSMVDEWISETLAIRTPPLIVGLDIEWRPDNRSYDNPVATLQLCIGRRCLILQLI
HTPEIPKSLFEFLENESFTFVGVGIDEDAEKLNCDYGLKLGKRVDLRNLAESVTGRGDLKNAGLKRLVKEVLGKEIEKPKRVTLSRWDQQWLTLNQVKYACIDAFFSFEI
GRFLQSSSN