; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc10G15740 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc10G15740
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPolynucleotidyl transferase, ribonuclease H-like superfamily protein
Genome locationClcChr10:29468699..29477366
RNA-Seq ExpressionClc10G15740
SyntenyClc10G15740
Gene Ontology termsGO:0006139 - nucleobase-containing compound metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008408 - 3'-5' exonuclease activity (molecular function)
InterPro domainsIPR002562 - 3'-5' exonuclease domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053895.1 Filamentous hemagglutinin [Cucumis melo var. makuwa]5.1e-25188.85Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMEN
        MGKSEEEQPLPVGVSSSELSDRNVE+RCGGGGCS IR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLS GNWPDRPIDSAYRDH+IVASFHAWKPVPF++N
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMEN

Query:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL
        HIFELEDNIFGEIP+P VKVSSSVSCF WLD S+FW FD IL  NQV ILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIK+TFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGT
        N SLFGNTSLFEVLKFPGGITIIPPQSAF LQ AQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LSPYENLYVSLSN+RGST+ +PT+VQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGT

Query:  N--SSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHH---HHHYHHHHHHHN--QDAAY
        N  SSKQRLKQLA TITNSHSGNLGLNNTVFGKVKQVRL S LNHSLGGGG+A SPSPAPLPHSHHHHHHHHHHHHHH   HHH+HHHHHHHN  Q AAY
Subjt:  N--SSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHH---HHHYHHHHHHHN--QDAAY

Query:  SPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVGLPAPVSDSISASSPLSGVVL
        SPSPGTEEHKHA KNGVSSAPEAGSSP+E PT+ KRNYEATPPAF+YGYKRSS K+RKQ HLGPIPS SS P SP+LRVGLPAPVSDSISASSPLSGVVL
Subjt:  SPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVGLPAPVSDSISASSPLSGVVL

Query:  SSVQSPNTGSGHAENFERSPPSVLPPQFS
        S+VQ PNTGSGHAENFERS PSVLPPQFS
Subjt:  SSVQSPNTGSGHAENFERSPPSVLPPQFS

TYK25511.1 Filamentous hemagglutinin [Cucumis melo var. makuwa]6.5e-21477.86Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMEN
        MGKSEEEQPLPVGVSSSELSDRNVE+RCGGGGCS IR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLS GNWPDRPIDSAYRDH+IVASFHAWKPVPF++N
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMEN

Query:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL
        HIFELEDNIFGEIP+P VKV+                           ILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIK+TFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGT
        N SLFGNTSLFEVLKFPGGITIIPPQSAF LQ AQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LSPYENLYVSLSN+RGST+ +PT+VQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGT

Query:  N--SSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHYHHHHHHHNQDAAYSPSPG
        N  SSKQRLKQLA TITNSHSGNLGLNNTVFGKVKQVRL S LNHSLGGGG+A                                             PG
Subjt:  N--SSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHYHHHHHHHNQDAAYSPSPG

Query:  TEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVGLPAPVSDSISASSPLSGVVLSSVQS
        TEEHKHA KNGVSSAPEAGSSP+E PT+ KRNYEATPPAF+YGYKRSS K+RKQ HLGPIPS SS P SP+LRVGLPAPVSDSISASSPLSGVVLS+VQ 
Subjt:  TEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVGLPAPVSDSISASSPLSGVVLSSVQS

Query:  PNTGSGHAENFERSPPSVLPPQFS
        PNTGSGHAENFERS PSVLPPQFS
Subjt:  PNTGSGHAENFERSPPSVLPPQFS

XP_004136773.3 uncharacterized protein LOC101213172 isoform X1 [Cucumis sativus]1.2e-23184.73Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMEN
        MGKSEEEQPLPVG SSSELSDRNVE+RCGGGGCS IRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLS GNWPDRP+DSAYRDH+IVASFHA KPVPF++ 
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMEN

Query:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL
        HIFELEDNIFGEIP+P VKV+                           ILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIK+TFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGT
        N SLFGNTSLFEVLKFPGGITIIPPQSAF LQ AQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LSPYENLYVSLSN+RGSTI +PT+VQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGT

Query:  N--SSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHYHHHHHHHNQDAAYSPSPG
        N  SSKQRLKQLA TITNSHSGNLGLNNTVFGKVKQVRL S LNHSLGGGG+ARSPSPAPLPHSHHH HHHHHHHHHHHHH+HHHHHHH++DAAYSPSPG
Subjt:  N--SSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHYHHHHHHHNQDAAYSPSPG

Query:  TEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVGLPAPVSDSISASSPLSGVVLSSVQS
        TEEHKHA KNGVSSAPEAGSSP+E PT+ KRNYEATPPAF+YGYKRS  K+RK  +LGPIPS SS P SP+LRVG PAPVSDSISASSPLSGVVLS+VQ 
Subjt:  TEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVGLPAPVSDSISASSPLSGVVLSSVQS

Query:  PNTGSGHAENFERSPPSVLPPQFS
        PNTGSGHAENFERS PSVLPPQFS
Subjt:  PNTGSGHAENFERSPPSVLPPQFS

XP_008443610.1 PREDICTED: uncharacterized protein LOC103487165 [Cucumis melo]6.3e-23381.79Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMEN
        MGKSEEEQPLPVGVSSSELSDRNVE+RCGGGGCS IR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLS GNWPDRPIDSAYRDH+IVASFHAWKPVPF++N
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMEN

Query:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL
        HIFELEDNIFGEIP+P VKV+                           ILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIK+TFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGT
        N SLFGNTSLFEVLKFPGGITIIPPQSAF LQ AQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LSPYENLYVSLSN+RGST+ +PT+VQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGT

Query:  N--SSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHYHHHHHH------------
        N  SSKQRLKQLA TITNSHSGNLGLNNTVFGKVKQVRL S LNHSLGGGG+A SPSPAPLPHSHHHHHHHHHHHHHHHHH+HHHHHH            
Subjt:  N--SSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHYHHHHHH------------

Query:  -------------HNQDAAYSPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVG
                     H+Q AAYSPSPGTEEHKHA KNGVSSAPEAGSSP+E PT+ KRNYEATPPAF+YGYKRSS K+RKQ HLGPIPS SS P SP+LRVG
Subjt:  -------------HNQDAAYSPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVG

Query:  LPAPVSDSISASSPLSGVVLSSVQSPNTGSGHAENFERSPPSVLPPQFS
        LPAPVSDSISASSPLSGVVLS+VQ PNTGSGHAENFERS PSVLPPQFS
Subjt:  LPAPVSDSISASSPLSGVVLSSVQSPNTGSGHAENFERSPPSVLPPQFS

XP_038904490.1 uncharacterized protein LOC120090859 [Benincasa hispida]2.2e-21478.74Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMEN
        MGKSEEEQ LPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLS AVFLSAIFWLPPFLS GNWPDRP+DSAYRDHEIVASFHAWKP P +EN
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMEN

Query:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL
        HIFELEDNIFGEIPVPFVKV+                           ILSLQSLGGPN TKIVFAVDSDAKYSKIPPTSQSLIK+TFETLVIN+PPLRL
Subjt:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGT
        NASLFGNTSLFEVLKFPGGITIIPPQSAF LQ AQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSN+RGST+H+PTIVQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGT

Query:  NSSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHYHHHHHHHNQDAAYSPSPGTE
        NSSKQRLKQLAQTITNSHS NLGLNNT+FGKVKQVRLSSVLNHSLGGGGSAR                                                
Subjt:  NSSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHYHHHHHHHNQDAAYSPSPGTE

Query:  EHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVGLPAPVSDSISASSPLSGVVLSSVQSPN
        E +H LKNGVSSAPEAGSSPVESPT+  RNYEATPPAFQYGYKRSSRKVRKQ+HLGPIPS SS P SP+LRVGLPAPVSDSISASSPLSGVVLS+VQ PN
Subjt:  EHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVGLPAPVSDSISASSPLSGVVLSSVQSPN

Query:  TGSGHAENFERSPPSVLPPQFS
        +GS HAENF  S PSVLPPQFS
Subjt:  TGSGHAENFERSPPSVLPPQFS

TrEMBL top hitse value%identityAlignment
A0A0A0LHD1 Uncharacterized protein8.6e-22883.4Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMEN
        MGKSEEEQPLPVG SSSELSDRNVE+RCGGGGCS IRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLS GNWPDRP+DSAYRDH+IVASFHA KPVPF++ 
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMEN

Query:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL
        HIFELEDNIFGEIP+P VKV+                           ILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIK+TFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGT
        N SLFGNTSLFEVLKFPGGITIIPPQSAF LQ AQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LSPYENLYVSLSN+RGSTI +PT+VQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGT

Query:  N--SSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHYHHHHHHHNQDAAYSPSPG
        N  SSKQRLKQLA TITNSHSGNLGLNNTVFGKVKQVRL S LNHSLGGGG+ARSPSPAPLPHSHHH HHHHHHHHHHHHH+        +DAAYSPSPG
Subjt:  N--SSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHYHHHHHHHNQDAAYSPSPG

Query:  TEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVGLPAPVSDSISASSPLSGVVLSSVQS
        TEEHKHA KNGVSSAPEAGSSP+E PT+ KRNYEATPPAF+YGYKRS  K+RK  +LGPIPS SS P SP+LRVG PAPVSDSISASSPLSGVVLS+VQ 
Subjt:  TEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVGLPAPVSDSISASSPLSGVVLSSVQS

Query:  PNTGSGHAENFERSPPSVLPPQFS
        PNTGSGHAENFERS PSVLPPQFS
Subjt:  PNTGSGHAENFERSPPSVLPPQFS

A0A1S3B8E9 uncharacterized protein LOC1034871653.0e-23381.79Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMEN
        MGKSEEEQPLPVGVSSSELSDRNVE+RCGGGGCS IR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLS GNWPDRPIDSAYRDH+IVASFHAWKPVPF++N
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMEN

Query:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL
        HIFELEDNIFGEIP+P VKV+                           ILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIK+TFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGT
        N SLFGNTSLFEVLKFPGGITIIPPQSAF LQ AQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LSPYENLYVSLSN+RGST+ +PT+VQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGT

Query:  N--SSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHYHHHHHH------------
        N  SSKQRLKQLA TITNSHSGNLGLNNTVFGKVKQVRL S LNHSLGGGG+A SPSPAPLPHSHHHHHHHHHHHHHHHHH+HHHHHH            
Subjt:  N--SSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHYHHHHHH------------

Query:  -------------HNQDAAYSPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVG
                     H+Q AAYSPSPGTEEHKHA KNGVSSAPEAGSSP+E PT+ KRNYEATPPAF+YGYKRSS K+RKQ HLGPIPS SS P SP+LRVG
Subjt:  -------------HNQDAAYSPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVG

Query:  LPAPVSDSISASSPLSGVVLSSVQSPNTGSGHAENFERSPPSVLPPQFS
        LPAPVSDSISASSPLSGVVLS+VQ PNTGSGHAENFERS PSVLPPQFS
Subjt:  LPAPVSDSISASSPLSGVVLSSVQSPNTGSGHAENFERSPPSVLPPQFS

A0A5A7UJM2 Filamentous hemagglutinin2.5e-25188.85Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMEN
        MGKSEEEQPLPVGVSSSELSDRNVE+RCGGGGCS IR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLS GNWPDRPIDSAYRDH+IVASFHAWKPVPF++N
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMEN

Query:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL
        HIFELEDNIFGEIP+P VKVSSSVSCF WLD S+FW FD IL  NQV ILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIK+TFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGT
        N SLFGNTSLFEVLKFPGGITIIPPQSAF LQ AQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LSPYENLYVSLSN+RGST+ +PT+VQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGT

Query:  N--SSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHH---HHHYHHHHHHHN--QDAAY
        N  SSKQRLKQLA TITNSHSGNLGLNNTVFGKVKQVRL S LNHSLGGGG+A SPSPAPLPHSHHHHHHHHHHHHHH   HHH+HHHHHHHN  Q AAY
Subjt:  N--SSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHH---HHHYHHHHHHHN--QDAAY

Query:  SPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVGLPAPVSDSISASSPLSGVVL
        SPSPGTEEHKHA KNGVSSAPEAGSSP+E PT+ KRNYEATPPAF+YGYKRSS K+RKQ HLGPIPS SS P SP+LRVGLPAPVSDSISASSPLSGVVL
Subjt:  SPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVGLPAPVSDSISASSPLSGVVL

Query:  SSVQSPNTGSGHAENFERSPPSVLPPQFS
        S+VQ PNTGSGHAENFERS PSVLPPQFS
Subjt:  SSVQSPNTGSGHAENFERSPPSVLPPQFS

A0A5D3DPD6 Filamentous hemagglutinin3.2e-21477.86Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMEN
        MGKSEEEQPLPVGVSSSELSDRNVE+RCGGGGCS IR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLS GNWPDRPIDSAYRDH+IVASFHAWKPVPF++N
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMEN

Query:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL
        HIFELEDNIFGEIP+P VKV+                           ILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIK+TFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGT
        N SLFGNTSLFEVLKFPGGITIIPPQSAF LQ AQIYFNFTLNYSIYQIQVNFDDL+SQLRSGL LSPYENLYVSLSN+RGST+ +PT+VQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGT

Query:  N--SSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHYHHHHHHHNQDAAYSPSPG
        N  SSKQRLKQLA TITNSHSGNLGLNNTVFGKVKQVRL S LNHSLGGGG+A                                             PG
Subjt:  N--SSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHYHHHHHHHNQDAAYSPSPG

Query:  TEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVGLPAPVSDSISASSPLSGVVLSSVQS
        TEEHKHA KNGVSSAPEAGSSP+E PT+ KRNYEATPPAF+YGYKRSS K+RKQ HLGPIPS SS P SP+LRVGLPAPVSDSISASSPLSGVVLS+VQ 
Subjt:  TEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVGLPAPVSDSISASSPLSGVVLSSVQS

Query:  PNTGSGHAENFERSPPSVLPPQFS
        PNTGSGHAENFERS PSVLPPQFS
Subjt:  PNTGSGHAENFERSPPSVLPPQFS

A0A6J1F409 uncharacterized protein LOC1114419634.1e-21478.36Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMEN
        MGKSEEEQPLPVGVSSSELSD  V+SRCGGGGC  IRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLS G+WPD+  DS YRDHEIVA F A KPVPF++N
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMEN

Query:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL
        HIFELEDNIFGEIPVPFVKV+                           +LSLQSLGG NVT I+F+VD DAKYSKIPPTSQSLIK+TFETLVIN+PPLRL
Subjt:  HIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRL

Query:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGT
        NASLFGNTSLFEVLKFPGGITIIPPQSAF LQ AQIYFNFTLNYSIYQIQVNFDDLTSQLRSGL LS YENLYVSLSN+RGST+ +PTIVQSSVLMAIGT
Subjt:  NASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGT

Query:  NSSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHS-----------HHHHHHHHHHHHHHHHHYHHHHHHHNQ
        NSS QRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSL  GG ARSPSPAPLPHS           HHHHH HHHHHHHHHHH+HHHHHHH+Q
Subjt:  NSSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHS-----------HHHHHHHHHHHHHHHHHYHHHHHHHNQ

Query:  DAAYSPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVGLPAPVSDSISASSPLS
        DA YSPSPGTEEHK+A KNG+SSAPEAGSSPVESP + KRNYEATPP F+YGYK  S KVRK+SHLG I S SSPP SP+LRVGLPAPV+ SISASSPL 
Subjt:  DAAYSPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVGLPAPVSDSISASSPLS

Query:  GVVLSSVQSPNTGSGHAENFERSPPSVLPPQFSWGI
        GV LS+VQ P  G       +RS PSVLPPQFS+ +
Subjt:  GVVLSSVQSPNTGSGHAENFERSPPSVLPPQFSWGI

SwissProt top hitse value%identityAlignment
Q84LH3 Werner Syndrome-like exonuclease4.3e-1939.22Show/hide
Query:  VGLDIEWRPNNRSYDNP--VATLQLCI-GRRCLILQLIHTPEIPKSLFEFLENESYTFVGVGIDEDAEKLTCDYGLKVGKRVDLRNLAESVTGRGDLKNA
        VGLDIEWRP+ R    P  VAT+Q+C+    C ++ + H+  IP+SL   +E+ +   VG+GID D+ KL  DYG+ +    DL +LA    G GD K  
Subjt:  VGLDIEWRPNNRSYDNP--VATLQLCI-GRRCLILQLIHTPEIPKSLFEFLENESYTFVGVGIDEDAEKLTCDYGLKVGKRVDLRNLAESVTGRGDLKNA

Query:  GLKRLGKEVLGKEIEKPKRVTLSRWDQQWLTLNQVKYACIDAFFSFEIGRFLQ
        GL  L + ++ KE+ KP R+ L  W+   L+  Q++YA  DA+ S+ + + L+
Subjt:  GLKRLGKEVLGKEIEKPKRVTLSRWDQQWLTLNQVKYACIDAFFSFEIGRFLQ

Q9VGN7 Exonuclease 3'-5' domain-containing protein 27.2e-1438.1Show/hide
Query:  IVGLDIEWRPNNRSYDNPVATLQLCIGR-RCLILQLIHTPEIPKSLFEFLENESYTFVGVGIDEDAEKLTCDYGLKVGKRVDLRNLAESVTGRGDLKNAG
        ++G D EW     S   PVA LQL   R  C + +L H  +IP+ L E LE++S   VGV   EDA KL+ DYG+ V   +DLR L   + G    K  G
Subjt:  IVGLDIEWRPNNRSYDNPVATLQLCIGR-RCLILQLIHTPEIPKSLFEFLENESYTFVGVGIDEDAEKLTCDYGLKVGKRVDLRNLAESVTGRGDLKNAG

Query:  LKRLGKEVLGKEIEKPKRVTLSRWDQQWLTLNQVKYACIDAFFSFEI
        L +L K  L   ++K  R+  S W+ + L   Q+ YA  DA  +  I
Subjt:  LKRLGKEVLGKEIEKPKRVTLSRWDQQWLTLNQVKYACIDAFFSFEI

Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)1.3e-3431.2Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGC-SGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFME
        M K  +E  L +   + +L +     R  G  C S   RL+ +RC+  L+LS A+ LSAIFWL P  S   +  +   +   +  + ASF   KPV  + 
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGC-SGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFME

Query:  NHIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLR
         H  ++E +I   I                           + + ++V +LSL   G  N T + FAV       +I   S SL++ +F  L      L+
Subjt:  NHIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLR

Query:  LNASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIG
        L  S FG  + F+VLKFPGGIT+ P + A     A + F+ T+  SI  +Q   D L       L L PYE+++  L+NK+GSTI  P   Q  V   + 
Subjt:  LNASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIG

Query:  TNSSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAP
             QRL    Q I  S + NLGL+  VFG+VK +  S+ L+  +       +P+P P
Subjt:  TNSSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAP

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein1.7e-9046.26Show/hide
Query:  MGKSEEEQPLPV--GVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFM
        MGK+E++  L V  G ++ + + RN  +RC  G C  I   +  +C+F LLLS A+FLSA+F L PF  D    D  +D  +R H IVASF   +   F+
Subjt:  MGKSEEEQPLPV--GVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFM

Query:  ENHIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPL
          +  +L+++IF E+    +KV+                           IL+++     N+TK+VF +D D  Y +I P S S IK+ FE+++IN+  L
Subjt:  ENHIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPL

Query:  RLNASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAI
        +L  SLFG T LFEVLKFPGGIT+IPPQSAFPLQ  +I FNFTLNYSI+QIQ+NF+ L SQL++GL+L+PYENLYVSLSN  GST+  PT V SSVL+ +
Subjt:  RLNASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAI

Query:  GTNSSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHS-HHHHHHHHHHHHHHHHHYHHHHHHHNQDAAYSPSP
        GT++S  RLKQL  TIT S S NLGLNNT+FGKVKQVRLSS L +S     S +SPSP+P PHS HHHHHHHHHHHHHHHHH HHHHHHHN     SP  
Subjt:  GTNSSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHS-HHHHHHHHHHHHHHHHHYHHHHHHHNQDAAYSPSP

Query:  GTEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEA--TPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVGLPAPVSDS----ISASSPLSGV
                       APE   SPV SP  ++    A   PP    G +   ++ R Q    P P+ S+   +P  ++  PAP+S +    +  S+PL  V
Subjt:  GTEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEA--TPPAFQYGYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVGLPAPVSDS----ISASSPLSGV

Query:  VLSSVQSP
        V +    P
Subjt:  VLSSVQSP

AT3G12410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.1e-2131.6Show/hide
Query:  DSHNLFDVTFDSEEPILTLLTTSPSMVDDWISETL---AIRTPPLIVGLDIEW---------RPNN--------RSY-DNPVATLQLCIGRRCLILQLIH
        ++H  + V F  +E I+T +T   S++  WI   L      + PL+VG+ ++W         RPNN        R Y DNP   LQLC+G RCLI+QL +
Subjt:  DSHNLFDVTFDSEEPILTLLTTSPSMVDDWISETL---AIRTPPLIVGLDIEW---------RPNN--------RSY-DNPVATLQLCIGRRCLILQLIH

Query:  TPEIPKSLFEFLENESYTFVGVGIDEDAEKLT-CDYGLKVGKRVDLRNLAESVTGRGDLKNAGLKRLGKEVLGKE-IEKPKRVTLSRWDQQWLTLNQVKY
          ++P +L  FL +   TFVGV   +DA KL  C + L++G+ +D+R       GR  ++ +  + + +E +G + +     +++S W    L L+Q+  
Subjt:  TPEIPKSLFEFLENESYTFVGVGIDEDAEKLT-CDYGLKVGKRVDLRNLAESVTGRGDLKNAGLKRLGKEVLGKE-IEKPKRVTLSRWDQQWLTLNQVKY

Query:  ACIDAFFSFEIG
        A +DA+   ++G
Subjt:  ACIDAFFSFEIG

AT3G56590.1 hydroxyproline-rich glycoprotein family protein2.2e-9044.91Show/hide
Query:  MGKSE-EEQPLPVGVSSSELSDRNVESRCGGGG------CSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWK
        MGK+  EEQ LP  VS    S RN     GGGG      C  I    ++RCV  L  SAAVFLSA+FWLPPFL   +  D  +D  ++DH IVASF   K
Subjt:  MGKSE-EEQPLPVGVSSSELSDRNVESRCGGGG------CSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWK

Query:  PVPFMENHIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVI
        P+ FME+++ +LE++I  EI  P  K                           VV+L+L+ LG  N T ++FA+D + + SKIP   +SLIK  FETLV 
Subjt:  PVPFMENHIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVI

Query:  NEPPLRLNASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSS
         +   RL  SLFG    FEVLKFPGGIT+IPPQ  FPLQ AQ+ FNFTLN+SIYQIQ NF++L SQL+ G++L+ YENLY++LSN RGST+  PTIV SS
Subjt:  NEPPLRLNASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSS

Query:  VLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHYHHHHHHHNQDAAY
        VL+  G++S   RLKQLAQTIT+SHS NLGLN+TVFGKVKQVRLSS+L HS     ++ +PSP+P P +H + HHH HHHHHHH                
Subjt:  VLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHYHHHHHHHNQDAAY

Query:  SPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLG---PIPSRSSP-PLSPFLRVGLPAPV-SDSISASSPL
        +P P              S P  G +P  +PT +       PP   Y  +R         H     P P RS P P +P      PAP    +I  SSPL
Subjt:  SPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLG---PIPSRSSP-PLSPFLRVGLPAPV-SDSISASSPL

Query:  SGVVLSSVQSPNTGSGHAENFERSPPSVLP
          VV + +  P+  S  +E      PS  P
Subjt:  SGVVLSSVQSPNTGSGHAENFERSPPSVLP

AT3G56590.2 hydroxyproline-rich glycoprotein family protein1.3e-9044.69Show/hide
Query:  MGKSE-EEQPLPVGVSSSELSDRNVESRCGGGG------CSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWK
        MGK+  EEQ LP  VS    S RN     GGGG      C  I    ++RCV  L  SAAVFLSA+FWLPPFL   +  D  +D  ++DH IVASF   K
Subjt:  MGKSE-EEQPLPVGVSSSELSDRNVESRCGGGG------CSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWK

Query:  PVPFMENHIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVI
        P+ FME+++ +LE++I  EI  P  K                           VV+L+L+ LG  N T ++FA+D + + SKIP   +SLIK  FETLV 
Subjt:  PVPFMENHIFELEDNIFGEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVI

Query:  NEPPLRLNASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSS
         +   RL  SLFG    FEVLKFPGGIT+IPPQ  FPLQ AQ+ FNFTLN+SIYQIQ NF++L SQL+ G++L+ YENLY++LSN RGST+  PTIV SS
Subjt:  NEPPLRLNASLFGNTSLFEVLKFPGGITIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSS

Query:  VLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHYHHHHHHHNQDAAY
        VL+  G++S   RLKQLAQTIT+SHS NLGLN+TVFGKVKQVRLSS+L HS     ++ +PSP+P P +H + HHH HHHHHHH                
Subjt:  VLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHYHHHHHHHNQDAAY

Query:  SPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLG---PIPSRSSP-PLSPFLRVGLPAPV-SDSISASSPL
        +P P              S P  G +P  +PT +       PP   Y  +R         H     P P RS P P +P      PAP    +I  SSPL
Subjt:  SPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQYGYKRSSRKVRKQSHLG---PIPSRSSP-PLSPFLRVGLPAPV-SDSISASSPL

Query:  SGVVLSSVQSPNTGSGHAENFERSPPSVLPPQFSWGI
          VV + +  P+  S  +E      PS  P   S  I
Subjt:  SGVVLSSVQSPNTGSGHAENFERSPPSVLPPQFSWGI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAGAGTGAAGAAGAACAGCCGCTGCCGGTTGGAGTGAGCTCCTCTGAGCTTTCTGACCGGAATGTGGAGAGCAGATGCGGCGGCGGTGGGTGCTCTGGGATTCG
TAGACTGATTGCGGTGAGATGTGTCTTCTTCCTGTTATTATCGGCGGCTGTGTTTCTTTCTGCTATTTTTTGGCTGCCACCGTTCCTTTCCGATGGAAATTGGCCGGATC
GGCCTATTGATTCTGCTTATAGAGATCATGAAATAGTAGCAAGTTTTCATGCTTGGAAGCCAGTTCCTTTTATGGAAAACCATATTTTTGAGCTTGAGGATAACATTTTT
GGAGAAATACCCGTACCATTTGTCAAGGTATCCAGTTCAGTTTCTTGTTTCCTCTGGCTTGATGTGTCACATTTTTGGTCGTTTGATCAAATACTCTCTTTTAACCAGGT
GGTTATCCTCTCACTACAATCATTAGGTGGACCAAACGTAACAAAAATTGTTTTTGCGGTAGATTCTGATGCAAAGTATTCAAAAATTCCCCCAACATCTCAAAGTTTAA
TCAAGGATACCTTTGAAACATTGGTTATAAATGAACCTCCTCTGAGATTGAATGCATCATTATTTGGCAATACATCCTTATTCGAGGTGTTGAAATTTCCTGGAGGAATA
ACTATTATTCCTCCTCAGAGTGCATTTCCTCTGCAGGCGGCACAGATCTATTTCAACTTCACATTAAATTATTCTATTTATCAAATTCAAGTGAATTTTGATGATCTTAC
CAGCCAGTTGAGGTCAGGATTACATCTATCTCCTTATGAGAATTTGTATGTTAGCCTATCGAACAAAAGAGGTTCAACAATACATTCCCCCACTATTGTCCAGTCATCTG
TTCTGATGGCAATTGGGACTAACTCATCGAAACAAAGGCTAAAACAGTTGGCTCAAACCATCACAAATTCTCATTCAGGAAACCTTGGCCTGAACAACACTGTATTTGGT
AAGGTCAAGCAGGTGCGTCTTTCATCAGTCCTAAACCACTCTCTTGGTGGTGGTGGCAGTGCTCGGTCACCTTCACCTGCGCCTCTGCCTCATTCTCACCACCATCACCA
CCACCATCACCACCACCATCACCACCATCACCACCATTACCACCATCACCACCACCACCACAATCAGGATGCTGCATACTCACCTAGTCCTGGAACAGAGGAGCACAAAC
ATGCACTGAAGAATGGGGTCTCATCTGCTCCCGAAGCTGGTTCATCTCCAGTGGAAAGTCCAACTGCAAATAAAAGAAACTATGAAGCTACTCCGCCTGCTTTTCAATAT
GGATATAAGAGGTCTTCAAGAAAAGTCAGAAAACAATCTCATTTAGGCCCTATTCCTTCTCGAAGCAGTCCTCCATTGTCACCATTCTTACGAGTAGGCCTGCCAGCACC
TGTTTCTGATTCTATTTCTGCTTCAAGTCCACTGTCAGGTGTAGTTCTATCTAGTGTACAGTCTCCAAATACAGGCAGTGGACATGCAGAAAATTTTGAAAGAAGTCCCC
CTTCAGTCTTACCACCTCAATTTTCTTGGGGAATTGTATATTTGGTTTTGGTTAGAATTACACAGAGGCACAAGGCTGCTACTGTTACAGTTTCTGATGGAAAATCAAGC
AAGTTGGGAGATGCATTTTTCCAGGTCAAAGTCACAGAGGTGGCAGGCCTTTGTTCCTTCTTCTTCTCTGCCATGGCGATCACCATCGTTGACCATCAAGTTCCCTCCGA
TTCCCACAATTTGTTCGACGTAACTTTCGATTCCGAGGAGCCAATTCTCACTCTTCTCACCACTTCACCATCCATGGTAGATGATTGGATATCCGAAACCCTCGCCATTC
GAACTCCACCTCTCATCGTCGGCCTCGACATCGAATGGCGCCCTAATAATCGGTCCTACGACAACCCCGTCGCCACCTTGCAACTCTGCATCGGCCGCCGCTGCCTGATT
CTGCAACTGATCCACACACCTGAGATCCCTAAATCTCTGTTCGAGTTTCTGGAAAACGAATCCTACACATTCGTAGGAGTGGGAATCGACGAGGATGCTGAAAAGCTCAC
CTGTGATTACGGATTGAAAGTGGGGAAGAGAGTGGATCTGAGGAATTTGGCTGAGAGTGTAACGGGAAGAGGAGATTTGAAGAATGCGGGATTGAAGAGATTGGGGAAAG
AGGTTTTGGGGAAAGAGATTGAAAAGCCGAAGAGGGTGACGCTGAGTAGATGGGATCAACAGTGGCTTACTCTTAATCAGGTTAAGTATGCTTGTATTGATGCCTTTTTT
TCGTTTGAGATTGGAAGGTTTTTGCAATCTTCATCCTATTAA
mRNA sequenceShow/hide mRNA sequence
GAATTTGGTTCTGGGTTTGCGGTGGATTAGCTCCACGAGCTGTAATGGGTGAAGATAATTCAGACCCAATTGAGGGAGGTGACAATGGAGGTTATTAACCCCACTTCACA
TGCATTGCTTCCATGGGAAAGAGTGAAGAAGAACAGCCGCTGCCGGTTGGAGTGAGCTCCTCTGAGCTTTCTGACCGGAATGTGGAGAGCAGATGCGGCGGCGGTGGGTG
CTCTGGGATTCGTAGACTGATTGCGGTGAGATGTGTCTTCTTCCTGTTATTATCGGCGGCTGTGTTTCTTTCTGCTATTTTTTGGCTGCCACCGTTCCTTTCCGATGGAA
ATTGGCCGGATCGGCCTATTGATTCTGCTTATAGAGATCATGAAATAGTAGCAAGTTTTCATGCTTGGAAGCCAGTTCCTTTTATGGAAAACCATATTTTTGAGCTTGAG
GATAACATTTTTGGAGAAATACCCGTACCATTTGTCAAGGTATCCAGTTCAGTTTCTTGTTTCCTCTGGCTTGATGTGTCACATTTTTGGTCGTTTGATCAAATACTCTC
TTTTAACCAGGTGGTTATCCTCTCACTACAATCATTAGGTGGACCAAACGTAACAAAAATTGTTTTTGCGGTAGATTCTGATGCAAAGTATTCAAAAATTCCCCCAACAT
CTCAAAGTTTAATCAAGGATACCTTTGAAACATTGGTTATAAATGAACCTCCTCTGAGATTGAATGCATCATTATTTGGCAATACATCCTTATTCGAGGTGTTGAAATTT
CCTGGAGGAATAACTATTATTCCTCCTCAGAGTGCATTTCCTCTGCAGGCGGCACAGATCTATTTCAACTTCACATTAAATTATTCTATTTATCAAATTCAAGTGAATTT
TGATGATCTTACCAGCCAGTTGAGGTCAGGATTACATCTATCTCCTTATGAGAATTTGTATGTTAGCCTATCGAACAAAAGAGGTTCAACAATACATTCCCCCACTATTG
TCCAGTCATCTGTTCTGATGGCAATTGGGACTAACTCATCGAAACAAAGGCTAAAACAGTTGGCTCAAACCATCACAAATTCTCATTCAGGAAACCTTGGCCTGAACAAC
ACTGTATTTGGTAAGGTCAAGCAGGTGCGTCTTTCATCAGTCCTAAACCACTCTCTTGGTGGTGGTGGCAGTGCTCGGTCACCTTCACCTGCGCCTCTGCCTCATTCTCA
CCACCATCACCACCACCATCACCACCACCATCACCACCATCACCACCATTACCACCATCACCACCACCACCACAATCAGGATGCTGCATACTCACCTAGTCCTGGAACAG
AGGAGCACAAACATGCACTGAAGAATGGGGTCTCATCTGCTCCCGAAGCTGGTTCATCTCCAGTGGAAAGTCCAACTGCAAATAAAAGAAACTATGAAGCTACTCCGCCT
GCTTTTCAATATGGATATAAGAGGTCTTCAAGAAAAGTCAGAAAACAATCTCATTTAGGCCCTATTCCTTCTCGAAGCAGTCCTCCATTGTCACCATTCTTACGAGTAGG
CCTGCCAGCACCTGTTTCTGATTCTATTTCTGCTTCAAGTCCACTGTCAGGTGTAGTTCTATCTAGTGTACAGTCTCCAAATACAGGCAGTGGACATGCAGAAAATTTTG
AAAGAAGTCCCCCTTCAGTCTTACCACCTCAATTTTCTTGGGGAATTGTATATTTGGTTTTGGTTAGAATTACACAGAGGCACAAGGCTGCTACTGTTACAGTTTCTGAT
GGAAAATCAAGCAAGTTGGGAGATGCATTTTTCCAGGTCAAAGTCACAGAGGTGGCAGGCCTTTGTTCCTTCTTCTTCTCTGCCATGGCGATCACCATCGTTGACCATCA
AGTTCCCTCCGATTCCCACAATTTGTTCGACGTAACTTTCGATTCCGAGGAGCCAATTCTCACTCTTCTCACCACTTCACCATCCATGGTAGATGATTGGATATCCGAAA
CCCTCGCCATTCGAACTCCACCTCTCATCGTCGGCCTCGACATCGAATGGCGCCCTAATAATCGGTCCTACGACAACCCCGTCGCCACCTTGCAACTCTGCATCGGCCGC
CGCTGCCTGATTCTGCAACTGATCCACACACCTGAGATCCCTAAATCTCTGTTCGAGTTTCTGGAAAACGAATCCTACACATTCGTAGGAGTGGGAATCGACGAGGATGC
TGAAAAGCTCACCTGTGATTACGGATTGAAAGTGGGGAAGAGAGTGGATCTGAGGAATTTGGCTGAGAGTGTAACGGGAAGAGGAGATTTGAAGAATGCGGGATTGAAGA
GATTGGGGAAAGAGGTTTTGGGGAAAGAGATTGAAAAGCCGAAGAGGGTGACGCTGAGTAGATGGGATCAACAGTGGCTTACTCTTAATCAGGTTAAGTATGCTTGTATT
GATGCCTTTTTTTCGTTTGAGATTGGAAGGTTTTTGCAATCTTCATCCTATTAA
Protein sequenceShow/hide protein sequence
MGKSEEEQPLPVGVSSSELSDRNVESRCGGGGCSGIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSDGNWPDRPIDSAYRDHEIVASFHAWKPVPFMENHIFELEDNIF
GEIPVPFVKVSSSVSCFLWLDVSHFWSFDQILSFNQVVILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKDTFETLVINEPPLRLNASLFGNTSLFEVLKFPGGI
TIIPPQSAFPLQAAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLHLSPYENLYVSLSNKRGSTIHSPTIVQSSVLMAIGTNSSKQRLKQLAQTITNSHSGNLGLNNTVFG
KVKQVRLSSVLNHSLGGGGSARSPSPAPLPHSHHHHHHHHHHHHHHHHHYHHHHHHHNQDAAYSPSPGTEEHKHALKNGVSSAPEAGSSPVESPTANKRNYEATPPAFQY
GYKRSSRKVRKQSHLGPIPSRSSPPLSPFLRVGLPAPVSDSISASSPLSGVVLSSVQSPNTGSGHAENFERSPPSVLPPQFSWGIVYLVLVRITQRHKAATVTVSDGKSS
KLGDAFFQVKVTEVAGLCSFFFSAMAITIVDHQVPSDSHNLFDVTFDSEEPILTLLTTSPSMVDDWISETLAIRTPPLIVGLDIEWRPNNRSYDNPVATLQLCIGRRCLI
LQLIHTPEIPKSLFEFLENESYTFVGVGIDEDAEKLTCDYGLKVGKRVDLRNLAESVTGRGDLKNAGLKRLGKEVLGKEIEKPKRVTLSRWDQQWLTLNQVKYACIDAFF
SFEIGRFLQSSSY