; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG07G006830 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG07G006830
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionHydroxyproline-rich glycoprotein family protein, putative isoform 2
Genome locationCG_Chr07:13987379..14000042
RNA-Seq ExpressionClCG07G006830
SyntenyClCG07G006830
Gene Ontology termsGO:0006402 - mRNA catabolic process (biological process)
GO:0070988 - demethylation (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0032451 - demethylase activity (molecular function)
InterPro domainsIPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily
IPR044842 - RNA demethylase ALKBH9B/ALKBH10B-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056373.1 Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Cucumis melo var. makuwa]0.0e+0089.78Show/hide
Query:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MA+PSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
Subjt:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLND
        QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHR EATVKE+ +TCAESCNGGNSSSFV SRKVEQVSNTCDESKA GEDEKL++
Subjt:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLND

Query:  KDSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDD
        KDSGSAEDNKDTHGKDQSNSK KCAENLED+A NK+SQVEP DDGCSSSHRDKELQSVQSQNGKQ+AATTPRTFVANEMFDGKMVNVMDGLKL+E+L DD
Subjt:  KDSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDD

Query:  GEVSKLLSLVNDLRASGKRGQFQGQ--TYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSC
         EVSKLLSLVNDLRASGKRGQFQG+  TYVVSKRP KGHGREMIQLGFPIADAP++DDNS  LSKVNFTDRRIE IPSLLQDLIDRLVG+QVMTVKPDSC
Subjt:  GEVSKLLSLVNDLRASGKRGQFQGQ--TYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSC

Query:  IIDFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPAD
        IIDFYNEGDHSQPHVWP WFGRPVGVL LTECE+TFGRVIGTDHSGNY+GA+KLSLTPG LLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRA+PAD
Subjt:  IIDFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPAD

Query:  GQRTSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGILPLIVPPVAPPMPFPPVSIPTGPSTWPTAHPRHPPPRLPIPGT
        GQR+SL+VG FS WGPPSARSPNPRLSPGQKPY  VPSTGVLP PPIRPQMAPPNGI PLIVP VAPPMPF PV IPTGPSTWPTAH RHPPPRLP+PGT
Subjt:  GQRTSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGILPLIVPPVAPPMPFPPVSIPTGPSTWPTAHPRHPPPRLPIPGT

Query:  GVFLPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPKPHRQECNGSMDGSGSDKVAEEEQQQQEEEQSENLQAQNA
        GVFLPPPGSSSAP+PSPQQQLPNS +E GSLSEKENG TKSDHNSG  PGEKPE K  RQECNG++DGSG+DKV EEEQQQQ+EE+     AQNA
Subjt:  GVFLPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPKPHRQECNGSMDGSGSDKVAEEEQQQQEEEQSENLQAQNA

XP_008454722.1 PREDICTED: uncharacterized protein LOC103495063 isoform X1 [Cucumis melo]0.0e+0090.19Show/hide
Query:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MA+PSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
Subjt:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLND
        QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHR EATVKE+ +TCAESCNGGNSSSFV SRKVEQVSNTCDESKA GEDEKL++
Subjt:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLND

Query:  KDSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDD
        KDSGSAEDNKDTHGKDQSNSK KCAENLED+A NK+SQVEP DDGCSSSHRDKELQSVQSQNGKQ+AATTPRTFVANEMFDGKMVNVMDGLKL+E+L DD
Subjt:  KDSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDD

Query:  GEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCII
         EVSKLLSLVNDLRASGKRGQFQGQTYVVSKRP KGHGREMIQLGFPIADAP++DDNS  LSKVNFTDRRIE IPSLLQDLIDRLVG+QVMTVKPDSCII
Subjt:  GEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCII

Query:  DFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQ
        DFYNEGDHSQPHVWP WFGRPVGVL LTECE+TFGRVIGTDHSGNY+GA+KLSLTPG LLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRA+PADGQ
Subjt:  DFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQ

Query:  RTSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGILPLIVPPVAPPMPFPPVSIPTGPSTWPTAHPRHPPPRLPIPGTGV
        R+SL+VG FS WGPPSARSPNPRLSPGQKPY  VPSTGVLP PPIRPQMAPPNGI PLIVP VAPPMPF PV IPTGPSTWPTAH RHPPPRLP+PGTGV
Subjt:  RTSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGILPLIVPPVAPPMPFPPVSIPTGPSTWPTAHPRHPPPRLPIPGTGV

Query:  FLPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPKPHRQECNGSMDGSGSDKVAEEEQQQQEEEQSENLQAQNA
        FLPPPGSSSAP+PSPQQQLPNS +E GSLSEKENG TKSDHNSG  PGEKPE K  RQECNG++DGSG+DKV EEEQQQQ+EE+     AQNA
Subjt:  FLPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPKPHRQECNGSMDGSGSDKVAEEEQQQQEEEQSENLQAQNA

XP_011654491.1 RNA demethylase ALKBH10B isoform X1 [Cucumis sativus]0.0e+0090.17Show/hide
Query:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQ
        MAMPSGNVGVPDKVSFQS GGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA+IDALCHHLRAVGEPGEYD+VIGCIQQRRCNWTPVLHMQ
Subjt:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQ

Query:  QYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLNDK
        QYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHR EATVKE+ +TCAESCNGGNSS+FV SRKVEQVSNTCDESKA GEDEKL++K
Subjt:  QYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLNDK

Query:  DSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDG
        DSGSA DNKDTHGKDQSN K K AENLED+A NK+SQVEP DDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVA+EMFDGKMVNVMDGLKL+E+L DD 
Subjt:  DSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDG

Query:  EVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIID
        EVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPH+DDNS GLSKVNFTDRRIE IPSLLQDLIDRLVG+QVMTVKPDSCIID
Subjt:  EVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIID

Query:  FYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQR
        FYNEGDHSQPHVWP WFGRPVGVL LTECE+TFGRVIGTDHSGNY+GAMKLSLTPG LLVVQGKSADFAKHA+PAIRKQRILVTLTKSQPKRAAPADGQR
Subjt:  FYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQR

Query:  TSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGILPLIVPPVAPPMPFPPVSIPTGPSTWPTAHPRHPPPRLPIPGTGVF
        TSL+VG FS WGPPSARSPNPRLSPGQKPYPTVPSTGVLP PPIRPQMAPPNGI PLIVPPVA PMPF PV IPTGPS WPTAH RHPPPRLP+PGTGVF
Subjt:  TSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGILPLIVPPVAPPMPFPPVSIPTGPSTWPTAHPRHPPPRLPIPGTGVF

Query:  LPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPKPHRQECNGSMDGSGSDKVAEEEQQQQEEEQSENLQAQNA
        LPPPGSSSAP PSPQQQLP S +ETGSLSEKENG TKSDH+SG  PGEKP+ K  RQECNGS+DGSG+DKV EEEQQQQ+EE+     AQNA
Subjt:  LPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPKPHRQECNGSMDGSGSDKVAEEEQQQQEEEQSENLQAQNA

XP_038892010.1 RNA demethylase ALKBH10B isoform X1 [Benincasa hispida]0.0e+0093.86Show/hide
Query:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQ
        MAMPSGNVGVPDKVS+QSGGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA+IDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQ
Subjt:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQ

Query:  QYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLNDK
        QYFSVAEVMYALQQVTSRRQQRYMDPVKVG KLYRRPGPGFKQQQGHRVEATVKE+I TCAESCNG NSSS VG RKVEQVSNTCDESKA GED KLNDK
Subjt:  QYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLNDK

Query:  DSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDG
        DS SAEDNKDTHGKDQSNSKPKCAENLED+ASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAA TPRTFVANEMFDGKMVNVMDGLKL+E+L DD 
Subjt:  DSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDG

Query:  EVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIID
        EVSKLLSLVNDLRASGKRGQFQGQTYVV KRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCI+D
Subjt:  EVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIID

Query:  FYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQR
        FYNEGDHSQPHVWPPWFGRPVGVL LTECEMTFGRVIGTDHSGNY+GAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRA PADGQR
Subjt:  FYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQR

Query:  TSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGILPLIVPPVAPPMPFPPVSIPTGPSTWPTAHPRHPPPRLPIPGTGVF
        TSL++G FSSWGPPS RSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGI PLIVPPVAPPMPFPPV IPTGPS WPTAHPRHPPPRLP+PGTGVF
Subjt:  TSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGILPLIVPPVAPPMPFPPVSIPTGPSTWPTAHPRHPPPRLPIPGTGVF

Query:  LPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPK-PHRQECNGSMDGSGSDKVAEEEQ--QQQEEEQSENLQAQNAGGGAV
        LPPPGSSSAPAPSPQQ  PNS VETGSLSEKENGSTKSDHNSG SPGEKPE K P RQECNGSMDGSGSDKV EEEQ  QQQ+EEQ+EN QAQNAGGGAV
Subjt:  LPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPK-PHRQECNGSMDGSGSDKVAEEEQ--QQQEEEQSENLQAQNAGGGAV

XP_038892011.1 RNA demethylase ALKBH10B isoform X2 [Benincasa hispida]0.0e+0093.29Show/hide
Query:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQ
        MAMPSGNVGVPDKVS+QSGGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA+IDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQ
Subjt:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQ

Query:  QYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLNDK
        QYFSVAEVMYALQQVTSRRQQRYMDPVKVG KLYRRPGPGFKQQQGHRVEATVKE+I TCAESCNG NSSS VG RKVEQVSNTCDESKA GED KLNDK
Subjt:  QYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLNDK

Query:  DSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDG
        DS SAEDNKDTHGKDQSNSKPKCAENLED+ASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAA TPRTFVANEMFDGKMVNVMDGLKL+E+L DD 
Subjt:  DSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDG

Query:  EVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIID
        EVSKLLSLVNDLRASGKRGQFQGQTYVV KRPMKGHGREMIQLGFPIADAPHDDDNSSGLSK    DRRIESIPSLLQDLIDRLVGEQVMTVKPDSCI+D
Subjt:  EVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIID

Query:  FYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQR
        FYNEGDHSQPHVWPPWFGRPVGVL LTECEMTFGRVIGTDHSGNY+GAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRA PADGQR
Subjt:  FYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQR

Query:  TSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGILPLIVPPVAPPMPFPPVSIPTGPSTWPTAHPRHPPPRLPIPGTGVF
        TSL++G FSSWGPPS RSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGI PLIVPPVAPPMPFPPV IPTGPS WPTAHPRHPPPRLP+PGTGVF
Subjt:  TSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGILPLIVPPVAPPMPFPPVSIPTGPSTWPTAHPRHPPPRLPIPGTGVF

Query:  LPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPK-PHRQECNGSMDGSGSDKVAEEEQ--QQQEEEQSENLQAQNAGGGAV
        LPPPGSSSAPAPSPQQ  PNS VETGSLSEKENGSTKSDHNSG SPGEKPE K P RQECNGSMDGSGSDKV EEEQ  QQQ+EEQ+EN QAQNAGGGAV
Subjt:  LPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPK-PHRQECNGSMDGSGSDKVAEEEQ--QQQEEEQSENLQAQNAGGGAV

TrEMBL top hitse value%identityAlignment
A0A1S3C013 uncharacterized protein LOC103495063 isoform X10.0e+0090.19Show/hide
Query:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MA+PSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
Subjt:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLND
        QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHR EATVKE+ +TCAESCNGGNSSSFV SRKVEQVSNTCDESKA GEDEKL++
Subjt:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLND

Query:  KDSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDD
        KDSGSAEDNKDTHGKDQSNSK KCAENLED+A NK+SQVEP DDGCSSSHRDKELQSVQSQNGKQ+AATTPRTFVANEMFDGKMVNVMDGLKL+E+L DD
Subjt:  KDSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDD

Query:  GEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCII
         EVSKLLSLVNDLRASGKRGQFQGQTYVVSKRP KGHGREMIQLGFPIADAP++DDNS  LSKVNFTDRRIE IPSLLQDLIDRLVG+QVMTVKPDSCII
Subjt:  GEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCII

Query:  DFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQ
        DFYNEGDHSQPHVWP WFGRPVGVL LTECE+TFGRVIGTDHSGNY+GA+KLSLTPG LLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRA+PADGQ
Subjt:  DFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQ

Query:  RTSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGILPLIVPPVAPPMPFPPVSIPTGPSTWPTAHPRHPPPRLPIPGTGV
        R+SL+VG FS WGPPSARSPNPRLSPGQKPY  VPSTGVLP PPIRPQMAPPNGI PLIVP VAPPMPF PV IPTGPSTWPTAH RHPPPRLP+PGTGV
Subjt:  RTSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGILPLIVPPVAPPMPFPPVSIPTGPSTWPTAHPRHPPPRLPIPGTGV

Query:  FLPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPKPHRQECNGSMDGSGSDKVAEEEQQQQEEEQSENLQAQNA
        FLPPPGSSSAP+PSPQQQLPNS +E GSLSEKENG TKSDHNSG  PGEKPE K  RQECNG++DGSG+DKV EEEQQQQ+EE+     AQNA
Subjt:  FLPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPKPHRQECNGSMDGSGSDKVAEEEQQQQEEEQSENLQAQNA

A0A1S4E0A0 uncharacterized protein LOC103495063 isoform X20.0e+0089.61Show/hide
Query:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MA+PSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
Subjt:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLND
        QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHR EATVKE+ +TCAESCNGGNSSSFV SRKVEQVSNTCDESKA GEDEKL++
Subjt:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLND

Query:  KDSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDD
        KDSGSAEDNKDTHGKDQSNSK KCAENLED+A NK+SQVEP DDGCSSSHRDKELQSVQSQNGKQ+AATTPRTFVANEMFDGKMVNVMDGLKL+E+L DD
Subjt:  KDSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDD

Query:  GEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCII
         EVSKLLSLVNDLRASGKRGQFQGQTYVVSKRP KGHGREMIQLGFPIADAP++DDNS  LSK    DRRIE IPSLLQDLIDRLVG+QVMTVKPDSCII
Subjt:  GEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCII

Query:  DFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQ
        DFYNEGDHSQPHVWP WFGRPVGVL LTECE+TFGRVIGTDHSGNY+GA+KLSLTPG LLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRA+PADGQ
Subjt:  DFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQ

Query:  RTSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGILPLIVPPVAPPMPFPPVSIPTGPSTWPTAHPRHPPPRLPIPGTGV
        R+SL+VG FS WGPPSARSPNPRLSPGQKPY  VPSTGVLP PPIRPQMAPPNGI PLIVP VAPPMPF PV IPTGPSTWPTAH RHPPPRLP+PGTGV
Subjt:  RTSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGILPLIVPPVAPPMPFPPVSIPTGPSTWPTAHPRHPPPRLPIPGTGV

Query:  FLPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPKPHRQECNGSMDGSGSDKVAEEEQQQQEEEQSENLQAQNA
        FLPPPGSSSAP+PSPQQQLPNS +E GSLSEKENG TKSDHNSG  PGEKPE K  RQECNG++DGSG+DKV EEEQQQQ+EE+     AQNA
Subjt:  FLPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPKPHRQECNGSMDGSGSDKVAEEEQQQQEEEQSENLQAQNA

A0A5A7UKE8 Hydroxyproline-rich glycoprotein family protein, putative isoform 20.0e+0089.78Show/hide
Query:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MA+PSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
Subjt:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLND
        QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHR EATVKE+ +TCAESCNGGNSSSFV SRKVEQVSNTCDESKA GEDEKL++
Subjt:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLND

Query:  KDSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDD
        KDSGSAEDNKDTHGKDQSNSK KCAENLED+A NK+SQVEP DDGCSSSHRDKELQSVQSQNGKQ+AATTPRTFVANEMFDGKMVNVMDGLKL+E+L DD
Subjt:  KDSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDD

Query:  GEVSKLLSLVNDLRASGKRGQFQGQ--TYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSC
         EVSKLLSLVNDLRASGKRGQFQG+  TYVVSKRP KGHGREMIQLGFPIADAP++DDNS  LSKVNFTDRRIE IPSLLQDLIDRLVG+QVMTVKPDSC
Subjt:  GEVSKLLSLVNDLRASGKRGQFQGQ--TYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSC

Query:  IIDFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPAD
        IIDFYNEGDHSQPHVWP WFGRPVGVL LTECE+TFGRVIGTDHSGNY+GA+KLSLTPG LLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRA+PAD
Subjt:  IIDFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPAD

Query:  GQRTSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGILPLIVPPVAPPMPFPPVSIPTGPSTWPTAHPRHPPPRLPIPGT
        GQR+SL+VG FS WGPPSARSPNPRLSPGQKPY  VPSTGVLP PPIRPQMAPPNGI PLIVP VAPPMPF PV IPTGPSTWPTAH RHPPPRLP+PGT
Subjt:  GQRTSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGILPLIVPPVAPPMPFPPVSIPTGPSTWPTAHPRHPPPRLPIPGT

Query:  GVFLPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPKPHRQECNGSMDGSGSDKVAEEEQQQQEEEQSENLQAQNA
        GVFLPPPGSSSAP+PSPQQQLPNS +E GSLSEKENG TKSDHNSG  PGEKPE K  RQECNG++DGSG+DKV EEEQQQQ+EE+     AQNA
Subjt:  GVFLPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPKPHRQECNGSMDGSGSDKVAEEEQQQQEEEQSENLQAQNA

A0A5D3E038 Hydroxyproline-rich glycoprotein family protein, putative isoform 20.0e+0089.61Show/hide
Query:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MA+PSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
Subjt:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLND
        QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHR EATVKE+ +TCAESCNGGNSSSFV SRKVEQVSNTCDESKA GEDEKL++
Subjt:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLND

Query:  KDSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDD
        KDSGSAEDNKDTHGKDQSNSK KCAENLED+A NK+SQVEP DDGCSSSHRDKELQSVQSQNGKQ+AATTPRTFVANEMFDGKMVNVMDGLKL+E+L DD
Subjt:  KDSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDD

Query:  GEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCII
         EVSKLLSLVNDLRASGKRGQFQGQTYVVSKRP KGHGREMIQLGFPIADAP++DDNS  LSK    DRRIE IPSLLQDLIDRLVG+QVMTVKPDSCII
Subjt:  GEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCII

Query:  DFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQ
        DFYNEGDHSQPHVWP WFGRPVGVL LTECE+TFGRVIGTDHSGNY+GA+KLSLTPG LLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRA+PADGQ
Subjt:  DFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQ

Query:  RTSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGILPLIVPPVAPPMPFPPVSIPTGPSTWPTAHPRHPPPRLPIPGTGV
        R+SL+VG FS WGPPSARSPNPRLSPGQKPY  VPSTGVLP PPIRPQMAPPNGI PLIVP VAPPMPF PV IPTGPSTWPTAH RHPPPRLP+PGTGV
Subjt:  RTSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGILPLIVPPVAPPMPFPPVSIPTGPSTWPTAHPRHPPPRLPIPGTGV

Query:  FLPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPKPHRQECNGSMDGSGSDKVAEEEQQQQEEEQSENLQAQNA
        FLPPPGSSSAP+PSPQQQLPNS +E GSLSEKENG TKSDHNSG  PGEKPE K  RQECNG++DGSG+DKV EEEQQQQ+EE+     AQNA
Subjt:  FLPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPKPHRQECNGSMDGSGSDKVAEEEQQQQEEEQSENLQAQNA

A0A6J1J9C0 uncharacterized protein LOC1114846090.0e+0088.48Show/hide
Query:  MAMPSGNVGVPDKVSFQS-GGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MAMPSGNVGVPDKVSFQS GGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLR EFAA+NAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPV+HM
Subjt:  MAMPSGNVGVPDKVSFQS-GGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFK----QQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDE
        QQYFSVA+V Y+LQQV SRRQQRY+DPVKVGPK YRRPGPGFK    QQQGHR+EATVKE+++TCAESCNGGNSSSFVGSRKVEQVSNTC+ESKA GEDE
Subjt:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFK----QQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDE

Query:  KLNDKDSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQ
         LNDKDSGSAED KDTHGKDQ NSKPKCAE+LED+ASNKES VEPTDDGCSSS+R+KELQSVQSQNGKQYAATTPRTFVANEM DGKMVNVMDGLKL+E 
Subjt:  KLNDKDSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQ

Query:  LFDDGEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPD
          DD EVSKLLSLVNDLRASGKRGQFQG TYVVSKRPMKGHGREMIQLGFPIAD PHDDDNSSGLSK    DRRIESIPSLLQDLIDRLVGEQ+M+VKPD
Subjt:  LFDDGEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPD

Query:  SCIIDFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAP
        SCIIDFYNEGDHSQPHVWPPWFGRPVGVL LTECEMTFGRVIGTDHSGNYKGAMKLSL PGTLLVV+GKSADFAKHAIPAIRKQRILVTLTKSQPKRA P
Subjt:  SCIIDFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAP

Query:  ADGQRTSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGILPLIVPPVAPPMPF-PPVSIPTGPSTWPTAHPRHPPPRLPI
        +DGQRTSL+VGPF+SWGPPS RSPNPRLSPGQK Y +VPSTGVLPAPPIRPQM PPNGI PLIV PVA PMPF PPV IPTGP TWPTAHPRHPPPRLP+
Subjt:  ADGQRTSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGILPLIVPPVAPPMPF-PPVSIPTGPSTWPTAHPRHPPPRLPI

Query:  PGTGVFLPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPKPHRQECNGSMDGSGSDKVAEEEQQQQEEEQSENLQAQNAGG
        PGTGVFLPPPGSSSAP+P+P QQLPNSTVETGSLSEKENGSTKSDHN+GAS GEK E KP RQECN    GS S+KV EEEQQ+Q +EQSENLQAQ+AG 
Subjt:  PGTGVFLPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPKPHRQECNGSMDGSGSDKVAEEEQQQQEEEQSENLQAQNAGG

Query:  GAV
        GAV
Subjt:  GAV

SwissProt top hitse value%identityAlignment
Q9SL49 RNA demethylase ALKBH9B5.8e-3130.83Show/hide
Query:  ENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDGEVSKLLSLVNDLRASGKRGQFQGQ
        E  E+    ++S  +  D     +    +L   Q +N +       + F+  E   GK+VNV+DGL+L+  +F   E  +++  V  L+  G+RG+ + +
Subjt:  ENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDGEVSKLLSLVNDLRASGKRGQFQGQ

Query:  TYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVM--TVKPDSCIIDFYNEGDHSQPHVWPPWFGRP-V
        T+    + M+G GRE IQ G     AP    N  G+         ++ +P L + +I +L+   V+  T  PDSCI++ Y+EGD   PH+    F RP  
Subjt:  TYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVM--TVKPDSCIIDFYNEGDHSQPHVWPPWFGRP-V

Query:  GVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTK
         + FL+EC++ FG  +  +  G++ G+  + L  G++LV+ G  AD AKH +PA+  +RI +T  K
Subjt:  GVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTK

Q9ZT92 RNA demethylase ALKBH10B1.1e-5832.02Show/hide
Query:  RDGFISWLRGEFAASNAMIDALCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGF
        +D  ISW RGEFAA+NA+IDA+C HLR   E     EY+ V   I +RR NW PVL MQ+Y S+AEV   LQ+V +++ +                    
Subjt:  RDGFISWLRGEFAASNAMIDALCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGF

Query:  KQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLNDKD-SGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEP
        KQ+   + E   +ED+               V + + E+V   C         EK+ + D +G  ED +D    D   S      ++ DS S+++     
Subjt:  KQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLNDKD-SGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEP

Query:  TDDGCSS----SHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDGEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGH
          D        SH D + +S + +  K         F A E   G  VNV+ GLKLYE+L  + E+SKLL  V +LR +G  G+  G+++++  + +KG+
Subjt:  TDDGCSS----SHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDGEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGH

Query:  GREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMT--VKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFG
         RE+IQLG PI      D+NS+  +        IE IP LL+ +ID  V  +++    +P+ C+I+F+ EG++SQP + PP   +P+  L L+E  M +G
Subjt:  GREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMT--VKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFG

Query:  RVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQRTSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVP
        R++ +D+ GN++G + LSL  G+LLV++G SAD A+H +   + +R+ +T  + +P          +  + G  + W  P   +P P L+        +P
Subjt:  RVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQRTSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVP

Query:  STGVLPAPPIRPQMAPPNGILPLIVPP--VAPPMPFPPVSIPTGPSTWPTAHPRHPPPR------LPIPGTGVFLPPPGSSSAP
          GVL  P +   MAPP  + P+I+P   V        V +P         H +H PPR      LP+P      P  GS+S P
Subjt:  STGVLPAPPIRPQMAPPNGILPLIVPP--VAPPMPFPPVSIPTGPSTWPTAHPRHPPPR------LPIPGTGVFLPPPGSSSAP

Arabidopsis top hitse value%identityAlignment
AT1G14710.1 hydroxyproline-rich glycoprotein family protein7.7e-13246.75Show/hide
Query:  PRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQ-----QRYMDPVKVGP
        P  W PDERDGFISWLR EFAA+NA+ID+LC HL+AVG+  EY+ VIG I  RR  W+ VL MQQ+F VA+V Y LQQ+  +RQ     QR+ +  +VG 
Subjt:  PRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQ-----QRYMDPVKVGP

Query:  KLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLNDKDSGSAEDNKDTHGKDQSNSKPKCAENLEDSA
           RR GPGF +  G        + +     + NG NS       +VE      +E+K   + + L+      AE+ +D  G ++  S  K  + LE+S 
Subjt:  KLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLNDKDSGSAEDNKDTHGKDQSNSKPKCAENLEDSA

Query:  SNKESQVEPTDDGCSSSHRDKELQSVQSQ--NGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDGEVSKLLSLVNDLRASGKRGQFQGQTYVVS
        + +E      +  C+S  +D  L S Q Q  N K+  A+  +TFV  EM+D KMVNV++GLKLY+++ D  EVS+L+SLV +LR +G+RGQ Q + YV  
Subjt:  SNKESQVEPTDDGCSSSHRDKELQSVQSQ--NGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDGEVSKLLSLVNDLRASGKRGQFQGQTYVVS

Query:  KRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLFLTEC
        KRP +GHGREMIQLG PIAD P DDD        +  DRRIE IPS L D+I+RLV +Q++ VKPD+CIIDF++EGDHSQPH++ PWFGRP+ VL L+EC
Subjt:  KRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLFLTEC

Query:  EMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQRTSLDVGPFSSWGPPSARSPNPRL---SPG
        + TFGRVI +++ G+YKG++KLSLTPG++L+V+GKSA+ AK+AI A RKQRIL++  KS+P+                 S+WGPP +RSPN  +   +  
Subjt:  EMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQRTSLDVGPFSSWGPPSARSPNPRL---SPG

Query:  QKPYPTV-PSTGVLPAPPIRPQMAPPNG-ILPLIV---PPVAPPMPFPPVSIPTGPSTWP--TAHPRH---PPPRLPIPGTGVFLPPPGSSSAPAPSPQQ
         K YP V PSTGVLP P  R    PPNG + P+ +   PP+A PMPFP   +PTGP  WP    HPRH   P PR+PIPGTGVFLPP         S Q+
Subjt:  QKPYPTV-PSTGVLPAPPIRPQMAPPNG-ILPLIV---PPVAPPMPFPPVSIPTGPSTWP--TAHPRH---PPPRLPIPGTGVFLPPPGSSSAPAPSPQQ

Query:  QLPNSTVETGSLSEKENGSTKSDHNSGASPG
           NS    G L  K     ++    G   G
Subjt:  QLPNSTVETGSLSEKENGSTKSDHNSGASPG

AT1G14710.2 hydroxyproline-rich glycoprotein family protein7.7e-13246.75Show/hide
Query:  PRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQ-----QRYMDPVKVGP
        P  W PDERDGFISWLR EFAA+NA+ID+LC HL+AVG+  EY+ VIG I  RR  W+ VL MQQ+F VA+V Y LQQ+  +RQ     QR+ +  +VG 
Subjt:  PRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQ-----QRYMDPVKVGP

Query:  KLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLNDKDSGSAEDNKDTHGKDQSNSKPKCAENLEDSA
           RR GPGF +  G        + +     + NG NS       +VE      +E+K   + + L+      AE+ +D  G ++  S  K  + LE+S 
Subjt:  KLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLNDKDSGSAEDNKDTHGKDQSNSKPKCAENLEDSA

Query:  SNKESQVEPTDDGCSSSHRDKELQSVQSQ--NGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDGEVSKLLSLVNDLRASGKRGQFQGQTYVVS
        + +E      +  C+S  +D  L S Q Q  N K+  A+  +TFV  EM+D KMVNV++GLKLY+++ D  EVS+L+SLV +LR +G+RGQ Q + YV  
Subjt:  SNKESQVEPTDDGCSSSHRDKELQSVQSQ--NGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDGEVSKLLSLVNDLRASGKRGQFQGQTYVVS

Query:  KRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLFLTEC
        KRP +GHGREMIQLG PIAD P DDD        +  DRRIE IPS L D+I+RLV +Q++ VKPD+CIIDF++EGDHSQPH++ PWFGRP+ VL L+EC
Subjt:  KRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLFLTEC

Query:  EMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQRTSLDVGPFSSWGPPSARSPNPRL---SPG
        + TFGRVI +++ G+YKG++KLSLTPG++L+V+GKSA+ AK+AI A RKQRIL++  KS+P+                 S+WGPP +RSPN  +   +  
Subjt:  EMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQRTSLDVGPFSSWGPPSARSPNPRL---SPG

Query:  QKPYPTV-PSTGVLPAPPIRPQMAPPNG-ILPLIV---PPVAPPMPFPPVSIPTGPSTWP--TAHPRH---PPPRLPIPGTGVFLPPPGSSSAPAPSPQQ
         K YP V PSTGVLP P  R    PPNG + P+ +   PP+A PMPFP   +PTGP  WP    HPRH   P PR+PIPGTGVFLPP         S Q+
Subjt:  QKPYPTV-PSTGVLPAPPIRPQMAPPNG-ILPLIV---PPVAPPMPFPPVSIPTGPSTWP--TAHPRH---PPPRLPIPGTGVFLPPPGSSSAPAPSPQQ

Query:  QLPNSTVETGSLSEKENGSTKSDHNSGASPG
           NS    G L  K     ++    G   G
Subjt:  QLPNSTVETGSLSEKENGSTKSDHNSGASPG

AT2G17970.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.1e-3230.83Show/hide
Query:  ENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDGEVSKLLSLVNDLRASGKRGQFQGQ
        E  E+    ++S  +  D     +    +L   Q +N +       + F+  E   GK+VNV+DGL+L+  +F   E  +++  V  L+  G+RG+ + +
Subjt:  ENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDGEVSKLLSLVNDLRASGKRGQFQGQ

Query:  TYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVM--TVKPDSCIIDFYNEGDHSQPHVWPPWFGRP-V
        T+    + M+G GRE IQ G     AP    N  G+         ++ +P L + +I +L+   V+  T  PDSCI++ Y+EGD   PH+    F RP  
Subjt:  TYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVM--TVKPDSCIIDFYNEGDHSQPHVWPPWFGRP-V

Query:  GVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTK
         + FL+EC++ FG  +  +  G++ G+  + L  G++LV+ G  AD AKH +PA+  +RI +T  K
Subjt:  GVLFLTECEMTFGRVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTK

AT2G48080.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein3.7e-4927.98Show/hide
Query:  RDGFISWLRGEFAASNAMIDALCHHL-RAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQ
        +D  ++W RGEFAA+NA+IDALC HL +A G   +Y+ V+  + +RR NW PVL MQ+Y S+++V   LQQ  ++    ++D                  
Subjt:  RDGFISWLRGEFAASNAMIDALCHHL-RAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQ

Query:  QQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLNDKDSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDD
                                                              +D D  S   +    G  +  +   C ++               +D
Subjt:  QQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLNDKDSGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEPTDD

Query:  GCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDGEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQL
         C S       QS              + F A E   G   NV+ GLKLY+ +F   ++SKLL  +N LR +G+  Q  G+T+V+  +  KG  RE++QL
Subjt:  GCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDGEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQL

Query:  GFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMT--VKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDH
        G PI     D+ +             +E IP+L+Q +ID L+  +++    +P+ C+I+F++E +HSQP   PP   +P+  L L+E  M FG  +G D+
Subjt:  GFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMT--VKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTDH

Query:  SGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRA--------------------APADGQRTSLDVGPFSSWGPPSARSPNP
         GN++G++ L L  G+LLV++G SAD A+H +     +R+ +T  K +P                       APA  +R     G F  W PP +R P  
Subjt:  SGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRA--------------------APADGQRTSLDVGPFSSWGPPSARSPNP

Query:  RLSP
         L P
Subjt:  RLSP

AT4G02940.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein7.9e-6032.02Show/hide
Query:  RDGFISWLRGEFAASNAMIDALCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGF
        +D  ISW RGEFAA+NA+IDA+C HLR   E     EY+ V   I +RR NW PVL MQ+Y S+AEV   LQ+V +++ +                    
Subjt:  RDGFISWLRGEFAASNAMIDALCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGF

Query:  KQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLNDKD-SGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEP
        KQ+   + E   +ED+               V + + E+V   C         EK+ + D +G  ED +D    D   S      ++ DS S+++     
Subjt:  KQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLNDKD-SGSAEDNKDTHGKDQSNSKPKCAENLEDSASNKESQVEP

Query:  TDDGCSS----SHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDGEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGH
          D        SH D + +S + +  K         F A E   G  VNV+ GLKLYE+L  + E+SKLL  V +LR +G  G+  G+++++  + +KG+
Subjt:  TDDGCSS----SHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDGEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGH

Query:  GREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMT--VKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFG
         RE+IQLG PI      D+NS+  +        IE IP LL+ +ID  V  +++    +P+ C+I+F+ EG++SQP + PP   +P+  L L+E  M +G
Subjt:  GREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMT--VKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFG

Query:  RVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQRTSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVP
        R++ +D+ GN++G + LSL  G+LLV++G SAD A+H +   + +R+ +T  + +P          +  + G  + W  P   +P P L+        +P
Subjt:  RVIGTDHSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQRTSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVP

Query:  STGVLPAPPIRPQMAPPNGILPLIVPP--VAPPMPFPPVSIPTGPSTWPTAHPRHPPPR------LPIPGTGVFLPPPGSSSAP
          GVL  P +   MAPP  + P+I+P   V        V +P         H +H PPR      LP+P      P  GS+S P
Subjt:  STGVLPAPPIRPQMAPPNGILPLIVPP--VAPPMPFPPVSIPTGPSTWPTAHPRHPPPR------LPIPGTGVFLPPPGSSSAP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATGCCATCGGGAAATGTGGGTGTACCGGATAAAGTTTCGTTTCAGAGTGGTGGTGGTGGAGTTGCGGTGAGTGGTGGCGGTGGCGAGATCCATCAGCACCACCC
CCGCCCCTGGTTTCCAGATGAGCGTGATGGGTTTATCTCATGGTTGCGAGGTGAATTTGCTGCCTCAAATGCTATGATTGATGCCCTTTGCCATCATTTGCGTGCTGTGG
GAGAGCCTGGGGAGTATGACGTGGTTATTGGATGTATACAGCAACGGCGGTGTAATTGGACGCCGGTGCTTCATATGCAGCAGTATTTTTCAGTGGCAGAAGTGATGTAT
GCCCTTCAGCAGGTCACCTCAAGGAGGCAGCAGCGGTATATGGATCCTGTGAAAGTGGGGCCGAAGTTGTATAGGAGACCTGGGCCAGGGTTTAAGCAGCAGCAAGGCCA
TCGGGTTGAAGCCACAGTCAAGGAAGATATACTCACTTGTGCAGAGTCATGTAATGGTGGGAATTCTTCAAGTTTTGTAGGCTCTAGGAAGGTGGAGCAAGTAAGTAATA
CGTGTGATGAAAGTAAGGCATTGGGGGAGGATGAAAAATTGAACGATAAAGATTCAGGGTCAGCTGAGGACAATAAAGATACTCATGGGAAGGACCAAAGTAATAGCAAA
CCAAAGTGTGCAGAAAATTTAGAAGACAGTGCAAGTAATAAAGAATCTCAAGTTGAACCTACTGATGATGGATGTTCTTCAAGTCATAGAGATAAGGAGTTGCAGTCTGT
TCAAAGCCAGAATGGAAAGCAGTATGCTGCCACAACCCCGAGAACCTTTGTTGCCAATGAGATGTTTGATGGAAAGATGGTTAATGTGATGGATGGATTGAAATTATATG
AACAATTATTCGATGATGGCGAGGTTTCAAAGCTTCTTTCACTGGTGAATGATTTGAGGGCTTCCGGAAAGAGAGGGCAATTTCAAGGTCAGACGTATGTGGTCTCAAAA
AGACCCATGAAGGGACATGGGAGAGAAATGATCCAACTAGGCTTTCCCATTGCAGATGCTCCTCATGATGATGACAATTCTTCAGGGCTCTCTAAAGTGAACTTTACAGA
TAGAAGAATAGAATCCATCCCCTCACTGCTTCAAGATCTCATTGATCGATTGGTTGGGGAGCAAGTGATGACAGTGAAACCAGATTCCTGCATCATTGATTTTTATAATG
AGGGTGATCATTCTCAGCCTCATGTCTGGCCACCATGGTTTGGGAGGCCCGTTGGTGTCCTCTTTTTGACTGAATGTGAAATGACCTTTGGTAGAGTGATTGGTACAGAC
CATTCTGGCAACTATAAAGGGGCTATGAAGTTGTCTCTCACACCCGGAACCCTTCTTGTGGTGCAAGGAAAATCTGCAGATTTTGCTAAGCATGCAATTCCCGCTATCCG
CAAGCAACGTATTCTTGTTACGTTGACCAAATCACAACCGAAAAGAGCTGCACCAGCTGATGGGCAACGCACATCTCTGGATGTAGGTCCATTTTCCAGTTGGGGCCCTC
CATCTGCTAGATCGCCCAACCCTCGTCTTTCCCCTGGACAGAAGCCATACCCTACGGTTCCATCAACTGGGGTGCTACCGGCGCCACCCATTCGTCCCCAAATGGCACCA
CCAAATGGCATCCTGCCCTTAATTGTCCCTCCTGTAGCACCACCTATGCCTTTCCCTCCTGTGTCAATCCCGACCGGTCCATCCACATGGCCCACTGCACATCCAAGGCA
TCCTCCACCTCGTCTCCCTATCCCAGGCACTGGAGTATTCCTTCCCCCTCCAGGTTCCTCCAGTGCTCCAGCTCCATCTCCTCAACAGCAGTTGCCAAACTCCACAGTTG
AGACGGGTTCCCTTTCAGAAAAGGAGAATGGTTCGACAAAATCTGATCACAATTCAGGGGCTTCTCCAGGAGAAAAACCGGAACCAAAGCCTCATAGACAAGAATGCAAT
GGAAGTATGGATGGAAGTGGGAGTGATAAGGTGGCAGAGGAAGAGCAGCAGCAGCAGGAGGAGGAGCAAAGTGAGAATCTGCAGGCCCAAAATGCAGGAGGTGGAGCAGT
TTAG
mRNA sequenceShow/hide mRNA sequence
CGCAGCAGTGAAGAGAGAGAGAGAGAGAAGCCACCATTTTAAAGTCTTAAGAACCCATGTACATTCGTGTTCATTAAAACCATCCCCAGTTTTCACTTTCCAAACCCTAG
ATCTTATATATCTAAATTCCCCTCAATTCCCTCAAATCCACCCCTTTTTCTTTTGTTTCTTCTACGTCACTGAATTTCACTGTGCGTTATTCGCAAGCACAGTCTAAAAA
TTGCTTTTTCTCTAACCCCATTTTGCAGATCTGATTCAATTGGGGTCCATCGCCGCCTTAATTCTTCTGTAGCTTGTTGTTTCAGTTGTCAGAATTTGGCCTCTTATTTT
TGAGGTTCAGATTTGAGGATATACTACTGTATATGCGGGTTTAGTGTTTTTTTCTGGTTAGAAGATTCTCATGGCAATGCCATCGGGAAATGTGGGTGTACCGGATAAAG
TTTCGTTTCAGAGTGGTGGTGGTGGAGTTGCGGTGAGTGGTGGCGGTGGCGAGATCCATCAGCACCACCCCCGCCCCTGGTTTCCAGATGAGCGTGATGGGTTTATCTCA
TGGTTGCGAGGTGAATTTGCTGCCTCAAATGCTATGATTGATGCCCTTTGCCATCATTTGCGTGCTGTGGGAGAGCCTGGGGAGTATGACGTGGTTATTGGATGTATACA
GCAACGGCGGTGTAATTGGACGCCGGTGCTTCATATGCAGCAGTATTTTTCAGTGGCAGAAGTGATGTATGCCCTTCAGCAGGTCACCTCAAGGAGGCAGCAGCGGTATA
TGGATCCTGTGAAAGTGGGGCCGAAGTTGTATAGGAGACCTGGGCCAGGGTTTAAGCAGCAGCAAGGCCATCGGGTTGAAGCCACAGTCAAGGAAGATATACTCACTTGT
GCAGAGTCATGTAATGGTGGGAATTCTTCAAGTTTTGTAGGCTCTAGGAAGGTGGAGCAAGTAAGTAATACGTGTGATGAAAGTAAGGCATTGGGGGAGGATGAAAAATT
GAACGATAAAGATTCAGGGTCAGCTGAGGACAATAAAGATACTCATGGGAAGGACCAAAGTAATAGCAAACCAAAGTGTGCAGAAAATTTAGAAGACAGTGCAAGTAATA
AAGAATCTCAAGTTGAACCTACTGATGATGGATGTTCTTCAAGTCATAGAGATAAGGAGTTGCAGTCTGTTCAAAGCCAGAATGGAAAGCAGTATGCTGCCACAACCCCG
AGAACCTTTGTTGCCAATGAGATGTTTGATGGAAAGATGGTTAATGTGATGGATGGATTGAAATTATATGAACAATTATTCGATGATGGCGAGGTTTCAAAGCTTCTTTC
ACTGGTGAATGATTTGAGGGCTTCCGGAAAGAGAGGGCAATTTCAAGGTCAGACGTATGTGGTCTCAAAAAGACCCATGAAGGGACATGGGAGAGAAATGATCCAACTAG
GCTTTCCCATTGCAGATGCTCCTCATGATGATGACAATTCTTCAGGGCTCTCTAAAGTGAACTTTACAGATAGAAGAATAGAATCCATCCCCTCACTGCTTCAAGATCTC
ATTGATCGATTGGTTGGGGAGCAAGTGATGACAGTGAAACCAGATTCCTGCATCATTGATTTTTATAATGAGGGTGATCATTCTCAGCCTCATGTCTGGCCACCATGGTT
TGGGAGGCCCGTTGGTGTCCTCTTTTTGACTGAATGTGAAATGACCTTTGGTAGAGTGATTGGTACAGACCATTCTGGCAACTATAAAGGGGCTATGAAGTTGTCTCTCA
CACCCGGAACCCTTCTTGTGGTGCAAGGAAAATCTGCAGATTTTGCTAAGCATGCAATTCCCGCTATCCGCAAGCAACGTATTCTTGTTACGTTGACCAAATCACAACCG
AAAAGAGCTGCACCAGCTGATGGGCAACGCACATCTCTGGATGTAGGTCCATTTTCCAGTTGGGGCCCTCCATCTGCTAGATCGCCCAACCCTCGTCTTTCCCCTGGACA
GAAGCCATACCCTACGGTTCCATCAACTGGGGTGCTACCGGCGCCACCCATTCGTCCCCAAATGGCACCACCAAATGGCATCCTGCCCTTAATTGTCCCTCCTGTAGCAC
CACCTATGCCTTTCCCTCCTGTGTCAATCCCGACCGGTCCATCCACATGGCCCACTGCACATCCAAGGCATCCTCCACCTCGTCTCCCTATCCCAGGCACTGGAGTATTC
CTTCCCCCTCCAGGTTCCTCCAGTGCTCCAGCTCCATCTCCTCAACAGCAGTTGCCAAACTCCACAGTTGAGACGGGTTCCCTTTCAGAAAAGGAGAATGGTTCGACAAA
ATCTGATCACAATTCAGGGGCTTCTCCAGGAGAAAAACCGGAACCAAAGCCTCATAGACAAGAATGCAATGGAAGTATGGATGGAAGTGGGAGTGATAAGGTGGCAGAGG
AAGAGCAGCAGCAGCAGGAGGAGGAGCAAAGTGAGAATCTGCAGGCCCAAAATGCAGGAGGTGGAGCAGTTTAGAGAGAAGAAGATACATTATTGAAGAGAAGAGAGAGA
GAGAAAGGAAACCAGGTAGGCTGCAGAGTTGAATGAGTTACAAGCAAAATGTAGAGAGCGGCAACATTCAAGACTGATGATTTTCATACACTAAACACTACTACTACCAA
GGAGAACAAGGGGAGTTCTCTTTCAAAATCCTTTAGTTCATTCCTTTTTTGTTGTCCTGAAAAATTATTGGTTAGATGGGAAAACTCATTCCTAATGAAACTTTGGATCT
TACTTATCATCTTTTTTAAAAAGAGATTTTTGAGAAGTCAGAACCAGTTGGGAATTTTCTTTTGGGTTTAAACAGTTGAAAAAAGAGAGAGGA
Protein sequenceShow/hide protein sequence
MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMY
ALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDILTCAESCNGGNSSSFVGSRKVEQVSNTCDESKALGEDEKLNDKDSGSAEDNKDTHGKDQSNSK
PKCAENLEDSASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVANEMFDGKMVNVMDGLKLYEQLFDDGEVSKLLSLVNDLRASGKRGQFQGQTYVVSK
RPMKGHGREMIQLGFPIADAPHDDDNSSGLSKVNFTDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLFLTECEMTFGRVIGTD
HSGNYKGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILVTLTKSQPKRAAPADGQRTSLDVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAP
PNGILPLIVPPVAPPMPFPPVSIPTGPSTWPTAHPRHPPPRLPIPGTGVFLPPPGSSSAPAPSPQQQLPNSTVETGSLSEKENGSTKSDHNSGASPGEKPEPKPHRQECN
GSMDGSGSDKVAEEEQQQQEEEQSENLQAQNAGGGAV