; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G011750 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G011750
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionHydroxyproline-rich glycoprotein family protein, putative isoform 2
Genome locationchr04:17229035..17246383
RNA-Seq ExpressionLsi04G011750
SyntenyLsi04G011750
Gene Ontology termsGO:0006402 - mRNA catabolic process (biological process)
GO:0070988 - demethylation (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0032451 - demethylase activity (molecular function)
InterPro domainsIPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily
IPR044842 - RNA demethylase ALKBH9B/ALKBH10B-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142291.1 RNA demethylase ALKBH10B isoform X2 [Cucumis sativus]0.0e+0089.96Show/hide
Query:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQ
        MAMPSGNVGVPDKVSFQS GGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA+IDALCHHLRAVGEPGEYD+VIGCIQQRRCNWTPVLHMQ
Subjt:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQ

Query:  QYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGG------GSRKVEQVSNTCDESKASGEDEKLNDK
        QYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHR EATVKE+ ITCAESCNGG       SRKVEQVSNTCDESKASGEDEKL++K
Subjt:  QYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGG------GSRKVEQVSNTCDESKASGEDEKLNDK

Query:  DSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDA
        DSGSA DNKDTHGKDQSN K K AE  E+NA NK+SQVEP DDGCSSSHRDKELQSVQSQNGKQ AATTPRTFVA+EMFDGKMVNVMDGLKLFEEL DDA
Subjt:  DSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDA

Query:  EVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIIDFYNE
        EVSKL SLVNDLRASGKRGQ QGQTYVVSKRPMKGHGREMIQLGFPIADAPH+DDNS GLSKDRRIE IPSLLQDLIDRLVG+QVMTVKPDSCIIDFYNE
Subjt:  EVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIIDFYNE

Query:  GDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQRTSLN
        GDHSQPHVWP WFGRPVGVLLLTECE+TFGRVIGTDHSGNYRGAMKLSLTPG LLVVQGKSADFAKHA+PAIRKQRIL+TLTKSQPKRAAPADGQRTSLN
Subjt:  GDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQRTSLN

Query:  VGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGTGVFLPPP
        VG FS WGPPSARSPNPRLSPGQKPYPTVPSTGVLP PPIRPQMAPPNGIPPLIVPPVA PMPF PVPIPTGPSAWPTAH RHPPPRLPVPGTGVFLPPP
Subjt:  VGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGTGVFLPPP

Query:  GSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSNKVTEEEQQQQEEQSENLQAQNA
        GSSSAP PSPQQQLP + +ETGSLSEKENG  KSDH+SG  PGEKP+AK QRQECNGS++GSG++KV EEEQQQQ+E+ ++  AQNA
Subjt:  GSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSNKVTEEEQQQQEEQSENLQAQNA

XP_011654491.1 RNA demethylase ALKBH10B isoform X1 [Cucumis sativus]0.0e+0089.44Show/hide
Query:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQ
        MAMPSGNVGVPDKVSFQS GGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA+IDALCHHLRAVGEPGEYD+VIGCIQQRRCNWTPVLHMQ
Subjt:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQ

Query:  QYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGG------GSRKVEQVSNTCDESKASGEDEKLNDK
        QYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHR EATVKE+ ITCAESCNGG       SRKVEQVSNTCDESKASGEDEKL++K
Subjt:  QYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGG------GSRKVEQVSNTCDESKASGEDEKLNDK

Query:  DSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDA
        DSGSA DNKDTHGKDQSN K K AE  E+NA NK+SQVEP DDGCSSSHRDKELQSVQSQNGKQ AATTPRTFVA+EMFDGKMVNVMDGLKLFEEL DDA
Subjt:  DSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDA

Query:  EVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSK----DRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIID
        EVSKL SLVNDLRASGKRGQ QGQTYVVSKRPMKGHGREMIQLGFPIADAPH+DDNS GLSK    DRRIE IPSLLQDLIDRLVG+QVMTVKPDSCIID
Subjt:  EVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSK----DRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIID

Query:  FYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQR
        FYNEGDHSQPHVWP WFGRPVGVLLLTECE+TFGRVIGTDHSGNYRGAMKLSLTPG LLVVQGKSADFAKHA+PAIRKQRIL+TLTKSQPKRAAPADGQR
Subjt:  FYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQR

Query:  TSLNVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGTGVF
        TSLNVG FS WGPPSARSPNPRLSPGQKPYPTVPSTGVLP PPIRPQMAPPNGIPPLIVPPVA PMPF PVPIPTGPSAWPTAH RHPPPRLPVPGTGVF
Subjt:  TSLNVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGTGVF

Query:  LPPPGSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSNKVTEEEQQQQEEQSENLQAQNA
        LPPPGSSSAP PSPQQQLP + +ETGSLSEKENG  KSDH+SG  PGEKP+AK QRQECNGS++GSG++KV EEEQQQQ+E+ ++  AQNA
Subjt:  LPPPGSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSNKVTEEEQQQQEEQSENLQAQNA

XP_016901639.1 PREDICTED: uncharacterized protein LOC103495063 isoform X2 [Cucumis melo]0.0e+0089.53Show/hide
Query:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MA+PSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
Subjt:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGG------GSRKVEQVSNTCDESKASGEDEKLND
        QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHR EATVKE+ ITCAESCNGG       SRKVEQVSNTCDESKASGEDEKL++
Subjt:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGG------GSRKVEQVSNTCDESKASGEDEKLND

Query:  KDSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDD
        KDSGSAEDNKDTHGKDQSNSK K AE  E+NA NK+SQVEP DDGCSSSHRDKELQSVQSQNGKQ AATTPRTFVANEMFDGKMVNVMDGLKLFEEL DD
Subjt:  KDSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDD

Query:  AEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIIDFYN
        AEVSKL SLVNDLRASGKRGQ QGQTYVVSKRP KGHGREMIQLGFPIADAP++DDNS  LSKDRRIE IPSLLQDLIDRLVG+QVMTVKPDSCIIDFYN
Subjt:  AEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIIDFYN

Query:  EGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQRTSL
        EGDHSQPHVWP WFGRPVGVLLLTECE+TFGRVIGTDHSGNYRGA+KLSLTPG LLVVQGKSADFAKHAIPAIRKQRIL+TLTKSQPKRA+PADGQR+SL
Subjt:  EGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQRTSL

Query:  NVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGTGVFLPP
        NVG FS WGPPSARSPNPRLSPGQKPY  VPSTGVLP PPIRPQMAPPNGIPPLIVP VAPPMPF PVPIPTGPS WPTAH RHPPPRLPVPGTGVFLPP
Subjt:  NVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGTGVFLPP

Query:  PGSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSNKVTEEEQQQQEEQSENLQAQNA
        PGSSSAP+PSPQQQLPN+ +E GSLSEKENG  KSDHNSG  PGEKPEAK QRQECNG+++GSG++KV EEEQQQQ+E+ ++  AQNA
Subjt:  PGSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSNKVTEEEQQQQEEQSENLQAQNA

XP_038892010.1 RNA demethylase ALKBH10B isoform X1 [Benincasa hispida]0.0e+0092.57Show/hide
Query:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQ
        MAMPSGNVGVPDKVS+QSGGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA+IDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQ
Subjt:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQ

Query:  QYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGG------GSRKVEQVSNTCDESKASGEDEKLNDK
        QYFSVAEVMYALQQVTSRRQQRYMDPVKVG KLYRRPGPGFKQQQGHRVEATVKE+I TCAESCNG       G RKVEQVSNTCDESKASGED KLNDK
Subjt:  QYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGG------GSRKVEQVSNTCDESKASGEDEKLNDK

Query:  DSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDA
        DS SAEDNKDTHGKDQSNSKPK AE  E+NASNKESQVEP DDGCSSSHRDKELQSVQSQNGKQ AA TPRTFVANEMFDGKMVNVMDGLKLFEEL DDA
Subjt:  DSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDA

Query:  EVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSK----DRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIID
        EVSKL SLVNDLRASGKRGQ QGQTYVV KRPMKGHGREMIQLGFPIADAPHDDDNSSGLSK    DRRIESIPSLLQDLIDRLVGEQVMTVKPDSCI+D
Subjt:  EVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSK----DRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIID

Query:  FYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQR
        FYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRIL+TLTKSQPKRA PADGQR
Subjt:  FYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQR

Query:  TSLNVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGTGVF
        TSLN+G FSSWGPPS RSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGTGVF
Subjt:  TSLNVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGTGVF

Query:  LPPPGSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAK-PQRQECNGSMNGSGSNKVTEEE---QQQQEEQSENLQAQNAGGGAV
        LPPPGSSSAPAPSPQQ  PN+ VETGSLSEKENGS KSDHNSG  PGEKPEAK PQRQECNGSM+GSGS+KV EEE   QQQQEEQ+EN QAQNAGGGAV
Subjt:  LPPPGSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAK-PQRQECNGSMNGSGSNKVTEEE---QQQQEEQSENLQAQNAGGGAV

XP_038892011.1 RNA demethylase ALKBH10B isoform X2 [Benincasa hispida]0.0e+0093.1Show/hide
Query:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQ
        MAMPSGNVGVPDKVS+QSGGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNA+IDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQ
Subjt:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQ

Query:  QYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGG------GSRKVEQVSNTCDESKASGEDEKLNDK
        QYFSVAEVMYALQQVTSRRQQRYMDPVKVG KLYRRPGPGFKQQQGHRVEATVKE+I TCAESCNG       G RKVEQVSNTCDESKASGED KLNDK
Subjt:  QYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGG------GSRKVEQVSNTCDESKASGEDEKLNDK

Query:  DSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDA
        DS SAEDNKDTHGKDQSNSKPK AE  E+NASNKESQVEP DDGCSSSHRDKELQSVQSQNGKQ AA TPRTFVANEMFDGKMVNVMDGLKLFEEL DDA
Subjt:  DSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDA

Query:  EVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIIDFYNE
        EVSKL SLVNDLRASGKRGQ QGQTYVV KRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCI+DFYNE
Subjt:  EVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIIDFYNE

Query:  GDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQRTSLN
        GDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRIL+TLTKSQPKRA PADGQRTSLN
Subjt:  GDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQRTSLN

Query:  VGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGTGVFLPPP
        +G FSSWGPPS RSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGTGVFLPPP
Subjt:  VGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGTGVFLPPP

Query:  GSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAK-PQRQECNGSMNGSGSNKVTEEE---QQQQEEQSENLQAQNAGGGAV
        GSSSAPAPSPQQ  PN+ VETGSLSEKENGS KSDHNSG  PGEKPEAK PQRQECNGSM+GSGS+KV EEE   QQQQEEQ+EN QAQNAGGGAV
Subjt:  GSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAK-PQRQECNGSMNGSGSNKVTEEE---QQQQEEQSENLQAQNAGGGAV

TrEMBL top hitse value%identityAlignment
A0A1S3C013 uncharacterized protein LOC103495063 isoform X10.0e+0089.02Show/hide
Query:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MA+PSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
Subjt:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGG------GSRKVEQVSNTCDESKASGEDEKLND
        QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHR EATVKE+ ITCAESCNGG       SRKVEQVSNTCDESKASGEDEKL++
Subjt:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGG------GSRKVEQVSNTCDESKASGEDEKLND

Query:  KDSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDD
        KDSGSAEDNKDTHGKDQSNSK K AE  E+NA NK+SQVEP DDGCSSSHRDKELQSVQSQNGKQ AATTPRTFVANEMFDGKMVNVMDGLKLFEEL DD
Subjt:  KDSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDD

Query:  AEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSK----DRRIESIPSLLQDLIDRLVGEQVMTVKPDSCII
        AEVSKL SLVNDLRASGKRGQ QGQTYVVSKRP KGHGREMIQLGFPIADAP++DDNS  LSK    DRRIE IPSLLQDLIDRLVG+QVMTVKPDSCII
Subjt:  AEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSK----DRRIESIPSLLQDLIDRLVGEQVMTVKPDSCII

Query:  DFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQ
        DFYNEGDHSQPHVWP WFGRPVGVLLLTECE+TFGRVIGTDHSGNYRGA+KLSLTPG LLVVQGKSADFAKHAIPAIRKQRIL+TLTKSQPKRA+PADGQ
Subjt:  DFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQ

Query:  RTSLNVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGTGV
        R+SLNVG FS WGPPSARSPNPRLSPGQKPY  VPSTGVLP PPIRPQMAPPNGIPPLIVP VAPPMPF PVPIPTGPS WPTAH RHPPPRLPVPGTGV
Subjt:  RTSLNVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGTGV

Query:  FLPPPGSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSNKVTEEEQQQQEEQSENLQAQNA
        FLPPPGSSSAP+PSPQQQLPN+ +E GSLSEKENG  KSDHNSG  PGEKPEAK QRQECNG+++GSG++KV EEEQQQQ+E+ ++  AQNA
Subjt:  FLPPPGSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSNKVTEEEQQQQEEQSENLQAQNA

A0A1S4E0A0 uncharacterized protein LOC103495063 isoform X20.0e+0089.53Show/hide
Query:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MA+PSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
Subjt:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGG------GSRKVEQVSNTCDESKASGEDEKLND
        QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHR EATVKE+ ITCAESCNGG       SRKVEQVSNTCDESKASGEDEKL++
Subjt:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGG------GSRKVEQVSNTCDESKASGEDEKLND

Query:  KDSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDD
        KDSGSAEDNKDTHGKDQSNSK K AE  E+NA NK+SQVEP DDGCSSSHRDKELQSVQSQNGKQ AATTPRTFVANEMFDGKMVNVMDGLKLFEEL DD
Subjt:  KDSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDD

Query:  AEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIIDFYN
        AEVSKL SLVNDLRASGKRGQ QGQTYVVSKRP KGHGREMIQLGFPIADAP++DDNS  LSKDRRIE IPSLLQDLIDRLVG+QVMTVKPDSCIIDFYN
Subjt:  AEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIIDFYN

Query:  EGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQRTSL
        EGDHSQPHVWP WFGRPVGVLLLTECE+TFGRVIGTDHSGNYRGA+KLSLTPG LLVVQGKSADFAKHAIPAIRKQRIL+TLTKSQPKRA+PADGQR+SL
Subjt:  EGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQRTSL

Query:  NVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGTGVFLPP
        NVG FS WGPPSARSPNPRLSPGQKPY  VPSTGVLP PPIRPQMAPPNGIPPLIVP VAPPMPF PVPIPTGPS WPTAH RHPPPRLPVPGTGVFLPP
Subjt:  NVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGTGVFLPP

Query:  PGSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSNKVTEEEQQQQEEQSENLQAQNA
        PGSSSAP+PSPQQQLPN+ +E GSLSEKENG  KSDHNSG  PGEKPEAK QRQECNG+++GSG++KV EEEQQQQ+E+ ++  AQNA
Subjt:  PGSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSNKVTEEEQQQQEEQSENLQAQNA

A0A5A7UKE8 Hydroxyproline-rich glycoprotein family protein, putative isoform 20.0e+0088.62Show/hide
Query:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MA+PSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
Subjt:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGG------GSRKVEQVSNTCDESKASGEDEKLND
        QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHR EATVKE+ ITCAESCNGG       SRKVEQVSNTCDESKASGEDEKL++
Subjt:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGG------GSRKVEQVSNTCDESKASGEDEKLND

Query:  KDSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDD
        KDSGSAEDNKDTHGKDQSNSK K AE  E+NA NK+SQVEP DDGCSSSHRDKELQSVQSQNGKQ AATTPRTFVANEMFDGKMVNVMDGLKLFEEL DD
Subjt:  KDSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDD

Query:  AEVSKLHSLVNDLRASGKRGQLQGQ--TYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSK----DRRIESIPSLLQDLIDRLVGEQVMTVKPDSC
        AEVSKL SLVNDLRASGKRGQ QG+  TYVVSKRP KGHGREMIQLGFPIADAP++DDNS  LSK    DRRIE IPSLLQDLIDRLVG+QVMTVKPDSC
Subjt:  AEVSKLHSLVNDLRASGKRGQLQGQ--TYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSK----DRRIESIPSLLQDLIDRLVGEQVMTVKPDSC

Query:  IIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPAD
        IIDFYNEGDHSQPHVWP WFGRPVGVLLLTECE+TFGRVIGTDHSGNYRGA+KLSLTPG LLVVQGKSADFAKHAIPAIRKQRIL+TLTKSQPKRA+PAD
Subjt:  IIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPAD

Query:  GQRTSLNVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGT
        GQR+SLNVG FS WGPPSARSPNPRLSPGQKPY  VPSTGVLP PPIRPQMAPPNGIPPLIVP VAPPMPF PVPIPTGPS WPTAH RHPPPRLPVPGT
Subjt:  GQRTSLNVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGT

Query:  GVFLPPPGSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSNKVTEEEQQQQEEQSENLQAQNA
        GVFLPPPGSSSAP+PSPQQQLPN+ +E GSLSEKENG  KSDHNSG  PGEKPEAK QRQECNG+++GSG++KV EEEQQQQ+E+ ++  AQNA
Subjt:  GVFLPPPGSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSNKVTEEEQQQQEEQSENLQAQNA

A0A5D3E038 Hydroxyproline-rich glycoprotein family protein, putative isoform 20.0e+0089.53Show/hide
Query:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MA+PSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
Subjt:  MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQ-HHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGG------GSRKVEQVSNTCDESKASGEDEKLND
        QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHR EATVKE+ ITCAESCNGG       SRKVEQVSNTCDESKASGEDEKL++
Subjt:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGG------GSRKVEQVSNTCDESKASGEDEKLND

Query:  KDSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDD
        KDSGSAEDNKDTHGKDQSNSK K AE  E+NA NK+SQVEP DDGCSSSHRDKELQSVQSQNGKQ AATTPRTFVANEMFDGKMVNVMDGLKLFEEL DD
Subjt:  KDSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDD

Query:  AEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIIDFYN
        AEVSKL SLVNDLRASGKRGQ QGQTYVVSKRP KGHGREMIQLGFPIADAP++DDNS  LSKDRRIE IPSLLQDLIDRLVG+QVMTVKPDSCIIDFYN
Subjt:  AEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIIDFYN

Query:  EGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQRTSL
        EGDHSQPHVWP WFGRPVGVLLLTECE+TFGRVIGTDHSGNYRGA+KLSLTPG LLVVQGKSADFAKHAIPAIRKQRIL+TLTKSQPKRA+PADGQR+SL
Subjt:  EGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQRTSL

Query:  NVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGTGVFLPP
        NVG FS WGPPSARSPNPRLSPGQKPY  VPSTGVLP PPIRPQMAPPNGIPPLIVP VAPPMPF PVPIPTGPS WPTAH RHPPPRLPVPGTGVFLPP
Subjt:  NVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGTGVFLPP

Query:  PGSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSNKVTEEEQQQQEEQSENLQAQNA
        PGSSSAP+PSPQQQLPN+ +E GSLSEKENG  KSDHNSG  PGEKPEAK QRQECNG+++GSG++KV EEEQQQQ+E+ ++  AQNA
Subjt:  PGSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSNKVTEEEQQQQEEQSENLQAQNA

A0A6J1J9C0 uncharacterized protein LOC1114846090.0e+0087.97Show/hide
Query:  MAMPSGNVGVPDKVSFQS-GGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MAMPSGNVGVPDKVSFQS GGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLR EFAA+NAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPV+HM
Subjt:  MAMPSGNVGVPDKVSFQS-GGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFK----QQQGHRVEATVKEDIITCAESCNGG------GSRKVEQVSNTCDESKASGEDE
        QQYFSVA+V Y+LQQV SRRQQRY+DPVKVGPK YRRPGPGFK    QQQGHR+EATVKE+++TCAESCNGG      GSRKVEQVSNTC+ESKASGEDE
Subjt:  QQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFK----QQQGHRVEATVKEDIITCAESCNGG------GSRKVEQVSNTCDESKASGEDE

Query:  KLNDKDSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEE
         LNDKDSGSAED KDTHGKDQ NSKPK AE  E+NASNKES VEP DDGCSSS+R+KELQSVQSQNGKQ AATTPRTFVANEM DGKMVNVMDGLKLFE+
Subjt:  KLNDKDSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEE

Query:  LFDDAEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCII
          DDAEVSKL SLVNDLRASGKRGQ QG TYVVSKRPMKGHGREMIQLGFPIAD PHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQ+M+VKPDSCII
Subjt:  LFDDAEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCII

Query:  DFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQ
        DFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNY+GAMKLSL PGTLLVV+GKSADFAKHAIPAIRKQRIL+TLTKSQPKRA P+DGQ
Subjt:  DFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQ

Query:  RTSLNVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPF-PPVPIPTGPSAWPTAHPRHPPPRLPVPGTG
        RTSLNVGPF+SWGPPS RSPNPRLSPGQK Y +VPSTGVLPAPPIRPQM PPNGIPPLIV PVA PMPF PPVPIPTGP  WPTAHPRHPPPRLPVPGTG
Subjt:  RTSLNVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVPPVAPPMPF-PPVPIPTGPSAWPTAHPRHPPPRLPVPGTG

Query:  VFLPPPGSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSNKVTEEEQQQQEEQSENLQAQNAGGGAV
        VFLPPPGSSSAP+P+P QQLPN+TVETGSLSEKENGS KSDHN+GA  GEK EAKPQRQEC    NGS S KV EEEQQ+Q+EQSENLQAQ+AG GAV
Subjt:  VFLPPPGSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSNKVTEEEQQQQEEQSENLQAQNAGGGAV

SwissProt top hitse value%identityAlignment
Q9SL49 RNA demethylase ALKBH9B1.0e-3231.7Show/hide
Query:  KYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDAEVSKLHSLVNDLRASGKRGQL
        ++ E+ EE    ++S  +  D     +    +L   Q +N +       + F+  E   GK+VNV+DGL+L   +F   E  ++   V  L+  G+RG+L
Subjt:  KYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDAEVSKLHSLVNDLRASGKRGQL

Query:  QGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVM--TVKPDSCIIDFYNEGDHSQPHVWPPWFGRP-VG
        + +T+    + M+G GRE IQ G     AP    N  G+ +   ++ +P L + +I +L+   V+  T  PDSCI++ Y+EGD   PH+    F RP   
Subjt:  QGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVM--TVKPDSCIIDFYNEGDHSQPHVWPPWFGRP-VG

Query:  VLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTK
        +  L+EC++ FG  +  +  G++ G+  + L  G++LV+ G  AD AKH +PA+  +RI IT  K
Subjt:  VLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTK

Q9ZT92 RNA demethylase ALKBH10B9.9e-6032.11Show/hide
Query:  RDGFISWLRGEFAASNAMIDALCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGF
        +D  ISW RGEFAA+NA+IDA+C HLR   E     EY+ V   I +RR NW PVL MQ+Y S+AEV   LQ+V +++ +                    
Subjt:  RDGFISWLRGEFAASNAMIDALCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGF

Query:  KQQQGHRVEATVKEDIITCAESCNGGGSRKVEQVSNTCDESKASGEDEKLNDKD-SGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCS
        +++     E  +KE + T           + E+V   C         EK+ + D +G  ED +D            + +  +   ++   Q+        
Subjt:  KQQQGHRVEATVKEDIITCAESCNGGGSRKVEQVSNTCDESKASGEDEKLNDKD-SGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCS

Query:  SSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDAEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFP
         SH D + +S        C     + F A E   G  VNV+ GLKL+EEL  + E+SKL   V +LR +G  G+L G+++++  + +KG+ RE+IQLG P
Subjt:  SSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDAEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFP

Query:  IADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMT--VKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGA
        I      D+NS+  +    IE IP LL+ +ID  V  +++    +P+ C+I+F+ EG++SQP + PP   +P+  L+L+E  M +GR++ +D+ GN+RG 
Subjt:  IADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMT--VKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGA

Query:  MKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQRTSLNVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQM
        + LSL  G+LLV++G SAD A+H +   + +R+ IT  + +P          +  N G  + W  P   +P P L+        +P  GVL  P +   M
Subjt:  MKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQRTSLNVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQM

Query:  APPNGIPPLIVPP--VAPPMPFPPVPIPTGPSAWPTAHPRHPPPR------LPVPGTGVFLPPPGSSSAP
        APP  + P+I+P   V        V +P         H +H PPR      LP+P      P  GS+S P
Subjt:  APPNGIPPLIVPP--VAPPMPFPPVPIPTGPSAWPTAHPRHPPPR------LPVPGTGVFLPPPGSSSAP

Arabidopsis top hitse value%identityAlignment
AT1G14710.1 hydroxyproline-rich glycoprotein family protein2.6e-13247.12Show/hide
Query:  PRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQ-----QRYMDPVKVGP
        P  W PDERDGFISWLR EFAA+NA+ID+LC HL+AVG+  EY+ VIG I  RR  W+ VL MQQ+F VA+V Y LQQ+  +RQ     QR+ +  +VG 
Subjt:  PRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQ-----QRYMDPVKVGP

Query:  KLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGGGSRKVEQVSNTCDESKASGEDEKLNDKDSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQ
           RR GPGF +  G        + +     + NG  S +VE      +E+K + + + L+      AE+ +D   K +S+SK    EK  E +  +E  
Subjt:  KLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGGGSRKVEQVSNTCDESKASGEDEKLNDKDSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQ

Query:  VEPADDGCSSSHRDKELQSVQSQ--NGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDAEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKG
        V+  +  C+S  +D  L S Q Q  N K+C A+  +TFV  EM+D KMVNV++GLKL++++ D  EVS+L SLV +LR +G+RGQLQ + YV  KRP +G
Subjt:  VEPADDGCSSSHRDKELQSVQSQ--NGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDAEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKG

Query:  HGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGT
        HGREMIQLG PIAD P DDD+     KDRRIE IPS L D+I+RLV +Q++ VKPD+CIIDF++EGDHSQPH++ PWFGRP+ VL L+EC+ TFGRVI +
Subjt:  HGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGT

Query:  DHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQRTSLNVGPFSSWGPPSARSPNPRL---SPGQKPYPTV-PS
        ++ G+Y+G++KLSLTPG++L+V+GKSA+ AK+AI A RKQRILI+  KS+P+                 S+WGPP +RSPN  +   +   K YP V PS
Subjt:  DHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQRTSLNVGPFSSWGPPSARSPNPRL---SPGQKPYPTV-PS

Query:  TGVLPAPPIRPQMAPPNG-IPPLIV---PPVAPPMPFPPVPIPTGPSAWP--TAHPRH---PPPRLPVPGTGVFLPPPGSSSAPAPSPQQQLPNTTVETG
        TGVLP P  R    PPNG + P+ +   PP+A PMPFP   +PTGP  WP    HPRH   P PR+P+PGTGVFLPP         S Q+   N+    G
Subjt:  TGVLPAPPIRPQMAPPNG-IPPLIV---PPVAPPMPFPPVPIPTGPSAWP--TAHPRH---PPPRLPVPGTGVFLPPPGSSSAPAPSPQQQLPNTTVETG

Query:  SLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSN
         L  K     ++    G              EC+GS NG  SN
Subjt:  SLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSN

AT1G14710.2 hydroxyproline-rich glycoprotein family protein2.6e-13247.12Show/hide
Query:  PRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQ-----QRYMDPVKVGP
        P  W PDERDGFISWLR EFAA+NA+ID+LC HL+AVG+  EY+ VIG I  RR  W+ VL MQQ+F VA+V Y LQQ+  +RQ     QR+ +  +VG 
Subjt:  PRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQ-----QRYMDPVKVGP

Query:  KLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGGGSRKVEQVSNTCDESKASGEDEKLNDKDSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQ
           RR GPGF +  G        + +     + NG  S +VE      +E+K + + + L+      AE+ +D   K +S+SK    EK  E +  +E  
Subjt:  KLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGGGSRKVEQVSNTCDESKASGEDEKLNDKDSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQ

Query:  VEPADDGCSSSHRDKELQSVQSQ--NGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDAEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKG
        V+  +  C+S  +D  L S Q Q  N K+C A+  +TFV  EM+D KMVNV++GLKL++++ D  EVS+L SLV +LR +G+RGQLQ + YV  KRP +G
Subjt:  VEPADDGCSSSHRDKELQSVQSQ--NGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDAEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKG

Query:  HGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGT
        HGREMIQLG PIAD P DDD+     KDRRIE IPS L D+I+RLV +Q++ VKPD+CIIDF++EGDHSQPH++ PWFGRP+ VL L+EC+ TFGRVI +
Subjt:  HGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGT

Query:  DHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQRTSLNVGPFSSWGPPSARSPNPRL---SPGQKPYPTV-PS
        ++ G+Y+G++KLSLTPG++L+V+GKSA+ AK+AI A RKQRILI+  KS+P+                 S+WGPP +RSPN  +   +   K YP V PS
Subjt:  DHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQRTSLNVGPFSSWGPPSARSPNPRL---SPGQKPYPTV-PS

Query:  TGVLPAPPIRPQMAPPNG-IPPLIV---PPVAPPMPFPPVPIPTGPSAWP--TAHPRH---PPPRLPVPGTGVFLPPPGSSSAPAPSPQQQLPNTTVETG
        TGVLP P  R    PPNG + P+ +   PP+A PMPFP   +PTGP  WP    HPRH   P PR+P+PGTGVFLPP         S Q+   N+    G
Subjt:  TGVLPAPPIRPQMAPPNG-IPPLIV---PPVAPPMPFPPVPIPTGPSAWP--TAHPRH---PPPRLPVPGTGVFLPPPGSSSAPAPSPQQQLPNTTVETG

Query:  SLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSN
         L  K     ++    G              EC+GS NG  SN
Subjt:  SLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSN

AT2G17970.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein7.3e-3431.7Show/hide
Query:  KYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDAEVSKLHSLVNDLRASGKRGQL
        ++ E+ EE    ++S  +  D     +    +L   Q +N +       + F+  E   GK+VNV+DGL+L   +F   E  ++   V  L+  G+RG+L
Subjt:  KYAEKSEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDAEVSKLHSLVNDLRASGKRGQL

Query:  QGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVM--TVKPDSCIIDFYNEGDHSQPHVWPPWFGRP-VG
        + +T+    + M+G GRE IQ G     AP    N  G+ +   ++ +P L + +I +L+   V+  T  PDSCI++ Y+EGD   PH+    F RP   
Subjt:  QGQTYVVSKRPMKGHGREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVM--TVKPDSCIIDFYNEGDHSQPHVWPPWFGRP-VG

Query:  VLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTK
        +  L+EC++ FG  +  +  G++ G+  + L  G++LV+ G  AD AKH +PA+  +RI IT  K
Subjt:  VLLLTECEMTFGRVIGTDHSGNYRGAMKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTK

AT2G48080.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein1.4e-5329.35Show/hide
Query:  RDGFISWLRGEFAASNAMIDALCHHL-RAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQ
        +D  ++W RGEFAA+NA+IDALC HL +A G   +Y+ V+  + +RR NW PVL MQ+Y S+++V   LQQ  ++    ++D                  
Subjt:  RDGFISWLRGEFAASNAMIDALCHHL-RAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQ

Query:  QQGHRVEATVKEDIITCAESCNGGGSRKVEQVSNTCDESKASGEDEKLNDKDSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSH
                    D  + +     GGSR+ E +S  C                                                        +D C S  
Subjt:  QQGHRVEATVKEDIITCAESCNGGGSRKVEQVSNTCDESKASGEDEKLNDKDSGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCSSSH

Query:  RDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDAEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFPIAD
             QS              + F A E   G   NV+ GLKL++++F   ++SKL   +N LR +G+  QL G+T+V+  +  KG  RE++QLG PI  
Subjt:  RDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDAEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFPIAD

Query:  APHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMT--VKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKL
           D+ +         +E IP+L+Q +ID L+  +++    +P+ C+I+F++E +HSQP   PP   +P+  L+L+E  M FG  +G D+ GN+RG++ L
Subjt:  APHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMT--VKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKL

Query:  SLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRA--------------------APADGQRTSLNVGPFSSWGPPSARSPNPRLSP
         L  G+LLV++G SAD A+H +     +R+ IT  K +P                       APA  +R     G F  W PP +R P   L P
Subjt:  SLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRA--------------------APADGQRTSLNVGPFSSWGPPSARSPNPRLSP

AT4G02940.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein7.0e-6132.11Show/hide
Query:  RDGFISWLRGEFAASNAMIDALCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGF
        +D  ISW RGEFAA+NA+IDA+C HLR   E     EY+ V   I +RR NW PVL MQ+Y S+AEV   LQ+V +++ +                    
Subjt:  RDGFISWLRGEFAASNAMIDALCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGF

Query:  KQQQGHRVEATVKEDIITCAESCNGGGSRKVEQVSNTCDESKASGEDEKLNDKD-SGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCS
        +++     E  +KE + T           + E+V   C         EK+ + D +G  ED +D            + +  +   ++   Q+        
Subjt:  KQQQGHRVEATVKEDIITCAESCNGGGSRKVEQVSNTCDESKASGEDEKLNDKD-SGSAEDNKDTHGKDQSNSKPKYAEKSEENASNKESQVEPADDGCS

Query:  SSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDAEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFP
         SH D + +S        C     + F A E   G  VNV+ GLKL+EEL  + E+SKL   V +LR +G  G+L G+++++  + +KG+ RE+IQLG P
Subjt:  SSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDAEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGHGREMIQLGFP

Query:  IADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMT--VKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGA
        I      D+NS+  +    IE IP LL+ +ID  V  +++    +P+ C+I+F+ EG++SQP + PP   +P+  L+L+E  M +GR++ +D+ GN+RG 
Subjt:  IADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMT--VKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGA

Query:  MKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQRTSLNVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQM
        + LSL  G+LLV++G SAD A+H +   + +R+ IT  + +P          +  N G  + W  P   +P P L+        +P  GVL  P +   M
Subjt:  MKLSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQRTSLNVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQM

Query:  APPNGIPPLIVPP--VAPPMPFPPVPIPTGPSAWPTAHPRHPPPR------LPVPGTGVFLPPPGSSSAP
        APP  + P+I+P   V        V +P         H +H PPR      LP+P      P  GS+S P
Subjt:  APPNGIPPLIVPP--VAPPMPFPPVPIPTGPSAWPTAHPRHPPPR------LPVPGTGVFLPPPGSSSAP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATGCCATCGGGAAATGTGGGTGTACCGGATAAAGTTTCATTTCAGAGTGGTGGTGGTGGAGTTGCAGTGAGTGGTGGTGGTGGCGAGATCCATCAGCACCACCC
CCGTCCCTGGTTTCCTGATGAGCGCGATGGTTTTATCTCATGGTTGCGAGGTGAATTTGCTGCCTCAAATGCTATGATTGATGCCCTTTGCCATCATTTGCGTGCTGTGG
GGGAGCCTGGGGAGTATGACGTGGTTATTGGATGTATACAGCAACGGCGGTGTAATTGGACTCCGGTGCTTCATATGCAGCAGTATTTTTCAGTGGCAGAAGTGATGTAT
GCCCTTCAGCAGGTCACCTCGAGGAGGCAGCAGAGGTATATGGATCCTGTGAAAGTGGGGCCGAAGTTGTATAGGAGACCTGGGCCAGGATTTAAGCAGCAGCAGGGCCA
TCGGGTTGAAGCCACAGTCAAGGAAGATATAATCACTTGTGCAGAGTCATGTAATGGGGGAGGCTCTAGGAAGGTGGAGCAAGTAAGTAATACGTGTGATGAAAGTAAGG
CATCGGGGGAGGATGAAAAACTGAACGATAAAGATTCAGGGTCAGCTGAGGACAATAAAGATACTCATGGGAAGGACCAAAGTAATAGCAAACCAAAGTATGCAGAAAAA
TCAGAAGAGAATGCAAGTAATAAAGAATCTCAAGTTGAACCTGCTGATGATGGATGTTCTTCAAGCCATAGAGATAAGGAGTTGCAGTCTGTTCAAAGCCAGAATGGAAA
GCAGTGTGCTGCCACAACCCCGAGAACCTTTGTAGCCAATGAGATGTTTGATGGAAAGATGGTTAATGTGATGGATGGATTGAAATTATTTGAAGAATTATTTGATGATG
CTGAGGTTTCAAAGCTTCACTCACTGGTGAATGATTTGAGGGCTTCCGGAAAGAGAGGGCAACTTCAAGGTCAGACGTATGTGGTCTCAAAAAGACCCATGAAGGGACAT
GGGAGAGAGATGATCCAACTAGGCTTTCCCATTGCAGATGCTCCTCATGATGATGACAATTCTTCAGGGCTCTCTAAAGATAGAAGAATAGAATCCATCCCCTCACTGCT
TCAAGATCTCATTGATCGATTGGTTGGAGAGCAAGTGATGACAGTGAAACCAGATTCCTGCATCATTGATTTTTATAATGAGGGTGATCATTCTCAGCCTCATGTCTGGC
CACCATGGTTTGGTAGGCCCGTTGGTGTCCTCCTTTTGACTGAATGTGAAATGACCTTTGGTAGAGTGATTGGGACAGACCACTCTGGCAACTATAGAGGGGCTATGAAG
CTGTCTCTCACACCCGGAACCCTTCTTGTGGTGCAAGGAAAATCTGCAGATTTTGCTAAGCATGCAATTCCCGCTATCCGCAAGCAACGTATACTTATTACGTTGACCAA
ATCACAACCAAAAAGAGCTGCACCAGCTGATGGACAACGCACATCTCTGAATGTAGGTCCATTTTCCAGTTGGGGCCCTCCATCTGCTAGATCGCCCAACCCTCGTCTTT
CCCCTGGACAGAAGCCTTACCCTACTGTTCCTTCGACTGGGGTGCTACCGGCACCACCCATTCGTCCCCAAATGGCACCACCAAATGGCATCCCTCCCTTAATTGTCCCT
CCTGTAGCACCACCTATGCCTTTCCCTCCTGTGCCAATCCCAACTGGTCCATCTGCATGGCCCACTGCACATCCAAGGCATCCTCCACCTCGTCTCCCTGTTCCAGGCAC
TGGAGTATTCCTTCCCCCTCCAGGTTCTTCCAGTGCTCCAGCTCCATCTCCTCAACAACAATTGCCAAACACCACAGTTGAGACGGGTTCCCTTTCAGAAAAGGAGAATG
GTTCGATGAAATCTGATCACAATTCAGGTGCTCCTCCAGGAGAAAAACCAGAAGCAAAGCCTCAAAGACAAGAATGCAATGGAAGTATGAATGGAAGTGGGAGTAATAAA
GTGACAGAGGAAGAACAGCAGCAGCAGGAGGAGCAAAGTGAGAATCTGCAGGCCCAAAATGCAGGAGGTGGAGCAGTTTAG
mRNA sequenceShow/hide mRNA sequence
CAGAACCTTAAATTGAAGTTACCAAAATACCCTTTTGAGGCTGAAGAAGAATTTGAGAGTGTATCCTACGTTGGTCCCTCTTCGATTCCCAGTTTTTATTTTTCTCAGAG
TGAAGAGAGAGAGAGAGAGAGAAGCCACCATTTTAAAGTCTTAAGAAACCCATGTACATTCGTGTTCATTAAAACCATCCTCAGTTTTCAACTTTCCAAACCCTAGATCT
TATTTATCTAAATTTCCCCCAATTCCCTCAAATCCCACCTTTTTTTTTTGGTTTCTTCTACGTCACTGAATTTCACTGTGCGTTATTCACAAGCACAGTCTAAAAAGAGC
TTTTTTTCTCTAACCCCATTTTGCAGATCTGATACAATTGGGGTCCATCGCCGCCTAAATTCTTCTGTAGCTTGTTATTCAGTTGTCAGAATTTGGCTTTTTATTTTTTC
GGTTTAGATTTGAGGATATACTACTGTATATGCGGGTTTAGTGTATTTTTCTGGTTAGAAGATTCTCATGGCAATGCCATCGGGAAATGTGGGTGTACCGGATAAAGTTT
CATTTCAGAGTGGTGGTGGTGGAGTTGCAGTGAGTGGTGGTGGTGGCGAGATCCATCAGCACCACCCCCGTCCCTGGTTTCCTGATGAGCGCGATGGTTTTATCTCATGG
TTGCGAGGTGAATTTGCTGCCTCAAATGCTATGATTGATGCCCTTTGCCATCATTTGCGTGCTGTGGGGGAGCCTGGGGAGTATGACGTGGTTATTGGATGTATACAGCA
ACGGCGGTGTAATTGGACTCCGGTGCTTCATATGCAGCAGTATTTTTCAGTGGCAGAAGTGATGTATGCCCTTCAGCAGGTCACCTCGAGGAGGCAGCAGAGGTATATGG
ATCCTGTGAAAGTGGGGCCGAAGTTGTATAGGAGACCTGGGCCAGGATTTAAGCAGCAGCAGGGCCATCGGGTTGAAGCCACAGTCAAGGAAGATATAATCACTTGTGCA
GAGTCATGTAATGGGGGAGGCTCTAGGAAGGTGGAGCAAGTAAGTAATACGTGTGATGAAAGTAAGGCATCGGGGGAGGATGAAAAACTGAACGATAAAGATTCAGGGTC
AGCTGAGGACAATAAAGATACTCATGGGAAGGACCAAAGTAATAGCAAACCAAAGTATGCAGAAAAATCAGAAGAGAATGCAAGTAATAAAGAATCTCAAGTTGAACCTG
CTGATGATGGATGTTCTTCAAGCCATAGAGATAAGGAGTTGCAGTCTGTTCAAAGCCAGAATGGAAAGCAGTGTGCTGCCACAACCCCGAGAACCTTTGTAGCCAATGAG
ATGTTTGATGGAAAGATGGTTAATGTGATGGATGGATTGAAATTATTTGAAGAATTATTTGATGATGCTGAGGTTTCAAAGCTTCACTCACTGGTGAATGATTTGAGGGC
TTCCGGAAAGAGAGGGCAACTTCAAGGTCAGACGTATGTGGTCTCAAAAAGACCCATGAAGGGACATGGGAGAGAGATGATCCAACTAGGCTTTCCCATTGCAGATGCTC
CTCATGATGATGACAATTCTTCAGGGCTCTCTAAAGATAGAAGAATAGAATCCATCCCCTCACTGCTTCAAGATCTCATTGATCGATTGGTTGGAGAGCAAGTGATGACA
GTGAAACCAGATTCCTGCATCATTGATTTTTATAATGAGGGTGATCATTCTCAGCCTCATGTCTGGCCACCATGGTTTGGTAGGCCCGTTGGTGTCCTCCTTTTGACTGA
ATGTGAAATGACCTTTGGTAGAGTGATTGGGACAGACCACTCTGGCAACTATAGAGGGGCTATGAAGCTGTCTCTCACACCCGGAACCCTTCTTGTGGTGCAAGGAAAAT
CTGCAGATTTTGCTAAGCATGCAATTCCCGCTATCCGCAAGCAACGTATACTTATTACGTTGACCAAATCACAACCAAAAAGAGCTGCACCAGCTGATGGACAACGCACA
TCTCTGAATGTAGGTCCATTTTCCAGTTGGGGCCCTCCATCTGCTAGATCGCCCAACCCTCGTCTTTCCCCTGGACAGAAGCCTTACCCTACTGTTCCTTCGACTGGGGT
GCTACCGGCACCACCCATTCGTCCCCAAATGGCACCACCAAATGGCATCCCTCCCTTAATTGTCCCTCCTGTAGCACCACCTATGCCTTTCCCTCCTGTGCCAATCCCAA
CTGGTCCATCTGCATGGCCCACTGCACATCCAAGGCATCCTCCACCTCGTCTCCCTGTTCCAGGCACTGGAGTATTCCTTCCCCCTCCAGGTTCTTCCAGTGCTCCAGCT
CCATCTCCTCAACAACAATTGCCAAACACCACAGTTGAGACGGGTTCCCTTTCAGAAAAGGAGAATGGTTCGATGAAATCTGATCACAATTCAGGTGCTCCTCCAGGAGA
AAAACCAGAAGCAAAGCCTCAAAGACAAGAATGCAATGGAAGTATGAATGGAAGTGGGAGTAATAAAGTGACAGAGGAAGAACAGCAGCAGCAGGAGGAGCAAAGTGAGA
ATCTGCAGGCCCAAAATGCAGGAGGTGGAGCAGTTTAGACAGAGAAGATACATTATTGAGGAGAGAGAGAGAGAAAAAAGAAACCAGATAGGCTGCAGAGTTGAATGAGT
TACAAGCAAAATGTAGAGAGCGGCAACATTCAAGACTGATGATTTTCACACACTAAACACTACTACCAAGGAGAACAAGGGGAGTTCTCTTTCAAAATCCTTTAGTTCAT
TCCGTTTTTGTTCTGAAAAATTATTGGTTAGATGGGAAAACTCATTCCTGATGGAACTTTGGATCTTACTATCTTTTTTAAAAAGAGATTTTTGAGAATTCAGAACCAGT
TGGGAATTTTCTTTTGGATTTAAACAGTTGAAAAAAAAGAAAGAAAAAGCTAACCCCTTGAATTATCAGGGACTCGATCTCCCCCCCCC
Protein sequenceShow/hide protein sequence
MAMPSGNVGVPDKVSFQSGGGGVAVSGGGGEIHQHHPRPWFPDERDGFISWLRGEFAASNAMIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMY
ALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRVEATVKEDIITCAESCNGGGSRKVEQVSNTCDESKASGEDEKLNDKDSGSAEDNKDTHGKDQSNSKPKYAEK
SEENASNKESQVEPADDGCSSSHRDKELQSVQSQNGKQCAATTPRTFVANEMFDGKMVNVMDGLKLFEELFDDAEVSKLHSLVNDLRASGKRGQLQGQTYVVSKRPMKGH
GREMIQLGFPIADAPHDDDNSSGLSKDRRIESIPSLLQDLIDRLVGEQVMTVKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMK
LSLTPGTLLVVQGKSADFAKHAIPAIRKQRILITLTKSQPKRAAPADGQRTSLNVGPFSSWGPPSARSPNPRLSPGQKPYPTVPSTGVLPAPPIRPQMAPPNGIPPLIVP
PVAPPMPFPPVPIPTGPSAWPTAHPRHPPPRLPVPGTGVFLPPPGSSSAPAPSPQQQLPNTTVETGSLSEKENGSMKSDHNSGAPPGEKPEAKPQRQECNGSMNGSGSNK
VTEEEQQQQEEQSENLQAQNAGGGAV