; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g29320 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g29320
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionHydroxyproline-rich glycoprotein family protein, putative isoform 2
Genome locationchr9:22056700..22067334
RNA-Seq ExpressionMoc09g29320
SyntenyMoc09g29320
Gene Ontology termsGO:0006402 - mRNA catabolic process (biological process)
GO:0070988 - demethylation (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0032451 - demethylase activity (molecular function)
InterPro domainsIPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily
IPR044842 - RNA demethylase ALKBH9B/ALKBH10B-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056373.1 Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Cucumis melo var. makuwa]4.0e-28176.72Show/hide
Query:  MAMPSGNVGVSNKVPFQSGG------GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MA+PSGNVGV +KV FQSGG      GGGGEIH  HHPRPW+PDERDGFISWLRGEFAA+NA+ID+LCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
Subjt:  MAMPSGNVGVSNKVPFQSGG------GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPHGHRVEA-VKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLID
        QQYFSVAEVMY+LQQV SRRQQRY+DPVKVGPKLYRRPGP FKQ  GHR EA VKEE + CA+SCNGGNSS FV SRKVEQVSNTC+ESKA GEDEKL +
Subjt:  QQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPHGHRVEA-VKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLID

Query:  EDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDD
        +DSGS ED KD HGKDQS+SK+KC ENLEDNA NK+SQVE  DDGCSSSHRDKELQSVQS+NGKQ+AA TPRTFVANEMFDGKMVNVM+GLKL+EEL DD
Subjt:  EDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDD

Query:  AEVSKLVSLVNDLRASGKRGQFQGKFQFIL---------------------------------------NRRIEPIPSLLQDVIDRLVGEQVMTLKPDSC
        AEVSKL+SLVNDLRASGKRGQFQGKF   +                                       +RRIEPIPSLLQD+IDRLVG+QVMT+KPDSC
Subjt:  AEVSKLVSLVNDLRASGKRGQFQGKFQFIL---------------------------------------NRRIEPIPSLLQDVIDRLVGEQVMTLKPDSC

Query:  IIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALAD
        IIDFYNEGDHSQPHVWP WFGRPVGVLLLTECE+TFGRVIGTDHSGNYRGA+KL LTPG LL+VQG+SADFAKHAIPAIRKQRILVTLTKSQPK+A+ AD
Subjt:  IIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALAD

Query:  GQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPG
        GQR+S NVG FS WGPPSARSPN RL PGQK Y  VPST VLP PPI  QM PPNGIPPLIVP VAPPMPF  PV IP GP  WP AH RHPPPRLPVPG
Subjt:  GQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPG

Query:  TGVFL-PPGSSSTP---PLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEG
        TGVFL PPGSSS P   P QQL +S +E GSL+EKENG  TKSDH++   PGEKPEAK +RQECNG ++G
Subjt:  TGVFL-PPGSSSTP---PLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEG

XP_022150017.1 uncharacterized protein LOC111018294 [Momordica charantia]0.0e+0093.87Show/hide
Query:  MAMPSGNVGVSNKVPFQSGGGGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSV
        MAMPSGNVGVSNKVPFQSGGGGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSV
Subjt:  MAMPSGNVGVSNKVPFQSGGGGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSV

Query:  AEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFKQPHGHRVEAVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLIDEDSGSVED
        AEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFKQPHGHRVEAVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLIDEDSGSVED
Subjt:  AEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFKQPHGHRVEAVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLIDEDSGSVED

Query:  KKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVS
        KKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVS
Subjt:  KKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVS

Query:  LVNDLRASGKRGQFQGKFQFI---------------------------------LNRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFYNEGDHSQPH
        LVNDLRASGKRGQFQG+   +                                  +RRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFYNEGDHSQPH
Subjt:  LVNDLRASGKRGQFQGKFQFI---------------------------------LNRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFYNEGDHSQPH

Query:  VWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSW
        VWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSW
Subjt:  VWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSW

Query:  GPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGTGVFLPPGSSSTPP
        GPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGTGVFLPPGSSSTPP
Subjt:  GPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGTGVFLPPGSSSTPP

Query:  LQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEG
        LQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEG
Subjt:  LQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEG

XP_022942363.1 uncharacterized protein LOC111447430 [Cucurbita moschata]6.8e-28176.8Show/hide
Query:  MAMPSGNVGVSNKVPFQSGG--------GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL
        MAMPSGNVGV +KVPFQSGG        GGGGEI HQHHPRPW+PDERDGFISWLR EFAAANA+ID+LCHHLRAVGEPGEYDVVIGCIQQRRCNWTPV+
Subjt:  MAMPSGNVGVSNKVPFQSGG--------GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL

Query:  HMQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FK---QPHGHRVE-AVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGED
        HMQQYFSVA+V YSLQQV SRRQQRYIDPVKVGPK YRRPGP FK   Q  GHR+E  VKEE+V CA+SCNGGNSS FVGSRKVE VSNTCE+SKA GED
Subjt:  HMQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FK---QPHGHRVE-AVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGED

Query:  EKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYE
        E L D+DSGS ED+KD HGKDQS+SK KC E+LEDNASNKES VE TDDGCSSS+R+KELQSVQ++NGKQ AA TPRTFVANEM DGKMVNVM+GLKL+E
Subjt:  EKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYE

Query:  ELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFI---------------------------------LNRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCI
        +  DDAEVSKL+SLVNDLRASGKRGQFQG+   +                                  +RRIE IPSLLQD+IDRLVGEQ+M++KPDSCI
Subjt:  ELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFI---------------------------------LNRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCI

Query:  IDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADG
        IDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNY+GAMKL L PGTLL+V+G+SADFAKHAIPAIRKQRILVTLTKSQPK+A  +DG
Subjt:  IDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADG

Query:  QRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGT
        QRTS NVGPFSSWGPPS RSPN RL PG KHYP+VPST VLPAPPI  QMPPPNGIPPLIV PVA PMPFP PV IP GPP WP AHPRHPPPRLPVPGT
Subjt:  QRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGT

Query:  GVFLPPGSSSTPPL------QQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGN
        GVFLPP  SS+ P       QQL +STVETGSL+EKENG STKSDH+T A  GEK EAK +RQECNG+
Subjt:  GVFLPPGSSSTPPL------QQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGN

XP_038892010.1 RNA demethylase ALKBH10B isoform X1 [Benincasa hispida]4.6e-28578.74Show/hide
Query:  MAMPSGNVGVSNKVPFQSGG------GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MAMPSGNVGV +KV +QSGG      GGGGEI HQHHPRPW+PDERDGFISWLRGEFAA+NAIID+LCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
Subjt:  MAMPSGNVGVSNKVPFQSGG------GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPHGHRVEA-VKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLID
        QQYFSVAEVMY+LQQV SRRQQRY+DPVKVG KLYRRPGP FKQ  GHRVEA VKEE+  CA+SCNG NSS  VG RKVEQVSNTC+ESKA GED KL D
Subjt:  QQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPHGHRVEA-VKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLID

Query:  EDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDD
        +DS S ED KD HGKDQS+SK KC ENLEDNASNKESQVE TDDGCSSSHRDKELQSVQS+NGKQ AA TPRTFVANEMFDGKMVNVM+GLKL+EEL DD
Subjt:  EDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDD

Query:  AEVSKLVSLVNDLRASGKRGQFQGKFQFIL-------------------------------------NRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCII
        AEVSKL+SLVNDLRASGKRGQFQG+   +L                                     +RRIE IPSLLQD+IDRLVGEQVMT+KPDSCI+
Subjt:  AEVSKLVSLVNDLRASGKRGQFQGKFQFIL-------------------------------------NRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCII

Query:  DFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQ
        DFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKL LTPGTLL+VQG+SADFAKHAIPAIRKQRILVTLTKSQPK+A  ADGQ
Subjt:  DFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQ

Query:  RTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGTG
        RTS N+G FSSWGPPS RSPN RL PGQK YPTVPST VLPAPPI  QM PPNGIPPLIVPPVAPPMPFP PV IP GP AWP AHPRHPPPRLPVPGTG
Subjt:  RTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGTG

Query:  VFLPPGSSSTPPL---QQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAK-AERQECNGNMEG
        VFLPP  SS+ P    QQ  +S VETGSL+EKENG STKSDH++  SPGEKPEAK  +RQECNG+M+G
Subjt:  VFLPPGSSSTPPL---QQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAK-AERQECNGNMEG

XP_038892011.1 RNA demethylase ALKBH10B isoform X2 [Benincasa hispida]1.6e-28579.22Show/hide
Query:  MAMPSGNVGVSNKVPFQSGG------GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MAMPSGNVGV +KV +QSGG      GGGGEI HQHHPRPW+PDERDGFISWLRGEFAA+NAIID+LCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
Subjt:  MAMPSGNVGVSNKVPFQSGG------GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPHGHRVEA-VKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLID
        QQYFSVAEVMY+LQQV SRRQQRY+DPVKVG KLYRRPGP FKQ  GHRVEA VKEE+  CA+SCNG NSS  VG RKVEQVSNTC+ESKA GED KL D
Subjt:  QQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPHGHRVEA-VKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLID

Query:  EDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDD
        +DS S ED KD HGKDQS+SK KC ENLEDNASNKESQVE TDDGCSSSHRDKELQSVQS+NGKQ AA TPRTFVANEMFDGKMVNVM+GLKL+EEL DD
Subjt:  EDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDD

Query:  AEVSKLVSLVNDLRASGKRGQFQGKFQFIL---------------------------------NRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFYN
        AEVSKL+SLVNDLRASGKRGQFQG+   +L                                 +RRIE IPSLLQD+IDRLVGEQVMT+KPDSCI+DFYN
Subjt:  AEVSKLVSLVNDLRASGKRGQFQGKFQFIL---------------------------------NRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFYN

Query:  EGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSS
        EGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKL LTPGTLL+VQG+SADFAKHAIPAIRKQRILVTLTKSQPK+A  ADGQRTS 
Subjt:  EGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSS

Query:  NVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGTGVFLP
        N+G FSSWGPPS RSPN RL PGQK YPTVPST VLPAPPI  QM PPNGIPPLIVPPVAPPMPFP PV IP GP AWP AHPRHPPPRLPVPGTGVFLP
Subjt:  NVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGTGVFLP

Query:  PGSSSTPPL---QQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAK-AERQECNGNMEG
        P  SS+ P    QQ  +S VETGSL+EKENG STKSDH++  SPGEKPEAK  +RQECNG+M+G
Subjt:  PGSSSTPPL---QQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAK-AERQECNGNMEG

TrEMBL top hitse value%identityAlignment
A0A5A7UKE8 Hydroxyproline-rich glycoprotein family protein, putative isoform 21.9e-28176.72Show/hide
Query:  MAMPSGNVGVSNKVPFQSGG------GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MA+PSGNVGV +KV FQSGG      GGGGEIH  HHPRPW+PDERDGFISWLRGEFAA+NA+ID+LCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
Subjt:  MAMPSGNVGVSNKVPFQSGG------GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPHGHRVEA-VKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLID
        QQYFSVAEVMY+LQQV SRRQQRY+DPVKVGPKLYRRPGP FKQ  GHR EA VKEE + CA+SCNGGNSS FV SRKVEQVSNTC+ESKA GEDEKL +
Subjt:  QQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPHGHRVEA-VKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLID

Query:  EDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDD
        +DSGS ED KD HGKDQS+SK+KC ENLEDNA NK+SQVE  DDGCSSSHRDKELQSVQS+NGKQ+AA TPRTFVANEMFDGKMVNVM+GLKL+EEL DD
Subjt:  EDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDD

Query:  AEVSKLVSLVNDLRASGKRGQFQGKFQFIL---------------------------------------NRRIEPIPSLLQDVIDRLVGEQVMTLKPDSC
        AEVSKL+SLVNDLRASGKRGQFQGKF   +                                       +RRIEPIPSLLQD+IDRLVG+QVMT+KPDSC
Subjt:  AEVSKLVSLVNDLRASGKRGQFQGKFQFIL---------------------------------------NRRIEPIPSLLQDVIDRLVGEQVMTLKPDSC

Query:  IIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALAD
        IIDFYNEGDHSQPHVWP WFGRPVGVLLLTECE+TFGRVIGTDHSGNYRGA+KL LTPG LL+VQG+SADFAKHAIPAIRKQRILVTLTKSQPK+A+ AD
Subjt:  IIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALAD

Query:  GQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPG
        GQR+S NVG FS WGPPSARSPN RL PGQK Y  VPST VLP PPI  QM PPNGIPPLIVP VAPPMPF  PV IP GP  WP AH RHPPPRLPVPG
Subjt:  GQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPG

Query:  TGVFL-PPGSSSTP---PLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEG
        TGVFL PPGSSS P   P QQL +S +E GSL+EKENG  TKSDH++   PGEKPEAK +RQECNG ++G
Subjt:  TGVFL-PPGSSSTP---PLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEG

A0A5D3E038 Hydroxyproline-rich glycoprotein family protein, putative isoform 27.3e-28177.11Show/hide
Query:  MAMPSGNVGVSNKVPFQSGG------GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MA+PSGNVGV +KV FQSGG      GGGGEIH  HHPRPW+PDERDGFISWLRGEFAA+NA+ID+LCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
Subjt:  MAMPSGNVGVSNKVPFQSGG------GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPHGHRVEA-VKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLID
        QQYFSVAEVMY+LQQV SRRQQRY+DPVKVGPKLYRRPGP FKQ  GHR EA VKEE + CA+SCNGGNSS FV SRKVEQVSNTC+ESKA GEDEKL +
Subjt:  QQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPHGHRVEA-VKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLID

Query:  EDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDD
        +DSGS ED KD HGKDQS+SK+KC ENLEDNA NK+SQVE  DDGCSSSHRDKELQSVQS+NGKQ+AA TPRTFVANEMFDGKMVNVM+GLKL+EEL DD
Subjt:  EDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDD

Query:  AEVSKLVSLVNDLRASGKRGQFQGKFQFI---------------------------------LNRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFYN
        AEVSKL+SLVNDLRASGKRGQFQG+   +                                  +RRIEPIPSLLQD+IDRLVG+QVMT+KPDSCIIDFYN
Subjt:  AEVSKLVSLVNDLRASGKRGQFQGKFQFI---------------------------------LNRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFYN

Query:  EGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSS
        EGDHSQPHVWP WFGRPVGVLLLTECE+TFGRVIGTDHSGNYRGA+KL LTPG LL+VQG+SADFAKHAIPAIRKQRILVTLTKSQPK+A+ ADGQR+S 
Subjt:  EGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSS

Query:  NVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGTGVFL-
        NVG FS WGPPSARSPN RL PGQK Y  VPST VLP PPI  QM PPNGIPPLIVP VAPPMPF  PV IP GP  WP AH RHPPPRLPVPGTGVFL 
Subjt:  NVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGTGVFL-

Query:  PPGSSSTP---PLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEG
        PPGSSS P   P QQL +S +E GSL+EKENG  TKSDH++   PGEKPEAK +RQECNG ++G
Subjt:  PPGSSSTP---PLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEG

A0A6J1D9K3 uncharacterized protein LOC1110182940.0e+0093.87Show/hide
Query:  MAMPSGNVGVSNKVPFQSGGGGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSV
        MAMPSGNVGVSNKVPFQSGGGGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSV
Subjt:  MAMPSGNVGVSNKVPFQSGGGGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSV

Query:  AEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFKQPHGHRVEAVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLIDEDSGSVED
        AEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFKQPHGHRVEAVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLIDEDSGSVED
Subjt:  AEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFKQPHGHRVEAVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLIDEDSGSVED

Query:  KKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVS
        KKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVS
Subjt:  KKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVS

Query:  LVNDLRASGKRGQFQGKFQFI---------------------------------LNRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFYNEGDHSQPH
        LVNDLRASGKRGQFQG+   +                                  +RRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFYNEGDHSQPH
Subjt:  LVNDLRASGKRGQFQGKFQFI---------------------------------LNRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFYNEGDHSQPH

Query:  VWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSW
        VWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSW
Subjt:  VWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSW

Query:  GPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGTGVFLPPGSSSTPP
        GPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGTGVFLPPGSSSTPP
Subjt:  GPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGTGVFLPPGSSSTPP

Query:  LQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEG
        LQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEG
Subjt:  LQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEG

A0A6J1FUM5 uncharacterized protein LOC1114474303.3e-28176.8Show/hide
Query:  MAMPSGNVGVSNKVPFQSGG--------GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL
        MAMPSGNVGV +KVPFQSGG        GGGGEI HQHHPRPW+PDERDGFISWLR EFAAANA+ID+LCHHLRAVGEPGEYDVVIGCIQQRRCNWTPV+
Subjt:  MAMPSGNVGVSNKVPFQSGG--------GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL

Query:  HMQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FK---QPHGHRVE-AVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGED
        HMQQYFSVA+V YSLQQV SRRQQRYIDPVKVGPK YRRPGP FK   Q  GHR+E  VKEE+V CA+SCNGGNSS FVGSRKVE VSNTCE+SKA GED
Subjt:  HMQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FK---QPHGHRVE-AVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGED

Query:  EKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYE
        E L D+DSGS ED+KD HGKDQS+SK KC E+LEDNASNKES VE TDDGCSSS+R+KELQSVQ++NGKQ AA TPRTFVANEM DGKMVNVM+GLKL+E
Subjt:  EKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYE

Query:  ELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFI---------------------------------LNRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCI
        +  DDAEVSKL+SLVNDLRASGKRGQFQG+   +                                  +RRIE IPSLLQD+IDRLVGEQ+M++KPDSCI
Subjt:  ELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFI---------------------------------LNRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCI

Query:  IDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADG
        IDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNY+GAMKL L PGTLL+V+G+SADFAKHAIPAIRKQRILVTLTKSQPK+A  +DG
Subjt:  IDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADG

Query:  QRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGT
        QRTS NVGPFSSWGPPS RSPN RL PG KHYP+VPST VLPAPPI  QMPPPNGIPPLIV PVA PMPFP PV IP GPP WP AHPRHPPPRLPVPGT
Subjt:  QRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGT

Query:  GVFLPPGSSSTPPL------QQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGN
        GVFLPP  SS+ P       QQL +STVETGSL+EKENG STKSDH+T A  GEK EAK +RQECNG+
Subjt:  GVFLPPGSSSTPPL------QQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGN

A0A6J1J9C0 uncharacterized protein LOC1114846095.6e-28177.59Show/hide
Query:  MAMPSGNVGVSNKVPFQSGG-------GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLH
        MAMPSGNVGV +KV FQSGG       GGGGEI HQHHPRPW+PDERDGFISWLR EFAAANA+ID+LCHHLRAVGEPGEYDVVIGCIQQRRCNWTPV+H
Subjt:  MAMPSGNVGVSNKVPFQSGG-------GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLH

Query:  MQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FK----QPHGHRVEA-VKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGED
        MQQYFSVA+V YSLQQV SRRQQRYIDPVKVGPK YRRPGP FK    Q  GHR+EA VKEE+V CA+SCNGGNSS FVGSRKVEQVSNTCEESKA GED
Subjt:  MQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FK----QPHGHRVEA-VKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGED

Query:  EKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYE
        E L D+DSGS ED+KD HGKDQ +SK KC E+LEDNASNKES VE TDDGCSSS+R+KELQSVQS+NGKQ AA TPRTFVANEM DGKMVNVM+GLKL+E
Subjt:  EKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYE

Query:  ELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFI---------------------------------LNRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCI
        +  DDAEVSKL+SLVNDLRASGKRGQFQG    +                                  +RRIE IPSLLQD+IDRLVGEQ+M++KPDSCI
Subjt:  ELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFI---------------------------------LNRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCI

Query:  IDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADG
        IDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNY+GAMKL L PGTLL+V+G+SADFAKHAIPAIRKQRILVTLTKSQPK+A  +DG
Subjt:  IDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADG

Query:  QRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGT
        QRTS NVGPF+SWGPPS RSPN RL PGQKHY +VPST VLPAPPI  QMPPPNGIPPLIV PVA PMPFP PV IP GPP WP AHPRHPPPRLPVPGT
Subjt:  QRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGT

Query:  GVFL-PPGSSS--TPPLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGN
        GVFL PPGSSS  +P  QQL +STVETGSL+EKENG STKSDH+  AS GEK EAK +RQECNG+
Subjt:  GVFL-PPGSSS--TPPLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGN

SwissProt top hitse value%identityAlignment
Q9SL49 RNA demethylase ALKBH9B2.4e-2328.84Show/hide
Query:  ENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQ----
        E  E+    ++S  +  D     +    +L   Q  N +       + F+  E   GK+VNV++GL+L+  +F   E  ++V  V  L+  G+RG+    
Subjt:  ENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQ----

Query:  --------FQGK----FQF----------------ILNR-RIEPIPSLLQDVIDRLVGEQVM--TLKPDSCIIDFYNEGDHSQPHVWPPWFGRP-VGVLL
                 +GK     QF                IL R  ++P+P L + +I +L+   V+  T  PDSCI++ Y+EGD   PH+    F RP   +  
Subjt:  --------FQGK----FQF----------------ILNR-RIEPIPSLLQDVIDRLVGEQVM--TLKPDSCIIDFYNEGDHSQPHVWPPWFGRP-VGVLL

Query:  LTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKK
        L+EC++ FG  +  +  G++ G+  +PL  G++L++ G  AD AKH +PA+  +RI +T  K    K
Subjt:  LTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKK

Q9ZT92 RNA demethylase ALKBH10B5.8e-4930.22Show/hide
Query:  RDGFISWLRGEFAAANAIIDSLCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFK
        +D  ISW RGEFAAANAIID++C HLR   E     EY+ V   I +RR NW PVL MQ+Y S+AEV   LQ+VA+++ +                   K
Subjt:  RDGFISWLRGEFAAANAIIDSLCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFK

Query:  QPHGHRVEAVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLIDED-SGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTD
        +      E +KE V                 + + E+V   C         EK+ + D +G VED +D       DS +    ++ D+ S+++    +  
Subjt:  QPHGHRVEAVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLIDED-SGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTD

Query:  DGCSS----SHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILNRR------
        D        SH D + +S + +  K         F A E   G  VNV++GLKLYEEL  + E+SKL+  V +LR +G  G+  G+   + N++      
Subjt:  DGCSS----SHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILNRR------

Query:  ---------------------------IEPIPSLLQDVIDRLVGEQVMT--LKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTD
                                   IEPIP LL+ VID  V  +++    +P+ C+I+F+ EG++SQP + PP   +P+  L+L+E  M +GR++ +D
Subjt:  ---------------------------IEPIPSLLQDVIDRLVGEQVMT--LKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTD

Query:  HSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLP
        + GN+RG + L L  G+LL+++G SAD A+H +   + +R+ +T  + +P          +  N G  + W  P   +P   L         +P   VL 
Subjt:  HSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLP

Query:  APPIHSQMPPPNGIPPLIVP-PVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPR------LPVPGTGVFLPPGSSSTPPL
         PP+    PPP  + P+I+P P          V +P         H +H PPR      LP+P      P G S++ P+
Subjt:  APPIHSQMPPPNGIPPLIVP-PVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPR------LPVPGTGVFLPPGSSSTPPL

Arabidopsis top hitse value%identityAlignment
AT1G14710.1 hydroxyproline-rich glycoprotein family protein3.2e-11946.3Show/hide
Query:  PRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQVASRRQ-----QRYIDPVKVGP
        P  W PDERDGFISWLR EFAAANAIIDSLC HL+AVG+  EY+ VIG I  RR  W+ VL MQQ+F VA+V Y+LQQ+A +RQ     QR+ +  +VG 
Subjt:  PRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQVASRRQ-----QRYIDPVKVGP

Query:  KLYRRPGP-FKQPHGHRVEAVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNAS
           RR GP F + HG        + +A     NG N +G V S +VE       E   +  D K +       E+K+D   K +SDSK      +E    
Subjt:  KLYRRPGP-FKQPHGHRVEAVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNAS

Query:  NKESQVELT-DDGCSSSHRDKEL--QSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKF----
          E+Q E+  +  C+S  +D  L  +  Q  N K+  A+  +TFV  EM+D KMVNV+EGLKLY+++ D  EVS+LVSLV +LR +G+RGQ Q +     
Subjt:  NKESQVELT-DDGCSSSHRDKEL--QSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKF----

Query:  -------------------------QFILNRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVI
                                   I +RRIEPIPS L D+I+RLV +Q++ +KPD+CIIDF++EGDHSQPH++ PWFGRP+ VL L+EC+ TFGRVI
Subjt:  -------------------------QFILNRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVI

Query:  GTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSWGPPSARSPNH--RLPPG-QKHYPTV-
         +++ G+Y+G++KL LTPG++LLV+G+SA+ AK+AI A RKQRIL++  KS+P+                 S+WGPP +RSPN   R P G  KHYP V 
Subjt:  GTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSWGPPSARSPNH--RLPPG-QKHYPTV-

Query:  PSTCVLPAPPIHSQMPPPNGIPPLIV---PPVAPPMPFPAPVSIPPGPPAWP--AAHPRH---PPPRLPVPGTGVFLPPGSSSTPPLQQLASST
        PST VLP P   S  PP   + P+ +   PP+A PMPFP    +P GPP WP    HPRH   P PR+P+PGTGVFLPPGS+     Q+LA ++
Subjt:  PSTCVLPAPPIHSQMPPPNGIPPLIV---PPVAPPMPFPAPVSIPPGPPAWP--AAHPRH---PPPRLPVPGTGVFLPPGSSSTPPLQQLASST

AT1G14710.2 hydroxyproline-rich glycoprotein family protein3.2e-11946.3Show/hide
Query:  PRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQVASRRQ-----QRYIDPVKVGP
        P  W PDERDGFISWLR EFAAANAIIDSLC HL+AVG+  EY+ VIG I  RR  W+ VL MQQ+F VA+V Y+LQQ+A +RQ     QR+ +  +VG 
Subjt:  PRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQVASRRQ-----QRYIDPVKVGP

Query:  KLYRRPGP-FKQPHGHRVEAVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNAS
           RR GP F + HG        + +A     NG N +G V S +VE       E   +  D K +       E+K+D   K +SDSK      +E    
Subjt:  KLYRRPGP-FKQPHGHRVEAVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNAS

Query:  NKESQVELT-DDGCSSSHRDKEL--QSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKF----
          E+Q E+  +  C+S  +D  L  +  Q  N K+  A+  +TFV  EM+D KMVNV+EGLKLY+++ D  EVS+LVSLV +LR +G+RGQ Q +     
Subjt:  NKESQVELT-DDGCSSSHRDKEL--QSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKF----

Query:  -------------------------QFILNRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVI
                                   I +RRIEPIPS L D+I+RLV +Q++ +KPD+CIIDF++EGDHSQPH++ PWFGRP+ VL L+EC+ TFGRVI
Subjt:  -------------------------QFILNRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVI

Query:  GTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSWGPPSARSPNH--RLPPG-QKHYPTV-
         +++ G+Y+G++KL LTPG++LLV+G+SA+ AK+AI A RKQRIL++  KS+P+                 S+WGPP +RSPN   R P G  KHYP V 
Subjt:  GTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSWGPPSARSPNH--RLPPG-QKHYPTV-

Query:  PSTCVLPAPPIHSQMPPPNGIPPLIV---PPVAPPMPFPAPVSIPPGPPAWP--AAHPRH---PPPRLPVPGTGVFLPPGSSSTPPLQQLASST
        PST VLP P   S  PP   + P+ +   PP+A PMPFP    +P GPP WP    HPRH   P PR+P+PGTGVFLPPGS+     Q+LA ++
Subjt:  PSTCVLPAPPIHSQMPPPNGIPPLIV---PPVAPPMPFPAPVSIPPGPPAWP--AAHPRH---PPPRLPVPGTGVFLPPGSSSTPPLQQLASST

AT1G48980.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.9e-2630.77Show/hide
Query:  LEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQF-----
        LED  S +E   + + +   SS  + +L   Q  + +       R FV  E  +G++VN++EGL+L+ E+F+ AE  ++V  V +L+   ++G+      
Subjt:  LEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQF-----

Query:  -QGK----FQF-----------------ILNRRIEPIPSLLQDVIDRLVGEQVM--TLKPDSCIIDFYNEGDHSQPHVWPPWFGRP-VGVLLLTECEMTF
         QGK     QF                 + +  ++P+P L + +I RLV   V+  T  PD C+++ Y+EGD   PH+    F RP   V  L+EC + F
Subjt:  -QGK----FQF-----------------ILNRRIEPIPSLLQDVIDRLVGEQVM--TLKPDSCIIDFYNEGDHSQPHVWPPWFGRP-VGVLLLTECEMTF

Query:  GRVIGTDHSGNYR-GAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKK
        G  +  + +G Y  G+  LPL  G++L++ G  AD AKH +P +  +RI +T  K    K
Subjt:  GRVIGTDHSGNYR-GAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKK

AT2G48080.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein1.0e-4827.87Show/hide
Query:  RDGFISWLRGEFAAANAIIDSLCHHL-RAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFKQP
        +D  ++W RGEFAAANAIID+LC HL +A G   +Y+ V+  + +RR NW PVL MQ+Y S+++V   LQQ  ++    ++D                  
Subjt:  RDGFISWLRGEFAAANAIIDSLCHHL-RAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFKQP

Query:  HGHRVEAVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGC
        H               DS +   + G  GSR+ E +S  C+                                                       +D C
Subjt:  HGHRVEAVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGC

Query:  SSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILNRR-------------
         S       QS              + F A E   G   NV++GLKLY+++F   ++SKL+  +N LR +G+  Q  G+   + N+              
Subjt:  SSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILNRR-------------

Query:  -----------IEPIPSLLQDVIDRLVGEQVMT--LKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPG
                   +EPIP+L+Q VID L+  +++    +P+ C+I+F++E +HSQP   PP   +P+  L+L+E  M FG  +G D+ GN+RG++ LPL  G
Subjt:  -----------IEPIPSLLQDVIDRLVGEQVMT--LKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPG

Query:  TLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALAD-------------------GQRTSSNVGPFSSWGPPSARSPNHRLPP
        +LL+++G SAD A+H +     +R+ +T  K +P    +                      +R  +  G F  W PP +R P   LPP
Subjt:  TLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALAD-------------------GQRTSSNVGPFSSWGPPSARSPNHRLPP

AT4G02940.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein4.1e-5030.22Show/hide
Query:  RDGFISWLRGEFAAANAIIDSLCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFK
        +D  ISW RGEFAAANAIID++C HLR   E     EY+ V   I +RR NW PVL MQ+Y S+AEV   LQ+VA+++ +                   K
Subjt:  RDGFISWLRGEFAAANAIIDSLCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFK

Query:  QPHGHRVEAVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLIDED-SGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTD
        +      E +KE V                 + + E+V   C         EK+ + D +G VED +D       DS +    ++ D+ S+++    +  
Subjt:  QPHGHRVEAVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLIDED-SGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTD

Query:  DGCSS----SHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILNRR------
        D        SH D + +S + +  K         F A E   G  VNV++GLKLYEEL  + E+SKL+  V +LR +G  G+  G+   + N++      
Subjt:  DGCSS----SHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILNRR------

Query:  ---------------------------IEPIPSLLQDVIDRLVGEQVMT--LKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTD
                                   IEPIP LL+ VID  V  +++    +P+ C+I+F+ EG++SQP + PP   +P+  L+L+E  M +GR++ +D
Subjt:  ---------------------------IEPIPSLLQDVIDRLVGEQVMT--LKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTD

Query:  HSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLP
        + GN+RG + L L  G+LL+++G SAD A+H +   + +R+ +T  + +P          +  N G  + W  P   +P   L         +P   VL 
Subjt:  HSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLP

Query:  APPIHSQMPPPNGIPPLIVP-PVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPR------LPVPGTGVFLPPGSSSTPPL
         PP+    PPP  + P+I+P P          V +P         H +H PPR      LP+P      P G S++ P+
Subjt:  APPIHSQMPPPNGIPPLIVP-PVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPR------LPVPGTGVFLPPGSSSTPPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATGCCATCGGGAAATGTGGGTGTATCGAATAAAGTTCCGTTTCAGAGCGGTGGCGGAGGTGGTGGCGAGATCCATCATCAGCATCATCCACGGCCGTGGTATCC
GGATGAGCGTGATGGGTTTATCTCATGGTTGCGAGGGGAATTTGCTGCAGCCAATGCGATCATTGATTCCCTTTGCCATCATTTGCGCGCCGTGGGAGAGCCCGGGGAGT
ATGATGTGGTTATTGGGTGTATACAACAACGGCGGTGTAATTGGACGCCCGTGCTTCATATGCAGCAGTACTTTTCAGTGGCGGAAGTGATGTATTCCCTTCAACAGGTC
GCCTCGAGGAGGCAGCAGAGATATATTGATCCTGTGAAAGTGGGGCCGAAGTTGTATAGGAGACCTGGGCCGTTTAAGCAGCCCCATGGTCATCGTGTGGAAGCAGTCAA
AGAAGAGGTGGTTGCGTGTGCAGATTCTTGTAATGGAGGGAATTCGTCAGGTTTTGTAGGCTCTAGGAAGGTGGAGCAAGTAAGTAATACATGTGAGGAAAGTAAGGCAA
TGGGGGAGGATGAGAAATTGATAGATGAAGATTCAGGGTCAGTAGAGGACAAAAAAGATATTCACGGGAAGGACCAAAGTGATAGCAAATCGAAGTGTGGGGAAAATTTA
GAAGACAATGCAAGTAATAAAGAATCCCAAGTTGAACTTACTGATGATGGATGTTCTTCAAGTCATAGAGATAAGGAGTTGCAATCTGTTCAAAGCCGGAATGGAAAGCA
GAATGCTGCTGCGACACCAAGAACCTTTGTTGCGAATGAGATGTTTGATGGAAAGATGGTTAATGTGATGGAAGGATTGAAACTGTATGAAGAATTATTTGATGATGCTG
AGGTTTCAAAGCTTGTTTCATTGGTAAATGATTTGAGAGCTTCAGGGAAGAGGGGGCAATTTCAAGGCAAGTTTCAGTTCATTTTAAATAGAAGAATAGAACCAATCCCC
TCCTTGCTTCAAGATGTCATTGATCGCTTGGTTGGTGAGCAGGTGATGACATTGAAACCAGATTCCTGCATCATTGACTTTTATAATGAGGGAGATCATTCTCAGCCTCA
TGTCTGGCCACCATGGTTTGGGAGACCTGTCGGTGTTCTCCTTTTGACTGAATGCGAAATGACCTTTGGTAGAGTAATTGGAACAGACCATTCTGGCAACTATAGAGGGG
CTATGAAGCTGCCTCTTACACCCGGAACCCTTCTTTTGGTGCAAGGAAGATCTGCAGATTTTGCCAAGCATGCGATACCTGCTATCCGCAAGCAACGTATACTTGTTACC
TTGACCAAGTCACAGCCAAAGAAAGCTGCACTAGCTGATGGGCAACGCACATCATCGAACGTAGGTCCGTTTTCCAGTTGGGGCCCTCCATCTGCTCGATCACCAAACCA
TCGTCTTCCCCCTGGACAGAAGCATTATCCCACAGTTCCATCAACTTGCGTGTTACCCGCACCACCCATTCACTCCCAAATGCCCCCACCAAATGGCATCCCGCCCTTAA
TTGTCCCTCCTGTGGCACCACCTATGCCGTTCCCTGCTCCCGTGTCGATCCCACCTGGTCCACCCGCTTGGCCTGCTGCACACCCAAGGCATCCTCCGCCCCGCCTACCT
GTTCCAGGCACTGGAGTATTCCTCCCTCCAGGATCTTCCAGCACTCCACCCCTGCAACAGTTGGCGAGCTCCACGGTCGAGACAGGCTCCCTCACAGAAAAGGAAAATGG
TTGTTCGACGAAATCCGATCACAGTACAGCTGCTTCTCCAGGAGAGAAACCTGAAGCAAAAGCAGAAAGACAAGAGTGCAATGGAAACATGGAAGGAATGTTTACTATTG
GTGGCCTCACACTACTCCGGGGAGAAAGACACCTTGTCCTCAAGGTGGAATTGTGGAAATTGGCACTGGATGGTCGTGAGGAGTTCCCAAGTGGCTTCATATTCAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAATGCCATCGGGAAATGTGGGTGTATCGAATAAAGTTCCGTTTCAGAGCGGTGGCGGAGGTGGTGGCGAGATCCATCATCAGCATCATCCACGGCCGTGGTATCC
GGATGAGCGTGATGGGTTTATCTCATGGTTGCGAGGGGAATTTGCTGCAGCCAATGCGATCATTGATTCCCTTTGCCATCATTTGCGCGCCGTGGGAGAGCCCGGGGAGT
ATGATGTGGTTATTGGGTGTATACAACAACGGCGGTGTAATTGGACGCCCGTGCTTCATATGCAGCAGTACTTTTCAGTGGCGGAAGTGATGTATTCCCTTCAACAGGTC
GCCTCGAGGAGGCAGCAGAGATATATTGATCCTGTGAAAGTGGGGCCGAAGTTGTATAGGAGACCTGGGCCGTTTAAGCAGCCCCATGGTCATCGTGTGGAAGCAGTCAA
AGAAGAGGTGGTTGCGTGTGCAGATTCTTGTAATGGAGGGAATTCGTCAGGTTTTGTAGGCTCTAGGAAGGTGGAGCAAGTAAGTAATACATGTGAGGAAAGTAAGGCAA
TGGGGGAGGATGAGAAATTGATAGATGAAGATTCAGGGTCAGTAGAGGACAAAAAAGATATTCACGGGAAGGACCAAAGTGATAGCAAATCGAAGTGTGGGGAAAATTTA
GAAGACAATGCAAGTAATAAAGAATCCCAAGTTGAACTTACTGATGATGGATGTTCTTCAAGTCATAGAGATAAGGAGTTGCAATCTGTTCAAAGCCGGAATGGAAAGCA
GAATGCTGCTGCGACACCAAGAACCTTTGTTGCGAATGAGATGTTTGATGGAAAGATGGTTAATGTGATGGAAGGATTGAAACTGTATGAAGAATTATTTGATGATGCTG
AGGTTTCAAAGCTTGTTTCATTGGTAAATGATTTGAGAGCTTCAGGGAAGAGGGGGCAATTTCAAGGCAAGTTTCAGTTCATTTTAAATAGAAGAATAGAACCAATCCCC
TCCTTGCTTCAAGATGTCATTGATCGCTTGGTTGGTGAGCAGGTGATGACATTGAAACCAGATTCCTGCATCATTGACTTTTATAATGAGGGAGATCATTCTCAGCCTCA
TGTCTGGCCACCATGGTTTGGGAGACCTGTCGGTGTTCTCCTTTTGACTGAATGCGAAATGACCTTTGGTAGAGTAATTGGAACAGACCATTCTGGCAACTATAGAGGGG
CTATGAAGCTGCCTCTTACACCCGGAACCCTTCTTTTGGTGCAAGGAAGATCTGCAGATTTTGCCAAGCATGCGATACCTGCTATCCGCAAGCAACGTATACTTGTTACC
TTGACCAAGTCACAGCCAAAGAAAGCTGCACTAGCTGATGGGCAACGCACATCATCGAACGTAGGTCCGTTTTCCAGTTGGGGCCCTCCATCTGCTCGATCACCAAACCA
TCGTCTTCCCCCTGGACAGAAGCATTATCCCACAGTTCCATCAACTTGCGTGTTACCCGCACCACCCATTCACTCCCAAATGCCCCCACCAAATGGCATCCCGCCCTTAA
TTGTCCCTCCTGTGGCACCACCTATGCCGTTCCCTGCTCCCGTGTCGATCCCACCTGGTCCACCCGCTTGGCCTGCTGCACACCCAAGGCATCCTCCGCCCCGCCTACCT
GTTCCAGGCACTGGAGTATTCCTCCCTCCAGGATCTTCCAGCACTCCACCCCTGCAACAGTTGGCGAGCTCCACGGTCGAGACAGGCTCCCTCACAGAAAAGGAAAATGG
TTGTTCGACGAAATCCGATCACAGTACAGCTGCTTCTCCAGGAGAGAAACCTGAAGCAAAAGCAGAAAGACAAGAGTGCAATGGAAACATGGAAGGAATGTTTACTATTG
GTGGCCTCACACTACTCCGGGGAGAAAGACACCTTGTCCTCAAGGTGGAATTGTGGAAATTGGCACTGGATGGTCGTGAGGAGTTCCCAAGTGGCTTCATATTCAGGTAG
Protein sequenceShow/hide protein sequence
MAMPSGNVGVSNKVPFQSGGGGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQV
ASRRQQRYIDPVKVGPKLYRRPGPFKQPHGHRVEAVKEEVVACADSCNGGNSSGFVGSRKVEQVSNTCEESKAMGEDEKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENL
EDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILNRRIEPIP
SLLQDVIDRLVGEQVMTLKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVT
LTKSQPKKAALADGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIHSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLP
VPGTGVFLPPGSSSTPPLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEGMFTIGGLTLLRGERHLVLKVELWKLALDGREEFPSGFIFR