; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g0941 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g0941
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionHydroxyproline-rich glycoprotein family protein, putative isoform 2
Genome locationMC09:14170848..14181450
RNA-Seq ExpressionMC09g0941
SyntenyMC09g0941
Gene Ontology termsGO:0006402 - mRNA catabolic process (biological process)
GO:0070988 - demethylation (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0032451 - demethylase activity (molecular function)
InterPro domainsIPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily
IPR044842 - RNA demethylase ALKBH9B/ALKBH10B-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142291.1 RNA demethylase ALKBH10B isoform X2 [Cucumis sativus]0.079.77Show/hide
Query:  MAMPSGNVGVSNKVPFQSGGG---GGGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQ
        MAMPSGNVGV +KV FQSGGG    GGGGEIH QHHPRPW+PDERDGFISWLRGEFAA+NAIID+LCHHLRAVGEPGEYD+VIGCIQQRRCNWTPVLHMQ
Subjt:  MAMPSGNVGVSNKVPFQSGGG---GGGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQ

Query:  QYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPHGHRVEA-VKEE---------------------VEQVSNTCEESKAMGEDEKLIDE
        QYFSVAEVMY+LQQV SRRQQRY+DPVKVGPKLYRRPGP FKQ  GHR EA VKEE                     VEQVSNTC+ESKA GEDEKL ++
Subjt:  QYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPHGHRVEA-VKEE---------------------VEQVSNTCEESKAMGEDEKLIDE

Query:  DSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDA
        DSGS  D KD HGKDQS+ K+K  ENLEDNA NK+SQVE  DDGCSSSHRDKELQSVQS+NGKQ AA TPRTFVA+EMFDGKMVNVM+GLKL+EEL DDA
Subjt:  DSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDA

Query:  EVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMTLKPDSC
        EVSKL+SLVNDLRASGKRGQFQG       QTYVVSKRPMKGHGREMIQLGFPIADAPH+D+ S G+SKDRRIEPIPSLLQD+IDRLVG+QVMT+KPDSC
Subjt:  EVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMTLKPDSC

Query:  IIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALAD
        IIDFYNEGDHSQPHVWP WFGRPVGVLLLTECE+TFGRVIGTDHSGNYRGAMKL LTPG LL+VQG+SADFAKHA+PAIRKQRILVTLTKSQPK+AA AD
Subjt:  IIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALAD

Query:  GQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPG
        GQRTS NVG FS WGPPSARSPN RL PGQK YPTVPST VLP PPIR QM PPNGIPPLIVPPVA PMPF  PV IP GP AWP AH RHPPPRLPVPG
Subjt:  GQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPG

Query:  TGVFLPP-GSSSTP---PLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEGSESGKVAAEEEQQQQQQ
        TGVFLPP GSSS P   P QQL  S +ETGSL+EKENG  TKSDHS+   PGEKP+AKA+RQECNG+++GS + KV  EE+QQQQ++
Subjt:  TGVFLPP-GSSSTP---PLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEGSESGKVAAEEEQQQQQQ

XP_022150017.1 uncharacterized protein LOC111018294 [Momordica charantia]0.095.45Show/hide
Query:  MAMPSGNVGVSNKVPFQSGGGGGGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYF
        MAMPSGNVGVSNKVPFQSGGGGG  GEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYF
Subjt:  MAMPSGNVGVSNKVPFQSGGGGGGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYF

Query:  SVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFKQPHGHRVEAVKEEV---------------------EQVSNTCEESKAMGEDEKLIDEDSGSV
        SVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFKQPHGHRVEAVKEEV                     EQVSNTCEESKAMGEDEKLIDEDSGSV
Subjt:  SVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFKQPHGHRVEAVKEEV---------------------EQVSNTCEESKAMGEDEKLIDEDSGSV

Query:  EDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKL
        EDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKL
Subjt:  EDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKL

Query:  VSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFY
        VSLVNDLRASGKRGQFQG       QTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFY
Subjt:  VSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFY

Query:  NEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTS
        NEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTS
Subjt:  NEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTS

Query:  SNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGTGVFL
        SNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPI SQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGTGVFL
Subjt:  SNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGTGVFL

Query:  PPGSSSTPPLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEGSESGKVAAEEEQQQQQQSGGI
        PPGSSSTPPLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEGSESGKVAAEEEQQQQQQSGGI
Subjt:  PPGSSSTPPLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEGSESGKVAAEEEQQQQQQSGGI

XP_022987072.1 uncharacterized protein LOC111484609 [Cucurbita maxima]0.078.79Show/hide
Query:  MAMPSGNVGVSNKVPFQSGGGGG-----GGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLH
        MAMPSGNVGV +KV FQSGGGGG     GGGEIH QHHPRPW+PDERDGFISWLR EFAAANA+ID+LCHHLRAVGEPGEYDVVIGCIQQRRCNWTPV+H
Subjt:  MAMPSGNVGVSNKVPFQSGGGGG-----GGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLH

Query:  MQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPH----GHRVEA-VKEE---------------------VEQVSNTCEESKAMGED
        MQQYFSVA+V YSLQQV SRRQQRYIDPVKVGPK YRRPGP FKQ      GHR+EA VKEE                     VEQVSNTCEESKA GED
Subjt:  MQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPH----GHRVEA-VKEE---------------------VEQVSNTCEESKAMGED

Query:  EKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYE
        E L D+DSGS ED+KD HGKDQ +SK KC E+LEDNASNKES VE TDDGCSSS+R+KELQSVQS+NGKQ AA TPRTFVANEM DGKMVNVM+GLKL+E
Subjt:  EKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYE

Query:  ELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMT
        +  DDAEVSKL+SLVNDLRASGKRGQFQG        TYVVSKRPMKGHGREMIQLGFPIAD PHDD+ S+G+SKDRRIE IPSLLQD+IDRLVGEQ+M+
Subjt:  ELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMT

Query:  LKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPK
        +KPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNY+GAMKL L PGTLL+V+G+SADFAKHAIPAIRKQRILVTLTKSQPK
Subjt:  LKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPK

Query:  KAALADGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPP
        +A  +DGQRTS NVGPF+SWGPPS RSPN RL PGQKHY +VPST VLPAPPIR QMPPPNGIPPLIV PVA PMPFP PV IP GPP WP AHPRHPPP
Subjt:  KAALADGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPP

Query:  RLPVPGTGVFLPPGSSSTPPL---QQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEGSESGKVAAEEEQQQQQQS
        RLPVPGTGVFLPP  SS+ P    QQL +STVETGSL+EKENG STKSDH+  AS GEK EAK +RQECNG    SES KV  EE+Q+QQ+QS
Subjt:  RLPVPGTGVFLPPGSSSTPPL---QQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEGSESGKVAAEEEQQQQQQS

XP_038892010.1 RNA demethylase ALKBH10B isoform X1 [Benincasa hispida]0.081.07Show/hide
Query:  MAMPSGNVGVSNKVPFQSGGGG----GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MAMPSGNVGV +KV +QSGGGG    GGGGEIH QHHPRPW+PDERDGFISWLRGEFAA+NAIID+LCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
Subjt:  MAMPSGNVGVSNKVPFQSGGGG----GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPHGHRVEA-VKEE---------------------VEQVSNTCEESKAMGEDEKLID
        QQYFSVAEVMY+LQQV SRRQQRY+DPVKVG KLYRRPGP FKQ  GHRVEA VKEE                     VEQVSNTC+ESKA GED KL D
Subjt:  QQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPHGHRVEA-VKEE---------------------VEQVSNTCEESKAMGEDEKLID

Query:  EDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDD
        +DS S ED KD HGKDQS+SK KC ENLEDNASNKESQVE TDDGCSSSHRDKELQSVQS+NGKQ AA TPRTFVANEMFDGKMVNVM+GLKL+EEL DD
Subjt:  EDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDD

Query:  AEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISK----DRRIEPIPSLLQDVIDRLVGEQVMTL
        AEVSKL+SLVNDLRASGKRGQFQG       QTYVV KRPMKGHGREMIQLGFPIADAPHDD+ S+G+SK    DRRIE IPSLLQD+IDRLVGEQVMT+
Subjt:  AEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISK----DRRIEPIPSLLQDVIDRLVGEQVMTL

Query:  KPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKK
        KPDSCI+DFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKL LTPGTLL+VQG+SADFAKHAIPAIRKQRILVTLTKSQPK+
Subjt:  KPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKK

Query:  AALADGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPR
        A  ADGQRTS N+G FSSWGPPS RSPN RL PGQK YPTVPST VLPAPPIR QM PPNGIPPLIVPPVAPPMPFP PV IP GP AWP AHPRHPPPR
Subjt:  AALADGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPR

Query:  LPVPGTGVFLPPGSSSTPPL---QQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKA-ERQECNGNMEGSESGKVAAEEEQQQQQQ
        LPVPGTGVFLPP  SS+ P    QQ  +S VETGSL+EKENG STKSDH++  SPGEKPEAK  +RQECNG+M+GS S KV  EE+Q QQQQ
Subjt:  LPVPGTGVFLPPGSSSTPPL---QQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKA-ERQECNGNMEGSESGKVAAEEEQQQQQQ

XP_038892011.1 RNA demethylase ALKBH10B isoform X2 [Benincasa hispida]0.081.54Show/hide
Query:  MAMPSGNVGVSNKVPFQSGGGG----GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MAMPSGNVGV +KV +QSGGGG    GGGGEIH QHHPRPW+PDERDGFISWLRGEFAA+NAIID+LCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
Subjt:  MAMPSGNVGVSNKVPFQSGGGG----GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPHGHRVEA-VKEE---------------------VEQVSNTCEESKAMGEDEKLID
        QQYFSVAEVMY+LQQV SRRQQRY+DPVKVG KLYRRPGP FKQ  GHRVEA VKEE                     VEQVSNTC+ESKA GED KL D
Subjt:  QQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPHGHRVEA-VKEE---------------------VEQVSNTCEESKAMGEDEKLID

Query:  EDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDD
        +DS S ED KD HGKDQS+SK KC ENLEDNASNKESQVE TDDGCSSSHRDKELQSVQS+NGKQ AA TPRTFVANEMFDGKMVNVM+GLKL+EEL DD
Subjt:  EDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDD

Query:  AEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMTLKPDS
        AEVSKL+SLVNDLRASGKRGQFQG       QTYVV KRPMKGHGREMIQLGFPIADAPHDD+ S+G+SKDRRIE IPSLLQD+IDRLVGEQVMT+KPDS
Subjt:  AEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMTLKPDS

Query:  CIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALA
        CI+DFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKL LTPGTLL+VQG+SADFAKHAIPAIRKQRILVTLTKSQPK+A  A
Subjt:  CIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALA

Query:  DGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVP
        DGQRTS N+G FSSWGPPS RSPN RL PGQK YPTVPST VLPAPPIR QM PPNGIPPLIVPPVAPPMPFP PV IP GP AWP AHPRHPPPRLPVP
Subjt:  DGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVP

Query:  GTGVFLPPGSSSTPPL---QQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKA-ERQECNGNMEGSESGKVAAEEEQQQQQQ
        GTGVFLPP  SS+ P    QQ  +S VETGSL+EKENG STKSDH++  SPGEKPEAK  +RQECNG+M+GS S KV  EE+Q QQQQ
Subjt:  GTGVFLPPGSSSTPPL---QQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKA-ERQECNGNMEGSESGKVAAEEEQQQQQQ

TrEMBL top hitse value%identityAlignment
A0A1S4E0A0 uncharacterized protein LOC103495063 isoform X20.078.63Show/hide
Query:  MAMPSGNVGVSNKVPFQSGGGG----GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MA+PSGNVGV +KV FQSGGGG    GGGGEIH  HHPRPW+PDERDGFISWLRGEFAA+NA+ID+LCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
Subjt:  MAMPSGNVGVSNKVPFQSGGGG----GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPHGHRVEA-VKEE---------------------VEQVSNTCEESKAMGEDEKLID
        QQYFSVAEVMY+LQQV SRRQQRY+DPVKVGPKLYRRPGP FKQ  GHR EA VKEE                     VEQVSNTC+ESKA GEDEKL +
Subjt:  QQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPHGHRVEA-VKEE---------------------VEQVSNTCEESKAMGEDEKLID

Query:  EDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDD
        +DSGS ED KD HGKDQS+SK+KC ENLEDNA NK+SQVE  DDGCSSSHRDKELQSVQS+NGKQ+AA TPRTFVANEMFDGKMVNVM+GLKL+EEL DD
Subjt:  EDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDD

Query:  AEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMTLKPDS
        AEVSKL+SLVNDLRASGKRGQFQG       QTYVVSKRP KGHGREMIQLGFPIADAP++D+ S  +SKDRRIEPIPSLLQD+IDRLVG+QVMT+KPDS
Subjt:  AEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMTLKPDS

Query:  CIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALA
        CIIDFYNEGDHSQPHVWP WFGRPVGVLLLTECE+TFGRVIGTDHSGNYRGA+KL LTPG LL+VQG+SADFAKHAIPAIRKQRILVTLTKSQPK+A+ A
Subjt:  CIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALA

Query:  DGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVP
        DGQR+S NVG FS WGPPSARSPN RL PGQK Y  VPST VLP PPIR QM PPNGIPPLIVP VAPPMPF  PV IP GP  WP AH RHPPPRLPVP
Subjt:  DGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVP

Query:  GTGVFLPP-GSSSTP---PLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEGSESGKVAAEEEQQQQQQ
        GTGVFLPP GSSS P   P QQL +S +E GSL+EKENG  TKSDH++   PGEKPEAK +RQECNG ++GS + KV  EE+QQQQ++
Subjt:  GTGVFLPP-GSSSTP---PLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEGSESGKVAAEEEQQQQQQ

A0A5D3E038 Hydroxyproline-rich glycoprotein family protein, putative isoform 20.078.63Show/hide
Query:  MAMPSGNVGVSNKVPFQSGGGG----GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
        MA+PSGNVGV +KV FQSGGGG    GGGGEIH  HHPRPW+PDERDGFISWLRGEFAA+NA+ID+LCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM
Subjt:  MAMPSGNVGVSNKVPFQSGGGG----GGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHM

Query:  QQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPHGHRVEA-VKEE---------------------VEQVSNTCEESKAMGEDEKLID
        QQYFSVAEVMY+LQQV SRRQQRY+DPVKVGPKLYRRPGP FKQ  GHR EA VKEE                     VEQVSNTC+ESKA GEDEKL +
Subjt:  QQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPHGHRVEA-VKEE---------------------VEQVSNTCEESKAMGEDEKLID

Query:  EDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDD
        +DSGS ED KD HGKDQS+SK+KC ENLEDNA NK+SQVE  DDGCSSSHRDKELQSVQS+NGKQ+AA TPRTFVANEMFDGKMVNVM+GLKL+EEL DD
Subjt:  EDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDD

Query:  AEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMTLKPDS
        AEVSKL+SLVNDLRASGKRGQFQG       QTYVVSKRP KGHGREMIQLGFPIADAP++D+ S  +SKDRRIEPIPSLLQD+IDRLVG+QVMT+KPDS
Subjt:  AEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMTLKPDS

Query:  CIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALA
        CIIDFYNEGDHSQPHVWP WFGRPVGVLLLTECE+TFGRVIGTDHSGNYRGA+KL LTPG LL+VQG+SADFAKHAIPAIRKQRILVTLTKSQPK+A+ A
Subjt:  CIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALA

Query:  DGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVP
        DGQR+S NVG FS WGPPSARSPN RL PGQK Y  VPST VLP PPIR QM PPNGIPPLIVP VAPPMPF  PV IP GP  WP AH RHPPPRLPVP
Subjt:  DGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVP

Query:  GTGVFLPP-GSSSTP---PLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEGSESGKVAAEEEQQQQQQ
        GTGVFLPP GSSS P   P QQL +S +E GSL+EKENG  TKSDH++   PGEKPEAK +RQECNG ++GS + KV  EE+QQQQ++
Subjt:  GTGVFLPP-GSSSTP---PLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEGSESGKVAAEEEQQQQQQ

A0A6J1D9K3 uncharacterized protein LOC1110182940.095.45Show/hide
Query:  MAMPSGNVGVSNKVPFQSGGGGGGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYF
        MAMPSGNVGVSNKVPFQSGGGGG  GEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYF
Subjt:  MAMPSGNVGVSNKVPFQSGGGGGGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYF

Query:  SVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFKQPHGHRVEAVKEEV---------------------EQVSNTCEESKAMGEDEKLIDEDSGSV
        SVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFKQPHGHRVEAVKEEV                     EQVSNTCEESKAMGEDEKLIDEDSGSV
Subjt:  SVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFKQPHGHRVEAVKEEV---------------------EQVSNTCEESKAMGEDEKLIDEDSGSV

Query:  EDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKL
        EDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKL
Subjt:  EDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKL

Query:  VSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFY
        VSLVNDLRASGKRGQFQG       QTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFY
Subjt:  VSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFY

Query:  NEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTS
        NEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTS
Subjt:  NEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTS

Query:  SNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGTGVFL
        SNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPI SQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGTGVFL
Subjt:  SNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPRLPVPGTGVFL

Query:  PPGSSSTPPLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEGSESGKVAAEEEQQQQQQSGGI
        PPGSSSTPPLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEGSESGKVAAEEEQQQQQQSGGI
Subjt:  PPGSSSTPPLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEGSESGKVAAEEEQQQQQQSGGI

A0A6J1FUM5 uncharacterized protein LOC1114474300.078.3Show/hide
Query:  MAMPSGNVGVSNKVPFQSGGGGGG------GGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL
        MAMPSGNVGV +KVPFQSGGGGGG      GGEIH QHHPRPW+PDERDGFISWLR EFAAANA+ID+LCHHLRAVGEPGEYDVVIGCIQQRRCNWTPV+
Subjt:  MAMPSGNVGVSNKVPFQSGGGGGG------GGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL

Query:  HMQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPH---GHRVEA-VKEE---------------------VEQVSNTCEESKAMGED
        HMQQYFSVA+V YSLQQV SRRQQRYIDPVKVGPK YRRPGP FKQ     GHR+E  VKEE                     VE VSNTCE+SKA GED
Subjt:  HMQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPH---GHRVEA-VKEE---------------------VEQVSNTCEESKAMGED

Query:  EKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYE
        E L D+DSGS ED+KD HGKDQS+SK KC E+LEDNASNKES VE TDDGCSSS+R+KELQSVQ++NGKQ AA TPRTFVANEM DGKMVNVM+GLKL+E
Subjt:  EKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYE

Query:  ELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMT
        +  DDAEVSKL+SLVNDLRASGKRGQFQG       QTYVVSKRPMKGHGREMIQLGFPIAD PHDD+ S+G+SKDRRIE IPSLLQD+IDRLVGEQ+M+
Subjt:  ELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMT

Query:  LKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPK
        +KPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNY+GAMKL L PGTLL+V+G+SADFAKHAIPAIRKQRILVTLTKSQPK
Subjt:  LKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPK

Query:  KAALADGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPP
        +A  +DGQRTS NVGPFSSWGPPS RSPN RL PG KHYP+VPST VLPAPPIR QMPPPNGIPPLIV PVA PMPFP PV IP GPP WP AHPRHPPP
Subjt:  KAALADGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPP

Query:  RLPVPGTGVFLPPGSSSTPPL------QQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEGSESGKVAAEEEQQQQQQS
        RLPVPGTGVFLPP  SS+ P       QQL +STVETGSL+EKENG STKSDH+T A  GEK EAK +RQECNG    SES KV  EE+Q +Q+QS
Subjt:  RLPVPGTGVFLPPGSSSTPPL------QQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEGSESGKVAAEEEQQQQQQS

A0A6J1J9C0 uncharacterized protein LOC1114846090.078.79Show/hide
Query:  MAMPSGNVGVSNKVPFQSGGGGG-----GGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLH
        MAMPSGNVGV +KV FQSGGGGG     GGGEIH QHHPRPW+PDERDGFISWLR EFAAANA+ID+LCHHLRAVGEPGEYDVVIGCIQQRRCNWTPV+H
Subjt:  MAMPSGNVGVSNKVPFQSGGGGG-----GGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLH

Query:  MQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPH----GHRVEA-VKEE---------------------VEQVSNTCEESKAMGED
        MQQYFSVA+V YSLQQV SRRQQRYIDPVKVGPK YRRPGP FKQ      GHR+EA VKEE                     VEQVSNTCEESKA GED
Subjt:  MQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGP-FKQPH----GHRVEA-VKEE---------------------VEQVSNTCEESKAMGED

Query:  EKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYE
        E L D+DSGS ED+KD HGKDQ +SK KC E+LEDNASNKES VE TDDGCSSS+R+KELQSVQS+NGKQ AA TPRTFVANEM DGKMVNVM+GLKL+E
Subjt:  EKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYE

Query:  ELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMT
        +  DDAEVSKL+SLVNDLRASGKRGQFQG        TYVVSKRPMKGHGREMIQLGFPIAD PHDD+ S+G+SKDRRIE IPSLLQD+IDRLVGEQ+M+
Subjt:  ELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMT

Query:  LKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPK
        +KPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNY+GAMKL L PGTLL+V+G+SADFAKHAIPAIRKQRILVTLTKSQPK
Subjt:  LKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPK

Query:  KAALADGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPP
        +A  +DGQRTS NVGPF+SWGPPS RSPN RL PGQKHY +VPST VLPAPPIR QMPPPNGIPPLIV PVA PMPFP PV IP GPP WP AHPRHPPP
Subjt:  KAALADGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVSIPPGPPAWPAAHPRHPPP

Query:  RLPVPGTGVFLPPGSSSTPPL---QQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEGSESGKVAAEEEQQQQQQS
        RLPVPGTGVFLPP  SS+ P    QQL +STVETGSL+EKENG STKSDH+  AS GEK EAK +RQECNG    SES KV  EE+Q+QQ+QS
Subjt:  RLPVPGTGVFLPPGSSSTPPL---QQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEGSESGKVAAEEEQQQQQQS

SwissProt top hitse value%identityAlignment
Q9SL49 RNA demethylase ALKBH9B1.1e-3130.29Show/hide
Query:  ENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGK
        E  E+    ++S  +  D     +    +L   Q  N +       + F+  E   GK+VNV++GL+L+  +F   E  ++V  V  L+  G+RG+ +  
Subjt:  ENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGK

Query:  FQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVM--TLKPDSCIIDFYNEGDHSQPHVWPPWFGR
              +T+    + M+G GRE IQ G     AP       GI +   ++P+P L + +I +L+   V+  T  PDSCI++ Y+EGD   PH+    F R
Subjt:  FQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVM--TLKPDSCIIDFYNEGDHSQPHVWPPWFGR

Query:  P-VGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKK
        P   +  L+EC++ FG  +  +  G++ G+  +PL  G++L++ G  AD AKH +PA+  +RI +T  K    K
Subjt:  P-VGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKK

Q9ZT92 RNA demethylase ALKBH10B3.3e-6033.16Show/hide
Query:  RDGFISWLRGEFAAANAIIDSLCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFK
        +D  ISW RGEFAAANAIID++C HLR   E     EY+ V   I +RR NW PVL MQ+Y S+AEV   LQ+VA+++ +                   K
Subjt:  RDGFISWLRGEFAAANAIIDSLCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFK

Query:  QPHGHRVEAVKEEVEQVSNTCEE--SKAMGEDEKLIDED-SGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSS----SHRDKELQSV
        Q      E  +E++++V  T EE   K     EK+ + D +G VED +D       DS +    ++ D+ S+++    +  D        SH D + +S 
Subjt:  QPHGHRVEAVKEEVEQVSNTCEE--SKAMGEDEKLIDED-SGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSS----SHRDKELQSV

Query:  QSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADA
        + +  K         F A E   G  VNV++GLKLYEEL  + E+SKL+  V +LR +G  G+  G       +++++  + +KG+ RE+IQLG PI   
Subjt:  QSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADA

Query:  PHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMT--LKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLP
           DE S   +    IEPIP LL+ VID  V  +++    +P+ C+I+F+ EG++SQP + PP   +P+  L+L+E  M +GR++ +D+ GN+RG + L 
Subjt:  PHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMT--LKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLP

Query:  LTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPN
        L  G+LL+++G SAD A+H +   + +R+ +T  + +P          +  N G  + W  P   +P   L         +P   VL  PP+    PPP 
Subjt:  LTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPN

Query:  GIPPLIVP-PVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPR------LPVPGTGVFLPPGSSSTPPL
         + P+I+P P          V +P         H +H PPR      LP+P      P G S++ P+
Subjt:  GIPPLIVP-PVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPR------LPVPGTGVFLPPGSSSTPPL

Arabidopsis top hitse value%identityAlignment
AT1G14710.1 hydroxyproline-rich glycoprotein family protein2.7e-13449.07Show/hide
Query:  PRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQVASRRQ-----QRYIDPVKVGP
        P  W PDERDGFISWLR EFAAANAIIDSLC HL+AVG+  EY+ VIG I  RR  W+ VL MQQ+F VA+V Y+LQQ+A +RQ     QR+ +  +VG 
Subjt:  PRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQVASRRQ-----QRYIDPVKVGP

Query:  KLYRRPGP-FKQPHG-----HRVEAVKEEVEQVSNTCEESKAMGEDEKLIDEDS--GSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELT-DDGC
           RR GP F + HG        +++       +    +     E+ KL  +       E+K+D   K +SDSK      +E      E+Q E+  +  C
Subjt:  KLYRRPGP-FKQPHG-----HRVEAVKEEVEQVSNTCEESKAMGEDEKLIDEDS--GSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELT-DDGC

Query:  SSSHRDKEL--QSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGH
        +S  +D  L  +  Q  N K+  A+  +TFV  EM+D KMVNV+EGLKLY+++ D  EVS+LVSLV +LR +G+RGQ Q       S+ YV  KRP +GH
Subjt:  SSSHRDKEL--QSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGH

Query:  GREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTD
        GREMIQLG PIAD P DD++     KDRRIEPIPS L D+I+RLV +Q++ +KPD+CIIDF++EGDHSQPH++ PWFGRP+ VL L+EC+ TFGRVI ++
Subjt:  GREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTD

Query:  HSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSWGPPSARSPNH--RLPPG-QKHYPTV-PST
        + G+Y+G++KL LTPG++LLV+G+SA+ AK+AI A RKQRIL++  KS+P+                 S+WGPP +RSPN   R P G  KHYP V PST
Subjt:  HSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSWGPPSARSPNH--RLPPG-QKHYPTV-PST

Query:  CVLPAPPIRSQMPPPNGIPPLIV---PPVAPPMPFPAPVSIPPGPPAWP--AAHPRH---PPPRLPVPGTGVFLPPGSSSTPPLQQLASST
         VLP P   S  PP   + P+ +   PP+A PMPFP    +P GPP WP    HPRH   P PR+P+PGTGVFLPPGS+     Q+LA ++
Subjt:  CVLPAPPIRSQMPPPNGIPPLIV---PPVAPPMPFPAPVSIPPGPPAWP--AAHPRH---PPPRLPVPGTGVFLPPGSSSTPPLQQLASST

AT1G14710.2 hydroxyproline-rich glycoprotein family protein2.7e-13449.07Show/hide
Query:  PRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQVASRRQ-----QRYIDPVKVGP
        P  W PDERDGFISWLR EFAAANAIIDSLC HL+AVG+  EY+ VIG I  RR  W+ VL MQQ+F VA+V Y+LQQ+A +RQ     QR+ +  +VG 
Subjt:  PRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQVASRRQ-----QRYIDPVKVGP

Query:  KLYRRPGP-FKQPHG-----HRVEAVKEEVEQVSNTCEESKAMGEDEKLIDEDS--GSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELT-DDGC
           RR GP F + HG        +++       +    +     E+ KL  +       E+K+D   K +SDSK      +E      E+Q E+  +  C
Subjt:  KLYRRPGP-FKQPHG-----HRVEAVKEEVEQVSNTCEESKAMGEDEKLIDEDS--GSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELT-DDGC

Query:  SSSHRDKEL--QSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGH
        +S  +D  L  +  Q  N K+  A+  +TFV  EM+D KMVNV+EGLKLY+++ D  EVS+LVSLV +LR +G+RGQ Q       S+ YV  KRP +GH
Subjt:  SSSHRDKEL--QSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGH

Query:  GREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTD
        GREMIQLG PIAD P DD++     KDRRIEPIPS L D+I+RLV +Q++ +KPD+CIIDF++EGDHSQPH++ PWFGRP+ VL L+EC+ TFGRVI ++
Subjt:  GREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTD

Query:  HSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSWGPPSARSPNH--RLPPG-QKHYPTV-PST
        + G+Y+G++KL LTPG++LLV+G+SA+ AK+AI A RKQRIL++  KS+P+                 S+WGPP +RSPN   R P G  KHYP V PST
Subjt:  HSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSWGPPSARSPNH--RLPPG-QKHYPTV-PST

Query:  CVLPAPPIRSQMPPPNGIPPLIV---PPVAPPMPFPAPVSIPPGPPAWP--AAHPRH---PPPRLPVPGTGVFLPPGSSSTPPLQQLASST
         VLP P   S  PP   + P+ +   PP+A PMPFP    +P GPP WP    HPRH   P PR+P+PGTGVFLPPGS+     Q+LA ++
Subjt:  CVLPAPPIRSQMPPPNGIPPLIV---PPVAPPMPFPAPVSIPPGPPAWP--AAHPRH---PPPRLPVPGTGVFLPPGSSSTPPLQQLASST

AT2G17970.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein7.8e-3330.29Show/hide
Query:  ENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGK
        E  E+    ++S  +  D     +    +L   Q  N +       + F+  E   GK+VNV++GL+L+  +F   E  ++V  V  L+  G+RG+ +  
Subjt:  ENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGK

Query:  FQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVM--TLKPDSCIIDFYNEGDHSQPHVWPPWFGR
              +T+    + M+G GRE IQ G     AP       GI +   ++P+P L + +I +L+   V+  T  PDSCI++ Y+EGD   PH+    F R
Subjt:  FQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVM--TLKPDSCIIDFYNEGDHSQPHVWPPWFGR

Query:  P-VGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKK
        P   +  L+EC++ FG  +  +  G++ G+  +PL  G++L++ G  AD AKH +PA+  +RI +T  K    K
Subjt:  P-VGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKK

AT2G48080.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein5.0e-5630.02Show/hide
Query:  RDGFISWLRGEFAAANAIIDSLCHHL-RAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFKQP
        +D  ++W RGEFAAANAIID+LC HL +A G   +Y+ V+  + +RR NW PVL MQ+Y S+++V   LQQ  ++    ++D                  
Subjt:  RDGFISWLRGEFAAANAIIDSLCHHL-RAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFKQP

Query:  HGHRVEAVKEEVEQVSNTCEESKAMGEDEKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNA
                                                   D H  D   S      ++ D  S +E  + +        H D E +S  +   KQ+ 
Subjt:  HGHRVEAVKEEVEQVSNTCEESKAMGEDEKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSSSHRDKELQSVQSRNGKQNA

Query:  AATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAG
            + F A E   G   NV++GLKLY+++F   ++SKL+  +N LR +G+  Q  G       +T+V+  +  KG  RE++QLG PI     D+ +   
Subjt:  AATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADAPHDDETSAG

Query:  ISKDRRIEPIPSLLQDVIDRLVGEQVMT--LKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLV
              +EPIP+L+Q VID L+  +++    +P+ C+I+F++E +HSQP   PP   +P+  L+L+E  M FG  +G D+ GN+RG++ LPL  G+LL++
Subjt:  ISKDRRIEPIPSLLQDVIDRLVGEQVMT--LKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLV

Query:  QGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALAD-------------------GQRTSSNVGPFSSWGPPSARSPNHRLPP
        +G SAD A+H +     +R+ +T  K +P    +                      +R  +  G F  W PP +R P   LPP
Subjt:  QGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALAD-------------------GQRTSSNVGPFSSWGPPSARSPNHRLPP

AT4G02940.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein2.3e-6133.16Show/hide
Query:  RDGFISWLRGEFAAANAIIDSLCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFK
        +D  ISW RGEFAAANAIID++C HLR   E     EY+ V   I +RR NW PVL MQ+Y S+AEV   LQ+VA+++ +                   K
Subjt:  RDGFISWLRGEFAAANAIIDSLCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQQVASRRQQRYIDPVKVGPKLYRRPGPFK

Query:  QPHGHRVEAVKEEVEQVSNTCEE--SKAMGEDEKLIDED-SGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSS----SHRDKELQSV
        Q      E  +E++++V  T EE   K     EK+ + D +G VED +D       DS +    ++ D+ S+++    +  D        SH D + +S 
Subjt:  QPHGHRVEAVKEEVEQVSNTCEE--SKAMGEDEKLIDED-SGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCSS----SHRDKELQSV

Query:  QSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADA
        + +  K         F A E   G  VNV++GLKLYEEL  + E+SKL+  V +LR +G  G+  G       +++++  + +KG+ RE+IQLG PI   
Subjt:  QSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIADA

Query:  PHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMT--LKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLP
           DE S   +    IEPIP LL+ VID  V  +++    +P+ C+I+F+ EG++SQP + PP   +P+  L+L+E  M +GR++ +D+ GN+RG + L 
Subjt:  PHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMT--LKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLP

Query:  LTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPN
        L  G+LL+++G SAD A+H +   + +R+ +T  + +P          +  N G  + W  P   +P   L         +P   VL  PP+    PPP 
Subjt:  LTPGTLLLVQGRSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPN

Query:  GIPPLIVP-PVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPR------LPVPGTGVFLPPGSSSTPPL
         + P+I+P P          V +P         H +H PPR      LP+P      P G S++ P+
Subjt:  GIPPLIVP-PVAPPMPFPAPVSIPPGPPAWPAAHPRHPPPR------LPVPGTGVFLPPGSSSTPPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATGCCATCGGGAAATGTGGGTGTATCGAATAAAGTTCCGTTTCAGAGCGGTGGCGGAGGTGGTGGCGGTGGCGAGATCCATCATCAGCATCATCCACGGCCGTG
GTATCCGGATGAGCGTGATGGGTTTATCTCATGGTTGCGAGGGGAATTTGCTGCAGCCAATGCGATCATTGATTCCCTTTGCCATCATTTGCGCGCCGTGGGAGAGCCCG
GGGAGTATGATGTGGTTATTGGGTGTATACAACAACGGCGGTGTAATTGGACGCCCGTGCTTCATATGCAGCAGTACTTTTCAGTGGCGGAAGTGATGTATTCCCTTCAA
CAGGTCGCCTCGAGGAGGCAGCAGAGATATATTGATCCTGTGAAAGTGGGGCCGAAGTTGTATAGGAGACCTGGGCCGTTTAAGCAGCCCCATGGTCATCGTGTGGAAGC
AGTCAAAGAAGAGGTGGAGCAAGTAAGTAATACATGTGAGGAAAGTAAGGCAATGGGGGAGGATGAGAAATTGATAGATGAAGATTCAGGGTCAGTAGAGGATAAAAAAG
ATATTCACGGGAAGGACCAAAGTGATAGCAAATCGAAGTGTGGGGAAAATTTAGAAGACAATGCAAGTAATAAAGAATCTCAAGTTGAACTTACTGATGATGGATGTTCT
TCAAGTCATAGAGATAAGGAGTTGCAATCTGTTCAAAGCCGGAATGGAAAGCAGAATGCTGCTGCGACACCAAGAACCTTTGTTGCGAATGAGATGTTTGATGGAAAGAT
GGTTAATGTGATGGAAGGATTGAAACTGTATGAAGAATTATTTGATGATGCTGAGGTTTCAAAGCTTGTTTCATTGGTAAATGATTTGAGAGCTTCAGGGAAGAGGGGGC
AATTTCAAGGCAAGTTTCAGTTCATTTTAAGTCAGACATATGTGGTCTCAAAAAGACCTATGAAGGGACATGGGAGAGAGATGATCCAACTAGGCTTTCCCATTGCAGAT
GCGCCTCATGATGATGAAACTTCTGCAGGGATCTCAAAAGATAGAAGAATAGAACCAATCCCCTCCTTGCTTCAAGATGTCATTGATCGCTTGGTTGGTGAACAGGTGAT
GACATTGAAACCAGATTCCTGCATCATTGACTTTTATAATGAGGGAGATCATTCTCAGCCTCATGTCTGGCCACCATGGTTTGGGAGACCTGTCGGTGTTCTCCTTTTGA
CTGAATGCGAAATGACCTTTGGTAGAGTAATTGGAACAGACCATTCTGGCAACTATAGAGGGGCTATGAAGCTGCCTCTTACACCCGGAACCCTTCTTTTGGTGCAAGGA
AGATCTGCAGATTTTGCCAAGCATGCGATACCTGCTATCCGCAAGCAACGTATACTTGTTACCTTGACCAAGTCACAGCCAAAGAAAGCTGCACTAGCTGATGGGCAACG
CACATCATCGAACGTAGGTCCGTTTTCCAGTTGGGGCCCTCCATCTGCTCGATCACCAAACCATCGTCTTCCCCCTGGACAGAAGCATTATCCCACAGTTCCATCAACTT
GCGTGTTACCCGCACCACCCATTCGCTCCCAAATGCCCCCACCAAATGGCATCCCGCCCTTAATTGTCCCTCCTGTGGCACCACCTATGCCGTTCCCTGCTCCCGTGTCG
ATCCCACCTGGTCCACCCGCTTGGCCTGCTGCACACCCAAGGCATCCTCCGCCCCGCCTACCTGTTCCAGGCACTGGAGTATTCCTCCCTCCAGGATCTTCCAGCACTCC
ACCCCTGCAACAGTTGGCGAGCTCCACGGTCGAGACAGGCTCCCTCACAGAAAAGGAAAATGGTTGTTCGACGAAATCCGATCACAGTACAGCTGCTTCTCCAGGAGAGA
AACCTGAAGCAAAAGCAGAAAGACAAGAGTGCAATGGAAACATGGAAGGAAGTGAGAGTGGTAAAGTAGCAGCAGAGGAAGAGCAGCAGCAGCAGCAGCAAAGTGGAGGA
ATTTAG
mRNA sequenceShow/hide mRNA sequence
GGAAGTTCAAAGAGAAATGGCCAAAGTACCCCTATGCTCAAGAATTTTAAGAGTATATCCCACTCTTCGGTCCCTCCAATACCCAGTTTTCCCCAGCAGTGGAGAGAGGG
AGGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGCCACCATTTAAGCCCCAATTTACAAGCAACCGTGTTCATTAAACCATCCCCAGTTCCCCACTTTCCCA
AACCCTAGATCTTATTTTATCTAAATTCTCCCAAATCCCCCACTTTTTGGTTTTTGTTTTTGCTTTTGTTTTTGTTTTCTACGTCACTGAATTTCACTGTGCGTTTTTCA
CAAGCACAAACTCTTACAACCAACAAAACCCCCTTTTCTCCTTTCCGATTTTTACTTCTTTAGACCCCATTTTGCAGATCTGATTCAATTGGGGTCCATCGTCTGCTGAA
TTCTTCTGTAGCTCCTCGTTTAATTCCCTGAATTCTGCTTCTTTAATTATCGTTCAGATTTGAGGATATACTGTATATGCGGTTTTAGTGTTGCTCTGGTTAGAGGATTC
TCATGGCAATGCCATCGGGAAATGTGGGTGTATCGAATAAAGTTCCGTTTCAGAGCGGTGGCGGAGGTGGTGGCGGTGGCGAGATCCATCATCAGCATCATCCACGGCCG
TGGTATCCGGATGAGCGTGATGGGTTTATCTCATGGTTGCGAGGGGAATTTGCTGCAGCCAATGCGATCATTGATTCCCTTTGCCATCATTTGCGCGCCGTGGGAGAGCC
CGGGGAGTATGATGTGGTTATTGGGTGTATACAACAACGGCGGTGTAATTGGACGCCCGTGCTTCATATGCAGCAGTACTTTTCAGTGGCGGAAGTGATGTATTCCCTTC
AACAGGTCGCCTCGAGGAGGCAGCAGAGATATATTGATCCTGTGAAAGTGGGGCCGAAGTTGTATAGGAGACCTGGGCCGTTTAAGCAGCCCCATGGTCATCGTGTGGAA
GCAGTCAAAGAAGAGGTGGAGCAAGTAAGTAATACATGTGAGGAAAGTAAGGCAATGGGGGAGGATGAGAAATTGATAGATGAAGATTCAGGGTCAGTAGAGGATAAAAA
AGATATTCACGGGAAGGACCAAAGTGATAGCAAATCGAAGTGTGGGGAAAATTTAGAAGACAATGCAAGTAATAAAGAATCTCAAGTTGAACTTACTGATGATGGATGTT
CTTCAAGTCATAGAGATAAGGAGTTGCAATCTGTTCAAAGCCGGAATGGAAAGCAGAATGCTGCTGCGACACCAAGAACCTTTGTTGCGAATGAGATGTTTGATGGAAAG
ATGGTTAATGTGATGGAAGGATTGAAACTGTATGAAGAATTATTTGATGATGCTGAGGTTTCAAAGCTTGTTTCATTGGTAAATGATTTGAGAGCTTCAGGGAAGAGGGG
GCAATTTCAAGGCAAGTTTCAGTTCATTTTAAGTCAGACATATGTGGTCTCAAAAAGACCTATGAAGGGACATGGGAGAGAGATGATCCAACTAGGCTTTCCCATTGCAG
ATGCGCCTCATGATGATGAAACTTCTGCAGGGATCTCAAAAGATAGAAGAATAGAACCAATCCCCTCCTTGCTTCAAGATGTCATTGATCGCTTGGTTGGTGAACAGGTG
ATGACATTGAAACCAGATTCCTGCATCATTGACTTTTATAATGAGGGAGATCATTCTCAGCCTCATGTCTGGCCACCATGGTTTGGGAGACCTGTCGGTGTTCTCCTTTT
GACTGAATGCGAAATGACCTTTGGTAGAGTAATTGGAACAGACCATTCTGGCAACTATAGAGGGGCTATGAAGCTGCCTCTTACACCCGGAACCCTTCTTTTGGTGCAAG
GAAGATCTGCAGATTTTGCCAAGCATGCGATACCTGCTATCCGCAAGCAACGTATACTTGTTACCTTGACCAAGTCACAGCCAAAGAAAGCTGCACTAGCTGATGGGCAA
CGCACATCATCGAACGTAGGTCCGTTTTCCAGTTGGGGCCCTCCATCTGCTCGATCACCAAACCATCGTCTTCCCCCTGGACAGAAGCATTATCCCACAGTTCCATCAAC
TTGCGTGTTACCCGCACCACCCATTCGCTCCCAAATGCCCCCACCAAATGGCATCCCGCCCTTAATTGTCCCTCCTGTGGCACCACCTATGCCGTTCCCTGCTCCCGTGT
CGATCCCACCTGGTCCACCCGCTTGGCCTGCTGCACACCCAAGGCATCCTCCGCCCCGCCTACCTGTTCCAGGCACTGGAGTATTCCTCCCTCCAGGATCTTCCAGCACT
CCACCCCTGCAACAGTTGGCGAGCTCCACGGTCGAGACAGGCTCCCTCACAGAAAAGGAAAATGGTTGTTCGACGAAATCCGATCACAGTACAGCTGCTTCTCCAGGAGA
GAAACCTGAAGCAAAAGCAGAAAGACAAGAGTGCAATGGAAACATGGAAGGAAGTGAGAGTGGTAAAGTAGCAGCAGAGGAAGAGCAGCAGCAGCAGCAGCAAAGTGGAG
GAATTTAGAGAGAGAAGCACATATAAATATATTGTTTTATGAAGAAAGAAAAGCAGGTAGGCTGCGGAGTTGAATGAGTTACAAGCAAAAATGTAAAGGGCGGCAACATT
CAAGACTGATGATTTTCACACACTAGACACTACACTACACTTCTACTGTCAAGGAGAACAGAACAAGGGGAAGTTCTCTTTGAAAATCCTTTTAGTTCATTCCTTTCTCT
TTTTTGTTGTCCTCAAAATTCTTGATTACATCCCAAAACTCATTCCTAATGCAACTTTGGATCTTTCTTATCACAGATTTTGACCACTCAGAATCAGTTTTTTGAATTAT
CAGGGAAAGTGTGGAATTCATCATCAGTTTTATACTTTGTGGTTTTTTGTTATTTTTTTTTTTAAATTATTT
Protein sequenceShow/hide protein sequence
MAMPSGNVGVSNKVPFQSGGGGGGGGEIHHQHHPRPWYPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMYSLQ
QVASRRQQRYIDPVKVGPKLYRRPGPFKQPHGHRVEAVKEEVEQVSNTCEESKAMGEDEKLIDEDSGSVEDKKDIHGKDQSDSKSKCGENLEDNASNKESQVELTDDGCS
SSHRDKELQSVQSRNGKQNAAATPRTFVANEMFDGKMVNVMEGLKLYEELFDDAEVSKLVSLVNDLRASGKRGQFQGKFQFILSQTYVVSKRPMKGHGREMIQLGFPIAD
APHDDETSAGISKDRRIEPIPSLLQDVIDRLVGEQVMTLKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGTDHSGNYRGAMKLPLTPGTLLLVQG
RSADFAKHAIPAIRKQRILVTLTKSQPKKAALADGQRTSSNVGPFSSWGPPSARSPNHRLPPGQKHYPTVPSTCVLPAPPIRSQMPPPNGIPPLIVPPVAPPMPFPAPVS
IPPGPPAWPAAHPRHPPPRLPVPGTGVFLPPGSSSTPPLQQLASSTVETGSLTEKENGCSTKSDHSTAASPGEKPEAKAERQECNGNMEGSESGKVAAEEEQQQQQQSGG
I