; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg21990 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg21990
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionHydroxyproline-rich glycoprotein family protein, putative isoform 2
Genome locationCarg_Chr04:230738..234524
RNA-Seq ExpressionCarg21990
SyntenyCarg21990
Gene Ontology termsGO:0006402 - mRNA catabolic process (biological process)
GO:0070988 - demethylation (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0032451 - demethylase activity (molecular function)
InterPro domainsIPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily
IPR044842 - RNA demethylase ALKBH9B/ALKBH10B-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599881.1 RNA demethylase ALKBH10B, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0093.18Show/hide
Query:  GILMAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWT
        GILMAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWT
Subjt:  GILMAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWT

Query:  PVLHMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAM
        PVLHMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNA 
Subjt:  PVLHMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAM

Query:  GEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLK
        GEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLK
Subjt:  GEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLK

Query:  LYEELLDDIEVSKLLSLVNDLRASGKRGQLQG-----------------------------------------KIESIPSLLQDLIDCLVREQVMTVKPD
        LYEELLDDIEVS+LLSLVNDLRASGKRGQLQG                                         +IESIPSLLQDLIDCLVREQVMTVKPD
Subjt:  LYEELLDDIEVSKLLSLVNDLRASGKRGQLQG-----------------------------------------KIESIPSLLQDLIDCLVREQVMTVKPD

Query:  SCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNR
        SCIIDFYNE   GDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNR
Subjt:  SCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNR

Query:  AGPADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPFPPVPIPTGPPAWPAAHPRHPPPR
        AGPADGQRTSL VGSYSSWGPPSARSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPFPPVPIPTGPPAWPAAHPRHPPPR
Subjt:  AGPADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPFPPVPIPTGPPAWPAAHPRHPPPR

Query:  LPVPGTGVFLPPGSSSSPSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAKQQEEEEKGENVEGQNAG
        LPVPGTGVFLPPGSSSSPSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAKQQEEEEKGENVEGQNAG
Subjt:  LPVPGTGVFLPPGSSSSPSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAKQQEEEEKGENVEGQNAG

Query:  GGEA
        GGEA
Subjt:  GGEA

KAG7030566.1 hypothetical protein SDJN02_04603 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MVQIFYCKCGFSVFLSLGILMAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPG
        MVQIFYCKCGFSVFLSLGILMAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPG
Subjt:  MVQIFYCKCGFSVFLSLGILMAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPG

Query:  EYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVG
        EYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVG
Subjt:  EYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVG

Query:  SRKVEQVSNTCEESNAMGEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFV
        SRKVEQVSNTCEESNAMGEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFV
Subjt:  SRKVEQVSNTCEESNAMGEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFV

Query:  ANEIFDGKTVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQGKIESIPSLLQDLIDCLVREQVMTVKPDSCIIDFYNEYSLGDHSQPHVWPPW
        ANEIFDGKTVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQGKIESIPSLLQDLIDCLVREQVMTVKPDSCIIDFYNEYSLGDHSQPHVWPPW
Subjt:  ANEIFDGKTVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQGKIESIPSLLQDLIDCLVREQVMTVKPDSCIIDFYNEYSLGDHSQPHVWPPW

Query:  FGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNRAGPADGQRTSLTVGSYSSWGPPSA
        FGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNRAGPADGQRTSLTVGSYSSWGPPSA
Subjt:  FGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNRAGPADGQRTSLTVGSYSSWGPPSA

Query:  RSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPFPPVPIPTGPPAWPAAHPRHPPPRLPVPGTGVFLPPGSSSSPSPQQMP
        RSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPFPPVPIPTGPPAWPAAHPRHPPPRLPVPGTGVFLPPGSSSSPSPQQMP
Subjt:  RSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPFPPVPIPTGPPAWPAAHPRHPPPRLPVPGTGVFLPPGSSSSPSPQQMP

Query:  NSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAKQQEEEEKGENVEGQNAGGGEA
        NSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAKQQEEEEKGENVEGQNAGGGEA
Subjt:  NSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAKQQEEEEKGENVEGQNAGGGEA

XP_022942436.1 uncharacterized protein LOC111447480 [Cucurbita moschata]0.0e+0089.73Show/hide
Query:  MAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL
        MAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL
Subjt:  MAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL

Query:  HMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAMGED
        HMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQHGHR EATVKEEMVTCAESCNGGNSS FVGSRKVEQVSNTC+ES A GED
Subjt:  HMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAMGED

Query:  GKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYE
        GKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYE
Subjt:  GKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYE

Query:  ELLDDIEVSKLLSLVNDLRASGKRGQLQG-----------------------------------------KIESIPSLLQDLIDCLVREQVMTVKPDSCI
        ELLDDIEVSKLLSLVNDLRASGKRGQLQG                                         +IESIPSLLQDLIDCLVREQVMTVKPDSCI
Subjt:  ELLDDIEVSKLLSLVNDLRASGKRGQLQG-----------------------------------------KIESIPSLLQDLIDCLVREQVMTVKPDSCI

Query:  IDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNRAGP
        IDFYNE   GDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQP RAGP
Subjt:  IDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNRAGP

Query:  ADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPFPPVPIPTGPPAWPAAHPRHPPPRLPV
        ADGQRTSL VGSYSSWGPPSARSPNAR CPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIP IMVAPVAPPPMPFPPVPIPTGPPAWPAAHPRHPPPRLPV
Subjt:  ADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPFPPVPIPTGPPAWPAAHPRHPPPRLPV

Query:  PGTGVFLPPGSSSSPSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAKQQEEEEKGENVEGQNAGGGE
        PGTGVFLPPGSSS+PSPQQMPNSAVETSSLAEKENGPTE+DHNAGASPGEK             MDGSGSCKKTEEE+ KQQEEEEKGENVE QNAGGGE
Subjt:  PGTGVFLPPGSSSSPSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAKQQEEEEKGENVEGQNAGGGE

Query:  A
        A
Subjt:  A

XP_022978263.1 uncharacterized protein LOC111478296 isoform X1 [Cucurbita maxima]0.0e+0088.53Show/hide
Query:  MAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL
        MAMPSGNVGVSDKVPFQS GGVAVS      GGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL
Subjt:  MAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL

Query:  HMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQH---QQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAM
        HMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQH   QQHGHR EATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNA 
Subjt:  HMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQH---QQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAM

Query:  GEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLK
        GEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENL+DNASNKESQVEPTDDGCSSSQRDK LQSVQSRN +QYAATAPRTF ANEIFDGKTVNVMDGLK
Subjt:  GEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLK

Query:  LYEELLDDIEVSKLLSLVNDLRASGKRGQLQG-----------------------------------------KIESIPSLLQDLIDCLVREQVMTVKPD
        LYEELLDDIEVSKLLSLVNDLRASGKRGQLQG                                         +IESIPSLLQDLIDCLV EQVMTVKPD
Subjt:  LYEELLDDIEVSKLLSLVNDLRASGKRGQLQG-----------------------------------------KIESIPSLLQDLIDCLVREQVMTVKPD

Query:  SCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNR
        SCIIDFYNE   GDHSQPHVWPPWFGRPVGVLLLTECEM+FGRV+GSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQP R
Subjt:  SCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNR

Query:  AGPADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPFPP-VPIPTGPPAWPAAHPRHPPP
        AGPADGQRTSL +GSYSSWGPPSARSPNAR CPGQKHYPMGPSTGVLPVPPIRPQLPP NGIP IMVAPVAPPPMPFPP VPIPTGPPAWPAAHPRHPPP
Subjt:  AGPADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPFPP-VPIPTGPPAWPAAHPRHPPP

Query:  RLPVPGTGVFLPPGSSSSPSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAK-QQEEEEKGENVEGQN
        RLPVPGTGVFLPPG+SS+PSPQQMPNSAVETSSLAEKENGPTESDHN GASPGEKSE+KPQRQECNGSMDGSGSCKKTEEE+ K QQEEEEK ENVE QN
Subjt:  RLPVPGTGVFLPPGSSSSPSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAK-QQEEEEKGENVEGQN

Query:  AGGGEA
        AGGGEA
Subjt:  AGGGEA

XP_023542424.1 uncharacterized protein LOC111802330 [Cucurbita pepo subsp. pepo]0.0e+0090.75Show/hide
Query:  MAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL
        MAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL
Subjt:  MAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL

Query:  HMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAMGED
        HMQQYFSVAEVM ALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQHGHR EATVKEEMVTCAESCNGGNSSSFVG RKVEQVSNTCEESNA GED
Subjt:  HMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAMGED

Query:  GKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYE
        GKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKT+NVMDGLKLYE
Subjt:  GKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYE

Query:  ELLDDIEVSKLLSLVNDLRASGKRGQLQG-----------------------------------------KIESIPSLLQDLIDCLVREQVMTVKPDSCI
        ELLDDIEVSKLLSLVNDLRASGKRGQLQG                                         +IESIPSLLQDLIDCLVREQ MTVKPDSCI
Subjt:  ELLDDIEVSKLLSLVNDLRASGKRGQLQG-----------------------------------------KIESIPSLLQDLIDCLVREQVMTVKPDSCI

Query:  IDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNRAGP
        IDFYNE   GDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGA TLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQP RAGP
Subjt:  IDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNRAGP

Query:  ADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVA-PPPMPFPP-VPIPTGPPAWPAAHPRHPPPRL
        ADGQRTSL VGSYSSWGPPSARSPNAR CPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIP IMVAPVA PPPMPFPP VPIPTGPPAWPAAHPRHPPPRL
Subjt:  ADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVA-PPPMPFPP-VPIPTGPPAWPAAHPRHPPPRL

Query:  PVPGTGVFLPPGSSSSPSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAKQQEEEEKGENVEGQNAGG
        PVPGTGVFLPPGSSS+PSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSE+KPQRQECNGSMDGSGSCKKTEEE+ KQQ+EEEK ENVE QNAGG
Subjt:  PVPGTGVFLPPGSSSSPSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAKQQEEEEKGENVEGQNAGG

Query:  GEA
        GEA
Subjt:  GEA

TrEMBL top hitse value%identityAlignment
A0A6J1FRA9 uncharacterized protein LOC1114474800.0e+0089.73Show/hide
Query:  MAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL
        MAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL
Subjt:  MAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL

Query:  HMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAMGED
        HMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQHGHR EATVKEEMVTCAESCNGGNSS FVGSRKVEQVSNTC+ES A GED
Subjt:  HMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAMGED

Query:  GKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYE
        GKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYE
Subjt:  GKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYE

Query:  ELLDDIEVSKLLSLVNDLRASGKRGQLQG-----------------------------------------KIESIPSLLQDLIDCLVREQVMTVKPDSCI
        ELLDDIEVSKLLSLVNDLRASGKRGQLQG                                         +IESIPSLLQDLIDCLVREQVMTVKPDSCI
Subjt:  ELLDDIEVSKLLSLVNDLRASGKRGQLQG-----------------------------------------KIESIPSLLQDLIDCLVREQVMTVKPDSCI

Query:  IDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNRAGP
        IDFYNE   GDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQP RAGP
Subjt:  IDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNRAGP

Query:  ADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPFPPVPIPTGPPAWPAAHPRHPPPRLPV
        ADGQRTSL VGSYSSWGPPSARSPNAR CPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIP IMVAPVAPPPMPFPPVPIPTGPPAWPAAHPRHPPPRLPV
Subjt:  ADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPFPPVPIPTGPPAWPAAHPRHPPPRLPV

Query:  PGTGVFLPPGSSSSPSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAKQQEEEEKGENVEGQNAGGGE
        PGTGVFLPPGSSS+PSPQQMPNSAVETSSLAEKENGPTE+DHNAGASPGEK             MDGSGSCKKTEEE+ KQQEEEEKGENVE QNAGGGE
Subjt:  PGTGVFLPPGSSSSPSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAKQQEEEEKGENVEGQNAGGGE

Query:  A
        A
Subjt:  A

A0A6J1FUM5 uncharacterized protein LOC1114474302.1e-28675.07Show/hide
Query:  MAMPSGNVGVSDKVPFQS---GGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWT
        MAMPSGNVGV DKVPFQS   GGGVAVSG    GGGEIHQH PRPW+PDERDG ISW R EFAA+NA+IDALCHHLRAVGEPGEYDVVIGCIQQRRCNWT
Subjt:  MAMPSGNVGVSDKVPFQS---GGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWT

Query:  PVLHMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAM
        PV+HMQQYFSVA+V ++LQQV SRRQQR++DP+KVG K +RRPGP FKQQ QQ GHR E TVKEEMVTCAESCNGGNSSSFVGSRKVE VSNTCE+S A 
Subjt:  PVLHMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAM

Query:  GEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLK
        GED  LNDKDSGSAED KDTHGKDQSNSKPKCAE+L+DNASNKES VEPTDDGCSSS R+KELQSVQ++NGKQYAAT PRTFVANE+ DGK VNVMDGLK
Subjt:  GEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLK

Query:  LYEELLDDIEVSKLLSLVNDLRASGKRGQLQG-----------------------------------------KIESIPSLLQDLIDCLVREQVMTVKPD
        L+E+ LDD EVSKLLSLVNDLRASGKRGQ QG                                         +IESIPSLLQDLID LV EQ+M+VKPD
Subjt:  LYEELLDDIEVSKLLSLVNDLRASGKRGQLQG-----------------------------------------KIESIPSLLQDLIDCLVREQVMTVKPD

Query:  SCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNR
        SCIIDFYNE   GDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIG+DHSGNY+GA  LSL PG+LLVV+GKSADFAKHAIPA+RKQRILVTLTKSQP R
Subjt:  SCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNR

Query:  AGPADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPF-PPVPIPTGPPAWPAAHPRHPPP
        A P+DGQRTSL VG +SSWGPPS RSPN R  PG KHYP  PSTGVLP PPIRPQ+PPPNGIP ++V+PVA  PMPF PPVPIPTGPP WP AHPRHPPP
Subjt:  AGPADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPF-PPVPIPTGPPAWPAAHPRHPPP

Query:  RLPVPGTGVFLPPGSSSS------PSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAKQQEEEEKGEN
        RLPVPGTGVFLPP  SSS      PSPQQ+PNS VET SL+EKENG T+SDHN GA  GEKSE+KPQRQECNGS       +K  EE  +QQ+++E+ EN
Subjt:  RLPVPGTGVFLPPGSSSS------PSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAKQQEEEEKGEN

Query:  V-EGQNAGGG
        + E Q+AG G
Subjt:  V-EGQNAGGG

A0A6J1IMA3 uncharacterized protein LOC111478296 isoform X20.0e+0086.97Show/hide
Query:  MAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL
        MAMPSGNVGVSDKVPFQS GGVAVS      GGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL
Subjt:  MAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL

Query:  HMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQH---QQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAM
        HMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQH   QQHGHR EATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNA 
Subjt:  HMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQH---QQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAM

Query:  GEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLK
        GEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENL+DNASNKESQVEPTDDGCSSSQRDK LQSVQSRN +QYAATAPRTF ANEIFDGKTVNVMDGLK
Subjt:  GEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLK

Query:  LYEELLDDIEVSKLLSLVNDLRASGKRGQLQG-----------------------------------------KIESIPSLLQDLIDCLVREQVMTVKPD
        LYEELLDDIEVSKLLSLVNDLRASGKRGQLQG                                         +IESIPSLLQDLIDCLV EQVMTVKPD
Subjt:  LYEELLDDIEVSKLLSLVNDLRASGKRGQLQG-----------------------------------------KIESIPSLLQDLIDCLVREQVMTVKPD

Query:  SCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNR
        SCIIDFYNE   GDHSQPHVWPPWFGRPVGVLLLTECEM+FGRV+GSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQP R
Subjt:  SCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNR

Query:  AGPADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPFPP-VPIPTGPPAWPAAHPRHPPP
        AGPADGQRTSL +GSYSSWGPPSARSPNAR CPGQKHYPMGPSTGVLPVPPIRPQLPP NGIP IMVAPVAPPPMPFPP VPIPTGPPAWPAAHPRHPPP
Subjt:  AGPADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPFPP-VPIPTGPPAWPAAHPRHPPP

Query:  RLPVPGTGVFLPPGSSSSPSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAK-QQEEEEKGENVEGQN
        RLPVPGTGVFLPPG+SS+PSPQQMPNSAVETSSLAEKENGPTESDHN GASPGEKS+           MDGSGSCKKTEEE+ K QQEEEEK ENVE QN
Subjt:  RLPVPGTGVFLPPGSSSSPSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAK-QQEEEEKGENVEGQN

Query:  AGGGEA
        AGGGEA
Subjt:  AGGGEA

A0A6J1ISJ2 uncharacterized protein LOC111478296 isoform X10.0e+0088.53Show/hide
Query:  MAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL
        MAMPSGNVGVSDKVPFQS GGVAVS      GGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL
Subjt:  MAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVL

Query:  HMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQH---QQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAM
        HMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQH   QQHGHR EATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNA 
Subjt:  HMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQH---QQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAM

Query:  GEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLK
        GEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENL+DNASNKESQVEPTDDGCSSSQRDK LQSVQSRN +QYAATAPRTF ANEIFDGKTVNVMDGLK
Subjt:  GEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLK

Query:  LYEELLDDIEVSKLLSLVNDLRASGKRGQLQG-----------------------------------------KIESIPSLLQDLIDCLVREQVMTVKPD
        LYEELLDDIEVSKLLSLVNDLRASGKRGQLQG                                         +IESIPSLLQDLIDCLV EQVMTVKPD
Subjt:  LYEELLDDIEVSKLLSLVNDLRASGKRGQLQG-----------------------------------------KIESIPSLLQDLIDCLVREQVMTVKPD

Query:  SCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNR
        SCIIDFYNE   GDHSQPHVWPPWFGRPVGVLLLTECEM+FGRV+GSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQP R
Subjt:  SCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNR

Query:  AGPADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPFPP-VPIPTGPPAWPAAHPRHPPP
        AGPADGQRTSL +GSYSSWGPPSARSPNAR CPGQKHYPMGPSTGVLPVPPIRPQLPP NGIP IMVAPVAPPPMPFPP VPIPTGPPAWPAAHPRHPPP
Subjt:  AGPADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPFPP-VPIPTGPPAWPAAHPRHPPP

Query:  RLPVPGTGVFLPPGSSSSPSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAK-QQEEEEKGENVEGQN
        RLPVPGTGVFLPPG+SS+PSPQQMPNSAVETSSLAEKENGPTESDHN GASPGEKSE+KPQRQECNGSMDGSGSCKKTEEE+ K QQEEEEK ENVE QN
Subjt:  RLPVPGTGVFLPPGSSSSPSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAK-QQEEEEKGENVEGQN

Query:  AGGGEA
        AGGGEA
Subjt:  AGGGEA

A0A6J1J9C0 uncharacterized protein LOC1114846096.4e-28876.2Show/hide
Query:  MAMPSGNVGVSDKVPFQS--GGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTP
        MAMPSGNVGV DKV FQS  GGGVAVSG    GGGEIHQH PRPW+PDERDG ISW R EFAA+NA+IDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTP
Subjt:  MAMPSGNVGVSDKVPFQS--GGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTP

Query:  VLHMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFK-QQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAM
        V+HMQQYFSVA+V ++LQQV SRRQQR++DP+KVG K +RRPGP FK QQ QQ GHR EATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEES A 
Subjt:  VLHMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFK-QQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAM

Query:  GEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLK
        GED  LNDKDSGSAED KDTHGKDQ NSKPKCAE+L+DNASNKES VEPTDDGCSSS R+KELQSVQS+NGKQYAAT PRTFVANE+ DGK VNVMDGLK
Subjt:  GEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLK

Query:  LYEELLDDIEVSKLLSLVNDLRASGKRGQLQG-----------------------------------------KIESIPSLLQDLIDCLVREQVMTVKPD
        L+E+ LDD EVSKLLSLVNDLRASGKRGQ QG                                         +IESIPSLLQDLID LV EQ+M+VKPD
Subjt:  LYEELLDDIEVSKLLSLVNDLRASGKRGQLQG-----------------------------------------KIESIPSLLQDLIDCLVREQVMTVKPD

Query:  SCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNR
        SCIIDFYNE   GDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIG+DHSGNY+GA  LSLAPG+LLVV+GKSADFAKHAIPA+RKQRILVTLTKSQP R
Subjt:  SCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNR

Query:  AGPADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPF-PPVPIPTGPPAWPAAHPRHPPP
        A P+DGQRTSL VG ++SWGPPS RSPN R  PGQKHY   PSTGVLP PPIRPQ+PPPNGIP ++V+PVA  PMPF PPVPIPTGPP WP AHPRHPPP
Subjt:  AGPADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPF-PPVPIPTGPPAWPAAHPRHPPP

Query:  RLPVPGTGVFL-PPGSSS--SPSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAKQQEEEEKGENVEG
        RLPVPGTGVFL PPGSSS  SP+PQQ+PNS VET SL+EKENG T+SDHNAGAS GEKSE+KPQRQECNGS       +K  EE  +QQE++E+ EN++ 
Subjt:  RLPVPGTGVFL-PPGSSS--SPSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAKQQEEEEKGENVEG

Query:  QNAGGG
        Q+AG G
Subjt:  QNAGGG

SwissProt top hitse value%identityAlignment
Q9SL49 RNA demethylase ALKBH9B9.3e-1825.93Show/hide
Query:  ENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQL---
        E  ++    ++S  +  D     +    +L   Q  N +       + F+  E   GK VNV+DGL+L+  +   +E  +++  V  L+  G+RG+L   
Subjt:  ENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQL---

Query:  ---------QGK-----------------------------IESIPSLLQDLIDCLVREQVM--TVKPDSCIIDFYNEYSLGDHSQPHVWPPWFGRP-VG
                 +GK                             ++ +P L + +I  L++  V+  T  PDSCI++ Y+E   GD   PH+    F RP   
Subjt:  ---------QGK-----------------------------IESIPSLLQDLIDCLVREQVM--TVKPDSCIIDFYNEYSLGDHSQPHVWPPWFGRP-VG

Query:  VLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNR
        +  L+EC++ FG  +  +  G++ G+ ++ L  GS+LV+ G  AD AKH +PA+  +RI +T  K   ++
Subjt:  VLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNR

Q9ZT92 RNA demethylase ALKBH10B8.1e-4630.96Show/hide
Query:  RDGLISWFRGEFAASNAIIDALCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAF
        +D LISWFRGEFAA+NAIIDA+C HLR   E     EY+ V   I +RR NW PVL MQ+Y S+AEV   LQ+V +++ +                    
Subjt:  RDGLISWFRGEFAASNAIIDALCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAF

Query:  KQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAMGEDGKLND-KDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQ
        KQ+  +    AE  +KE + T  E         F G +  E   N   E   + +D   +D  DSGS +D+  T   D ++   +   +  ++   +  +
Subjt:  KQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAMGEDGKLND-KDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQ

Query:  VEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQGK-------------
        ++P                              + F A E   G TVNV+ GLKLYEELL + E+SKLL  V +LR +G  G+L G+             
Subjt:  VEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQGK-------------

Query:  ----------------------------IESIPSLLQDLIDCLVREQVMT--VKPDSCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRV
                                    IE IP LL+ +ID  V  +++    +P+ C+I+F+ E   G++SQP + PP   +P+  L+L+E  M +GR+
Subjt:  ----------------------------IESIPSLLQDLIDCLVREQVMT--VKPDSCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRV

Query:  IGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNRAGPADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPST
        + SD+ GN+RG  TLSL  GSLLV++G SAD A+H +   + +R+ +T  + +P+         +    G  + W  P   +P            M P  
Subjt:  IGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNRAGPADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPST

Query:  GVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPFPPVPIPTGPPAWPAAHPRHPPPRLPVPGTGVFLPPGSSSSPS
        GVL  PP+    PPP   P I+ +P          V +P         H +H PPR       + LPP +SSSP+
Subjt:  GVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPFPPVPIPTGPPAWPAAHPRHPPPRLPVPGTGVFLPPGSSSSPS

Arabidopsis top hitse value%identityAlignment
AT1G14710.1 hydroxyproline-rich glycoprotein family protein1.3e-10442.03Show/hide
Query:  PRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQ-----QRFVDPMKVGS
        P  W PDERDG ISW R EFAA+NAIID+LC HL+AVG+  EY+ VIG I  RR  W+ VL MQQ+F VA+V + LQQ+  +RQ     QR  +  +VG 
Subjt:  PRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQ-----QRFVDPMKVGS

Query:  KLFRRPGPAFKQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAMGEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLK
           RR GP F + H   G    A   + M     + NG NS       + +  S+                K    AE+ +D   K +S+SK    E   
Subjt:  KLFRRPGPAFKQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAMGEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLK

Query:  DNASNKESQVEPTDDGCSSSQRDKEL--QSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQG---
        + +  +E  V+  +  C+S  +D  L  +  Q  N K+  A+  +TFV  E++D K VNV++GLKLY+++LD  EVS+L+SLV +LR +G+RGQLQ    
Subjt:  DNASNKESQVEPTDDGCSSSQRDKEL--QSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQG---

Query:  ----------------------------------KIESIPSLLQDLIDCLVREQVMTVKPDSCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEM
                                          +IE IPS L D+I+ LV +Q++ VKPD+CIIDF++E   GDHSQPH++ PWFGRP+ VL L+EC+ 
Subjt:  ----------------------------------KIESIPSLLQDLIDCLVREQVMTVKPDSCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEM

Query:  TFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNRAGPADGQRTSLTVGSYSSWGPPSARSPN--ARSCPG-QK
        TFGRVI S++ G+Y+G+  LSL PGS+L+V+GKSA+ AK+AI A RKQRIL++  KS+P                  S+WGPP +RSPN   R   G  K
Subjt:  TFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNRAGPADGQRTSLTVGSYSSWGPPSARSPN--ARSCPG-QK

Query:  HYPMG-PSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPP---PMPFPPVPIPTGPPAWP--AAHPRH---PPPRLPVPGTGVFLPPGSSSSPSPQQMPNSA
        HYP+  PSTGVLP P  R    PPNG  Q +  P +PP   PMPFP   +PTGPP WP    HPRH   P PR+P+PGTGVFLPPGS+            
Subjt:  HYPMG-PSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPP---PMPFPPVPIPTGPPAWP--AAHPRH---PPPRLPVPGTGVFLPPGSSSSPSPQQMPNSA

Query:  VETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGS
             LA+  NG TE   +  A   E++ +     EC+GS
Subjt:  VETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGS

AT1G14710.2 hydroxyproline-rich glycoprotein family protein1.3e-10442.03Show/hide
Query:  PRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQ-----QRFVDPMKVGS
        P  W PDERDG ISW R EFAA+NAIID+LC HL+AVG+  EY+ VIG I  RR  W+ VL MQQ+F VA+V + LQQ+  +RQ     QR  +  +VG 
Subjt:  PRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQ-----QRFVDPMKVGS

Query:  KLFRRPGPAFKQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAMGEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLK
           RR GP F + H   G    A   + M     + NG NS       + +  S+                K    AE+ +D   K +S+SK    E   
Subjt:  KLFRRPGPAFKQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAMGEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLK

Query:  DNASNKESQVEPTDDGCSSSQRDKEL--QSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQG---
        + +  +E  V+  +  C+S  +D  L  +  Q  N K+  A+  +TFV  E++D K VNV++GLKLY+++LD  EVS+L+SLV +LR +G+RGQLQ    
Subjt:  DNASNKESQVEPTDDGCSSSQRDKEL--QSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQG---

Query:  ----------------------------------KIESIPSLLQDLIDCLVREQVMTVKPDSCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEM
                                          +IE IPS L D+I+ LV +Q++ VKPD+CIIDF++E   GDHSQPH++ PWFGRP+ VL L+EC+ 
Subjt:  ----------------------------------KIESIPSLLQDLIDCLVREQVMTVKPDSCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEM

Query:  TFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNRAGPADGQRTSLTVGSYSSWGPPSARSPN--ARSCPG-QK
        TFGRVI S++ G+Y+G+  LSL PGS+L+V+GKSA+ AK+AI A RKQRIL++  KS+P                  S+WGPP +RSPN   R   G  K
Subjt:  TFGRVIGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNRAGPADGQRTSLTVGSYSSWGPPSARSPN--ARSCPG-QK

Query:  HYPMG-PSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPP---PMPFPPVPIPTGPPAWP--AAHPRH---PPPRLPVPGTGVFLPPGSSSSPSPQQMPNSA
        HYP+  PSTGVLP P  R    PPNG  Q +  P +PP   PMPFP   +PTGPP WP    HPRH   P PR+P+PGTGVFLPPGS+            
Subjt:  HYPMG-PSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPP---PMPFPPVPIPTGPPAWP--AAHPRH---PPPRLPVPGTGVFLPPGSSSSPSPQQMPNSA

Query:  VETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGS
             LA+  NG TE   +  A   E++ +     EC+GS
Subjt:  VETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGS

AT1G48980.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.1e-1927.38Show/hide
Query:  LKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQL-----
        L+D  S +E   + + +   SS  + +L   Q  + +       R FV  E  +G+ VN+++GL+L+ E+ +  E  +++  V +L+   ++G+L     
Subjt:  LKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQL-----

Query:  -QGK-----------------------------IESIPSLLQDLIDCLVREQVM--TVKPDSCIIDFYNEYSLGDHSQPHVWPPWFGRP-VGVLLLTECE
         QGK                             ++ +P L + +I  LV+  V+  T  PD C+++ Y+E   GD   PH+    F RP   V  L+EC 
Subjt:  -QGK-----------------------------IESIPSLLQDLIDCLVREQVM--TVKPDSCIIDFYNEYSLGDHSQPHVWPPWFGRP-VGVLLLTECE

Query:  MTFGRVIGSDHSGNYRGAK-TLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNR
        + FG  +  + +G Y G   +L L  GS+LV+ G  AD AKH +P +  +RI +T  K   ++
Subjt:  MTFGRVIGSDHSGNYRGAK-TLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNR

AT2G48080.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein7.3e-4227.49Show/hide
Query:  RDGLISWFRGEFAASNAIIDALCHHL-RAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQ
        +D +++WFRGEFAA+NAIIDALC HL +A G   +Y+ V+  + +RR NW PVL MQ+Y S+++V   LQQ  ++     +D                  
Subjt:  RDGLISWFRGEFAASNAIIDALCHHL-RAVGEPGEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQ

Query:  QHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAMGEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEP
         H      ++ T            +G       GSR+ E +S  C+                                                      
Subjt:  QHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAMGEDGKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEP

Query:  TDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQGK----------------
         +D C S       QS              + F A E   G T NV+ GLKLY+++    ++SKLL  +N LR +G+  QL G+                
Subjt:  TDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQGK----------------

Query:  ----------------IESIPSLLQDLIDCLVREQVMT--VKPDSCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGA
                        +E IP+L+Q +ID L++ +++    +P+ C+I+F++E    +HSQP   PP   +P+  L+L+E  M FG  +G D+ GN+RG+
Subjt:  ----------------IESIPSLLQDLIDCLVREQVMT--VKPDSCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGA

Query:  KTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNRA--------------------GPADGQRTSLTVGSYSSWGPPSARSP
         TL L  GSLLV++G SAD A+H +     +R+ +T  K +P+                       PA  +R     G +  W PP +R P
Subjt:  KTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNRA--------------------GPADGQRTSLTVGSYSSWGPPSARSP

AT4G02940.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein5.7e-4730.96Show/hide
Query:  RDGLISWFRGEFAASNAIIDALCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAF
        +D LISWFRGEFAA+NAIIDA+C HLR   E     EY+ V   I +RR NW PVL MQ+Y S+AEV   LQ+V +++ +                    
Subjt:  RDGLISWFRGEFAASNAIIDALCHHLRAVGEP---GEYDVVIGCIQQRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAF

Query:  KQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAMGEDGKLND-KDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQ
        KQ+  +    AE  +KE + T  E         F G +  E   N   E   + +D   +D  DSGS +D+  T   D ++   +   +  ++   +  +
Subjt:  KQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAMGEDGKLND-KDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQ

Query:  VEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQGK-------------
        ++P                              + F A E   G TVNV+ GLKLYEELL + E+SKLL  V +LR +G  G+L G+             
Subjt:  VEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYEELLDDIEVSKLLSLVNDLRASGKRGQLQGK-------------

Query:  ----------------------------IESIPSLLQDLIDCLVREQVMT--VKPDSCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRV
                                    IE IP LL+ +ID  V  +++    +P+ C+I+F+ E   G++SQP + PP   +P+  L+L+E  M +GR+
Subjt:  ----------------------------IESIPSLLQDLIDCLVREQVMT--VKPDSCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRV

Query:  IGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNRAGPADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPST
        + SD+ GN+RG  TLSL  GSLLV++G SAD A+H +   + +R+ +T  + +P+         +    G  + W  P   +P            M P  
Subjt:  IGSDHSGNYRGAKTLSLAPGSLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNRAGPADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPST

Query:  GVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPFPPVPIPTGPPAWPAAHPRHPPPRLPVPGTGVFLPPGSSSSPS
        GVL  PP+    PPP   P I+ +P          V +P         H +H PPR       + LPP +SSSP+
Subjt:  GVLPVPPIRPQLPPPNGIPQIMVAPVAPPPMPFPPVPIPTGPPAWPAAHPRHPPPRLPVPGTGVFLPPGSSSSPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAGATATTCTACTGTAAATGCGGGTTTAGTGTTTTTCTCTCGTTAGGGATTCTCATGGCAATGCCATCGGGAAATGTGGGTGTTTCCGATAAAGTTCCGTTTCA
GAGCGGTGGCGGAGTTGCGGTGAGCGGCGGTGGAGGCAATGGCGGTGGCGAGATCCATCAGCACCGCCCCCGTCCCTGGTATCCTGATGAGCGTGATGGGCTTATCTCAT
GGTTCCGAGGCGAATTTGCTGCTTCGAATGCGATCATTGATGCCCTTTGCCATCATTTGCGTGCTGTGGGAGAGCCTGGGGAATATGATGTTGTTATTGGGTGTATACAG
CAACGGCGGTGCAATTGGACGCCGGTGCTTCATATGCAGCAGTACTTTTCAGTGGCAGAAGTGATGTTTGCACTTCAGCAGGTCACCTCTAGAAGGCAGCAGAGGTTTGT
TGATCCTATGAAAGTGGGGTCGAAGTTGTTTAGGAGACCTGGGCCAGCATTTAAGCAGCAGCATCAGCAGCATGGCCATCGCGCTGAAGCCACAGTCAAGGAAGAGATGG
TCACTTGTGCAGAGTCTTGTAATGGTGGGAATTCTTCAAGTTTTGTAGGCTCTAGGAAGGTTGAGCAAGTAAGTAATACATGCGAGGAAAGTAATGCAATGGGGGAGGAT
GGAAAATTGAACGATAAAGATTCAGGGTCAGCTGAGGACATAAAAGATACTCATGGGAAGGACCAAAGTAATAGCAAACCCAAGTGTGCAGAAAATTTAAAAGACAATGC
AAGCAATAAAGAATCTCAAGTTGAACCTACTGATGATGGGTGTTCTTCAAGTCAAAGAGATAAGGAGCTGCAGTCTGTTCAAAGCCGGAATGGAAAGCAGTATGCTGCCA
CAGCCCCAAGAACCTTTGTTGCCAATGAGATATTTGATGGAAAGACGGTTAATGTGATGGATGGATTGAAATTGTATGAAGAATTATTGGATGATATTGAGGTTTCAAAG
CTTCTTTCGTTGGTGAATGATTTGAGGGCTTCTGGAAAGAGAGGGCAACTCCAAGGCAAAATAGAATCCATCCCCTCATTGCTTCAAGATCTCATTGATTGCTTGGTTCG
GGAGCAAGTGATGACAGTGAAACCAGATTCCTGCATCATTGACTTTTATAACGAGTATTCGTTAGGTGATCATTCTCAGCCTCATGTCTGGCCACCATGGTTTGGGAGGC
CTGTTGGTGTCCTCCTTTTGACTGAATGTGAAATGACCTTTGGTAGAGTAATTGGTTCAGACCATTCTGGCAATTATAGAGGGGCTAAGACATTGTCTCTTGCACCGGGG
AGCCTCCTTGTGGTGCAAGGAAAATCTGCAGATTTTGCTAAGCATGCAATTCCTGCTATGCGCAAGCAACGGATACTTGTTACCTTGACCAAATCACAACCAAATAGAGC
AGGACCAGCTGATGGGCAACGCACATCTTTGACTGTAGGATCATATTCCAGTTGGGGCCCTCCATCTGCTAGATCACCCAATGCTCGTTCTTGCCCGGGACAGAAGCATT
ACCCTATGGGTCCATCGACAGGCGTTCTACCTGTGCCACCCATTCGTCCCCAATTGCCACCACCAAATGGCATCCCACAAATAATGGTGGCTCCTGTAGCACCACCACCT
ATGCCTTTCCCTCCCGTGCCAATTCCAACTGGTCCACCTGCATGGCCTGCTGCTCATCCAAGGCATCCTCCGCCTCGTCTCCCTGTTCCTGGCACTGGAGTATTCCTTCC
TCCAGGTTCTTCCAGTTCTCCATCTCCTCAACAGATGCCAAACTCCGCAGTGGAGACGAGTTCCCTTGCAGAAAAGGAAAATGGTCCGACGGAATCTGATCACAATGCAG
GTGCTTCTCCAGGGGAAAAATCTGAATCAAAGCCTCAAAGACAAGAATGCAATGGAAGCATGGATGGAAGTGGGAGTTGTAAAAAGACGGAGGAAGAACGAGCAAAGCAG
CAGGAGGAGGAGGAGAAAGGTGAGAACGTAGAGGGCCAAAATGCAGGAGGTGGAGAAGCTTAA
mRNA sequenceShow/hide mRNA sequence
CCCATTTTGCAGATCTTATACAATTACGATCCATGGCGGCTAAATTCTTCTGTGCCTCGTTATTCAATTCCCAGAATCAAGCTTCTTAATTTATGGTTCAGATATTCTAC
TGTAAATGCGGGTTTAGTGTTTTTCTCTCGTTAGGGATTCTCATGGCAATGCCATCGGGAAATGTGGGTGTTTCCGATAAAGTTCCGTTTCAGAGCGGTGGCGGAGTTGC
GGTGAGCGGCGGTGGAGGCAATGGCGGTGGCGAGATCCATCAGCACCGCCCCCGTCCCTGGTATCCTGATGAGCGTGATGGGCTTATCTCATGGTTCCGAGGCGAATTTG
CTGCTTCGAATGCGATCATTGATGCCCTTTGCCATCATTTGCGTGCTGTGGGAGAGCCTGGGGAATATGATGTTGTTATTGGGTGTATACAGCAACGGCGGTGCAATTGG
ACGCCGGTGCTTCATATGCAGCAGTACTTTTCAGTGGCAGAAGTGATGTTTGCACTTCAGCAGGTCACCTCTAGAAGGCAGCAGAGGTTTGTTGATCCTATGAAAGTGGG
GTCGAAGTTGTTTAGGAGACCTGGGCCAGCATTTAAGCAGCAGCATCAGCAGCATGGCCATCGCGCTGAAGCCACAGTCAAGGAAGAGATGGTCACTTGTGCAGAGTCTT
GTAATGGTGGGAATTCTTCAAGTTTTGTAGGCTCTAGGAAGGTTGAGCAAGTAAGTAATACATGCGAGGAAAGTAATGCAATGGGGGAGGATGGAAAATTGAACGATAAA
GATTCAGGGTCAGCTGAGGACATAAAAGATACTCATGGGAAGGACCAAAGTAATAGCAAACCCAAGTGTGCAGAAAATTTAAAAGACAATGCAAGCAATAAAGAATCTCA
AGTTGAACCTACTGATGATGGGTGTTCTTCAAGTCAAAGAGATAAGGAGCTGCAGTCTGTTCAAAGCCGGAATGGAAAGCAGTATGCTGCCACAGCCCCAAGAACCTTTG
TTGCCAATGAGATATTTGATGGAAAGACGGTTAATGTGATGGATGGATTGAAATTGTATGAAGAATTATTGGATGATATTGAGGTTTCAAAGCTTCTTTCGTTGGTGAAT
GATTTGAGGGCTTCTGGAAAGAGAGGGCAACTCCAAGGCAAAATAGAATCCATCCCCTCATTGCTTCAAGATCTCATTGATTGCTTGGTTCGGGAGCAAGTGATGACAGT
GAAACCAGATTCCTGCATCATTGACTTTTATAACGAGTATTCGTTAGGTGATCATTCTCAGCCTCATGTCTGGCCACCATGGTTTGGGAGGCCTGTTGGTGTCCTCCTTT
TGACTGAATGTGAAATGACCTTTGGTAGAGTAATTGGTTCAGACCATTCTGGCAATTATAGAGGGGCTAAGACATTGTCTCTTGCACCGGGGAGCCTCCTTGTGGTGCAA
GGAAAATCTGCAGATTTTGCTAAGCATGCAATTCCTGCTATGCGCAAGCAACGGATACTTGTTACCTTGACCAAATCACAACCAAATAGAGCAGGACCAGCTGATGGGCA
ACGCACATCTTTGACTGTAGGATCATATTCCAGTTGGGGCCCTCCATCTGCTAGATCACCCAATGCTCGTTCTTGCCCGGGACAGAAGCATTACCCTATGGGTCCATCGA
CAGGCGTTCTACCTGTGCCACCCATTCGTCCCCAATTGCCACCACCAAATGGCATCCCACAAATAATGGTGGCTCCTGTAGCACCACCACCTATGCCTTTCCCTCCCGTG
CCAATTCCAACTGGTCCACCTGCATGGCCTGCTGCTCATCCAAGGCATCCTCCGCCTCGTCTCCCTGTTCCTGGCACTGGAGTATTCCTTCCTCCAGGTTCTTCCAGTTC
TCCATCTCCTCAACAGATGCCAAACTCCGCAGTGGAGACGAGTTCCCTTGCAGAAAAGGAAAATGGTCCGACGGAATCTGATCACAATGCAGGTGCTTCTCCAGGGGAAA
AATCTGAATCAAAGCCTCAAAGACAAGAATGCAATGGAAGCATGGATGGAAGTGGGAGTTGTAAAAAGACGGAGGAAGAACGAGCAAAGCAGCAGGAGGAGGAGGAGAAA
GGTGAGAACGTAGAGGGCCAAAATGCAGGAGGTGGAGAAGCTTAA
Protein sequenceShow/hide protein sequence
MVQIFYCKCGFSVFLSLGILMAMPSGNVGVSDKVPFQSGGGVAVSGGGGNGGGEIHQHRPRPWYPDERDGLISWFRGEFAASNAIIDALCHHLRAVGEPGEYDVVIGCIQ
QRRCNWTPVLHMQQYFSVAEVMFALQQVTSRRQQRFVDPMKVGSKLFRRPGPAFKQQHQQHGHRAEATVKEEMVTCAESCNGGNSSSFVGSRKVEQVSNTCEESNAMGED
GKLNDKDSGSAEDIKDTHGKDQSNSKPKCAENLKDNASNKESQVEPTDDGCSSSQRDKELQSVQSRNGKQYAATAPRTFVANEIFDGKTVNVMDGLKLYEELLDDIEVSK
LLSLVNDLRASGKRGQLQGKIESIPSLLQDLIDCLVREQVMTVKPDSCIIDFYNEYSLGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRVIGSDHSGNYRGAKTLSLAPG
SLLVVQGKSADFAKHAIPAMRKQRILVTLTKSQPNRAGPADGQRTSLTVGSYSSWGPPSARSPNARSCPGQKHYPMGPSTGVLPVPPIRPQLPPPNGIPQIMVAPVAPPP
MPFPPVPIPTGPPAWPAAHPRHPPPRLPVPGTGVFLPPGSSSSPSPQQMPNSAVETSSLAEKENGPTESDHNAGASPGEKSESKPQRQECNGSMDGSGSCKKTEEERAKQ
QEEEEKGENVEGQNAGGGEA