; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0019403 (gene) of Chayote v1 genome

Gene IDSed0019403
OrganismSechium edule (Chayote v1)
DescriptionHydroxyproline-rich glycoprotein family protein, putative isoform 2
Genome locationLG04:41593526..41600582
RNA-Seq ExpressionSed0019403
SyntenySed0019403
Gene Ontology termsGO:0006402 - mRNA catabolic process (biological process)
GO:0070988 - demethylation (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0032451 - demethylase activity (molecular function)
InterPro domainsIPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily
IPR044842 - RNA demethylase ALKBH9B/ALKBH10B-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599881.1 RNA demethylase ALKBH10B, partial [Cucurbita argyrosperma subsp. sororia]9.1e-30579.97Show/hide
Query:  MAMPSGNVGVSDKVPFPSGGAVVVSSGGGGGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNWAPVI
        MAMPSGNVGVSDKVPF SGG V VS GGG GGG+I QH P PW+PDERD LISW R EFAASNAIIDALC HLR+VGEPGEYD VIGCIQQRR NW PV+
Subjt:  MAMPSGNVGVSDKVPFPSGGAVVVSSGGGGGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNWAPVI

Query:  HMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAMRE
        HMQQ+F VAEV++ LQQV++RRQ R++DP+KVG K++RRPGP FKQQ+Q Q GHR E TVKEEMVT AESCNGGNSSSFVGSRKVEQ SNTCEES A  E
Subjt:  HMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAMRE

Query:  DEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSY
        D KLNDKDS SAE  KDTHGKDQSNSKPKCAENL+DNASNKESQVEPTDDGCSSS RDKELQSVQS+NGK YAAT PRTFV NEIFDGKTVNVMDGLK Y
Subjt:  DEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSY

Query:  EELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMTVKPDSC
        EELLDD EVS+LLSLVNDLRASGKRGQ QG TY+ SKRPM+GHGREMIQLGFPIADA HDDDNSSG SKDRRIESIPSLL+DLID LV EQVMTVKPDSC
Subjt:  EELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMTVKPDSC

Query:  IIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRAAPAD
        IIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGR+I SDHSGN+RGA TLSL PG+LLV QGKSADFAKHAIPA+RKQRIL+TLTKS P RA PAD
Subjt:  IIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRAAPAD

Query:  GQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHYI--PSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPGPIPTGPSAWTAAHSRHPPPRLPVP
        GQR+SLNVGS+SSWGP SARSPN R  P  KHY   PSTGVLP+  IR Q+PPP+GIP I+V P VAPPPMPFPP PIPTGP AW AAH RHPPPRLPVP
Subjt:  GQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHYI--PSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPGPIPTGPSAWTAAHSRHPPPRLPVP

Query:  GTGVFLPPGSSSSPSSQQLPNSAAETNSPAEKENGST--------GRISKSEAKPQRQECNGSMDGSGSGKVKEEE---QQEEKQ-----QVQNDGSGE
        GTGVFLPPGSSSSPS QQ+PNSA ET+S AEKENG T            KSE+KPQRQECNGSMDGSGS K  EEE   QQEE++     + QN G GE
Subjt:  GTGVFLPPGSSSSPSSQQLPNSAAETNSPAEKENGST--------GRISKSEAKPQRQECNGSMDGSGSGKVKEEE---QQEEKQ-----QVQNDGSGE

KAG6600812.1 RNA demethylase ALKBH10B, partial [Cucurbita argyrosperma subsp. sororia]2.6e-29978.48Show/hide
Query:  MAMPSGNVGVSDKVPFPSGGAVVVSSGGG----GGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNW
        MAMPSGNVGV DKVPF SGG      GGG    GGGG+I QHHP PWFPDERD  ISWLR+EFAA+NA+IDALC HLR+VGEPGEYD VIGCIQQRR NW
Subjt:  MAMPSGNVGVSDKVPFPSGGAVVVSSGGG----GGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNW

Query:  APVIHMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESK
         PVIHMQQ+F VA+V Y+LQQV +RRQHRYIDPVKVGPK YRRPGPGFKQ  Q QQGHR+E TVKEEMVT AESCNGGNSSSFVGSRKVE  SNTCE+SK
Subjt:  APVIHMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESK

Query:  AMREDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDG
        A  +DE LNDKDS SAE +KDTHGKDQSNSKPKCAE+LEDNASNKES VEPTDDGCSSS+R+KELQSVQ+QNGK YAAT PRTFV NE+ DGK VNVMDG
Subjt:  AMREDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDG

Query:  LKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMTVK
        LK +E+ LDDAEVSKLLSLVNDLRASGKRGQFQGQTYV SKRPM+GHGREMIQLGFPIAD PHDDDNSSG SKDRRIESIPSLL+DLIDRLV EQ+M+VK
Subjt:  LKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMTVK

Query:  PDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRA
        PDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGR+I +DHSGN++GAM LSL PGTLLV +GKSADFAKHAIPAIRKQRIL+TLTKS PKRA
Subjt:  PDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRA

Query:  APADGQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHY--IPSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPGPIPTGPSAWTAAHSRHPPPR
         P+DGQR+SLNVG FSSWGP S RSPNPRLSP HKHY  +PSTGVLP   IR QMPPP+GIPP+IV PV +  PMPFPP PIPTGP  W  AH RHPPPR
Subjt:  APADGQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHY--IPSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPGPIPTGPSAWTAAHSRHPPPR

Query:  LPVPGTGVFLPPGSSSS-------PSSQQLPNSAAETNSPAEKENGST--------GRISKSEAKPQRQECNGSMDGSGSGKVKEEEQQEEKQQVQN
        LPVPGTGVFLPP  SSS       PS QQLPNS  ET S +EKENGST         +  KSEAKPQRQECN    GS S KV+EEEQQ++++Q +N
Subjt:  LPVPGTGVFLPPGSSSS-------PSSQQLPNSAAETNSPAEKENGST--------GRISKSEAKPQRQECNGSMDGSGSGKVKEEEQQEEKQQVQN

XP_023542424.1 uncharacterized protein LOC111802330 [Cucurbita pepo subsp. pepo]3.1e-30579.57Show/hide
Query:  MAMPSGNVGVSDKVPFPSGGAVVVSSGGGGGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNWAPVI
        MAMPSGNVGVSDKVPF SGG V VS GGG GGG+I QH P PW+PDERD LISW R EFAASNAIIDALC HLR+VGEPGEYD VIGCIQQRR NW PV+
Subjt:  MAMPSGNVGVSDKVPFPSGGAVVVSSGGGGGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNWAPVI

Query:  HMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAMRE
        HMQQ+F VAEV+  LQQV++RRQ R++DP+KVG K++RRPGP FKQQ+Q Q GHRVE TVKEEMVT AESCNGGNSSSFVG RKVEQ SNTCEES A  E
Subjt:  HMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAMRE

Query:  DEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSY
        D KLNDKDS SAE  KDTHGKDQSNSKPKCAENL+DNASNKESQVEPTDDGCSSS RDKELQSVQS+NGK YAAT PRTFV NEIFDGKT+NVMDGLK Y
Subjt:  DEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSY

Query:  EELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMTVKPDSC
        EELLDD EVSKLLSLVNDLRASGKRGQ QG TY+ SKRPM+GHGREMIQLGFPIADA HDDDNSSG SKDRRIESIPSLL+DLID LV EQ MTVKPDSC
Subjt:  EELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMTVKPDSC

Query:  IIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRAAPAD
        IIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGR+I SDHSGN+RGA TLSL PG+LLV QGKSADFAKHAIPA+RKQRIL+TLTKS PKRA PAD
Subjt:  IIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRAAPAD

Query:  GQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHYI--PSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPG-PIPTGPSAWTAAHSRHPPPRLPV
        GQR+SLNVGS+SSWGP SARSPN R  P  KHY   PSTGVLP+  IR Q+PPP+GIPPI+V PV  PPPMPFPP  PIPTGP AW AAH RHPPPRLPV
Subjt:  GQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHYI--PSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPG-PIPTGPSAWTAAHSRHPPPRLPV

Query:  PGTGVFLPPGSSSSPSSQQLPNSAAETNSPAEKENGST--------GRISKSEAKPQRQECNGSMDGSGSGKVKEEEQQEEKQ--------QVQNDGSGE
        PGTGVFLPPGSSS+PS QQ+PNSA ET+S AEKENG T            KSEAKPQRQECNGSMDGSGS K  EEEQ +++Q        + QN G GE
Subjt:  PGTGVFLPPGSSSSPSSQQLPNSAAETNSPAEKENGST--------GRISKSEAKPQRQECNGSMDGSGSGKVKEEEQQEEKQ--------QVQNDGSGE

XP_038892010.1 RNA demethylase ALKBH10B isoform X1 [Benincasa hispida]8.5e-30379.66Show/hide
Query:  MAMPSGNVGVSDKVPFPSGGAVVVSSGGGGGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNWAPVI
        MAMPSGNVGV DKV + SGG  V  S   GGGG+I QHHP PWFPDERD  ISWLR EFAASNAIIDALC HLR+VGEPGEYD VIGCIQQRR NW PV+
Subjt:  MAMPSGNVGVSDKVPFPSGGAVVVSSGGGGGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNWAPVI

Query:  HMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAMRE
        HMQQ+F VAEV+Y LQQV++RRQ RY+DPVKVG K+YRRPGPGFKQ    QQGHRVE TVKEE+ T AESCNG NSSS VG RKVEQ SNTC+ESKA  E
Subjt:  HMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAMRE

Query:  DEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSY
        D KLNDKDS+SAE  KDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGK YAA TPRTFV NE+FDGK VNVMDGLK +
Subjt:  DEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSY

Query:  EELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSK----DRRIESIPSLLKDLIDRLVEEQVMTVK
        EELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYV  KRPM+GHGREMIQLGFPIADAPHDDDNSSG SK    DRRIESIPSLL+DLIDRLV EQVMTVK
Subjt:  EELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSK----DRRIESIPSLLKDLIDRLVEEQVMTVK

Query:  PDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRA
        PDSCI+DFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGR+I +DHSGN+RGAM LSLTPGTLLV QGKSADFAKHAIPAIRKQRIL+TLTKS PKRA
Subjt:  PDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRA

Query:  APADGQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHY--IPSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPGPIPTGPSAWTAAHSRHPPPR
         PADGQR+SLN+GSFSSWGP S RSPNPRLSP  K Y  +PSTGVLP   IR QM PP+GIPP+IVPPV   PPMPFPP PIPTGPSAW  AH RHPPPR
Subjt:  APADGQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHY--IPSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPGPIPTGPSAWTAAHSRHPPPR

Query:  LPVPGTGVFLPPGSSSS---PSSQQLPNSAAETNSPAEKENGS--------TGRISKSEAK-PQRQECNGSMDGSGSGKVKEEEQQEEKQ--------QV
        LPVPGTGVFLPP  SSS   PS QQ PNSA ET S +EKENGS        T    K EAK PQRQECNGSMDGSGS KVKEEEQQ ++Q        Q 
Subjt:  LPVPGTGVFLPPGSSSS---PSSQQLPNSAAETNSPAEKENGS--------TGRISKSEAK-PQRQECNGSMDGSGSGKVKEEEQQEEKQ--------QV

Query:  QNDGSGEV
        QN G G V
Subjt:  QNDGSGEV

XP_038892011.1 RNA demethylase ALKBH10B isoform X2 [Benincasa hispida]1.6e-30480.11Show/hide
Query:  MAMPSGNVGVSDKVPFPSGGAVVVSSGGGGGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNWAPVI
        MAMPSGNVGV DKV + SGG  V  S   GGGG+I QHHP PWFPDERD  ISWLR EFAASNAIIDALC HLR+VGEPGEYD VIGCIQQRR NW PV+
Subjt:  MAMPSGNVGVSDKVPFPSGGAVVVSSGGGGGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNWAPVI

Query:  HMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAMRE
        HMQQ+F VAEV+Y LQQV++RRQ RY+DPVKVG K+YRRPGPGFKQ    QQGHRVE TVKEE+ T AESCNG NSSS VG RKVEQ SNTC+ESKA  E
Subjt:  HMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAMRE

Query:  DEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSY
        D KLNDKDS+SAE  KDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGK YAA TPRTFV NE+FDGK VNVMDGLK +
Subjt:  DEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSY

Query:  EELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMTVKPDSC
        EELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYV  KRPM+GHGREMIQLGFPIADAPHDDDNSSG SKDRRIESIPSLL+DLIDRLV EQVMTVKPDSC
Subjt:  EELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMTVKPDSC

Query:  IIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRAAPAD
        I+DFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGR+I +DHSGN+RGAM LSLTPGTLLV QGKSADFAKHAIPAIRKQRIL+TLTKS PKRA PAD
Subjt:  IIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRAAPAD

Query:  GQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHY--IPSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPGPIPTGPSAWTAAHSRHPPPRLPVP
        GQR+SLN+GSFSSWGP S RSPNPRLSP  K Y  +PSTGVLP   IR QM PP+GIPP+IVPPV   PPMPFPP PIPTGPSAW  AH RHPPPRLPVP
Subjt:  GQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHY--IPSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPGPIPTGPSAWTAAHSRHPPPRLPVP

Query:  GTGVFLPPGSSSS---PSSQQLPNSAAETNSPAEKENGS--------TGRISKSEAK-PQRQECNGSMDGSGSGKVKEEEQQEEKQ--------QVQNDG
        GTGVFLPP  SSS   PS QQ PNSA ET S +EKENGS        T    K EAK PQRQECNGSMDGSGS KVKEEEQQ ++Q        Q QN G
Subjt:  GTGVFLPPGSSSS---PSSQQLPNSAAETNSPAEKENGS--------TGRISKSEAK-PQRQECNGSMDGSGSGKVKEEEQQEEKQ--------QVQNDG

Query:  SGEV
         G V
Subjt:  SGEV

TrEMBL top hitse value%identityAlignment
A0A5D3E038 Hydroxyproline-rich glycoprotein family protein, putative isoform 22.8e-29178Show/hide
Query:  MAMPSGNVGVSDKVPFPSGGAVVVSSGGGGGGGDIPQ-HHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNWAPV
        MA+PSGNVGV DKV F SGG  V  S   GGGG+I Q HHP PWFPDERD  ISWLR EFAASNA+IDALC HLR+VGEPGEYD VIGCIQQRR NW PV
Subjt:  MAMPSGNVGVSDKVPFPSGGAVVVSSGGGGGGGDIPQ-HHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNWAPV

Query:  IHMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAMR
        +HMQQ+F VAEV+Y LQQV++RRQ RY+DPVKVGPK+YRRPGPGFKQ    QQGHR E TVKEE +T AESCNGGNSSSFV SRKVEQ SNTC+ESKA  
Subjt:  IHMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAMR

Query:  EDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKS
        EDEKL++KDS SAE  KDTHGKDQSNSK KCAENLEDNA NK+SQVEP DDGCSSSHRDKELQSVQSQNGK +AATTPRTFV NE+FDGK VNVMDGLK 
Subjt:  EDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKS

Query:  YEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMTVKPDS
        +EELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYV SKRP +GHGREMIQLGFPIADAP++DDNS   SKDRRIE IPSLL+DLIDRLV +QVMTVKPDS
Subjt:  YEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMTVKPDS

Query:  CIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRAAPA
        CIIDFYNEGDHSQPHVWP WFGRPVGVLLLTECE+TFGR+I +DHSGN+RGA+ LSLTPG LLV QGKSADFAKHAIPAIRKQRIL+TLTKS PKRA+PA
Subjt:  CIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRAAPA

Query:  DGQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHY--IPSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPGPIPTGPSAWTAAHSRHPPPRLPV
        DGQRSSLNVG+FS WGP SARSPNPRLSP  K Y  +PSTGVLP+  IR QM PP+GIPP+IVP V   PPMPF P PIPTGPS W  AH+RHPPPRLPV
Subjt:  DGQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHY--IPSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPGPIPTGPSAWTAAHSRHPPPRLPV

Query:  PGTGVFL-PPGSSSSPS---SQQLPNSAAETNSPAEKENGST------GRI--SKSEAKPQRQECNGSMDGSGSGKVKEEEQQEEKQQVQN
        PGTGVFL PPGSSS+PS    QQLPNS  E  S +EKENG T      G     K EAK QRQECNG++DGSG+ KVKEEEQQ+++++ Q+
Subjt:  PGTGVFL-PPGSSSSPS---SQQLPNSAAETNSPAEKENGST------GRI--SKSEAKPQRQECNGSMDGSGSGKVKEEEQQEEKQQVQN

A0A6J1FRA9 uncharacterized protein LOC1114474806.2e-29979.31Show/hide
Query:  MAMPSGNVGVSDKVPFPSGGAVVVSSGGGGGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNWAPVI
        MAMPSGNVGVSDKVPF SGG V VS GGG GGG+I QH P PW+PDERD LISW R EFAASNAIIDALC HLR+VGEPGEYD VIGCIQQRR NW PV+
Subjt:  MAMPSGNVGVSDKVPFPSGGAVVVSSGGGGGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNWAPVI

Query:  HMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAMRE
        HMQQ+F VAEV++ LQQV++RRQ R++DP+KVG K++RRPGP FKQQ+Q Q GHRVE TVKEEMVT AESCNGGNSS FVGSRKVEQ SNTC+ES A  E
Subjt:  HMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAMRE

Query:  DEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSY
        D KLNDKDS SAE  KDTHGKDQSNSKPKCAENL+DNASNKESQVEPTDDGCSSS RDKELQSVQS+NGK YAAT PRTFV NEIFDGKTVNVMDGLK Y
Subjt:  DEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSY

Query:  EELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMTVKPDSC
        EELLDD EVSKLLSLVNDLRASGKRGQ QG TY+ SKRPM+GHGREMIQLGFPIADA HDDDNSSG SKDRRIESIPSLL+DLID LV EQVMTVKPDSC
Subjt:  EELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMTVKPDSC

Query:  IIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRAAPAD
        IIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGR+I SDHSGN+RGA TLSL PG+LLV QGKSADFAKHAIPA+RKQRIL+TLTKS P+RA PAD
Subjt:  IIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRAAPAD

Query:  GQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHYI--PSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPGPIPTGPSAWTAAHSRHPPPRLPVP
        GQR+SLNVGS+SSWGP SARSPN R  P  KHY   PSTGVLP+  IR Q+PPP+GIPPI+V P VAPPPMPFPP PIPTGP AW AAH RHPPPRLPVP
Subjt:  GQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHYI--PSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPGPIPTGPSAWTAAHSRHPPPRLPVP

Query:  GTGVFLPPGSSSSPSSQQLPNSAAETNSPAEKENGSTGRISKSEAKPQRQECNGSMDGSGSGKVKEEE---QQEEKQ-----QVQNDGSGE
        GTGVFLPPGSSS+PS QQ+PNSA ET+S AEKENG T     + A P  +     MDGSGS K  EEE   QQEE++     + QN G GE
Subjt:  GTGVFLPPGSSSSPSSQQLPNSAAETNSPAEKENGSTGRISKSEAKPQRQECNGSMDGSGSGKVKEEE---QQEEKQ-----QVQNDGSGE

A0A6J1FUM5 uncharacterized protein LOC1114474301.1e-29878.74Show/hide
Query:  MAMPSGNVGVSDKVPFPSGGAVVVSSGGG----GGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNW
        MAMPSGNVGV DKVPF SGG      GGG    GGGG+I QHHP PWFPDERD  ISWLR+EFAA+NA+IDALC HLR+VGEPGEYD VIGCIQQRR NW
Subjt:  MAMPSGNVGVSDKVPFPSGGAVVVSSGGG----GGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNW

Query:  APVIHMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESK
         PVIHMQQ+F VA+V Y+LQQV +RRQ RYIDPVKVGPK YRRPGPGFKQQ Q QQGHR+E TVKEEMVT AESCNGGNSSSFVGSRKVE  SNTCE+SK
Subjt:  APVIHMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESK

Query:  AMREDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDG
        A  EDE LNDKDS SAE +KDTHGKDQSNSKPKCAE+LEDNASNKES VEPTDDGCSSS+R+KELQSVQ+QNGK YAATTPRTFV NE+ DGK VNVMDG
Subjt:  AMREDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDG

Query:  LKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMTVK
        LK +E+ LDDAEVSKLLSLVNDLRASGKRGQFQGQTYV SKRPM+GHGREMIQLGFPIAD PHDDDNSSG SKDRRIESIPSLL+DLIDRLV EQ+M+VK
Subjt:  LKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMTVK

Query:  PDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRA
        PDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGR+I +DHSGN++GAM LSL PGTLLV +GKSADFAKHAIPAIRKQRIL+TLTKS PKRA
Subjt:  PDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRA

Query:  APADGQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHY--IPSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPGPIPTGPSAWTAAHSRHPPPR
         P+DGQR+SLNVG FSSWGP S RSPNPRLSP HKHY  +PSTGVLP   IR QMPPP+GIPP+IV PV +P P P PP PIPTGP  W  AH RHPPPR
Subjt:  APADGQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHY--IPSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPGPIPTGPSAWTAAHSRHPPPR

Query:  LPVPGTGVFLPPGSSSS------PSSQQLPNSAAETNSPAEKENGST--------GRISKSEAKPQRQECNGSMDGSGSGKVKEEEQQEEKQQVQN
        LPVPGTGVFLPP  SSS      PS QQLPNS  ET S +EKENGST         +  KSEAKPQRQECN    GS S KV+EEEQQ++++Q +N
Subjt:  LPVPGTGVFLPPGSSSS------PSSQQLPNSAAETNSPAEKENGST--------GRISKSEAKPQRQECNGSMDGSGSGKVKEEEQQEEKQQVQN

A0A6J1ISJ2 uncharacterized protein LOC111478296 isoform X13.8e-29677.95Show/hide
Query:  MAMPSGNVGVSDKVPFPSGGAVVVSSGGGGGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNWAPVI
        MAMPSGNVGVSDKVPF S G V VS      GG+I QH P PW+PDERD LISW R EFAASNAIIDALC HLR+VGEPGEYD VIGCIQQRR NW PV+
Subjt:  MAMPSGNVGVSDKVPFPSGGAVVVSSGGGGGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNWAPVI

Query:  HMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQ--PQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAM
        HMQQ+F VAEV++ LQQV++RRQ R++DP+KVG K++RRPGP FKQQ+Q   Q GHR+E TVKEEMVT AESCNGGNSSSFVGSRKVEQ SNTCEES A 
Subjt:  HMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQ--PQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAM

Query:  REDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLK
         ED KLNDKDS SAE  KDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSS RDK LQSVQS+N + YAAT PRTF  NEIFDGKTVNVMDGLK
Subjt:  REDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLK

Query:  SYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMTVKPD
         YEELLDD EVSKLLSLVNDLRASGKRGQ QG TY+ SKRPM+GHGREMIQLGFPIADA HDD NSSG SKDRRIESIPSLL+DLID LV EQVMTVKPD
Subjt:  SYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMTVKPD

Query:  SCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRAAP
        SCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEM+FGR++ SDHSGN+RGA TLSL PG+LLV QGKSADFAKHAIPA+RKQRIL+TLTKS PKRA P
Subjt:  SCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRAAP

Query:  ADGQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHYI--PSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPG-PIPTGPSAWTAAHSRHPPPRL
        ADGQR+SLN+GS+SSWGP SARSPN R  P  KHY   PSTGVLP+  IR Q+PP +GIPPI+V P VAPPPMPFPP  PIPTGP AW AAH RHPPPRL
Subjt:  ADGQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHYI--PSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPG-PIPTGPSAWTAAHSRHPPPRL

Query:  PVPGTGVFLPPGSSSSPSSQQLPNSAAETNSPAEKENGST-----GRIS---KSEAKPQRQECNGSMDGSGSGKVKEEEQQEEKQ---------QVQNDG
        PVPGTGVFLPPG+SS+PS QQ+PNSA ET+S AEKENG T     G  S   KSEAKPQRQECNGSMDGSGS K  EEEQ +++Q         + QN G
Subjt:  PVPGTGVFLPPGSSSSPSSQQLPNSAAETNSPAEKENGST-----GRIS---KSEAKPQRQECNGSMDGSGSGKVKEEEQQEEKQ---------QVQNDG

Query:  SGE
         GE
Subjt:  SGE

A0A6J1J9C0 uncharacterized protein LOC1114846098.1e-29979.08Show/hide
Query:  MAMPSGNVGVSDKVPFPSGGAVVVSSGGG----GGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNW
        MAMPSGNVGV DKV F SGG      GGG    GGGG+I QHHP PWFPDERD  ISWLR+EFAA+NA+IDALC HLR+VGEPGEYD VIGCIQQRR NW
Subjt:  MAMPSGNVGVSDKVPFPSGGAVVVSSGGG----GGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNW

Query:  APVIHMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESK
         PVIHMQQ+F VA+V Y+LQQV +RRQ RYIDPVKVGPK YRRPGPGFKQQ Q QQGHR+E TVKEEMVT AESCNGGNSSSFVGSRKVEQ SNTCEESK
Subjt:  APVIHMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESK

Query:  AMREDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDG
        A  EDE LNDKDS SAE +KDTHGKDQ NSKPKCAE+LEDNASNKES VEPTDDGCSSS+R+KELQSVQSQNGK YAATTPRTFV NE+ DGK VNVMDG
Subjt:  AMREDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDG

Query:  LKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMTVK
        LK +E+ LDDAEVSKLLSLVNDLRASGKRGQFQG TYV SKRPM+GHGREMIQLGFPIAD PHDDDNSSG SKDRRIESIPSLL+DLIDRLV EQ+M+VK
Subjt:  LKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMTVK

Query:  PDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRA
        PDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGR+I +DHSGN++GAM LSL PGTLLV +GKSADFAKHAIPAIRKQRIL+TLTKS PKRA
Subjt:  PDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRA

Query:  APADGQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHY--IPSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPGPIPTGPSAWTAAHSRHPPPR
         P+DGQR+SLNVG F+SWGP S RSPNPRLSP  KHY  +PSTGVLP   IR QMPPP+GIPP+IV PV +P P P PP PIPTGP  W  AH RHPPPR
Subjt:  APADGQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHY--IPSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPGPIPTGPSAWTAAHSRHPPPR

Query:  LPVPGTGVFL-PPGSSS--SPSSQQLPNSAAETNSPAEKENGST--------GRISKSEAKPQRQECNGSMDGSGSGKVKEEEQQEEKQQVQN
        LPVPGTGVFL PPGSSS  SP+ QQLPNS  ET S +EKENGST         +  KSEAKPQRQECN    GS S KV+EEEQQE+++Q +N
Subjt:  LPVPGTGVFL-PPGSSS--SPSSQQLPNSAAETNSPAEKENGST--------GRISKSEAKPQRQECNGSMDGSGSGKVKEEEQQEEKQQVQN

SwissProt top hitse value%identityAlignment
Q9SL49 RNA demethylase ALKBH9B1.2e-2828.78Show/hide
Query:  EEMVTRAESCNGGNSSSFVGSRKV-----EQASNTCEESKAMREDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSH
        +E     E C+  +  S+ GS  V     E  S+  +   A+ E      + +D A+             + +  E  E+    ++S  +  D     + 
Subjt:  EEMVTRAESCNGGNSSSFVGSRKV-----EQASNTCEESKAMREDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSH

Query:  RDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIAD
           +L   Q +N +       + F+  E   GK VNV+DGL+ +  +    E  +++  V  L+  G+RG+ + +T+ A  + MRG GRE IQ G     
Subjt:  RDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIAD

Query:  APHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVM--TVKPDSCIIDFYNEGDHSQPHVWPPWFGRP-VGVLLLTECEMTFGRLINSDHSGNFRGAMT
        AP    N  G  +   ++ +P L K +I +L++  V+  T  PDSCI++ Y+EGD   PH+    F RP   +  L+EC++ FG  +  +  G+F G+ +
Subjt:  APHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVM--TVKPDSCIIDFYNEGDHSQPHVWPPWFGRP-VGVLLLTECEMTFGRLINSDHSGNFRGAMT

Query:  LSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTK
        + L  G++LV  G  AD AKH +PA+  +RI IT  K
Subjt:  LSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTK

Q9ZT92 RNA demethylase ALKBH10B6.4e-5932.08Show/hide
Query:  RDALISWLRAEFAASNAIIDALCQHLRSVGEP---GEYDAVIGCIQQRRYNWAPVIHMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGF
        +DALISW R EFAA+NAIIDA+C HLR   E     EY+AV   I +RR NW PV+ MQ++  +AEV   LQ+V+ ++                      
Subjt:  RDALISWLRAEFAASNAIIDALCQHLRSVGEP---GEYDAVIGCIQQRRYNWAPVIHMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGF

Query:  KQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAMREDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQ
        +   Q +     E  +KE + T                 + E+    C   + + E    ND + D  + + D+   D           + D+ S+++  
Subjt:  KQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAMREDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQ

Query:  VEPTDDGCSS----SHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPM
             D        SH D + +S + +  K + A         E   G TVNV+ GLK YEELL + E+SKLL  V +LR +G  G+  G++++   + +
Subjt:  VEPTDDGCSS----SHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPM

Query:  RGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMT--VKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGR
        +G+ RE+IQLG PI      D+NS+  +    IE IP LL+ +ID  V  +++    +P+ C+I+F+ EG++SQP + PP   +P+  L+L+E  M +GR
Subjt:  RGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMT--VKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGR

Query:  LINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRAAPADGQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHYI---P
        +++SD+ GNFRG +TLSL  G+LLV +G SAD A+H +   + +R+ IT  +  P          S  N G  + W P    +P P L+  + H I   P
Subjt:  LINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRAAPADGQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHYI---P

Query:  STGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPGP-----IPTGPSAWTAAHSRHPPPRLPVPGTGVFLPPGSSSSPS
          GVL         PP   + P  V P++ P P     G      +P      +  H +H PPR       + LPP +SSSP+
Subjt:  STGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPGP-----IPTGPSAWTAAHSRHPPPRLPVPGTGVFLPPGSSSSPS

Arabidopsis top hitse value%identityAlignment
AT1G14710.1 hydroxyproline-rich glycoprotein family protein8.4e-13145.75Show/hide
Query:  MAM-PSGNVGV-SDKVPFPSGGAVVVSSGGGGGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNWAP
        MAM P GNV   S+K+ FP                      P  W PDERD  ISWLRAEFAA+NAIID+LCQHL++VG+  EY++VIG I  RR  W+ 
Subjt:  MAM-PSGNVGV-SDKVPFPSGGAVVVSSGGGGGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNWAP

Query:  VIHMQQFFPVAEVVYTLQQVSTRRQ-----HRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCE
        V+ MQQFFPVA+V Y LQQ++ +RQ      R+ +  +VG    RR GPGF + +    G+R      + M     + NG NS       +VE      E
Subjt:  VIHMQQFFPVAEVVYTLQQVSTRRQ-----HRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCE

Query:  ESKAMREDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVE-PTDDGCSSSHRDKELQSVQSQ--NGKHYAATTPRTFVVNEIFDGKT
        E+K   + + L+      AE K+D   K +S+SK      +E      E+Q E   +  C+S  +D  L S Q Q  N K   A+  +TFVV E++D K 
Subjt:  ESKAMREDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVE-PTDDGCSSSHRDKELQSVQSQ--NGKHYAATTPRTFVVNEIFDGKT

Query:  VNVMDGLKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEE
        VNV++GLK Y+++LD  EVS+L+SLV +LR +G+RGQ Q + YV  KRP RGHGREMIQLG PIAD P DDD+     KDRRIE IPS L D+I+RLV +
Subjt:  VNVMDGLKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEE

Query:  QVMTVKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTK
        Q++ VKPD+CIIDF++EGDHSQPH++ PWFGRP+ VL L+EC+ TFGR+I S++ G+++G++ LSLTPG++L+ +GKSA+ AK+AI A RKQRILI+  K
Subjt:  QVMTVKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTK

Query:  SIPKRAAPADGQRSSLNVGSFSSWGPSSARSPNPRL---SPVHKHY---IPSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPP---PMPFPPGPIPTGPS
        S P+                 S+WGP  +RSPN  +   +   KHY   IPSTGVLP  S R   PP   + PI +PP  +PP   PMPF PG +PTGP 
Subjt:  SIPKRAAPADGQRSSLNVGSFSSWGPSSARSPNPRL---SPVHKHY---IPSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPP---PMPFPPGPIPTGPS

Query:  AW--TAAHSRH---PPPRLPVPGTGVFLPPGSSSSPSSQQLPNSAAETNSPAEKENGSTGRI---SKSEAKP--QRQECNGS
         W     H RH   P PR+P+PGTGVFLPPG     S+Q+L          A+  NG+ G++   +K EA+      EC+GS
Subjt:  AW--TAAHSRH---PPPRLPVPGTGVFLPPGSSSSPSSQQLPNSAAETNSPAEKENGSTGRI---SKSEAKP--QRQECNGS

AT1G14710.2 hydroxyproline-rich glycoprotein family protein8.4e-13145.75Show/hide
Query:  MAM-PSGNVGV-SDKVPFPSGGAVVVSSGGGGGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNWAP
        MAM P GNV   S+K+ FP                      P  W PDERD  ISWLRAEFAA+NAIID+LCQHL++VG+  EY++VIG I  RR  W+ 
Subjt:  MAM-PSGNVGV-SDKVPFPSGGAVVVSSGGGGGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNWAP

Query:  VIHMQQFFPVAEVVYTLQQVSTRRQ-----HRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCE
        V+ MQQFFPVA+V Y LQQ++ +RQ      R+ +  +VG    RR GPGF + +    G+R      + M     + NG NS       +VE      E
Subjt:  VIHMQQFFPVAEVVYTLQQVSTRRQ-----HRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCE

Query:  ESKAMREDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVE-PTDDGCSSSHRDKELQSVQSQ--NGKHYAATTPRTFVVNEIFDGKT
        E+K   + + L+      AE K+D   K +S+SK      +E      E+Q E   +  C+S  +D  L S Q Q  N K   A+  +TFVV E++D K 
Subjt:  ESKAMREDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVE-PTDDGCSSSHRDKELQSVQSQ--NGKHYAATTPRTFVVNEIFDGKT

Query:  VNVMDGLKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEE
        VNV++GLK Y+++LD  EVS+L+SLV +LR +G+RGQ Q + YV  KRP RGHGREMIQLG PIAD P DDD+     KDRRIE IPS L D+I+RLV +
Subjt:  VNVMDGLKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEE

Query:  QVMTVKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTK
        Q++ VKPD+CIIDF++EGDHSQPH++ PWFGRP+ VL L+EC+ TFGR+I S++ G+++G++ LSLTPG++L+ +GKSA+ AK+AI A RKQRILI+  K
Subjt:  QVMTVKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTK

Query:  SIPKRAAPADGQRSSLNVGSFSSWGPSSARSPNPRL---SPVHKHY---IPSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPP---PMPFPPGPIPTGPS
        S P+                 S+WGP  +RSPN  +   +   KHY   IPSTGVLP  S R   PP   + PI +PP  +PP   PMPF PG +PTGP 
Subjt:  SIPKRAAPADGQRSSLNVGSFSSWGPSSARSPNPRL---SPVHKHY---IPSTGVLPLASIRSQMPPPSGIPPIIVPPVVAPP---PMPFPPGPIPTGPS

Query:  AW--TAAHSRH---PPPRLPVPGTGVFLPPGSSSSPSSQQLPNSAAETNSPAEKENGSTGRI---SKSEAKP--QRQECNGS
         W     H RH   P PR+P+PGTGVFLPPG     S+Q+L          A+  NG+ G++   +K EA+      EC+GS
Subjt:  AW--TAAHSRH---PPPRLPVPGTGVFLPPGSSSSPSSQQLPNSAAETNSPAEKENGSTGRI---SKSEAKP--QRQECNGS

AT2G17970.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein8.3e-3028.78Show/hide
Query:  EEMVTRAESCNGGNSSSFVGSRKV-----EQASNTCEESKAMREDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSH
        +E     E C+  +  S+ GS  V     E  S+  +   A+ E      + +D A+             + +  E  E+    ++S  +  D     + 
Subjt:  EEMVTRAESCNGGNSSSFVGSRKV-----EQASNTCEESKAMREDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSH

Query:  RDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIAD
           +L   Q +N +       + F+  E   GK VNV+DGL+ +  +    E  +++  V  L+  G+RG+ + +T+ A  + MRG GRE IQ G     
Subjt:  RDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPMRGHGREMIQLGFPIAD

Query:  APHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVM--TVKPDSCIIDFYNEGDHSQPHVWPPWFGRP-VGVLLLTECEMTFGRLINSDHSGNFRGAMT
        AP    N  G  +   ++ +P L K +I +L++  V+  T  PDSCI++ Y+EGD   PH+    F RP   +  L+EC++ FG  +  +  G+F G+ +
Subjt:  APHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVM--TVKPDSCIIDFYNEGDHSQPHVWPPWFGRP-VGVLLLTECEMTFGRLINSDHSGNFRGAMT

Query:  LSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTK
        + L  G++LV  G  AD AKH +PA+  +RI IT  K
Subjt:  LSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTK

AT2G48080.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein2.3e-4827.8Show/hide
Query:  PQHHPWPWFPDE--RDALISWLRAEFAASNAIIDALCQHL-RSVGEPGEYDAVIGCIQQRRYNWAPVIHMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKV
        P H P     D   +DA+++W R EFAA+NAIIDALC HL ++ G   +Y++V+  + +RR NW PV+ MQ++  +++V   LQQ   +  H ++D    
Subjt:  PQHHPWPWFPDE--RDALISWLRAEFAASNAIIDALCQHL-RSVGEPGEYDAVIGCIQQRRYNWAPVIHMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKV

Query:  GPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAMREDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAE
                                                                                +D D DS        G  +  +   C +
Subjt:  GPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAMREDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAE

Query:  NLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQT
        +               +D C S       QS              + F   E   G T NV+ GLK Y+++    ++SKLL  +N LR +G+  Q  G+T
Subjt:  NLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQT

Query:  YVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMT--VKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLT
        +V   +  +G  RE++QLG PI     D+     HS    +E IP+L++ +ID L++ +++    +P+ C+I+F++E +HSQP   PP   +P+  L+L+
Subjt:  YVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMT--VKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLT

Query:  ECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRA--------------------APADGQRSSLNVGS
        E  M FG  +  D+ GNFRG++TL L  G+LLV +G SAD A+H +     +R+ IT  K  P                       APA  +R     G 
Subjt:  ECEMTFGRLINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRA--------------------APADGQRSSLNVGS

Query:  FSSWGPSSARSPNPRLSP
        F  W P  +R P   L P
Subjt:  FSSWGPSSARSPNPRLSP

AT4G02940.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein4.5e-6032.08Show/hide
Query:  RDALISWLRAEFAASNAIIDALCQHLRSVGEP---GEYDAVIGCIQQRRYNWAPVIHMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGF
        +DALISW R EFAA+NAIIDA+C HLR   E     EY+AV   I +RR NW PV+ MQ++  +AEV   LQ+V+ ++                      
Subjt:  RDALISWLRAEFAASNAIIDALCQHLRSVGEP---GEYDAVIGCIQQRRYNWAPVIHMQQFFPVAEVVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGF

Query:  KQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAMREDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQ
        +   Q +     E  +KE + T                 + E+    C   + + E    ND + D  + + D+   D           + D+ S+++  
Subjt:  KQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAMREDEKLNDKDSDSAEGKKDTHGKDQSNSKPKCAENLEDNASNKESQ

Query:  VEPTDDGCSS----SHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPM
             D        SH D + +S + +  K + A         E   G TVNV+ GLK YEELL + E+SKLL  V +LR +G  G+  G++++   + +
Subjt:  VEPTDDGCSS----SHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVASKRPM

Query:  RGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMT--VKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGR
        +G+ RE+IQLG PI      D+NS+  +    IE IP LL+ +ID  V  +++    +P+ C+I+F+ EG++SQP + PP   +P+  L+L+E  M +GR
Subjt:  RGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMT--VKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGR

Query:  LINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRAAPADGQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHYI---P
        +++SD+ GNFRG +TLSL  G+LLV +G SAD A+H +   + +R+ IT  +  P          S  N G  + W P    +P P L+  + H I   P
Subjt:  LINSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRAAPADGQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHYI---P

Query:  STGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPGP-----IPTGPSAWTAAHSRHPPPRLPVPGTGVFLPPGSSSSPS
          GVL         PP   + P  V P++ P P     G      +P      +  H +H PPR       + LPP +SSSP+
Subjt:  STGVLPLASIRSQMPPPSGIPPIIVPPVVAPPPMPFPPGP-----IPTGPSAWTAAHSRHPPPRLPVPGTGVFLPPGSSSSPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATGCCATCGGGAAATGTGGGTGTATCCGATAAAGTTCCGTTTCCGAGCGGTGGCGCTGTTGTGGTGAGCAGTGGTGGAGGCGGCGGTGGTGGTGATATTCCTCA
ACACCATCCCTGGCCCTGGTTTCCTGATGAGCGTGATGCACTTATCTCATGGTTGCGAGCCGAATTCGCTGCTTCGAATGCGATCATTGATGCCCTTTGCCAGCATTTGC
GGTCTGTGGGAGAGCCTGGTGAATATGACGCTGTTATTGGGTGTATACAGCAACGACGGTATAATTGGGCGCCTGTGATTCATATGCAGCAGTTCTTTCCAGTGGCAGAA
GTGGTGTATACCCTTCAGCAGGTTAGCACGAGGAGGCAGCACAGGTATATTGATCCTGTGAAAGTTGGGCCGAAGATGTATAGGAGACCTGGGCCGGGGTTTAAGCAGCA
GTATCAGCCGCAGCAGGGCCATCGTGTTGAGCCCACAGTCAAGGAAGAGATGGTGACTCGAGCCGAGTCTTGTAATGGAGGGAATTCTTCAAGTTTTGTAGGCTCTAGGA
AGGTGGAGCAAGCAAGTAATACATGTGAGGAAAGCAAGGCAATGAGGGAGGATGAAAAATTGAACGATAAAGATTCAGATTCAGCTGAGGGTAAAAAAGATACTCATGGG
AAGGACCAAAGTAATAGCAAACCGAAGTGTGCAGAAAATTTAGAAGACAATGCAAGTAATAAAGAATCTCAAGTGGAACCTACTGATGATGGATGCTCTTCTAGTCATAG
AGATAAAGAATTGCAATCTGTTCAAAGCCAGAATGGAAAGCATTATGCTGCCACGACCCCAAGAACCTTTGTTGTCAATGAGATATTTGATGGAAAGACGGTTAATGTGA
TGGATGGACTGAAATCATATGAAGAATTATTGGATGATGCTGAGGTTTCAAAGCTTCTTTCATTGGTGAATGATTTGAGGGCTTCAGGAAAGAGAGGGCAATTTCAAGGT
CAGACATATGTAGCCTCAAAAAGACCAATGAGGGGGCATGGGAGAGAGATGATCCAACTAGGCTTTCCAATTGCAGATGCTCCTCATGATGATGACAATTCTTCAGGGCA
CTCAAAAGATAGAAGAATAGAATCCATCCCTTCATTGCTTAAAGATCTCATTGATCGCTTGGTGGAGGAGCAAGTGATGACAGTGAAACCTGATTCCTGCATCATTGACT
TCTATAACGAGGGTGATCATTCTCAGCCTCATGTCTGGCCACCATGGTTTGGGAGGCCTGTTGGTGTCCTCCTTCTGACTGAATGTGAAATGACCTTTGGAAGACTTATT
AATTCAGACCATTCTGGCAACTTTAGAGGGGCTATGACATTGTCCCTCACACCTGGAACCCTCCTTGTGGCGCAAGGAAAATCTGCAGATTTTGCTAAGCACGCAATTCC
TGCTATCCGCAAGCAACGGATACTTATTACCTTGACCAAATCAATACCAAAAAGAGCTGCACCAGCTGATGGGCAACGCTCATCTTTGAATGTTGGTTCATTTTCCAGTT
GGGGCCCTTCATCTGCCAGATCACCAAACCCTCGTCTTTCCCCGGTACACAAGCATTACATTCCATCGACAGGGGTGCTACCTTTGGCATCCATTCGTTCCCAAATGCCA
CCACCAAGCGGCATCCCGCCCATAATTGTTCCTCCTGTAGTAGCACCACCACCTATGCCTTTCCCTCCTGGGCCAATTCCAACAGGTCCATCCGCATGGACCGCTGCACA
TTCAAGACATCCCCCGCCTCGTCTCCCTGTTCCTGGCACGGGCGTATTCCTTCCTCCAGGTTCTTCCAGTTCTCCATCTTCTCAACAGTTACCAAACTCCGCAGCCGAGA
CGAATTCCCCTGCAGAAAAGGAAAATGGTTCAACAGGCCGCATATCGAAATCTGAAGCAAAACCTCAAAGACAAGAATGCAATGGAAGCATGGATGGAAGTGGAAGTGGT
AAAGTGAAAGAGGAAGAACAACAAGAAGAGAAGCAGCAGGTCCAAAATGATGGAAGTGGAGAAGTTTAG
mRNA sequenceShow/hide mRNA sequence
CTGAAGAATATGAGAGTATATCCTACTACTCTCCCAGTTCTTGAGCAGTGAAGAGAGACCATTTTAGACCCATGTTCACTAAAACCATTCTCAGTTTTTCACTTTCCAAA
CCCTAGATCTTAATTCCCAATCTCCATTGATTTCTCTTTTGTTTTTGTTTTTGTCACTGAATTCACTGTGCGTTTTTCACAAGAACAGACTCTGAAAAGAGCTTTTTCGC
CATAATATAATCCCATTTTGCAGATCTGTTACGGTTGGGGTGAATCGCAGGCTCAATTCTTCTCTAGCTCGTTGTTCGATTCCCAGAATTTAGCGGTTCAGATTTGAAGA
TATACTTGTATATGCGGTCTTAGTATTTTTGCTGGCTAGAGGATTCTCATGGCAATGCCATCGGGAAATGTGGGTGTATCCGATAAAGTTCCGTTTCCGAGCGGTGGCGC
TGTTGTGGTGAGCAGTGGTGGAGGCGGCGGTGGTGGTGATATTCCTCAACACCATCCCTGGCCCTGGTTTCCTGATGAGCGTGATGCACTTATCTCATGGTTGCGAGCCG
AATTCGCTGCTTCGAATGCGATCATTGATGCCCTTTGCCAGCATTTGCGGTCTGTGGGAGAGCCTGGTGAATATGACGCTGTTATTGGGTGTATACAGCAACGACGGTAT
AATTGGGCGCCTGTGATTCATATGCAGCAGTTCTTTCCAGTGGCAGAAGTGGTGTATACCCTTCAGCAGGTTAGCACGAGGAGGCAGCACAGGTATATTGATCCTGTGAA
AGTTGGGCCGAAGATGTATAGGAGACCTGGGCCGGGGTTTAAGCAGCAGTATCAGCCGCAGCAGGGCCATCGTGTTGAGCCCACAGTCAAGGAAGAGATGGTGACTCGAG
CCGAGTCTTGTAATGGAGGGAATTCTTCAAGTTTTGTAGGCTCTAGGAAGGTGGAGCAAGCAAGTAATACATGTGAGGAAAGCAAGGCAATGAGGGAGGATGAAAAATTG
AACGATAAAGATTCAGATTCAGCTGAGGGTAAAAAAGATACTCATGGGAAGGACCAAAGTAATAGCAAACCGAAGTGTGCAGAAAATTTAGAAGACAATGCAAGTAATAA
AGAATCTCAAGTGGAACCTACTGATGATGGATGCTCTTCTAGTCATAGAGATAAAGAATTGCAATCTGTTCAAAGCCAGAATGGAAAGCATTATGCTGCCACGACCCCAA
GAACCTTTGTTGTCAATGAGATATTTGATGGAAAGACGGTTAATGTGATGGATGGACTGAAATCATATGAAGAATTATTGGATGATGCTGAGGTTTCAAAGCTTCTTTCA
TTGGTGAATGATTTGAGGGCTTCAGGAAAGAGAGGGCAATTTCAAGGTCAGACATATGTAGCCTCAAAAAGACCAATGAGGGGGCATGGGAGAGAGATGATCCAACTAGG
CTTTCCAATTGCAGATGCTCCTCATGATGATGACAATTCTTCAGGGCACTCAAAAGATAGAAGAATAGAATCCATCCCTTCATTGCTTAAAGATCTCATTGATCGCTTGG
TGGAGGAGCAAGTGATGACAGTGAAACCTGATTCCTGCATCATTGACTTCTATAACGAGGGTGATCATTCTCAGCCTCATGTCTGGCCACCATGGTTTGGGAGGCCTGTT
GGTGTCCTCCTTCTGACTGAATGTGAAATGACCTTTGGAAGACTTATTAATTCAGACCATTCTGGCAACTTTAGAGGGGCTATGACATTGTCCCTCACACCTGGAACCCT
CCTTGTGGCGCAAGGAAAATCTGCAGATTTTGCTAAGCACGCAATTCCTGCTATCCGCAAGCAACGGATACTTATTACCTTGACCAAATCAATACCAAAAAGAGCTGCAC
CAGCTGATGGGCAACGCTCATCTTTGAATGTTGGTTCATTTTCCAGTTGGGGCCCTTCATCTGCCAGATCACCAAACCCTCGTCTTTCCCCGGTACACAAGCATTACATT
CCATCGACAGGGGTGCTACCTTTGGCATCCATTCGTTCCCAAATGCCACCACCAAGCGGCATCCCGCCCATAATTGTTCCTCCTGTAGTAGCACCACCACCTATGCCTTT
CCCTCCTGGGCCAATTCCAACAGGTCCATCCGCATGGACCGCTGCACATTCAAGACATCCCCCGCCTCGTCTCCCTGTTCCTGGCACGGGCGTATTCCTTCCTCCAGGTT
CTTCCAGTTCTCCATCTTCTCAACAGTTACCAAACTCCGCAGCCGAGACGAATTCCCCTGCAGAAAAGGAAAATGGTTCAACAGGCCGCATATCGAAATCTGAAGCAAAA
CCTCAAAGACAAGAATGCAATGGAAGCATGGATGGAAGTGGAAGTGGTAAAGTGAAAGAGGAAGAACAACAAGAAGAGAAGCAGCAGGTCCAAAATGATGGAAGTGGAGA
AGTTTAGAGAGAGAAGAAGATAAAATGATTTAAGATAAAAACAAAGAGAGAAGAAAAGCAGATAGACTGCAAAATGTAGAGAGCGGCAACACATTCAAAGTCTGATGATT
TTCACACACTAGAGATGTTACTTACTGGCCACTACTACTACCAAGGAGAACAAGATAAGTTCCCTTTCAAAATCCCTTACTTCATTCCTTTTTTTGTTGCCCTAAAAAAT
TATTGATTAGATGGGAAAACTCTTTCCTAATGCAACTTTGGATCTTTTCTTTTTTAAGAGATTTCAAAAGCCAGAATCTGTTCTCTTTTTTTATTTTATTGTTGTTTGGT
TTTAACTGTTGGCAAGAAAAAAAAACTAACCTTTTGGGATTAGGGAATAGTTATCGGAGGAACATGAATATGAGAGAATGATTATCAATGTTTAGAATGAAGGAAATGGT
TATGAAGAACCATAACTTGTACTCGTGGTACAAGAAATGAATAAGTGGACAAAACATTAGTCATGGAAAGAATGAATAGTGATTGTTGGTTTTCCAAAGGTGGGTGTAGG
GTTGCTGTGAGGATAGTTTAGGGATAGTTTAGTTAATACTTGAGGGAATTTGTTAGACATTTGGCTGTAGATAGAGGGTAATGAGTATGGAAGAGGCATAAAATATTGTG
ACCATATTGGTATTGGC
Protein sequenceShow/hide protein sequence
MAMPSGNVGVSDKVPFPSGGAVVVSSGGGGGGGDIPQHHPWPWFPDERDALISWLRAEFAASNAIIDALCQHLRSVGEPGEYDAVIGCIQQRRYNWAPVIHMQQFFPVAE
VVYTLQQVSTRRQHRYIDPVKVGPKMYRRPGPGFKQQYQPQQGHRVEPTVKEEMVTRAESCNGGNSSSFVGSRKVEQASNTCEESKAMREDEKLNDKDSDSAEGKKDTHG
KDQSNSKPKCAENLEDNASNKESQVEPTDDGCSSSHRDKELQSVQSQNGKHYAATTPRTFVVNEIFDGKTVNVMDGLKSYEELLDDAEVSKLLSLVNDLRASGKRGQFQG
QTYVASKRPMRGHGREMIQLGFPIADAPHDDDNSSGHSKDRRIESIPSLLKDLIDRLVEEQVMTVKPDSCIIDFYNEGDHSQPHVWPPWFGRPVGVLLLTECEMTFGRLI
NSDHSGNFRGAMTLSLTPGTLLVAQGKSADFAKHAIPAIRKQRILITLTKSIPKRAAPADGQRSSLNVGSFSSWGPSSARSPNPRLSPVHKHYIPSTGVLPLASIRSQMP
PPSGIPPIIVPPVVAPPPMPFPPGPIPTGPSAWTAAHSRHPPPRLPVPGTGVFLPPGSSSSPSSQQLPNSAAETNSPAEKENGSTGRISKSEAKPQRQECNGSMDGSGSG
KVKEEEQQEEKQQVQNDGSGEV