; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh08G003610 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh08G003610
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionoxidoreductase, 2OG-Fe(II) oxygenase family protein
Genome locationCmo_Chr08:2253008..2258523
RNA-Seq ExpressionCmoCh08G003610
SyntenyCmoCh08G003610
Gene Ontology termsGO:0006402 - mRNA catabolic process (biological process)
GO:0070988 - demethylation (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0032451 - demethylase activity (molecular function)
InterPro domainsIPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily
IPR044842 - RNA demethylase ALKBH9B/ALKBH10B-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593138.1 RNA demethylase ALKBH10B, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0098.29Show/hide
Query:  MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV
        MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV
Subjt:  MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV

Query:  TAEKKRKKKNRDEEEEEQKGGDEAAVVVVED--------DGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDP
        TAEKKRKKKN+DEEEEEQKGGDEAAVVVVED        DGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDP
Subjt:  TAEKKRKKKNRDEEEEEQKGGDEAAVVVVED--------DGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDP

Query:  IEEEDSIRSEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGE
        IEEEDSIRSEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGE
Subjt:  IEEEDSIRSEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGE

Query:  SFVLFNQQVKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSE
        SFVLFNQQVKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIP LLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSE
Subjt:  SFVLFNQQVKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSE

Query:  STMAFGRSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYE
        STMAFGRSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYE
Subjt:  STMAFGRSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYE

Query:  AMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV
        AMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV
Subjt:  AMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV

KAG7025538.1 hypothetical protein SDJN02_12034, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0095.48Show/hide
Query:  MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV
        MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV
Subjt:  MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV

Query:  TAEKKRKKKNRDEEEEEQKGGDEAAVVVVED--------DGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDP
        TAEKKRKKKN+DEEEEEQKGGDEAAVVVVED        DGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDP
Subjt:  TAEKKRKKKNRDEEEEEQKGGDEAAVVVVED--------DGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDP

Query:  IEEEDSIRSEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHM------------VNVVKGLKCYEDVFTESELAKLDGFVDDL
        IEEEDSIRSEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEP    +            VNVVKGLKCYEDVFTESELAKLDGFVDDL
Subjt:  IEEEDSIRSEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHM------------VNVVKGLKCYEDVFTESELAKLDGFVDDL

Query:  RSAAKNGELSGESFVLFNQQVKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPH
        RSAAKNGELSGESFVLFNQQVKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIP LLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPH
Subjt:  RSAAKNGELSGESFVLFNQQVKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPH

Query:  LEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGA
        LEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGA
Subjt:  LEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGA

Query:  CALPNGVPYAYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV
        CALPNGVPYAYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV
Subjt:  CALPNGVPYAYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV

XP_022960250.1 uncharacterized protein LOC111461049 [Cucurbita moschata]0.0e+00100Show/hide
Query:  MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV
        MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV
Subjt:  MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV

Query:  TAEKKRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIR
        TAEKKRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIR
Subjt:  TAEKKRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIR

Query:  SEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQ
        SEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQ
Subjt:  SEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQ

Query:  VKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRS
        VKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRS
Subjt:  VKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRS

Query:  IVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKW
        IVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKW
Subjt:  IVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKW

Query:  GILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV
        GILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV
Subjt:  GILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV

XP_023004498.1 uncharacterized protein LOC111497786 [Cucurbita maxima]0.0e+0097.92Show/hide
Query:  MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV
        MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV
Subjt:  MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV

Query:  TAEKKRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIR
        TAEKK+KKKN+DEEEEE+KGGDEAAVVVVED  DGDGDGDVEMEEKKNEIKKMKEEEEN+GKICSDEKEIVEE +IEINETDGGRNEALLDPIEEEDSIR
Subjt:  TAEKKRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIR

Query:  SEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQ
        SEITDSGSHQGV PTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQ
Subjt:  SEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQ

Query:  VKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRS
        VKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIP LLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRS
Subjt:  VKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRS

Query:  IVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKW
        IVSDNEGNYKGPLMLS+KEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKW
Subjt:  IVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKW

Query:  GILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV
        GILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQ+QPGISV
Subjt:  GILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV

XP_023514846.1 uncharacterized protein LOC111779033 [Cucurbita pepo subsp. pepo]0.0e+0098.09Show/hide
Query:  MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV
        MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV
Subjt:  MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV

Query:  TAEKKRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIR
        TAEKKRKKKN+DEEEEE+KGGDEAAVVVVED  DGDGDGDVEMEEKK EIKKMKEEEENDGKICSDEKEIVEET+IEINETDGGRNEA+LDPIEEEDSIR
Subjt:  TAEKKRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIR

Query:  SEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQ
        SEITDSGSHQGV PTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQ
Subjt:  SEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQ

Query:  VKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRS
        VKGKRREMIQLGVPIFGQIREESANN+QTSNIEPIP LLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRS
Subjt:  VKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRS

Query:  IVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKW
        IVSDNEGNYKGPLMLS+KEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKW
Subjt:  IVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKW

Query:  GILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV
        GILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV
Subjt:  GILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV

TrEMBL top hitse value%identityAlignment
A0A0A0K544 Uncharacterized protein6.9e-25981.02Show/hide
Query:  MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV
        MAAGAT+RARPVV+P AAA+TVTD + K+AVL WFRGEFAAANAIIDALCGH+AQVS+ GGSEYE+VF AIHRRRLNWIPVLQMQKYHPIADVA+ELRKV
Subjt:  MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV

Query:  TAEKKRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETT-----------IEINETDGGRNEAL
        TA KK KK N+++EEE + GG+  AV V   +GDGDG GDVEM     E+KKM EE+E +  +  DEKEIVEE T           IEINE DGGRNE +
Subjt:  TAEKKRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETT-----------IEINETDGGRNEAL

Query:  LDPIEEEDSIRSEITDSGSHQG--VQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNG
        L PIEEEDSI SEITDSGS  G  VQ  SA VEICSNH ECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYED+FT+SEL +L+ FVDDLRSAA NG
Subjt:  LDPIEEEDSIRSEITDSGSHQG--VQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNG

Query:  ELSGESFVLFNQQVKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPIST
        ELSG +F+LFN+QVKG RREMIQLGVPIF QI EES NNSQTSNIEPIP +LMTVIDHLIQWQLIPEYKRPNGCL NFFEEGEYSQPFQKPPHLEQPIST
Subjt:  ELSGESFVLFNQQVKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPIST

Query:  LFLSESTMAFGRSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGV
        L LSESTMAFGRSIVSDNEGNYKGPL LS+KEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRP+ DQ Q PT QMSNAMTLWQP VAG CALPNG 
Subjt:  LFLSESTMAFGRSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGV

Query:  PYAYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV
         Y YEAMEV+PKWGILRAPVVMLAPVRP+VMSPGRSQRDGTGVFLPWAVN+RKPAKHLPPRARKGRFLAL   VETRLPDSS E PGISV
Subjt:  PYAYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV

A0A6J1GNJ1 uncharacterized protein LOC1114560416.0e-26381.56Show/hide
Query:  MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV
        MAAGATDRARPV++P AAA  VTD + K+AVL WFRGEFAAANAIIDALCGHLAQVSD GG EYE+VF AIHRRRLNWIPVLQMQKYHPI DVA+ELRKV
Subjt:  MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV

Query:  TAEKKRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEI----------VEETTIEINETDGGRNEALL
        TAEKK+KKK +++EEEE    +E A  V ED      DGDVEME KK       E +EN GK+ SDE+ +          +EE +IEINET+GGRNE L 
Subjt:  TAEKKRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEI----------VEETTIEINETDGGRNEALL

Query:  DPIEEEDSIRSEITDSGSH---QGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNG
         PIEEEDSI SEITDSGS     GVQ +SAEVEICSNHGECEARPG MKLTKGFSAKEPVKGHMVNVVKGLKCYED+FTESEL KL+ FVDDLRSAAKNG
Subjt:  DPIEEEDSIRSEITDSGSH---QGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNG

Query:  ELSGESFVLFNQQVKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPIST
        ELSGE+FVLFNQQVKG RREMIQLGVPIFGQIR++SANN++TSNIEPIP LL TVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPIST
Subjt:  ELSGESFVLFNQQVKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPIST

Query:  LFLSESTMAFGRSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPT-SQMSNAMTLWQPGVAGACALPNG
        LFLSESTMAFGRSIVSDNEGNYKGPLMLS+KEGSLLVMRGNSADVARHV+CASPNKRVTITFFRVRPD DQ Q PT  QMSNA+TLWQPGVAG C LPNG
Subjt:  LFLSESTMAFGRSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPT-SQMSNAMTLWQPGVAGACALPNG

Query:  VPYAYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV
          Y YEAMEV+PKWGIL APVVMLAPVRP+VMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLAL SPVETR PDSS EQPGISV
Subjt:  VPYAYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV

A0A6J1H8J8 uncharacterized protein LOC1114610490.0e+00100Show/hide
Query:  MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV
        MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV
Subjt:  MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV

Query:  TAEKKRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIR
        TAEKKRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIR
Subjt:  TAEKKRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIR

Query:  SEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQ
        SEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQ
Subjt:  SEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQ

Query:  VKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRS
        VKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRS
Subjt:  VKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRS

Query:  IVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKW
        IVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKW
Subjt:  IVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKW

Query:  GILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV
        GILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV
Subjt:  GILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV

A0A6J1JXR1 uncharacterized protein LOC1114888194.2e-26481.9Show/hide
Query:  MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV
        MAAGATDRARPV++P AAA  VTD + K+AVL WFRGEFAAANAIIDALCGHLAQVSD GG EYE+VF AIHRRRLNWIPVLQMQKYHPI DVA+ELRKV
Subjt:  MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV

Query:  TAEKKRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEI----------VEETTIEINETDGGRNEALL
        TAEKK+KKK   EEEE      EAA  V ED      D DVEME KK       E +EN GK+CS+E+ +          +EE +IEINET+GGRNE L 
Subjt:  TAEKKRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEI----------VEETTIEINETDGGRNEALL

Query:  DPIEEEDSIRSEITDSGSH---QGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNG
         PIEEEDSI SEITDSGS     GVQ +SAEVEICSNHGECEARPG MKLTKGFSAKEPVKGHMVNVVKGLKCYED+FTESEL KL+ FVDDLRSAAKNG
Subjt:  DPIEEEDSIRSEITDSGSH---QGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNG

Query:  ELSGESFVLFNQQVKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPIST
        ELSGE+FVLFNQQVKG RREMIQLGVPIFGQIR++SANNS+TSNIEPIP LL+TVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPIST
Subjt:  ELSGESFVLFNQQVKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPIST

Query:  LFLSESTMAFGRSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPT-SQMSNAMTLWQPGVAGACALPNG
        LFLSESTMAFGRSIVSDNEGNYKGPLMLS+KEGSLLVMRGNSADVARHV+CASPNKRVTITFFRVRPD DQ Q PT  Q+SN +TLWQPGVAG CALPNG
Subjt:  LFLSESTMAFGRSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPT-SQMSNAMTLWQPGVAGACALPNG

Query:  VPYAYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV
        V Y YEAMEV+PKWGIL APVVMLAPVRP+VMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLAL SPVETRLPDSS EQPGISV
Subjt:  VPYAYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV

A0A6J1KZQ2 uncharacterized protein LOC1114977860.0e+0097.92Show/hide
Query:  MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV
        MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV
Subjt:  MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKV

Query:  TAEKKRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIR
        TAEKK+KKKN+DEEEEE+KGGDEAAVVVVED  DGDGDGDVEMEEKKNEIKKMKEEEEN+GKICSDEKEIVEE +IEINETDGGRNEALLDPIEEEDSIR
Subjt:  TAEKKRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIR

Query:  SEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQ
        SEITDSGSHQGV PTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQ
Subjt:  SEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQ

Query:  VKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRS
        VKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIP LLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRS
Subjt:  VKGKRREMIQLGVPIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRS

Query:  IVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKW
        IVSDNEGNYKGPLMLS+KEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKW
Subjt:  IVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKW

Query:  GILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV
        GILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQ+QPGISV
Subjt:  GILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV

SwissProt top hitse value%identityAlignment
Q9SL49 RNA demethylase ALKBH9B2.9e-2828.9Show/hide
Query:  ENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIRSEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKG
        E+   +      +VE  +  +   D  + +   D  EEE+    + +  G       T  + ++  +  E   R   +K  K F   E VKG +VNV+ G
Subjt:  ENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIRSEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKG

Query:  LKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQVKGKRREMIQLGVPIFGQIREESANNS---QTSNIEPIPSLLMTVIDHLIQWQLIPE
        L+ +  VF+  E  ++   V  L+   + GEL   +F   ++ ++GK RE IQ G   +    + + N     Q   ++P+P L   +I  LI+W ++P 
Subjt:  LKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQVKGKRREMIQLGVPIFGQIREESANNS---QTSNIEPIPSLLMTVIDHLIQWQLIPE

Query:  YKRPNGCLVNFFEEGEYSQPFQKPPHLE-----QPISTL-FLSESTMAFGRSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTIT
           P+ C+VN ++EG+       PPH++     +P  T+ FLSE  + FG ++  +  G++ G   + +  GS+LV+ GN ADVA+H + A P KR++IT
Subjt:  YKRPNGCLVNFFEEGEYSQPFQKPPHLE-----QPISTL-FLSESTMAFGRSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTIT

Query:  F
        F
Subjt:  F

Q9ZT92 RNA demethylase ALKBH10B7.9e-15152.3Show/hide
Query:  TDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDD-GGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKVTAEK
        T +A  V V    A  V++ +GK+A+++WFRGEFAAANAIIDA+C HL    +   GSEYE+VFAAIHRRRLNWIPVLQMQKYH IA+VA+EL+KV A+K
Subjt:  TDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDD-GGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKVTAEK

Query:  KRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIRSEIT
            K +  EE                          E EE   E+   +EEE    K C + +++ E      N+ +G   +       E+DS  S+IT
Subjt:  KRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIRSEIT

Query:  DSGSHQGVQPT----SAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQ
        DSGSHQ V  T    +A   IC +H +C+AR  ++K  KGF AKE VKGH VNVVKGLK YE++  E E++KL  FV +LR A  NG+L+GESF+LFN+Q
Subjt:  DSGSHQGVQPT----SAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQ

Query:  VKGKRREMIQLGVPIFGQIR-EESANNSQTS-NIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFG
        +KG +RE+IQLGVPIFG ++ +E++N++  S NIEPIP LL +VIDH + W+LIPEYKRPNGC++NFFEEGEYSQPF KPPHLEQPISTL LSESTMA+G
Subjt:  VKGKRREMIQLGVPIFGQIR-EESANNSQTS-NIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFG

Query:  RSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDC--DQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEV
        R + SDNEGN++GPL LS+K+GSLLVMRGNSAD+ARHVMC S NKRV+ITFFR+RPD   +  QP + +    MT+WQP         NG  +   ++++
Subjt:  RSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDC--DQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEV

Query:  VPKWGILRAPVVMLA--PVRPVVM-SPG-RSQRDGTGVFLPWAV--NSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV
        +PK G+LR P+VM+A  PV+P+++ SP       GTGVFLPWA   +SRK  KHLPPRA+K R L L     +     S  +P I+V
Subjt:  VPKWGILRAPVVMLA--PVRPVVM-SPG-RSQRDGTGVFLPWAV--NSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV

Arabidopsis top hitse value%identityAlignment
AT1G14710.1 hydroxyproline-rich glycoprotein family protein2.0e-6133.27Show/hide
Query:  PMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKVTAEKKRK---KKNRDEEEEEQKGG
        P  ++  ++W R EFAAANAIID+LC HL  V D   +EYESV  +IH RRL W  VL MQ++ P+ADV+  L+++  +++++   +++ + ++  + G 
Subjt:  PMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKVTAEKKRK---KKNRDEEEEEQKGG

Query:  DEAAVVVVEDDGDGDG----DGDVEMEEKKNEIKKMKEEEENDGKICSDEK--EIVEETT--IEINETDGGRNEALLDPIEEEDSIRSEITDSGSHQGVQ
          +     +  G G G    D         N +   + E   + K+ SD K   + EE     E   +D    + L +   +E+ +++   +SGS     
Subjt:  DEAAVVVVEDDGDGDG----DGDVEMEEKKNEIKKMKEEEENDGKICSDEK--EIVEETT--IEINETDGGRNEALLDPIEEEDSIRSEITDSGSHQGVQ

Query:  PTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQVKGKRREMIQLGV
         +  + E   N  EC A      + K F  +E     MVNVV+GLK Y+ +   +E+++L   V +LR A + G+L  E++V + +  +G  REMIQLG+
Subjt:  PTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQVKGKRREMIQLGV

Query:  PIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPL
        PI     ++  ++ +   IEPIPS L  +I+ L+  Q+IP   +P+ C+++FF EG++SQP    P   +PIS L LSE    FGR IVS+N G+YKG L
Subjt:  PIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPL

Query:  MLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKWGILRAP
         LS+  GS+L++ G SA++A++ + A+  +R+ I+F + +P    + PP S+  N               P G P  Y    V+P  G+L  P
Subjt:  MLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKWGILRAP

AT1G14710.2 hydroxyproline-rich glycoprotein family protein2.0e-6133.27Show/hide
Query:  PMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKVTAEKKRK---KKNRDEEEEEQKGG
        P  ++  ++W R EFAAANAIID+LC HL  V D   +EYESV  +IH RRL W  VL MQ++ P+ADV+  L+++  +++++   +++ + ++  + G 
Subjt:  PMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKVTAEKKRK---KKNRDEEEEEQKGG

Query:  DEAAVVVVEDDGDGDG----DGDVEMEEKKNEIKKMKEEEENDGKICSDEK--EIVEETT--IEINETDGGRNEALLDPIEEEDSIRSEITDSGSHQGVQ
          +     +  G G G    D         N +   + E   + K+ SD K   + EE     E   +D    + L +   +E+ +++   +SGS     
Subjt:  DEAAVVVVEDDGDGDG----DGDVEMEEKKNEIKKMKEEEENDGKICSDEK--EIVEETT--IEINETDGGRNEALLDPIEEEDSIRSEITDSGSHQGVQ

Query:  PTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQVKGKRREMIQLGV
         +  + E   N  EC A      + K F  +E     MVNVV+GLK Y+ +   +E+++L   V +LR A + G+L  E++V + +  +G  REMIQLG+
Subjt:  PTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQVKGKRREMIQLGV

Query:  PIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPL
        PI     ++  ++ +   IEPIPS L  +I+ L+  Q+IP   +P+ C+++FF EG++SQP    P   +PIS L LSE    FGR IVS+N G+YKG L
Subjt:  PIFGQIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPL

Query:  MLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKWGILRAP
         LS+  GS+L++ G SA++A++ + A+  +R+ I+F + +P    + PP S+  N               P G P  Y    V+P  G+L  P
Subjt:  MLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKWGILRAP

AT2G48080.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein6.0e-12245.99Show/hide
Query:  VTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKVTAEKKRKKKNRDEEEEEQK
        V ++D   K+A+L WFRGEFAAANAIIDALC HL Q S  G ++YESV AA+HRRRLNWIPVLQMQKYH I+ V L+L++  A+                
Subjt:  VTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKVTAEKKRKKKNRDEEEEEQK

Query:  GGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIRSEITDSGSHQGVQPTSAEV
                                                                       G  +   LD   ++DS  S+ITD GS +        +
Subjt:  GGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIRSEITDSGSHQGVQPTSAEV

Query:  EICSNH-GECEARPGQ-MKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQVKGKRREMIQLGVPIFG
         IC  H  ECE+R    +K +K FSAKE V+GH  NVVKGLK Y+DVFT  +L+KL   ++ LR A +N +LSGE+FVLFN+  KG +RE++QLGVPIFG
Subjt:  EICSNH-GECEARPGQ-MKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQVKGKRREMIQLGVPIFG

Query:  QIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSM
               N +   ++EPIP+L+ +VIDHL+QW+LIPEYKRPNGC++NFF+E E+SQPFQKPPH++QPISTL LSESTM FG  +  DN+GN++G L L +
Subjt:  QIREESANNSQTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSM

Query:  KEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKWGILRAPVVMLAPVRPVV
        KEGSLLVMRGNSAD+ARHVMC SPNKRV ITFF+++PD  + QPP        TLW+PG                            +P+VMLAP     
Subjt:  KEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKWGILRAPVVMLAPVRPVV

Query:  MSPGRSQRDGTGVFLPWAVN-SRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV
         +P R    GTGVFLPW    SRKPAKHLPPR ++ R L+ S  V     DS    P I V
Subjt:  MSPGRSQRDGTGVFLPWAVN-SRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV

AT4G02940.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein5.6e-15252.3Show/hide
Query:  TDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDD-GGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKVTAEK
        T +A  V V    A  V++ +GK+A+++WFRGEFAAANAIIDA+C HL    +   GSEYE+VFAAIHRRRLNWIPVLQMQKYH IA+VA+EL+KV A+K
Subjt:  TDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDD-GGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKVTAEK

Query:  KRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIRSEIT
            K +  EE                          E EE   E+   +EEE    K C + +++ E      N+ +G   +       E+DS  S+IT
Subjt:  KRKKKNRDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIRSEIT

Query:  DSGSHQGVQPT----SAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQ
        DSGSHQ V  T    +A   IC +H +C+AR  ++K  KGF AKE VKGH VNVVKGLK YE++  E E++KL  FV +LR A  NG+L+GESF+LFN+Q
Subjt:  DSGSHQGVQPT----SAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQ

Query:  VKGKRREMIQLGVPIFGQIR-EESANNSQTS-NIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFG
        +KG +RE+IQLGVPIFG ++ +E++N++  S NIEPIP LL +VIDH + W+LIPEYKRPNGC++NFFEEGEYSQPF KPPHLEQPISTL LSESTMA+G
Subjt:  VKGKRREMIQLGVPIFGQIR-EESANNSQTS-NIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFG

Query:  RSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDC--DQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEV
        R + SDNEGN++GPL LS+K+GSLLVMRGNSAD+ARHVMC S NKRV+ITFFR+RPD   +  QP + +    MT+WQP         NG  +   ++++
Subjt:  RSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITFFRVRPDC--DQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEV

Query:  VPKWGILRAPVVMLA--PVRPVVM-SPG-RSQRDGTGVFLPWAV--NSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV
        +PK G+LR P+VM+A  PV+P+++ SP       GTGVFLPWA   +SRK  KHLPPRA+K R L L     +     S  +P I+V
Subjt:  VPKWGILRAPVVMLA--PVRPVVM-SPG-RSQRDGTGVFLPWAV--NSRKPAKHLPPRARKGRFLALSSPVETRLPDSSQEQPGISV

AT4G36090.2 oxidoreductase, 2OG-Fe(II) oxygenase family protein3.4e-3228.39Show/hide
Query:  SDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIRSEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDV
        S +  + E  + +++  D G  + L +  ++E+ + S   D     G    + E    S       R   +K  K FS  E V+G  VN+++GL+ +  V
Subjt:  SDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIRSEITDSGSHQGVQPTSAEVEICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDV

Query:  FTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQVKGKRREMIQLGVPIFGQIREESANNS---QTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGC
        F+  E  K+  FV +L+   + GEL   +F   ++ ++GK R  IQ G   +    +++ N     Q  +++P+PS+   +I  L+ W ++P    P+ C
Subjt:  FTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQVKGKRREMIQLGVPIFGQIREESANNS---QTSNIEPIPSLLMTVIDHLIQWQLIPEYKRPNGC

Query:  LVNFFEEGEYSQPFQKPPHLE-----QPISTL-FLSESTMAFGRSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITF------
        +VN +EE +       PPH++     +P  T+ FLSE  + FG ++     G + G   + +  GS+LV++GN ADVA+H + A P KR++ITF      
Subjt:  LVNFFEEGEYSQPFQKPPHLE-----QPISTL-FLSESTMAFGRSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCASPNKRVTITF------

Query:  -----FRVRPDCDQYQP
             F   PD ++ +P
Subjt:  -----FRVRPDCDQYQP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCGGGGGCAACTGATCGAGCGCGGCCGGTGGTGGTGCCAACGGCGGCGGCTGTGACGGTGACGGACCCAATGGGGAAGGAAGCGGTGTTGGCGTGGTTCAGAGG
GGAGTTCGCGGCGGCGAACGCGATTATTGATGCGCTGTGTGGACATCTGGCGCAGGTGAGTGACGATGGAGGATCGGAGTACGAATCAGTGTTCGCTGCGATTCATAGAC
GGCGGCTGAATTGGATTCCGGTCCTGCAAATGCAGAAGTATCATCCGATCGCTGACGTTGCCTTGGAGCTACGGAAAGTGACGGCGGAGAAAAAGAGAAAGAAGAAGAAT
CGGGATGAGGAAGAGGAGGAGCAGAAAGGAGGCGATGAGGCGGCGGTAGTGGTGGTCGAGGACGACGGCGACGGCGATGGCGACGGTGATGTCGAAATGGAGGAAAAGAA
GAACGAGATTAAGAAAATGAAGGAAGAGGAGGAAAATGACGGAAAGATTTGTTCGGATGAGAAGGAAATCGTCGAAGAGACAACGATCGAGATTAACGAAACCGATGGCG
GAAGAAATGAAGCTCTTCTGGATCCAATCGAAGAGGAGGATTCAATTCGAAGCGAAATAACTGATTCAGGATCTCATCAAGGAGTGCAGCCCACTTCAGCAGAAGTTGAG
ATATGCAGTAATCATGGGGAGTGTGAAGCACGTCCAGGACAGATGAAATTGACAAAAGGTTTTTCCGCCAAGGAGCCAGTAAAAGGGCACATGGTGAATGTTGTGAAGGG
ATTGAAGTGTTATGAAGACGTTTTTACTGAGTCTGAATTGGCTAAGTTGGATGGGTTTGTTGATGATCTTCGCTCTGCTGCTAAGAATGGAGAGCTCTCTGGAGAGTCAT
TTGTTTTATTCAATCAGCAAGTGAAGGGCAAACGACGAGAGATGATCCAGCTTGGTGTCCCCATTTTTGGACAGATCAGAGAGGAATCGGCCAATAACAGCCAAACAAGT
AATATCGAGCCAATTCCATCTCTTCTTATGACGGTGATAGATCATCTTATTCAGTGGCAACTGATTCCCGAGTATAAAAGACCAAATGGGTGTCTCGTTAATTTCTTTGA
AGAGGGAGAGTACTCACAGCCATTCCAGAAACCTCCACACTTGGAACAGCCCATTTCCACTCTGTTTCTCTCTGAATCAACCATGGCCTTTGGGCGTTCTATTGTCAGTG
ATAACGAAGGCAACTATAAGGGGCCACTCATGCTGTCCATGAAGGAAGGGTCTCTTTTGGTCATGAGGGGGAACAGTGCAGACGTTGCACGTCATGTCATGTGTGCATCG
CCTAACAAACGGGTCACCATCACGTTCTTCCGAGTTCGACCAGACTGTGATCAATACCAACCACCGACGTCTCAGATGTCGAACGCGATGACTCTCTGGCAACCAGGAGT
TGCAGGTGCATGTGCCTTGCCTAATGGAGTGCCCTACGCCTATGAAGCAATGGAGGTAGTGCCAAAATGGGGGATTCTCCGTGCTCCTGTTGTCATGTTAGCTCCTGTCC
GCCCTGTGGTGATGAGCCCTGGAAGATCTCAACGTGATGGCACTGGAGTGTTCTTACCATGGGCTGTTAATTCAAGAAAACCAGCCAAACATCTTCCTCCTCGTGCTCGA
AAAGGGCGGTTCCTGGCACTATCTTCCCCTGTCGAAACTCGTCTACCAGATTCATCTCAGGAGCAGCCAGGCATAAGTGTTTGA
mRNA sequenceShow/hide mRNA sequence
TTTACACTTTTGCACACAAATTTCCATATTTGTCTATTATATATATATATATATATATTGCATTTTTGTGTGTGTATATATATATATTGAATTGATTTGAGGCTGTGGGG
AATCGAAGAGGGTAAGAATTCGGTTCCGAAATTTCGGTGAAGTCGGAATCGGGGAGGAGAGACTGAGAGAGGGGAGATTTTTATATTGGGATAAATTTTTTTGGTTTGGT
ATTGATATTGGTGGATGCCATGGCGGCGGGGGCAACTGATCGAGCGCGGCCGGTGGTGGTGCCAACGGCGGCGGCTGTGACGGTGACGGACCCAATGGGGAAGGAAGCGG
TGTTGGCGTGGTTCAGAGGGGAGTTCGCGGCGGCGAACGCGATTATTGATGCGCTGTGTGGACATCTGGCGCAGGTGAGTGACGATGGAGGATCGGAGTACGAATCAGTG
TTCGCTGCGATTCATAGACGGCGGCTGAATTGGATTCCGGTCCTGCAAATGCAGAAGTATCATCCGATCGCTGACGTTGCCTTGGAGCTACGGAAAGTGACGGCGGAGAA
AAAGAGAAAGAAGAAGAATCGGGATGAGGAAGAGGAGGAGCAGAAAGGAGGCGATGAGGCGGCGGTAGTGGTGGTCGAGGACGACGGCGACGGCGATGGCGACGGTGATG
TCGAAATGGAGGAAAAGAAGAACGAGATTAAGAAAATGAAGGAAGAGGAGGAAAATGACGGAAAGATTTGTTCGGATGAGAAGGAAATCGTCGAAGAGACAACGATCGAG
ATTAACGAAACCGATGGCGGAAGAAATGAAGCTCTTCTGGATCCAATCGAAGAGGAGGATTCAATTCGAAGCGAAATAACTGATTCAGGATCTCATCAAGGAGTGCAGCC
CACTTCAGCAGAAGTTGAGATATGCAGTAATCATGGGGAGTGTGAAGCACGTCCAGGACAGATGAAATTGACAAAAGGTTTTTCCGCCAAGGAGCCAGTAAAAGGGCACA
TGGTGAATGTTGTGAAGGGATTGAAGTGTTATGAAGACGTTTTTACTGAGTCTGAATTGGCTAAGTTGGATGGGTTTGTTGATGATCTTCGCTCTGCTGCTAAGAATGGA
GAGCTCTCTGGAGAGTCATTTGTTTTATTCAATCAGCAAGTGAAGGGCAAACGACGAGAGATGATCCAGCTTGGTGTCCCCATTTTTGGACAGATCAGAGAGGAATCGGC
CAATAACAGCCAAACAAGTAATATCGAGCCAATTCCATCTCTTCTTATGACGGTGATAGATCATCTTATTCAGTGGCAACTGATTCCCGAGTATAAAAGACCAAATGGGT
GTCTCGTTAATTTCTTTGAAGAGGGAGAGTACTCACAGCCATTCCAGAAACCTCCACACTTGGAACAGCCCATTTCCACTCTGTTTCTCTCTGAATCAACCATGGCCTTT
GGGCGTTCTATTGTCAGTGATAACGAAGGCAACTATAAGGGGCCACTCATGCTGTCCATGAAGGAAGGGTCTCTTTTGGTCATGAGGGGGAACAGTGCAGACGTTGCACG
TCATGTCATGTGTGCATCGCCTAACAAACGGGTCACCATCACGTTCTTCCGAGTTCGACCAGACTGTGATCAATACCAACCACCGACGTCTCAGATGTCGAACGCGATGA
CTCTCTGGCAACCAGGAGTTGCAGGTGCATGTGCCTTGCCTAATGGAGTGCCCTACGCCTATGAAGCAATGGAGGTAGTGCCAAAATGGGGGATTCTCCGTGCTCCTGTT
GTCATGTTAGCTCCTGTCCGCCCTGTGGTGATGAGCCCTGGAAGATCTCAACGTGATGGCACTGGAGTGTTCTTACCATGGGCTGTTAATTCAAGAAAACCAGCCAAACA
TCTTCCTCCTCGTGCTCGAAAAGGGCGGTTCCTGGCACTATCTTCCCCTGTCGAAACTCGTCTACCAGATTCATCTCAGGAGCAGCCAGGCATAAGTGTTTGAGTTTAAA
ATCAGGCTGTGGCGATTCCGAAGTCGAACCAACACGACTCGACTCGACTAAGTAGTGTTCTTGTCCTTTTACAGATCTAAGTGAGTTTTTCCCATTCCATTCCATCCCCA
CTCCCCACTCCCTACACCACAAAACCACTTATTTATTTGGATTTCATGTTCTTAGGG
Protein sequenceShow/hide protein sequence
MAAGATDRARPVVVPTAAAVTVTDPMGKEAVLAWFRGEFAAANAIIDALCGHLAQVSDDGGSEYESVFAAIHRRRLNWIPVLQMQKYHPIADVALELRKVTAEKKRKKKN
RDEEEEEQKGGDEAAVVVVEDDGDGDGDGDVEMEEKKNEIKKMKEEEENDGKICSDEKEIVEETTIEINETDGGRNEALLDPIEEEDSIRSEITDSGSHQGVQPTSAEVE
ICSNHGECEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDVFTESELAKLDGFVDDLRSAAKNGELSGESFVLFNQQVKGKRREMIQLGVPIFGQIREESANNSQTS
NIEPIPSLLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSMKEGSLLVMRGNSADVARHVMCAS
PNKRVTITFFRVRPDCDQYQPPTSQMSNAMTLWQPGVAGACALPNGVPYAYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNSRKPAKHLPPRAR
KGRFLALSSPVETRLPDSSQEQPGISV