; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003182 (gene) of Snake gourd v1 genome

Gene IDTan0003182
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationLG06:7945409..7950864
RNA-Seq ExpressionTan0003182
SyntenyTan0003182
Gene Ontology termsGO:0006402 - mRNA catabolic process (biological process)
GO:0070988 - demethylation (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0032451 - demethylase activity (molecular function)
InterPro domainsIPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily
IPR044842 - RNA demethylase ALKBH9B/ALKBH10B-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593138.1 RNA demethylase ALKBH10B, partial [Cucurbita argyrosperma subsp. sororia]1.6e-28185.26Show/hide
Query:  MAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVEL
        MAAGATDRARPVVVP   TA AV VTDPMGK+A+LAWFRGEFAAANAIIDALCGHLA VSD GGSEYE+VF AIHRRRLNWIPVLQMQKYHPIADVA+EL
Subjt:  MAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVEL

Query:  RKVTAEKKMKKKNQEEEQEEEEKGGQTATEAAVV---------------AEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEM
        RKVTAEKK KKKNQ+EE EEE+KGG    EAAVV                +GDGD+EMEEKK+E+K M EEEEN+GKICSDEKE V           EE 
Subjt:  RKVTAEKKMKKKNQEEEQEEEEKGGQTATEAAVV---------------AEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEM

Query:  SIEINETDGGRNDEALAPIEEEDSIGSEITDSGSQGGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKL
        +IEINETDGGRN+  L PIEEEDSI SEITDSGS   GVQ TSAEVEIC+NHG+CEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYED+FTESELAKL
Subjt:  SIEINETDGGRNDEALAPIEEEDSIGSEITDSGSQGGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKL

Query:  NYFVDDIRSAAKNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQ
        + FVDD+RSAAKNGELSGESFVLFNQQVKG RREMIQ GVPIFGQIREESANNSQTSNIEPIPPLL+TVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQ
Subjt:  NYFVDDIRSAAKNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQ

Query:  PFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPHMSNAMTLW
        PFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLS+KEGSLLV RGNSADVARHVMCASPNKRVTITFFRVRPD DQ Q PT  MSNAMTLW
Subjt:  PFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPHMSNAMTLW

Query:  QPGVAAACTLPNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALSSPVETRLPDSSHE-P
        QPGVA AC LPNG PY YEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVN+RKPAKHLPPRARKGRFLALSSPVETRLPDSS E P
Subjt:  QPGVAAACTLPNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALSSPVETRLPDSSHE-P

Query:  GISV
        GISV
Subjt:  GISV

XP_022960250.1 uncharacterized protein LOC111461049 [Cucurbita moschata]4.7e-28186.07Show/hide
Query:  MAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVEL
        MAAGATDRARPVVVP   TA AV VTDPMGK+A+LAWFRGEFAAANAIIDALCGHLA VSD GGSEYE+VF AIHRRRLNWIPVLQMQKYHPIADVA+EL
Subjt:  MAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVEL

Query:  RKVTAEKKMKKKNQEEEQEEEEKGGQTATEAAVV-------AEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETD
        RKVTAEKK KKKN++EE EEE+KGG    EAAVV        +GDGD+EMEEKK+E+K M EEEEN+GKICSDEKE V           EE +IEINETD
Subjt:  RKVTAEKKMKKKNQEEEQEEEEKGGQTATEAAVV-------AEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETD

Query:  GGRNDEALAPIEEEDSIGSEITDSGSQGGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIR
        GGRN+  L PIEEEDSI SEITDSGS   GVQ TSAEVEIC+NHG+CEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYED+FTESELAKL+ FVDD+R
Subjt:  GGRNDEALAPIEEEDSIGSEITDSGSQGGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIR

Query:  SAAKNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHL
        SAAKNGELSGESFVLFNQQVKG RREMIQ GVPIFGQIREESANNSQTSNIEPIP LL+TVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHL
Subjt:  SAAKNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHL

Query:  EQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPHMSNAMTLWQPGVAAAC
        EQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLS+KEGSLLV RGNSADVARHVMCASPNKRVTITFFRVRPD DQ Q PT  MSNAMTLWQPGVA AC
Subjt:  EQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPHMSNAMTLWQPGVAAAC

Query:  TLPNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALSSPVETRLPDSSHE-PGISV
         LPNG PY YEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVN+RKPAKHLPPRARKGRFLALSSPVETRLPDSS E PGISV
Subjt:  TLPNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALSSPVETRLPDSSHE-PGISV

XP_023004498.1 uncharacterized protein LOC111497786 [Cucurbita maxima]1.0e-28387.21Show/hide
Query:  MAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVEL
        MAAGATDRARPVVVP   TA AV VTDPMGK+A+LAWFRGEFAAANAIIDALCGHLA VSD GGSEYE+VF AIHRRRLNWIPVLQMQKYHPIADVA+EL
Subjt:  MAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVEL

Query:  RKVTAEKKMKKKNQEEEQEEEEKGGQTATEAAVV-----AEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGG
        RKVTAEKK KKKNQ+EE EEEEKGG    EAAVV      +GDGD+EMEEKK+E+K M EEEENEGKICSDEKE V           EEMSIEINETDGG
Subjt:  RKVTAEKKMKKKNQEEEQEEEEKGGQTATEAAVV-----AEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGG

Query:  RNDEALAPIEEEDSIGSEITDSGSQGGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSA
        RN+  L PIEEEDSI SEITDSGS   GV  TSAEVEIC+NHG+CEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYED+FTESELAKL+ FVDD+RSA
Subjt:  RNDEALAPIEEEDSIGSEITDSGSQGGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSA

Query:  AKNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQ
        AKNGELSGESFVLFNQQVKG RREMIQ GVPIFGQIREESANNSQTSNIEPIPPLL+TVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQ
Subjt:  AKNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQ

Query:  PISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPHMSNAMTLWQPGVAAACTL
        PISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLV RGNSADVARHVMCASPNKRVTITFFRVRPD DQ Q PT  MSNAMTLWQPGVA AC L
Subjt:  PISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPHMSNAMTLWQPGVAAACTL

Query:  PNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALSSPVETRLPDSS-HEPGISV
        PNG PY YEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVN+RKPAKHLPPRARKGRFLALSSPVETRLPDSS  +PGISV
Subjt:  PNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALSSPVETRLPDSS-HEPGISV

XP_023514846.1 uncharacterized protein LOC111779033 [Cucurbita pepo subsp. pepo]1.1e-28286.87Show/hide
Query:  MAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVEL
        MAAGATDRARPVVVP   TA AV VTDPMGK+A+LAWFRGEFAAANAIIDALCGHLA VSD GGSEYE+VF AIHRRRLNWIPVLQMQKYHPIADVA+EL
Subjt:  MAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVEL

Query:  RKVTAEKKMKKKNQEEEQEEEEKGGQTATEAAVV-----AEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGG
        RKVTAEKK KKKNQ+EE EEEEKGG    EAAVV      +GDGD+EMEEKK E+K M EEEEN+GKICSDEKE V           EE SIEINETDGG
Subjt:  RKVTAEKKMKKKNQEEEQEEEEKGGQTATEAAVV-----AEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGG

Query:  RNDEALAPIEEEDSIGSEITDSGSQGGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSA
        RN+  L PIEEEDSI SEITDSGS   GV  TSAEVEIC+NHG+CEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYED+FTESELAKL+ FVDD+RSA
Subjt:  RNDEALAPIEEEDSIGSEITDSGSQGGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSA

Query:  AKNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQ
        AKNGELSGESFVLFNQQVKG RREMIQ GVPIFGQIREESANN+QTSNIEPIPPLL+TVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQ
Subjt:  AKNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQ

Query:  PISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPHMSNAMTLWQPGVAAACTL
        PISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLV RGNSADVARHVMCASPNKRVTITFFRVRPD DQ Q PT  MSNAMTLWQPGVA AC L
Subjt:  PISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPHMSNAMTLWQPGVAAACTL

Query:  PNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALSSPVETRLPDSSHE-PGISV
        PNG PY YEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVN+RKPAKHLPPRARKGRFLALSSPVETRLPDSS E PGISV
Subjt:  PNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALSSPVETRLPDSSHE-PGISV

XP_023549480.1 uncharacterized protein LOC111807830 [Cucurbita pepo subsp. pepo]1.5e-27985.4Show/hide
Query:  MPMAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAV
        MPMAAGATDRARPV++P    A A AVTD + KDA+L WFRGEFAAANAIIDALCGHLA VSD GG EYEAVFGAIHRRRLNWIPVLQMQKYHPI DVAV
Subjt:  MPMAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAV

Query:  ELRKVTAEKKMKKKNQEEEQEEEEKGGQTATEAAVVAEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGGRND
        ELRKVTAEKK KKK ++ ++EEEE+      EAA VAE DGD+EME KK+     +E +EN GK+CSDE EFVEE+AN+E+ KIEEMSIEINET+GGRN+
Subjt:  ELRKVTAEKKMKKKNQEEEQEEEEKGGQTATEAAVVAEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGGRND

Query:  EALAPIEEEDSIGSEITDSGSQ----GGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRS
        + LAPIEEEDSIGSEITDSGSQ    GGGVQA+SAEVEIC+NHG+CEARPG MKLTKGFSAKEPVKGHMVNVVKGLKCYE+IFTESEL KLN FVDD+RS
Subjt:  EALAPIEEEDSIGSEITDSGSQ----GGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRS

Query:  AAKNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLE
        AAKNGELSGE+FVLFNQQVKGNRREMIQ GVPIFGQIR++SANNS+TSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLE
Subjt:  AAKNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLE

Query:  QPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTP-HMSNAMTLWQPGVAAAC
        QPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLV RGNSADVARHV+CASPNKRVTITFFRVRPDYDQCQSPTP  MSNA+TLWQPGVA  C
Subjt:  QPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTP-HMSNAMTLWQPGVAAAC

Query:  TLPNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALSSPVETRLPDSSHE-PGISV
        TLPNGA YGYEAMEV+PKWGIL APVVMLAPVRP+VMSPGRSQRDGTGVFLPWAVN+RKPAKHLPPRARKGRFLAL SPVETRLPDSS+E PGISV
Subjt:  TLPNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALSSPVETRLPDSSHE-PGISV

TrEMBL top hitse value%identityAlignment
A0A1S3BQT2 uncharacterized protein LOC1034927031.2e-26983.08Show/hide
Query:  MPMAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAV
        MPMAAGATDR RPVV+PA   A A+ VTD + KDA+L WFRGEFAAANAIIDALCGHLA VS+ GGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAV
Subjt:  MPMAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAV

Query:  ELRKVTAEKKMKKKNQEEEQEEEEKGGQT-ATE-AAVVAEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGGR
        ELRKVTA KK KK N+ +E+EEE KGG+  A E AA VAEGDGD+EME KK     M+EE         DEKEFVEE+ N    KIEE+SIEINE DGGR
Subjt:  ELRKVTAEKKMKKKNQEEEQEEEEKGGQT-ATE-AAVVAEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGGR

Query:  NDEALAPIEEEDSIGSEITDSGSQGGG-----VQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDD
        N E LAPIEEEDSIGSEITDSGSQGGG     VQA  A+VEIC+NH +CEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFT+SELA+LN FVD 
Subjt:  NDEALAPIEEEDSIGSEITDSGSQGGG-----VQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDD

Query:  IRSAAKNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPP
        +RSAA NGELSG +F+LFN+QVKG+RREMIQ GVPIF QI EES NNSQTSNIEPIP +L+TVIDHLIQWQLIPEYKRPNGCL NFFEEGEYSQPFQKPP
Subjt:  IRSAAKNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPP

Query:  HLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPHMSNAMTLWQPGVAA
        HLEQPISTL LSESTMAFGRSIVSDNEGNYKGPL LSLKEGSLLV RGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTP MSNAMTLWQP VA 
Subjt:  HLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPHMSNAMTLWQPGVAA

Query:  ACTLPNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALSSPVETRLPDSSHEPGISV
         C LPNGA YGYEAMEV+PKWGILRAPVVMLAPVRP+VMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLAL   VETRLPDSSHEPGISV
Subjt:  ACTLPNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALSSPVETRLPDSSHEPGISV

A0A6J1GNJ1 uncharacterized protein LOC1114560411.0e-27885.52Show/hide
Query:  MPMAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAV
        MPMAAGATDRARPV++P    A A AVTD + KDA+L WFRGEFAAANAIIDALCGHLA VSD GG EYEAVFGAIHRRRLNWIPVLQMQKYHPI DVAV
Subjt:  MPMAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAV

Query:  ELRKVTAEKKMKKKNQEEEQEEEEKGGQTATEAAVVAEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGGRND
        ELRKVTAEKK KKK Q +E+EEEE       EAA VAE DGD+EME KK+     +E +EN GK+ SDE EFVEE+AN+E+ KIEEMSIEINET+GGRN+
Subjt:  ELRKVTAEKKMKKKNQEEEQEEEEKGGQTATEAAVVAEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGGRND

Query:  EALAPIEEEDSIGSEITDSGSQ--GGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSAA
        + LAPIEEEDSIGSEITDSGSQ  GGGVQA+SAEVEIC+NHG+CEARPG MKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESEL KLN FVDD+RSAA
Subjt:  EALAPIEEEDSIGSEITDSGSQ--GGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSAA

Query:  KNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQP
        KNGELSGE+FVLFNQQVKGNRREMIQ GVPIFGQIR++SANN++TSNIEPIPPLL TVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQP
Subjt:  KNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQP

Query:  ISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTP-HMSNAMTLWQPGVAAACTL
        ISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLV RGNSADVARHV+CASPNKRVTITFFRVRPDYDQCQSPTP  MSNA+TLWQPGVA  CTL
Subjt:  ISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTP-HMSNAMTLWQPGVAAACTL

Query:  PNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALSSPVETRLPDSSHE-PGISV
        PNGA YGYEAMEV+PKWGIL APVVMLAPVRP+VMSPGRSQRDGTGVFLPWAVN+RKPAKHLPPRARKGRFLAL SPVETR PDSS+E PGISV
Subjt:  PNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALSSPVETRLPDSSHE-PGISV

A0A6J1H8J8 uncharacterized protein LOC1114610492.3e-28186.07Show/hide
Query:  MAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVEL
        MAAGATDRARPVVVP   TA AV VTDPMGK+A+LAWFRGEFAAANAIIDALCGHLA VSD GGSEYE+VF AIHRRRLNWIPVLQMQKYHPIADVA+EL
Subjt:  MAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVEL

Query:  RKVTAEKKMKKKNQEEEQEEEEKGGQTATEAAVV-------AEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETD
        RKVTAEKK KKKN++EE EEE+KGG    EAAVV        +GDGD+EMEEKK+E+K M EEEEN+GKICSDEKE V           EE +IEINETD
Subjt:  RKVTAEKKMKKKNQEEEQEEEEKGGQTATEAAVV-------AEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETD

Query:  GGRNDEALAPIEEEDSIGSEITDSGSQGGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIR
        GGRN+  L PIEEEDSI SEITDSGS   GVQ TSAEVEIC+NHG+CEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYED+FTESELAKL+ FVDD+R
Subjt:  GGRNDEALAPIEEEDSIGSEITDSGSQGGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIR

Query:  SAAKNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHL
        SAAKNGELSGESFVLFNQQVKG RREMIQ GVPIFGQIREESANNSQTSNIEPIP LL+TVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHL
Subjt:  SAAKNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHL

Query:  EQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPHMSNAMTLWQPGVAAAC
        EQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLS+KEGSLLV RGNSADVARHVMCASPNKRVTITFFRVRPD DQ Q PT  MSNAMTLWQPGVA AC
Subjt:  EQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPHMSNAMTLWQPGVAAAC

Query:  TLPNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALSSPVETRLPDSSHE-PGISV
         LPNG PY YEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVN+RKPAKHLPPRARKGRFLALSSPVETRLPDSS E PGISV
Subjt:  TLPNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALSSPVETRLPDSSHE-PGISV

A0A6J1JXR1 uncharacterized protein LOC1114888191.4e-27584.51Show/hide
Query:  MPMAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAV
        MPMAAGATDRARPV++P    A A AVTD + KDA+L WFRGEFAAANAIIDALCGHLA VSD GG EYEAVFGAIHRRRLNWIPVLQMQKYHPI DVAV
Subjt:  MPMAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAV

Query:  ELRKVTAEKKMKKKNQEEEQEEEEKGGQTATEAAVVAEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGGRND
        ELRKVTAEKK KKK  +EE+EE          AA VAE D D+EME KK+     +E +EN GK+CS+E EFVEE+AN+ + KIEEMSIEINET+GGRN+
Subjt:  ELRKVTAEKKMKKKNQEEEQEEEEKGGQTATEAAVVAEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGGRND

Query:  EALAPIEEEDSIGSEITDSGSQ--GGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSAA
        + LAPIEEEDSIGSEITDSGSQ  GGGVQA+SAEVEIC+NHG+CEARPG MKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESEL KLN FVDD+RSAA
Subjt:  EALAPIEEEDSIGSEITDSGSQ--GGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSAA

Query:  KNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQP
        KNGELSGE+FVLFNQQVKGNRREMIQ GVPIFGQIR++SANNS+TSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQP
Subjt:  KNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQP

Query:  ISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTP-HMSNAMTLWQPGVAAACTL
        ISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLV RGNSADVARHV+CASPNKRVTITFFRVRPDYDQCQSPTP  +SN +TLWQPGVA  C L
Subjt:  ISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTP-HMSNAMTLWQPGVAAACTL

Query:  PNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALSSPVETRLPDSSHE-PGISV
        PNG  YGYEAMEV+PKWGIL APVVMLAPVRP+VMSPGRSQRDGTGVFLPWAVN+RKPAKHLPPRARKGRFLAL SPVETRLPDSS+E PGISV
Subjt:  PNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALSSPVETRLPDSSHE-PGISV

A0A6J1KZQ2 uncharacterized protein LOC1114977864.9e-28487.21Show/hide
Query:  MAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVEL
        MAAGATDRARPVVVP   TA AV VTDPMGK+A+LAWFRGEFAAANAIIDALCGHLA VSD GGSEYE+VF AIHRRRLNWIPVLQMQKYHPIADVA+EL
Subjt:  MAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVEL

Query:  RKVTAEKKMKKKNQEEEQEEEEKGGQTATEAAVV-----AEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGG
        RKVTAEKK KKKNQ+EE EEEEKGG    EAAVV      +GDGD+EMEEKK+E+K M EEEENEGKICSDEKE V           EEMSIEINETDGG
Subjt:  RKVTAEKKMKKKNQEEEQEEEEKGGQTATEAAVV-----AEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGG

Query:  RNDEALAPIEEEDSIGSEITDSGSQGGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSA
        RN+  L PIEEEDSI SEITDSGS   GV  TSAEVEIC+NHG+CEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYED+FTESELAKL+ FVDD+RSA
Subjt:  RNDEALAPIEEEDSIGSEITDSGSQGGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSA

Query:  AKNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQ
        AKNGELSGESFVLFNQQVKG RREMIQ GVPIFGQIREESANNSQTSNIEPIPPLL+TVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQ
Subjt:  AKNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQ

Query:  PISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPHMSNAMTLWQPGVAAACTL
        PISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLV RGNSADVARHVMCASPNKRVTITFFRVRPD DQ Q PT  MSNAMTLWQPGVA AC L
Subjt:  PISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPHMSNAMTLWQPGVAAACTL

Query:  PNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALSSPVETRLPDSS-HEPGISV
        PNG PY YEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVN+RKPAKHLPPRARKGRFLALSSPVETRLPDSS  +PGISV
Subjt:  PNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAVNTRKPAKHLPPRARKGRFLALSSPVETRLPDSS-HEPGISV

SwissProt top hitse value%identityAlignment
Q9SL49 RNA demethylase ALKBH9B9.2e-2225.74Show/hide
Query:  NWIPVLQMQKYHPIADVAVELRKVTAEKKMKKKNQEEEQEEEEKGGQTATEAAVVAEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEK-----------
        NW+P L         D+  +   + + +             ++  G+  T +  V E  G    E   + V +  +E+   G+ C +             
Subjt:  NWIPVLQMQKYHPIADVAVELRKVTAEKKMKKKNQEEEQEEEEKGGQTATEAAVVAEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEK-----------

Query:  -EFVEEKAN---SEEEKIEEMSIEINETDGGRNDEALAPIEEEDSIGSEITDSGSQG--GGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGH
          F E+ ++   S    +E  S  +   D  + DE     +EE+    E  DS  +G       T  + ++  +  +   R   +K  K F   E VKG 
Subjt:  -EFVEEKAN---SEEEKIEEMSIEINETDGGRNDEALAPIEEEDSIGSEITDSGSQG--GGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGH

Query:  MVNVVKGLKCYEDIFTESELAKLNYFVDDIRSAAKNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNS---QTSNIEPIPPLLVTVIDHLI
        +VNV+ GL+ +  +F+  E  ++   V  ++   + GEL   +F   ++ ++G  RE IQFG   +    + + N     Q   ++P+P L   +I  LI
Subjt:  MVNVVKGLKCYEDIFTESELAKLNYFVDDIRSAAKNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNS---QTSNIEPIPPLLVTVIDHLI

Query:  QWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLE-----QPISTL-FLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASP
        +W ++P    P+ C+VN ++EG+       PPH++     +P  T+ FLSE  + FG ++  +  G++ G   + L  GS+LV  GN ADVA+H + A P
Subjt:  QWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLE-----QPISTL-FLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASP

Query:  NKRVTITF
         KR++ITF
Subjt:  NKRVTITF

Q9ZT92 RNA demethylase ALKBH10B3.4e-14952.51Show/hide
Query:  ARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDG-GGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTAEK
        A+ V VP      A  V++ +GKDAL++WFRGEFAAANAIIDA+C HL    +   GSEYEAVF AIHRRRLNWIPVLQMQKYH IA+VA+EL+KV A+K
Subjt:  ARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDG-GGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTAEK

Query:  --KMKKKNQEEEQEEEEKGGQTATEAAVVAEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGGRNDEALAPIE
           +K+K  EEE EE+ K         VVA                    EEE   K C + ++  E   N + E +                       
Subjt:  --KMKKKNQEEEQEEEEKGGQTATEAAVVAEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGGRNDEALAPIE

Query:  EEDSIGSEITDSGSQGG---GVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSAAKNGELS
        E+DS  S+ITDSGS       V A +A   IC++H DC+AR  ++K  KGF AKE VKGH VNVVKGLK YE++  E E++KL  FV ++R A  NG+L+
Subjt:  EEDSIGSEITDSGSQGG---GVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSAAKNGELS

Query:  GESFVLFNQQVKGNRREMIQFGVPIFGQIR-EESANNSQTS-NIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTL
        GESF+LFN+Q+KGN+RE+IQ GVPIFG ++ +E++N++  S NIEPIPPLL +VIDH + W+LIPEYKRPNGC++NFFEEGEYSQPF KPPHLEQPISTL
Subjt:  GESFVLFNQQVKGNRREMIQFGVPIFGQIR-EESANNSQTS-NIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTL

Query:  FLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPD--YDQCQSPTPHMSNAMTLWQPGVAAACTLPNG
         LSESTMA+GR + SDNEGN++GPL LSLK+GSLLV RGNSAD+ARHVMC S NKRV+ITFFR+RPD  ++  Q  +P     MT+WQP         NG
Subjt:  FLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPD--YDQCQSPTPHMSNAMTLWQPGVAAACTLPNG

Query:  APYGYEAMEVVPKWGILRAPVVMLA--PVRPVVM-SPG-RSQRDGTGVFLPWAV--NTRKPAKHLPPRARKGRFLALSSPVETRLP--DSSHEPGISV
          +   +++++PK G+LR P+VM+A  PV+P+++ SP       GTGVFLPWA   ++RK  KHLPPRA+K R L L  P  +  P   S+ EP I+V
Subjt:  APYGYEAMEVVPKWGILRAPVVMLA--PVRPVVM-SPG-RSQRDGTGVFLPWAV--NTRKPAKHLPPRARKGRFLALSSPVETRLP--DSSHEPGISV

Arabidopsis top hitse value%identityAlignment
AT1G14710.1 hydroxyproline-rich glycoprotein family protein1.2e-5631.96Show/hide
Query:  PMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTAEKKMKKKNQEEEQEEE--EKGG
        P  +D  ++W R EFAAANAIID+LC HL  V D   +EYE+V G+IH RRL W  VL MQ++ P+ADV+  L+++  +++ +   Q     ++  + G 
Subjt:  PMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTAEKKMKKKNQEEEQEEE--EKGG

Query:  QTATEAAVVAEGDGD-----IEMEEKKSEVKNMNE---EEENEGKICSDEK--EFVEEKANSEEEKIEEMSIE--INETDGGRNDEALAPIEEEDSIGSE
        + +        G G        M         +N    E   E K+ SD K     EEK +  E+   +  +E  + E++            +E+ + + 
Subjt:  QTATEAAVVAEGDGD-----IEMEEKKSEVKNMNE---EEENEGKICSDEK--EFVEEKANSEEEKIEEMSIE--INETDGGRNDEALAPIEEEDSIGSE

Query:  ITDSGSQGGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSAAKNGELSGESFVLFNQQV
          +SGS+   + +   + E   N  +C A      + K F  +E     MVNVV+GLK Y+ +   +E+++L   V ++R A + G+L  E++V + +  
Subjt:  ITDSGSQGGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSAAKNGELSGESFVLFNQQV

Query:  KGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSI
        +G+ REMIQ G+PI     ++  ++ +   IEPIP  L  +I+ L+  Q+IP   +P+ C+++FF EG++SQP    P   +PIS L LSE    FGR I
Subjt:  KGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSI

Query:  VSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQ-----CQSPTPHMSNAMTLWQPGVAAACTLPNGAPYGYEAMEV
        VS+N G+YKG L LSL  GS+L+  G SA++A++ + A+  +R+ I+F + +P          +SP  H+ +               P G P  Y    V
Subjt:  VSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQ-----CQSPTPHMSNAMTLWQPGVAAACTLPNGAPYGYEAMEV

Query:  VPKWGILRAP
        +P  G+L  P
Subjt:  VPKWGILRAP

AT1G14710.2 hydroxyproline-rich glycoprotein family protein1.2e-5631.96Show/hide
Query:  PMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTAEKKMKKKNQEEEQEEE--EKGG
        P  +D  ++W R EFAAANAIID+LC HL  V D   +EYE+V G+IH RRL W  VL MQ++ P+ADV+  L+++  +++ +   Q     ++  + G 
Subjt:  PMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTAEKKMKKKNQEEEQEEE--EKGG

Query:  QTATEAAVVAEGDGD-----IEMEEKKSEVKNMNE---EEENEGKICSDEK--EFVEEKANSEEEKIEEMSIE--INETDGGRNDEALAPIEEEDSIGSE
        + +        G G        M         +N    E   E K+ SD K     EEK +  E+   +  +E  + E++            +E+ + + 
Subjt:  QTATEAAVVAEGDGD-----IEMEEKKSEVKNMNE---EEENEGKICSDEK--EFVEEKANSEEEKIEEMSIE--INETDGGRNDEALAPIEEEDSIGSE

Query:  ITDSGSQGGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSAAKNGELSGESFVLFNQQV
          +SGS+   + +   + E   N  +C A      + K F  +E     MVNVV+GLK Y+ +   +E+++L   V ++R A + G+L  E++V + +  
Subjt:  ITDSGSQGGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSAAKNGELSGESFVLFNQQV

Query:  KGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSI
        +G+ REMIQ G+PI     ++  ++ +   IEPIP  L  +I+ L+  Q+IP   +P+ C+++FF EG++SQP    P   +PIS L LSE    FGR I
Subjt:  KGNRREMIQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSI

Query:  VSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQ-----CQSPTPHMSNAMTLWQPGVAAACTLPNGAPYGYEAMEV
        VS+N G+YKG L LSL  GS+L+  G SA++A++ + A+  +R+ I+F + +P          +SP  H+ +               P G P  Y    V
Subjt:  VSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQ-----CQSPTPHMSNAMTLWQPGVAAACTLPNGAPYGYEAMEV

Query:  VPKWGILRAP
        +P  G+L  P
Subjt:  VPKWGILRAP

AT2G17970.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.0e-2829.9Show/hide
Query:  IEEMSIEINETDGGRNDEALAPIEEEDSIGSEITDSGSQG--GGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTE
        +E  S  +   D  + DE     +EE+    E  DS  +G       T  + ++  +  +   R   +K  K F   E VKG +VNV+ GL+ +  +F+ 
Subjt:  IEEMSIEINETDGGRNDEALAPIEEEDSIGSEITDSGSQG--GGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTE

Query:  SELAKLNYFVDDIRSAAKNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNS---QTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVN
         E  ++   V  ++   + GEL   +F   ++ ++G  RE IQFG   +    + + N     Q   ++P+P L   +I  LI+W ++P    P+ C+VN
Subjt:  SELAKLNYFVDDIRSAAKNGELSGESFVLFNQQVKGNRREMIQFGVPIFGQIREESANNS---QTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVN

Query:  FFEEGEYSQPFQKPPHLE-----QPISTL-FLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITF
         ++EG+       PPH++     +P  T+ FLSE  + FG ++  +  G++ G   + L  GS+LV  GN ADVA+H + A P KR++ITF
Subjt:  FFEEGEYSQPFQKPPHLE-----QPISTL-FLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITF

AT2G48080.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein6.6e-11643.84Show/hide
Query:  VAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTAEKKMKKKNQEEEQEEEE
        V ++D   KDA+L WFRGEFAAANAIIDALC HL   S GG ++YE+V  A+HRRRLNWIPVLQMQKYH I+ V ++L++  A                 
Subjt:  VAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTAEKKMKKKNQEEEQEEEE

Query:  KGGQTATEAAVVAEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGGRNDEALAPIEEEDSIGSEITDSGSQGG
                                    K  +   +++                                             ++DS  S+ITD GS+  
Subjt:  KGGQTATEAAVVAEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGGRNDEALAPIEEEDSIGSEITDSGSQGG

Query:  GVQATSAEVEICNNHGD-CEARPGQ-MKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSAAKNGELSGESFVLFNQQVKGNRREM
                + IC  H D CE+R    +K +K FSAKE V+GH  NVVKGLK Y+D+FT  +L+KL   ++ +R A +N +LSGE+FVLFN+  KG +RE+
Subjt:  GVQATSAEVEICNNHGD-CEARPGQ-MKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSAAKNGELSGESFVLFNQQVKGNRREM

Query:  IQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGN
        +Q GVPIFG       N +   ++EPIP L+ +VIDHL+QW+LIPEYKRPNGC++NFF+E E+SQPFQKPPH++QPISTL LSESTM FG  +  DN+GN
Subjt:  IQFGVPIFGQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGN

Query:  YKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPHMSNAMTLWQPGVAAACTLPNGAPYGYEAMEVVPKWGILRAPVV
        ++G L L LKEGSLLV RGNSAD+ARHVMC SPNKRV ITFF+++PD  + Q P        TLW+PG                            +P+V
Subjt:  YKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPHMSNAMTLWQPGVAAACTLPNGAPYGYEAMEVVPKWGILRAPVV

Query:  MLAPVRPVVMSPGRSQRDGTGVFLPWAVN-TRKPAKHLPPRARKGRFLALSSPVETRLPDSSHEPGIS
        MLAP      +P R    GTGVFLPW    +RKPAKHLPPR ++ R L+ S  V      SS E G+S
Subjt:  MLAPVRPVVMSPGRSQRDGTGVFLPWAVN-TRKPAKHLPPRARKGRFLALSSPVETRLPDSSHEPGIS

AT4G02940.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein2.4e-15052.51Show/hide
Query:  ARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDG-GGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTAEK
        A+ V VP      A  V++ +GKDAL++WFRGEFAAANAIIDA+C HL    +   GSEYEAVF AIHRRRLNWIPVLQMQKYH IA+VA+EL+KV A+K
Subjt:  ARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDG-GGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTAEK

Query:  --KMKKKNQEEEQEEEEKGGQTATEAAVVAEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGGRNDEALAPIE
           +K+K  EEE EE+ K         VVA                    EEE   K C + ++  E   N + E +                       
Subjt:  --KMKKKNQEEEQEEEEKGGQTATEAAVVAEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGGRNDEALAPIE

Query:  EEDSIGSEITDSGSQGG---GVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSAAKNGELS
        E+DS  S+ITDSGS       V A +A   IC++H DC+AR  ++K  KGF AKE VKGH VNVVKGLK YE++  E E++KL  FV ++R A  NG+L+
Subjt:  EEDSIGSEITDSGSQGG---GVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSAAKNGELS

Query:  GESFVLFNQQVKGNRREMIQFGVPIFGQIR-EESANNSQTS-NIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTL
        GESF+LFN+Q+KGN+RE+IQ GVPIFG ++ +E++N++  S NIEPIPPLL +VIDH + W+LIPEYKRPNGC++NFFEEGEYSQPF KPPHLEQPISTL
Subjt:  GESFVLFNQQVKGNRREMIQFGVPIFGQIR-EESANNSQTS-NIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTL

Query:  FLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPD--YDQCQSPTPHMSNAMTLWQPGVAAACTLPNG
         LSESTMA+GR + SDNEGN++GPL LSLK+GSLLV RGNSAD+ARHVMC S NKRV+ITFFR+RPD  ++  Q  +P     MT+WQP         NG
Subjt:  FLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKRGNSADVARHVMCASPNKRVTITFFRVRPD--YDQCQSPTPHMSNAMTLWQPGVAAACTLPNG

Query:  APYGYEAMEVVPKWGILRAPVVMLA--PVRPVVM-SPG-RSQRDGTGVFLPWAV--NTRKPAKHLPPRARKGRFLALSSPVETRLP--DSSHEPGISV
          +   +++++PK G+LR P+VM+A  PV+P+++ SP       GTGVFLPWA   ++RK  KHLPPRA+K R L L  P  +  P   S+ EP I+V
Subjt:  APYGYEAMEVVPKWGILRAPVVMLA--PVRPVVM-SPG-RSQRDGTGVFLPWAV--NTRKPAKHLPPRARKGRFLALSSPVETRLP--DSSHEPGISV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGATGGCGGCCGGGGCAACTGATCGAGCGCGGCCGGTGGTGGTGCCTGCGGCGGGGACGGCGACGGCGGTGGCGGTGACGGATCCAATGGGGAAGGATGCGTTGTT
GGCGTGGTTCAGAGGGGAGTTCGCGGCGGCGAACGCGATTATTGATGCGCTGTGTGGGCATCTGGCGCATGTGAGTGACGGCGGAGGATCGGAGTACGAAGCAGTGTTCG
GTGCGATTCATAGACGGCGGCTGAATTGGATCCCGGTCCTGCAAATGCAGAAGTATCATCCGATCGCGGACGTCGCGGTGGAGCTACGGAAAGTGACGGCGGAGAAGAAG
ATGAAAAAGAAGAACCAGGAGGAGGAGCAGGAGGAGGAGGAGAAAGGAGGCCAGACGGCCACGGAGGCGGCGGTGGTGGCCGAGGGCGACGGCGATATCGAAATGGAGGA
GAAAAAGAGCGAGGTGAAGAACATGAATGAAGAGGAGGAAAATGAGGGAAAGATTTGTTCCGATGAGAAGGAATTCGTCGAAGAGAAAGCGAATAGTGAAGAAGAGAAGA
TCGAAGAGATGTCGATCGAGATTAACGAAACCGATGGCGGAAGAAATGATGAGGCTCTGGCTCCAATCGAAGAGGAGGATTCAATCGGAAGCGAAATAACTGATTCAGGA
TCTCAAGGAGGAGGAGTGCAGGCCACTTCAGCAGAAGTTGAGATATGTAATAATCATGGAGATTGTGAAGCACGTCCAGGACAAATGAAGTTGACAAAAGGTTTTTCCGC
CAAGGAGCCAGTAAAAGGCCACATGGTGAATGTTGTGAAAGGATTGAAGTGTTATGAAGACATTTTTACCGAGTCTGAATTGGCCAAATTGAATTATTTTGTTGATGATA
TTCGCTCTGCTGCAAAGAATGGAGAGCTCTCTGGAGAGTCATTTGTTTTATTCAATCAACAGGTGAAAGGCAACCGGCGAGAGATGATCCAGTTTGGCGTGCCCATTTTT
GGACAGATAAGAGAGGAATCGGCCAATAACAGCCAAACAAGCAACATTGAGCCAATTCCACCTCTTCTTGTGACGGTCATAGATCATCTTATTCAGTGGCAACTGATTCC
AGAGTATAAAAGACCAAATGGATGTTTAGTCAATTTCTTTGAAGAGGGTGAGTACTCACAGCCATTCCAGAAACCTCCACACTTGGAACAGCCCATTTCCACTCTCTTCC
TTTCTGAATCAACTATGGCTTTTGGGCGCTCTATTGTCAGTGATAACGAAGGCAACTATAAGGGGCCACTTATGCTGTCCTTGAAGGAAGGGTCTCTTTTGGTCAAGAGA
GGGAACAGTGCAGATGTTGCACGCCATGTCATGTGTGCATCTCCTAACAAAAGGGTCACCATCACATTCTTCCGAGTTCGGCCAGACTACGATCAATGCCAATCACCGAC
CCCTCACATGTCGAATGCCATGACTCTCTGGCAACCGGGAGTTGCAGCTGCATGCACCTTGCCCAATGGAGCCCCCTACGGCTATGAAGCAATGGAGGTAGTGCCGAAAT
GGGGGATCCTTCGTGCACCGGTAGTCATGTTAGCTCCTGTGCGCCCTGTGGTGATGAGCCCTGGAAGATCTCAACGTGATGGCACAGGAGTGTTCTTACCATGGGCTGTT
AATACAAGAAAACCAGCAAAACATCTTCCTCCCCGTGCTCGAAAAGGGCGATTCCTCGCACTATCTTCCCCTGTCGAAACTCGTCTACCAGATTCATCTCACGAGCCAGG
CATAAGTGTTTGA
mRNA sequenceShow/hide mRNA sequence
AGAAGAACCAAACGCTTTCTCCTTTTACACTTTGACACATAAATTTTCATATTTATAGTTTTATATATATATACACATTTTTCATTTTGTATATATATATATATTTTGAA
TTGATTTGAGGCTGTGGAGAATTAAGGAGTGTAACGATTCGGTTCCGAAATTTGGGTGAATTCGGAATCGGGAAGGGGAGAACTGGGAGATTTTTCTTGGGGAGAGACTG
CGAGAGGGGAGGTTTTTGAAATTGGGATATATATACATATATATATTTTTTGGGTTTGGTATTGATATTGGTGGATGCCGATGGCGGCCGGGGCAACTGATCGAGCGCGG
CCGGTGGTGGTGCCTGCGGCGGGGACGGCGACGGCGGTGGCGGTGACGGATCCAATGGGGAAGGATGCGTTGTTGGCGTGGTTCAGAGGGGAGTTCGCGGCGGCGAACGC
GATTATTGATGCGCTGTGTGGGCATCTGGCGCATGTGAGTGACGGCGGAGGATCGGAGTACGAAGCAGTGTTCGGTGCGATTCATAGACGGCGGCTGAATTGGATCCCGG
TCCTGCAAATGCAGAAGTATCATCCGATCGCGGACGTCGCGGTGGAGCTACGGAAAGTGACGGCGGAGAAGAAGATGAAAAAGAAGAACCAGGAGGAGGAGCAGGAGGAG
GAGGAGAAAGGAGGCCAGACGGCCACGGAGGCGGCGGTGGTGGCCGAGGGCGACGGCGATATCGAAATGGAGGAGAAAAAGAGCGAGGTGAAGAACATGAATGAAGAGGA
GGAAAATGAGGGAAAGATTTGTTCCGATGAGAAGGAATTCGTCGAAGAGAAAGCGAATAGTGAAGAAGAGAAGATCGAAGAGATGTCGATCGAGATTAACGAAACCGATG
GCGGAAGAAATGATGAGGCTCTGGCTCCAATCGAAGAGGAGGATTCAATCGGAAGCGAAATAACTGATTCAGGATCTCAAGGAGGAGGAGTGCAGGCCACTTCAGCAGAA
GTTGAGATATGTAATAATCATGGAGATTGTGAAGCACGTCCAGGACAAATGAAGTTGACAAAAGGTTTTTCCGCCAAGGAGCCAGTAAAAGGCCACATGGTGAATGTTGT
GAAAGGATTGAAGTGTTATGAAGACATTTTTACCGAGTCTGAATTGGCCAAATTGAATTATTTTGTTGATGATATTCGCTCTGCTGCAAAGAATGGAGAGCTCTCTGGAG
AGTCATTTGTTTTATTCAATCAACAGGTGAAAGGCAACCGGCGAGAGATGATCCAGTTTGGCGTGCCCATTTTTGGACAGATAAGAGAGGAATCGGCCAATAACAGCCAA
ACAAGCAACATTGAGCCAATTCCACCTCTTCTTGTGACGGTCATAGATCATCTTATTCAGTGGCAACTGATTCCAGAGTATAAAAGACCAAATGGATGTTTAGTCAATTT
CTTTGAAGAGGGTGAGTACTCACAGCCATTCCAGAAACCTCCACACTTGGAACAGCCCATTTCCACTCTCTTCCTTTCTGAATCAACTATGGCTTTTGGGCGCTCTATTG
TCAGTGATAACGAAGGCAACTATAAGGGGCCACTTATGCTGTCCTTGAAGGAAGGGTCTCTTTTGGTCAAGAGAGGGAACAGTGCAGATGTTGCACGCCATGTCATGTGT
GCATCTCCTAACAAAAGGGTCACCATCACATTCTTCCGAGTTCGGCCAGACTACGATCAATGCCAATCACCGACCCCTCACATGTCGAATGCCATGACTCTCTGGCAACC
GGGAGTTGCAGCTGCATGCACCTTGCCCAATGGAGCCCCCTACGGCTATGAAGCAATGGAGGTAGTGCCGAAATGGGGGATCCTTCGTGCACCGGTAGTCATGTTAGCTC
CTGTGCGCCCTGTGGTGATGAGCCCTGGAAGATCTCAACGTGATGGCACAGGAGTGTTCTTACCATGGGCTGTTAATACAAGAAAACCAGCAAAACATCTTCCTCCCCGT
GCTCGAAAAGGGCGATTCCTCGCACTATCTTCCCCTGTCGAAACTCGTCTACCAGATTCATCTCACGAGCCAGGCATAAGTGTTTGAGTTTGAAATCCAGATTGTTGTGG
CAGTTCTGAAGTCGAAGCAACACGACTCGTCTAAGTAGTGTTCTTGTCCTTTTACTGAGCTGAGTGAGTTTTTTCCCATTCCATTCCATTCCATTCCACCCCCTCCCCCT
AATACCACAAACCACTTATTTATTTGGATGCCATGTTCTTAGGTTTTTTTTTTCTTCTTCTTAGAAGAAGTTGAGGTTCAAGTTTGAAGAGTGAAACTAGAGGAGTTAGG
TTCGAATTTTGTATTTACTAATTGTATTTAAATCTTTCCTTTTCCTTCTAGAGGTGAAACTTGAACTATTTTCTACTGGCGTTTATATGGATTTTCTTCTTCCTGTTTTC
CTCTAAAGACAACAGTTTGGAAGGAAGCAATCTGCATCAAAGCATGATTGATTACTAAGAAAAATCAATTTGAAATATTAATT
Protein sequenceShow/hide protein sequence
MPMAAGATDRARPVVVPAAGTATAVAVTDPMGKDALLAWFRGEFAAANAIIDALCGHLAHVSDGGGSEYEAVFGAIHRRRLNWIPVLQMQKYHPIADVAVELRKVTAEKK
MKKKNQEEEQEEEEKGGQTATEAAVVAEGDGDIEMEEKKSEVKNMNEEEENEGKICSDEKEFVEEKANSEEEKIEEMSIEINETDGGRNDEALAPIEEEDSIGSEITDSG
SQGGGVQATSAEVEICNNHGDCEARPGQMKLTKGFSAKEPVKGHMVNVVKGLKCYEDIFTESELAKLNYFVDDIRSAAKNGELSGESFVLFNQQVKGNRREMIQFGVPIF
GQIREESANNSQTSNIEPIPPLLVTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVKR
GNSADVARHVMCASPNKRVTITFFRVRPDYDQCQSPTPHMSNAMTLWQPGVAAACTLPNGAPYGYEAMEVVPKWGILRAPVVMLAPVRPVVMSPGRSQRDGTGVFLPWAV
NTRKPAKHLPPRARKGRFLALSSPVETRLPDSSHEPGISV