; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g0685 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g0685
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionoxidoreductase, 2OG-Fe(II) oxygenase family protein
Genome locationMC09:6301488..6309729
RNA-Seq ExpressionMC09g0685
SyntenyMC09g0685
Gene Ontology termsGO:0006402 - mRNA catabolic process (biological process)
GO:0070988 - demethylation (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0032451 - demethylase activity (molecular function)
InterPro domainsIPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily
IPR044842 - RNA demethylase ALKBH9B/ALKBH10B-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575886.1 RNA demethylase ALKBH10B, partial [Cucurbita argyrosperma subsp. sororia]1.33e-31580.1Show/hide
Query:  MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL
        MPMAAGATDR R +++P A  A  VTD +AKDAVL WFRGEFAAANAIIDALCGHL QVS   G EYEAVF AIHRRRLNWIPVLQMQKYHPI DVAVEL
Subjt:  MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL

Query:  KKVTAKKKQEKGD------------AAGEGDVQMVEEEEEEKMNEEEENGGEMCLDER----EFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSIGS
        +KVTA+KK++K              A  +GDV+M    E +K +E +ENGG+MC DE     E   EK KIEEM +E NE +GGRNE LAPIEEEDSIGS
Subjt:  KKVTAKKKQEKGD------------AAGEGDVQMVEEEEEEKMNEEEENGGEMCLDER----EFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSIGS

Query:  EITDSGSQGG----VQATSAEVDEICSNHGDCEARPAQMKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFVL
        EITDSGSQGG    VQA+SAEV EICSNHG+CEARP  MKLTKGFSAKEPV     NVVKGLKCYEDIFTESEL KLNDFV+DLRSAAKNGELSGETFVL
Subjt:  EITDSGSQGG----VQATSAEVDEICSNHGDCEARPAQMKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFVL

Query:  FNQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMA
        FNQQVKGNRREMIQLGVPIFGQIR++SANNS+TSNIEPIP LL+ VIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMA
Subjt:  FNQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMA

Query:  FGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQ-MSNAMSLWQAGVTGACTLPSGAPYGYEALE
        FGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHV+ ASPNKRVTITFFRVRPDYDQCQSPTPQ MSNA++LWQ GV G CTLP+GA YGYEA+E
Subjt:  FGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQ-MSNAMSLWQAGVTGACTLPSGAPYGYEALE

Query:  VMPKWGILRAPVVMLAPVRPMVMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGISV
        VMPKWGIL APVVMLAPVRPMVMSPG R QRDGTGVFLPWAV SRKPAKHLPPRARKGR+L L SPVETR+   S  +PGISV
Subjt:  VMPKWGILRAPVVMLAPVRPMVMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGISV

KAG7014419.1 hypothetical protein SDJN02_24596 [Cucurbita argyrosperma subsp. argyrosperma]2.67e-31580.41Show/hide
Query:  MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL
        MPMAAGATDR R +++P A  A  VTD +AKDAVL WFRGEFAAANAIIDALCGHL QVS   G EYEAVF AIHRRRLNWIPVLQMQKYHPI DVAVEL
Subjt:  MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL

Query:  KKVTAKKK-------QEKGDAA----GEGDVQMVEEEEEEKMNEEEENGGEMCLDER----EFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSIGSE
        +KVTA+KK       +E+ DAA     +GDV+M    E +K +E +ENGG+MC DE     E   EK KIEEM +E NE +GGRNE LAPIEEEDSIGSE
Subjt:  KKVTAKKK-------QEKGDAA----GEGDVQMVEEEEEEKMNEEEENGGEMCLDER----EFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSIGSE

Query:  ITDSGSQGG----VQATSAEVDEICSNHGDCEARPAQMKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFVLF
        ITDSGSQGG    VQA+SAEV EICSNHG+CEARP  MKLTKGFSAKEPV     NVVKGLKCYEDIFTESEL KLNDFV+DLRSAAKNGEL GETFVLF
Subjt:  ITDSGSQGG----VQATSAEVDEICSNHGDCEARPAQMKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFVLF

Query:  NQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAF
        NQQVKGNRREMIQLGVPIFGQIR++SANNS+TSNIEPIP LL+ VIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAF
Subjt:  NQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAF

Query:  GRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQ-MSNAMSLWQAGVTGACTLPSGAPYGYEALEV
        GRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHV+ ASPNKRVTITFFRVRPDYDQCQSPTPQ MSNA++LWQ GV G CTLP+GA YGYEA+EV
Subjt:  GRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQ-MSNAMSLWQAGVTGACTLPSGAPYGYEALEV

Query:  MPKWGILRAPVVMLAPVRPMVMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGISV
        MPKWGIL APVVMLAPVRPMVMSPG R QRDGTGVFLPWAV SRKPAKHLPPRARKGR+L L SPVETR+   S  +PGISV
Subjt:  MPKWGILRAPVVMLAPVRPMVMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGISV

XP_022150239.1 uncharacterized protein LOC111018453 [Momordica charantia]0.099.11Show/hide
Query:  MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL
        MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL
Subjt:  MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL

Query:  KKVTAKKKQEKGDAAGEGDVQMVEEEEEEKMNEEEENGGEMCLDEREFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSIGSEITDSGSQGGVQATSA
        KKVTAKKKQEKGDAAGEGDVQMVEEEEEEKMNEEEENGGEMCLDEREFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSIGSEITDSGSQGGVQATSA
Subjt:  KKVTAKKKQEKGDAAGEGDVQMVEEEEEEKMNEEEENGGEMCLDEREFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSIGSEITDSGSQGGVQATSA

Query:  EVDEICSNHGDCEARPAQMKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFVLFNQQVKGNRREMIQLGVPIF
        EVDEICSNHGDCEARPAQMKLTKGFSAKEPV     NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFVLFNQQVKGNRREMIQLGVPIF
Subjt:  EVDEICSNHGDCEARPAQMKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFVLFNQQVKGNRREMIQLGVPIF

Query:  GQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLS
        GQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLS
Subjt:  GQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLS

Query:  LKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMSLWQAGVTGACTLPSGAPYGYEALEVMPKWGILRAPVVMLAPVRPM
        LKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMSLWQAGVTGACTLPSGAPYGYEALEVMPKWGILRAPVVMLAPVRPM
Subjt:  LKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMSLWQAGVTGACTLPSGAPYGYEALEVMPKWGILRAPVVMLAPVRPM

Query:  VMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGISV
        VMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGISV
Subjt:  VMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGISV

XP_022953518.1 uncharacterized protein LOC111456041 [Cucurbita moschata]6.26e-31479.79Show/hide
Query:  MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL
        MPMAAGATDR R +++P A  A  VTD +AKDAVL WFRGEFAAANAIIDALCGHL QVS   G EYEAVF AIHRRRLNWIPVLQMQKYHPI DVAVEL
Subjt:  MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL

Query:  KKVTAKKKQEKGD--------------AAGEGDVQMVEEEEEEKMNEEEENGGEMCLDER----EFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSI
        +KVTA+KK++K                A  +GDV+M    E +K +E +ENGG+M  DE     E   EK KIEEM +E NE +GGRNE LAPIEEEDSI
Subjt:  KKVTAKKKQEKGD--------------AAGEGDVQMVEEEEEEKMNEEEENGGEMCLDER----EFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSI

Query:  GSEITDSGSQGG---VQATSAEVDEICSNHGDCEARPAQMKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFV
        GSEITDSGSQGG   VQA+SAEV EICSNHG+CEARP  MKLTKGFSAKEPV     NVVKGLKCYEDIFTESEL KLNDFV+DLRSAAKNGELSGETFV
Subjt:  GSEITDSGSQGG---VQATSAEVDEICSNHGDCEARPAQMKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFV

Query:  LFNQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTM
        LFNQQVKGNRREMIQLGVPIFGQIR++SANN++TSNIEPIP LL TVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTM
Subjt:  LFNQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTM

Query:  AFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQ-MSNAMSLWQAGVTGACTLPSGAPYGYEAL
        AFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHV+ ASPNKRVTITFFRVRPDYDQCQSPTPQ MSNA++LWQ GV G CTLP+GA YGYEA+
Subjt:  AFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQ-MSNAMSLWQAGVTGACTLPSGAPYGYEAL

Query:  EVMPKWGILRAPVVMLAPVRPMVMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGISV
        EVMPKWGIL APVVMLAPVRPMVMSPG R QRDGTGVFLPWAV SRKPAKHLPPRARKGR+L L SPVETR+   S  +PGISV
Subjt:  EVMPKWGILRAPVVMLAPVRPMVMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGISV

XP_023549480.1 uncharacterized protein LOC111807830 [Cucurbita pepo subsp. pepo]1.19e-31479.97Show/hide
Query:  MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL
        MPMAAGATDR R +++P A  A  VTD +AKDAVL WFRGEFAAANAIIDALCGHL QVS   G EYEAVF AIHRRRLNWIPVLQMQKYHPI DVAVEL
Subjt:  MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL

Query:  KKVTAKKKQEKGD----------------AAGEGDVQMVEEEEEEKMNEEEENGGEMCLDER----EFGEEKAKIEEMPMENNEIDGGRNEALAPIEEED
        +KVTA+KK++K                  A  +GDV+M    E +K +E +ENGG+MC DE     E   EK KIEEM +E NE +GGRNE LAPIEEED
Subjt:  KKVTAKKKQEKGD----------------AAGEGDVQMVEEEEEEKMNEEEENGGEMCLDER----EFGEEKAKIEEMPMENNEIDGGRNEALAPIEEED

Query:  SIGSEITDSGSQGG-----VQATSAEVDEICSNHGDCEARPAQMKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSG
        SIGSEITDSGSQGG     VQA+SAEV EICSNHG+CEARP  MKLTKGFSAKEPV     NVVKGLKCYE+IFTESEL KLNDFV+DLRSAAKNGELSG
Subjt:  SIGSEITDSGSQGG-----VQATSAEVDEICSNHGDCEARPAQMKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSG

Query:  ETFVLFNQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLS
        ETFVLFNQQVKGNRREMIQLGVPIFGQIR++SANNS+TSNIEPIP LL+TVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLS
Subjt:  ETFVLFNQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLS

Query:  ESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQ-MSNAMSLWQAGVTGACTLPSGAPYG
        ESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHV+ ASPNKRVTITFFRVRPDYDQCQSPTPQ MSNA++LWQ GV G CTLP+GA YG
Subjt:  ESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQ-MSNAMSLWQAGVTGACTLPSGAPYG

Query:  YEALEVMPKWGILRAPVVMLAPVRPMVMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPE-PGISV
        YEA+EVMPKWGIL APVVMLAPVRPMVMSPG R QRDGTGVFLPWAV SRKPAKHLPPRARKGR+L L SPVETR LP+SS E PGISV
Subjt:  YEALEVMPKWGILRAPVVMLAPVRPMVMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPE-PGISV

TrEMBL top hitse value%identityAlignment
A0A6J1D9E3 uncharacterized protein LOC1110184530.099.11Show/hide
Query:  MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL
        MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL
Subjt:  MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL

Query:  KKVTAKKKQEKGDAAGEGDVQMVEEEEEEKMNEEEENGGEMCLDEREFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSIGSEITDSGSQGGVQATSA
        KKVTAKKKQEKGDAAGEGDVQMVEEEEEEKMNEEEENGGEMCLDEREFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSIGSEITDSGSQGGVQATSA
Subjt:  KKVTAKKKQEKGDAAGEGDVQMVEEEEEEKMNEEEENGGEMCLDEREFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSIGSEITDSGSQGGVQATSA

Query:  EVDEICSNHGDCEARPAQMKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFVLFNQQVKGNRREMIQLGVPIF
        EVDEICSNHGDCEARPAQMKLTKGFSAKEPV     NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFVLFNQQVKGNRREMIQLGVPIF
Subjt:  EVDEICSNHGDCEARPAQMKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFVLFNQQVKGNRREMIQLGVPIF

Query:  GQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLS
        GQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLS
Subjt:  GQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLS

Query:  LKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMSLWQAGVTGACTLPSGAPYGYEALEVMPKWGILRAPVVMLAPVRPM
        LKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMSLWQAGVTGACTLPSGAPYGYEALEVMPKWGILRAPVVMLAPVRPM
Subjt:  LKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMSLWQAGVTGACTLPSGAPYGYEALEVMPKWGILRAPVVMLAPVRPM

Query:  VMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGISV
        VMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGISV
Subjt:  VMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGISV

A0A6J1GNJ1 uncharacterized protein LOC1114560413.03e-31479.79Show/hide
Query:  MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL
        MPMAAGATDR R +++P A  A  VTD +AKDAVL WFRGEFAAANAIIDALCGHL QVS   G EYEAVF AIHRRRLNWIPVLQMQKYHPI DVAVEL
Subjt:  MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL

Query:  KKVTAKKKQEKGD--------------AAGEGDVQMVEEEEEEKMNEEEENGGEMCLDER----EFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSI
        +KVTA+KK++K                A  +GDV+M    E +K +E +ENGG+M  DE     E   EK KIEEM +E NE +GGRNE LAPIEEEDSI
Subjt:  KKVTAKKKQEKGD--------------AAGEGDVQMVEEEEEEKMNEEEENGGEMCLDER----EFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSI

Query:  GSEITDSGSQGG---VQATSAEVDEICSNHGDCEARPAQMKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFV
        GSEITDSGSQGG   VQA+SAEV EICSNHG+CEARP  MKLTKGFSAKEPV     NVVKGLKCYEDIFTESEL KLNDFV+DLRSAAKNGELSGETFV
Subjt:  GSEITDSGSQGG---VQATSAEVDEICSNHGDCEARPAQMKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFV

Query:  LFNQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTM
        LFNQQVKGNRREMIQLGVPIFGQIR++SANN++TSNIEPIP LL TVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTM
Subjt:  LFNQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTM

Query:  AFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQ-MSNAMSLWQAGVTGACTLPSGAPYGYEAL
        AFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHV+ ASPNKRVTITFFRVRPDYDQCQSPTPQ MSNA++LWQ GV G CTLP+GA YGYEA+
Subjt:  AFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQ-MSNAMSLWQAGVTGACTLPSGAPYGYEAL

Query:  EVMPKWGILRAPVVMLAPVRPMVMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGISV
        EVMPKWGIL APVVMLAPVRPMVMSPG R QRDGTGVFLPWAV SRKPAKHLPPRARKGR+L L SPVETR+   S  +PGISV
Subjt:  EVMPKWGILRAPVVMLAPVRPMVMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGISV

A0A6J1H8J8 uncharacterized protein LOC1114610496.06e-31479.73Show/hide
Query:  MAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVELKK
        MAAGATDR R +VVP A  AVTVTD + K+AVLAWFRGEFAAANAIIDALCGHL QVS + GSEYE+VFAAIHRRRLNWIPVLQMQKYHPIADVA+EL+K
Subjt:  MAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVELKK

Query:  VTAKKKQEK------------------------GDAAGEGDVQMVEEEEE-EKMNEEEENGGEMCLDEREFGEEKAKIEEMPMENNEIDGGRNEALA-PI
        VTA+KK++K                        GD  G+GDV+M E++ E +KM EEEEN G++C DE+E       +EE  +E NE DGGRNEAL  PI
Subjt:  VTAKKKQEK------------------------GDAAGEGDVQMVEEEEE-EKMNEEEENGGEMCLDEREFGEEKAKIEEMPMENNEIDGGRNEALA-PI

Query:  EEEDSIGSEITDSGSQGGVQATSAEVDEICSNHGDCEARPAQMKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGE
        EEEDSI SEITDSGS  GVQ TSAEV EICSNHG+CEARP QMKLTKGFSAKEPV     NVVKGLKCYED+FTESELAKL+ FV+DLRSAAKNGELSGE
Subjt:  EEEDSIGSEITDSGSQGGVQATSAEVDEICSNHGDCEARPAQMKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGE

Query:  TFVLFNQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSE
        +FVLFNQQVKG RREMIQLGVPIFGQIREESANNSQTSNIEPIP+LLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSE
Subjt:  TFVLFNQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSE

Query:  STMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMSLWQAGVTGACTLPSGAPYGYE
        STMAFGRSIVSDNEGNYKGPLMLS+KEGSLLVMRGNSADVARHVM ASPNKRVTITFFRVRPD DQ Q PT QMSNAM+LWQ GV GAC LP+G PY YE
Subjt:  STMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMSLWQAGVTGACTLPSGAPYGYE

Query:  ALEVMPKWGILRAPVVMLAPVRPMVMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPE-PGISV
        A+EV+PKWGILRAPVVMLAPVRP+VMSPG R QRDGTGVFLPWAV SRKPAKHLPPRARKGR+L LSSPVETR LP+SS E PGISV
Subjt:  ALEVMPKWGILRAPVVMLAPVRPMVMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPE-PGISV

A0A6J1JXR1 uncharacterized protein LOC1114888191.08e-31180.14Show/hide
Query:  MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL
        MPMAAGATDR R +++P A  A  VTD +AKDAVL WFRGEFAAANAIIDALCGHL QVS   G EYEAVF AIHRRRLNWIPVLQMQKYHPI DVAVEL
Subjt:  MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL

Query:  KKVTAKKKQEKGD------------AAGEGDVQMVEEEEEEKMNEEEENGGEMCLDEREFGEEKA-----KIEEMPMENNEIDGGRNEALAPIEEEDSIG
        +KVTA+KK++K              A  + DV+M    E +K +E +ENGG+MC +E EF EE+A     KIEEM +E NE +GGRNE LAPIEEEDSIG
Subjt:  KKVTAKKKQEKGD------------AAGEGDVQMVEEEEEEKMNEEEENGGEMCLDEREFGEEKA-----KIEEMPMENNEIDGGRNEALAPIEEEDSIG

Query:  SEITDSGSQGG---VQATSAEVDEICSNHGDCEARPAQMKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFVL
        SEITDSGSQGG   VQA+SAEV EICSNHG+CEARP  MKLTKGFSAKEPV     NVVKGLKCYEDIFTESEL KLNDFV+DLRSAAKNGELSGETFVL
Subjt:  SEITDSGSQGG---VQATSAEVDEICSNHGDCEARPAQMKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFVL

Query:  FNQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMA
        FNQQVKGNRREMIQLGVPIFGQIR++SANNS+TSNIEPIP LL+TVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMA
Subjt:  FNQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMA

Query:  FGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQ-MSNAMSLWQAGVTGACTLPSGAPYGYEALE
        FGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHV+ ASPNKRVTITFFRVRPDYDQCQSPTPQ +SN ++LWQ GV G C LP+G  YGYEA+E
Subjt:  FGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQ-MSNAMSLWQAGVTGACTLPSGAPYGYEALE

Query:  VMPKWGILRAPVVMLAPVRPMVMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPE-PGISV
        VMPKWGIL APVVMLAPVRPMVMSPG R QRDGTGVFLPWAV SRKPAKHLPPRARKGR+L L SPVETR LP+SS E PGISV
Subjt:  VMPKWGILRAPVVMLAPVRPMVMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPE-PGISV

A0A6J1KZQ2 uncharacterized protein LOC1114977864.62e-31379.79Show/hide
Query:  MAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVELKK
        MAAGATDR R +VVP A  AVTVTD + K+AVLAWFRGEFAAANAIIDALCGHL QVS + GSEYE+VFAAIHRRRLNWIPVLQMQKYHPIADVA+EL+K
Subjt:  MAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVELKK

Query:  VTAKKKQEK------------GDAA----------GEGDVQMVEEEEE-EKMNEEEENGGEMCLDEREFGEEKAKIEEMPMENNEIDGGRNEALA-PIEE
        VTA+KK++K            GD A          G+GDV+M E++ E +KM EEEEN G++C DE+E       +EEM +E NE DGGRNEAL  PIEE
Subjt:  VTAKKKQEK------------GDAA----------GEGDVQMVEEEEE-EKMNEEEENGGEMCLDEREFGEEKAKIEEMPMENNEIDGGRNEALA-PIEE

Query:  EDSIGSEITDSGSQGGVQATSAEVDEICSNHGDCEARPAQMKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETF
        EDSI SEITDSGS  GV  TSAEV EICSNHG+CEARP QMKLTKGFSAKEPV     NVVKGLKCYED+FTESELAKL+ FV+DLRSAAKNGELSGE+F
Subjt:  EDSIGSEITDSGSQGGVQATSAEVDEICSNHGDCEARPAQMKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETF

Query:  VLFNQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSEST
        VLFNQQVKG RREMIQLGVPIFGQIREESANNSQTSNIEPIP LLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSEST
Subjt:  VLFNQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSEST

Query:  MAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMSLWQAGVTGACTLPSGAPYGYEAL
        MAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVM ASPNKRVTITFFRVRPD DQ Q PT QMSNAM+LWQ GV GAC LP+G PY YEA+
Subjt:  MAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMSLWQAGVTGACTLPSGAPYGYEAL

Query:  EVMPKWGILRAPVVMLAPVRPMVMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGISV
        EV+PKWGILRAPVVMLAPVRP+VMSPG R QRDGTGVFLPWAV SRKPAKHLPPRARKGR+L LSSPVETR    S  +PGISV
Subjt:  EVMPKWGILRAPVVMLAPVRPMVMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGISV

SwissProt top hitse value%identityAlignment
Q9SL49 RNA demethylase ALKBH9B1.2e-2631.72Show/hide
Query:  EEEDSIGSEITDSGSQGGVQATSAEVDE--ICSNHGDCEARPAQMKLTKGFSAKEP-----VNVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELS
        +EE+    E  DS S+ G  A+S +  E    S       R   +K  K F   E      VNV+ GL+ +  +F+  E  ++ D V  L+   + GEL 
Subjt:  EEEDSIGSEITDSGSQGGVQATSAEVDE--ICSNHGDCEARPAQMKLTKGFSAKEP-----VNVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELS

Query:  GETFVLFNQQVKGNRREMIQLGVPIFGQIREESANNS---QTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLE-----
          TF   ++ ++G  RE IQ G   +    + + N     Q   ++P+P L   +I  LI+W ++P    P+ C+VN ++EG+       PPH++     
Subjt:  GETFVLFNQQVKGNRREMIQLGVPIFGQIREESANNS---QTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLE-----

Query:  QPISTL-FLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITF
        +P  T+ FLSE  + FG ++  +  G++ G   + L  GS+LV+ GN ADVA+H + A P KR++ITF
Subjt:  QPISTL-FLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITF

Q9ZT92 RNA demethylase ALKBH10B2.4e-14954.45Show/hide
Query:  PTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSE--SGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVELKKVTAKK----KQEKGD
        P A  V++ + KDA+++WFRGEFAAANAIIDA+C HL +++ E  SGSEYEAVFAAIHRRRLNWIPVLQMQKYH IA+VA+EL+KV AKK    KQ+K +
Subjt:  PTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSE--SGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVELKKVTAKK----KQEKGD

Query:  AAGEGDVQMVEEEEEEKMNEEEENGGEMCLDEREFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSIGSEITDSGSQGGVQAT---SAEVDEICSNHG
           E D++ V   EEE++ +E  NG               K+ E     N+++G   +      E+DS  S+ITDSGS   V  T         IC +H 
Subjt:  AAGEGDVQMVEEEEEEKMNEEEENGGEMCLDEREFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSIGSEITDSGSQGGVQAT---SAEVDEICSNHG

Query:  DCEARPAQMKLTKGFSAKE-----PVNVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFVLFNQQVKGNRREMIQLGVPIFGQIR-EESAN
        DC+AR  ++K  KGF AKE      VNVVKGLK YE++  E E++KL DFV +LR A  NG+L+GE+F+LFN+Q+KGN+RE+IQLGVPIFG ++ +E++N
Subjt:  DCEARPAQMKLTKGFSAKE-----PVNVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFVLFNQQVKGNRREMIQLGVPIFGQIR-EESAN

Query:  NSQTS-NIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLV
        ++  S NIEPIP LL +VIDH + W+LIPEYKRPNGC++NFFEEGEYSQPF KPPHLEQPISTL LSESTMA+GR + SDNEGN++GPL LSLK+GSLLV
Subjt:  NSQTS-NIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLV

Query:  MRGNSADVARHVMSASPNKRVTITFFRVRPD--YDQCQSPTPQMSNAMSLWQAGVTGACTLPSGAPYGYE-ALEVMPKWGILRAPVVMLA--PVRPMVM-
        MRGNSAD+ARHVM  S NKRV+ITFFR+RPD  ++  Q  +P+    M++WQ         P+    GY+ ++++MPK G+LR P+VM+A  PV+PM++ 
Subjt:  MRGNSADVARHVMSASPNKRVTITFFRVRPD--YDQCQSPTPQMSNAMSLWQAGVTGACTLPSGAPYGYE-ALEVMPKWGILRAPVVMLA--PVRPMVM-

Query:  SPGGRPQRDGTGVFLPWAV--TSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGISV
        SP       GTGVFLPWA   +SRK  KHLPPRA+K R L L     +     S+ EP I+V
Subjt:  SPGGRPQRDGTGVFLPWAV--TSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGISV

Arabidopsis top hitse value%identityAlignment
AT1G14710.1 hydroxyproline-rich glycoprotein family protein1.1e-5630.28Show/hide
Query:  MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL
        MP     T    +L  P  P A  + D   +D  ++W R EFAAANAIID+LC HL  V     +EYE+V  +IH RRL W  VL MQ++ P+ADV+  L
Subjt:  MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL

Query:  KKVTAKKKQE-------------------------KGDAAGEG-----------------DVQMVEEEEEEKMNEEEENGGEMCLDEREFGEEKAKIEEM
        +++  K++Q+                         K    G G                 +   VE  EE K+  +        +      EEK    E 
Subjt:  KKVTAKKKQE-------------------------KGDAAGEG-----------------DVQMVEEEEEEKMNEEEENGGEMCLDEREFGEEKAKIEEM

Query:  PMENNEIDGGRNEALAPIEEEDSIGSEITDSGSQGGVQATSAEVDEICSNHGDCEARPAQMKLTKGFSAKEPVNVVKGLKCYEDIFTESELAKLNDFVED
        P  +++++    E+     +E+ + +   +SGS+     +  + +E   N  +C A  A+  + +     + VNVV+GLK Y+ +   +E+++L   V +
Subjt:  PMENNEIDGGRNEALAPIEEEDSIGSEITDSGSQGGVQATSAEVDEICSNHGDCEARPAQMKLTKGFSAKEPVNVVKGLKCYEDIFTESELAKLNDFVED

Query:  LRSAAKNGELSGETFVLFNQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPP
        LR A + G+L  E +V + +  +G+ REMIQLG+PI     ++  ++ +   IEPIP+ L  +I+ L+  Q+IP   +P+ C+++FF EG++SQP    P
Subjt:  LRSAAKNGELSGETFVLFNQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPP

Query:  HLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMSLWQAGVTG
           +PIS L LSE    FGR IVS+N G+YKG L LSL  GS+L++ G SA++A++ + A+  +R+ I+F + +P       P  +  N           
Subjt:  HLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMSLWQAGVTG

Query:  ACTLPSGAPYGYEALEVMPKWGILRAP----------VVMLAPVRPMVMS---PGGRPQRDGTGVFLP
            P+G P  Y    V+P  G+L  P           + + P  P+      PGG P        LP
Subjt:  ACTLPSGAPYGYEALEVMPKWGILRAP----------VVMLAPVRPMVMS---PGGRPQRDGTGVFLP

AT1G14710.2 hydroxyproline-rich glycoprotein family protein1.1e-5630.28Show/hide
Query:  MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL
        MP     T    +L  P  P A  + D   +D  ++W R EFAAANAIID+LC HL  V     +EYE+V  +IH RRL W  VL MQ++ P+ADV+  L
Subjt:  MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVEL

Query:  KKVTAKKKQE-------------------------KGDAAGEG-----------------DVQMVEEEEEEKMNEEEENGGEMCLDEREFGEEKAKIEEM
        +++  K++Q+                         K    G G                 +   VE  EE K+  +        +      EEK    E 
Subjt:  KKVTAKKKQE-------------------------KGDAAGEG-----------------DVQMVEEEEEEKMNEEEENGGEMCLDEREFGEEKAKIEEM

Query:  PMENNEIDGGRNEALAPIEEEDSIGSEITDSGSQGGVQATSAEVDEICSNHGDCEARPAQMKLTKGFSAKEPVNVVKGLKCYEDIFTESELAKLNDFVED
        P  +++++    E+     +E+ + +   +SGS+     +  + +E   N  +C A  A+  + +     + VNVV+GLK Y+ +   +E+++L   V +
Subjt:  PMENNEIDGGRNEALAPIEEEDSIGSEITDSGSQGGVQATSAEVDEICSNHGDCEARPAQMKLTKGFSAKEPVNVVKGLKCYEDIFTESELAKLNDFVED

Query:  LRSAAKNGELSGETFVLFNQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPP
        LR A + G+L  E +V + +  +G+ REMIQLG+PI     ++  ++ +   IEPIP+ L  +I+ L+  Q+IP   +P+ C+++FF EG++SQP    P
Subjt:  LRSAAKNGELSGETFVLFNQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPP

Query:  HLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMSLWQAGVTG
           +PIS L LSE    FGR IVS+N G+YKG L LSL  GS+L++ G SA++A++ + A+  +R+ I+F + +P       P  +  N           
Subjt:  HLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMSLWQAGVTG

Query:  ACTLPSGAPYGYEALEVMPKWGILRAP----------VVMLAPVRPMVMS---PGGRPQRDGTGVFLP
            P+G P  Y    V+P  G+L  P           + + P  P+      PGG P        LP
Subjt:  ACTLPSGAPYGYEALEVMPKWGILRAP----------VVMLAPVRPMVMS---PGGRPQRDGTGVFLP

AT2G17970.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein8.3e-2831.72Show/hide
Query:  EEEDSIGSEITDSGSQGGVQATSAEVDE--ICSNHGDCEARPAQMKLTKGFSAKEP-----VNVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELS
        +EE+    E  DS S+ G  A+S +  E    S       R   +K  K F   E      VNV+ GL+ +  +F+  E  ++ D V  L+   + GEL 
Subjt:  EEEDSIGSEITDSGSQGGVQATSAEVDE--ICSNHGDCEARPAQMKLTKGFSAKEP-----VNVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELS

Query:  GETFVLFNQQVKGNRREMIQLGVPIFGQIREESANNS---QTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLE-----
          TF   ++ ++G  RE IQ G   +    + + N     Q   ++P+P L   +I  LI+W ++P    P+ C+VN ++EG+       PPH++     
Subjt:  GETFVLFNQQVKGNRREMIQLGVPIFGQIREESANNS---QTSNIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLE-----

Query:  QPISTL-FLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITF
        +P  T+ FLSE  + FG ++  +  G++ G   + L  GS+LV+ GN ADVA+H + A P KR++ITF
Subjt:  QPISTL-FLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITF

AT2G48080.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein1.4e-12047.09Show/hide
Query:  PAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVELKKVTAKKKQEKGDAAG
        P  P  V ++D+ AKDA+L WFRGEFAAANAIIDALC HL Q S  S ++YE+V AA+HRRRLNWIPVLQMQKYH I+ V ++L++  AK          
Subjt:  PAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVELKKVTAKKKQEKGDAAG

Query:  EGDVQMVEEEEEEKMNEEEENGGEMCLDEREFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSIGSEITDSGSQGGVQATSAEVDEICSNHGD-CEAR
                                                           G +  L    ++DS  S+ITD GS+        E   IC  H D CE+R
Subjt:  EGDVQMVEEEEEEKMNEEEENGGEMCLDEREFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSIGSEITDSGSQGGVQATSAEVDEICSNHGD-CEAR

Query:  PAQ-MKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFVLFNQQVKGNRREMIQLGVPIFGQIREESANNSQTS
         A  +K +K FSAKE V     NVVKGLK Y+D+FT  +L+KL D +  LR A +N +LSGETFVLFN+  KG +RE++QLGVPIFG       N +   
Subjt:  PAQ-MKLTKGFSAKEPV-----NVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFVLFNQQVKGNRREMIQLGVPIFGQIREESANNSQTS

Query:  NIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSA
        ++EPIP L+ +VIDHL+QW+LIPEYKRPNGC++NFF+E E+SQPFQKPPH++QPISTL LSESTM FG  +  DN+GN++G L L LKEGSLLVMRGNSA
Subjt:  NIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSA

Query:  DVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMSLWQAGVTGACTLPSGAPYGYEALEVMPKWGILRAPVVMLAPVRPMVMSPGGRPQR--DG
        D+ARHVM  SPNKRV ITFF+++PD  + Q P        +LW+ G                            +P+VMLAP           P+R   G
Subjt:  DVARHVMSASPNKRVTITFFRVRPDYDQCQSPTPQMSNAMSLWQAGVTGACTLPSGAPYGYEALEVMPKWGILRAPVVMLAPVRPMVMSPGGRPQR--DG

Query:  TGVFLPWA-VTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGIS
        TGVFLPW    SRKPAKHLPPR ++ R L+ S  V   +   SSPE G+S
Subjt:  TGVFLPWA-VTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGIS

AT4G02940.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein1.7e-15054.45Show/hide
Query:  PTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSE--SGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVELKKVTAKK----KQEKGD
        P A  V++ + KDA+++WFRGEFAAANAIIDA+C HL +++ E  SGSEYEAVFAAIHRRRLNWIPVLQMQKYH IA+VA+EL+KV AKK    KQ+K +
Subjt:  PTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSE--SGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVELKKVTAKK----KQEKGD

Query:  AAGEGDVQMVEEEEEEKMNEEEENGGEMCLDEREFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSIGSEITDSGSQGGVQAT---SAEVDEICSNHG
           E D++ V   EEE++ +E  NG               K+ E     N+++G   +      E+DS  S+ITDSGS   V  T         IC +H 
Subjt:  AAGEGDVQMVEEEEEEKMNEEEENGGEMCLDEREFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSIGSEITDSGSQGGVQAT---SAEVDEICSNHG

Query:  DCEARPAQMKLTKGFSAKE-----PVNVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFVLFNQQVKGNRREMIQLGVPIFGQIR-EESAN
        DC+AR  ++K  KGF AKE      VNVVKGLK YE++  E E++KL DFV +LR A  NG+L+GE+F+LFN+Q+KGN+RE+IQLGVPIFG ++ +E++N
Subjt:  DCEARPAQMKLTKGFSAKE-----PVNVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFVLFNQQVKGNRREMIQLGVPIFGQIR-EESAN

Query:  NSQTS-NIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLV
        ++  S NIEPIP LL +VIDH + W+LIPEYKRPNGC++NFFEEGEYSQPF KPPHLEQPISTL LSESTMA+GR + SDNEGN++GPL LSLK+GSLLV
Subjt:  NSQTS-NIEPIPALLMTVIDHLIQWQLIPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLV

Query:  MRGNSADVARHVMSASPNKRVTITFFRVRPD--YDQCQSPTPQMSNAMSLWQAGVTGACTLPSGAPYGYE-ALEVMPKWGILRAPVVMLA--PVRPMVM-
        MRGNSAD+ARHVM  S NKRV+ITFFR+RPD  ++  Q  +P+    M++WQ         P+    GY+ ++++MPK G+LR P+VM+A  PV+PM++ 
Subjt:  MRGNSADVARHVMSASPNKRVTITFFRVRPD--YDQCQSPTPQMSNAMSLWQAGVTGACTLPSGAPYGYE-ALEVMPKWGILRAPVVMLA--PVRPMVM-

Query:  SPGGRPQRDGTGVFLPWAV--TSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGISV
        SP       GTGVFLPWA   +SRK  KHLPPRA+K R L L     +     S+ EP I+V
Subjt:  SPGGRPQRDGTGVFLPWAV--TSRKPAKHLPPRARKGRYLTLSSPVETRQLPESSPEPGISV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGATGGCGGCGGGGGCAACTGATCGAGGGCGGCAGTTGGTGGTGCCGGCGGCGCCGACGGCGGTGACGGTGACGGACGCAATCGCGAAGGATGCGGTCCTGGCGTG
GTTTCGAGGGGAGTTCGCGGCGGCGAACGCGATTATTGATGCGCTGTGTGGCCACTTGGGGCAGGTGAGCAGTGAGAGTGGATCGGAGTACGAAGCAGTGTTCGCTGCGA
TTCATAGACGGCGGCTGAATTGGATCCCGGTCCTGCAAATGCAGAAGTATCATCCGATCGCCGACGTCGCGGTGGAGCTCAAGAAAGTGACGGCGAAGAAGAAGCAGGAG
AAAGGAGACGCGGCCGGCGAGGGCGATGTTCAAATGGTGGAGGAGGAGGAGGAGGAGAAGATGAACGAAGAGGAGGAAAATGGAGGTGAGATGTGTTTGGATGAGAGAGA
ATTTGGTGAAGAGAAGGCGAAGATCGAGGAGATGCCGATGGAGAACAACGAAATCGATGGCGGAAGAAATGAGGCACTGGCTCCAATTGAAGAGGAAGATTCAATCGGCA
GCGAGATTACTGATTCAGGATCTCAAGGAGGAGTGCAGGCCACATCAGCAGAAGTTGATGAGATATGCAGCAATCATGGTGACTGTGAAGCACGTCCGGCGCAGATGAAG
TTGACAAAAGGTTTTTCCGCCAAGGAGCCAGTGAATGTTGTGAAAGGATTGAAGTGTTATGAAGACATTTTCACCGAATCTGAATTGGCCAAGTTGAATGATTTTGTGGA
GGACCTTCGCTCTGCTGCAAAGAATGGGGAGCTCTCTGGAGAGACATTTGTTTTATTCAATCAGCAGGTGAAGGGCAACCGGCGAGAGATGATCCAACTTGGAGTGCCCA
TTTTTGGACAGATAAGAGAGGAATCCGCCAATAACAGCCAAACAAGCAACATAGAGCCAATTCCAGCTCTTCTTATGACCGTCATAGATCATCTTATTCAGTGGCAACTG
ATTCCAGAGTACAAAAGGCCAAATGGATGTCTCGTCAACTTTTTCGAAGAGGGTGAGTACTCGCAGCCATTCCAGAAACCTCCACATTTGGAACAACCCATTTCCACTCT
CTTCCTTTCTGAATCAACTATGGCTTTTGGGCGTTCTATTGTAAGTGATAACGAAGGCAACTACAAGGGGCCACTCATGCTGTCCTTGAAAGAAGGGTCTCTTTTGGTGA
TGAGGGGGAACAGTGCAGATGTTGCACGCCATGTCATGTCTGCATCTCCTAACAAAAGGGTCACGATTACGTTCTTCCGAGTCCGGCCAGACTACGATCAATGCCAATCA
CCGACCCCTCAAATGTCGAACGCCATGTCTCTATGGCAAGCCGGAGTTACAGGTGCATGCACTCTGCCTAGCGGAGCCCCATATGGGTACGAGGCGCTGGAAGTAATGCC
CAAATGGGGGATTCTTCGTGCACCGGTTGTGATGTTAGCTCCGGTACGCCCTATGGTGATGAGCCCCGGAGGAAGGCCTCAGCGTGATGGTACTGGAGTGTTCTTACCAT
GGGCAGTTACTTCAAGAAAACCTGCAAAACACCTCCCACCCCGTGCTCGTAAAGGTCGGTATCTCACATTATCTTCTCCTGTTGAAACTCGCCAGCTACCGGAGTCGTCT
CCCGAGCCAGGCATAAGCGTTTGA
mRNA sequenceShow/hide mRNA sequence
GAAATGGAACACCATGATTAGAATTCGCTACATTTTTTTGCAGCACACATTAATTATTTGTAAACAAAAGTACAATATAAAAATTAAAAGAGTTTAATCACTTTATTTAA
TTCCTAGTTTATTCAAATTAACAATTAAAAAGAAAAAGAAAAAGACCCTGCCACGTCACCACCCCTCAAAATGCGCCACGTCATACAGTTCCCATATATAAAAGGCGCCT
TAACGCTCCTTCCTCATTTCTCTGCCAATTACAGAGAAGAAACAAACGATTTCTCCTTTTACACTTTGACACATAAATTTTCACATATATATATTTTTAGATATATATAT
GAAATATATATATAAATAAATATACACATTTACATATATATATTGATTTGAGGGTGTGCTGAATTAAGAAGTATAACGATTTGTTCCGAAATTTGGGAGAATTCGAAATC
GGAAAGGGGGATTTTTCTGAGGAGTAGGGGGGGGGGGGGGGTTTTTGAAATTGGGATTTATTGATGCCGATGGCGGCGGGGGCAACTGATCGAGGGCGGCAGTTGGTGGT
GCCGGCGGCGCCGACGGCGGTGACGGTGACGGACGCAATCGCGAAGGATGCGGTCCTGGCGTGGTTTCGAGGGGAGTTCGCGGCGGCGAACGCGATTATTGATGCGCTGT
GTGGCCACTTGGGGCAGGTGAGCAGTGAGAGTGGATCGGAGTACGAAGCAGTGTTCGCTGCGATTCATAGACGGCGGCTGAATTGGATCCCGGTCCTGCAAATGCAGAAG
TATCATCCGATCGCCGACGTCGCGGTGGAGCTCAAGAAAGTGACGGCGAAGAAGAAGCAGGAGAAAGGAGACGCGGCCGGCGAGGGCGATGTTCAAATGGTGGAGGAGGA
GGAGGAGGAGAAGATGAACGAAGAGGAGGAAAATGGAGGTGAGATGTGTTTGGATGAGAGAGAATTTGGTGAAGAGAAGGCGAAGATCGAGGAGATGCCGATGGAGAACA
ACGAAATCGATGGCGGAAGAAATGAGGCACTGGCTCCAATTGAAGAGGAAGATTCAATCGGCAGCGAGATTACTGATTCAGGATCTCAAGGAGGAGTGCAGGCCACATCA
GCAGAAGTTGATGAGATATGCAGCAATCATGGTGACTGTGAAGCACGTCCGGCGCAGATGAAGTTGACAAAAGGTTTTTCCGCCAAGGAGCCAGTGAATGTTGTGAAAGG
ATTGAAGTGTTATGAAGACATTTTCACCGAATCTGAATTGGCCAAGTTGAATGATTTTGTGGAGGACCTTCGCTCTGCTGCAAAGAATGGGGAGCTCTCTGGAGAGACAT
TTGTTTTATTCAATCAGCAGGTGAAGGGCAACCGGCGAGAGATGATCCAACTTGGAGTGCCCATTTTTGGACAGATAAGAGAGGAATCCGCCAATAACAGCCAAACAAGC
AACATAGAGCCAATTCCAGCTCTTCTTATGACCGTCATAGATCATCTTATTCAGTGGCAACTGATTCCAGAGTACAAAAGGCCAAATGGATGTCTCGTCAACTTTTTCGA
AGAGGGTGAGTACTCGCAGCCATTCCAGAAACCTCCACATTTGGAACAACCCATTTCCACTCTCTTCCTTTCTGAATCAACTATGGCTTTTGGGCGTTCTATTGTAAGTG
ATAACGAAGGCAACTACAAGGGGCCACTCATGCTGTCCTTGAAAGAAGGGTCTCTTTTGGTGATGAGGGGGAACAGTGCAGATGTTGCACGCCATGTCATGTCTGCATCT
CCTAACAAAAGGGTCACGATTACGTTCTTCCGAGTCCGGCCAGACTACGATCAATGCCAATCACCGACCCCTCAAATGTCGAACGCCATGTCTCTATGGCAAGCCGGAGT
TACAGGTGCATGCACTCTGCCTAGCGGAGCCCCATATGGGTACGAGGCGCTGGAAGTAATGCCCAAATGGGGGATTCTTCGTGCACCGGTTGTGATGTTAGCTCCGGTAC
GCCCTATGGTGATGAGCCCCGGAGGAAGGCCTCAGCGTGATGGTACTGGAGTGTTCTTACCATGGGCAGTTACTTCAAGAAAACCTGCAAAACACCTCCCACCCCGTGCT
CGTAAAGGTCGGTATCTCACATTATCTTCTCCTGTTGAAACTCGCCAGCTACCGGAGTCGTCTCCCGAGCCAGGCATAAGCGTTTGAGTTGAAATCAATCATGTTGTGGC
GGGTGTGGAGTCGAAACAAAATGAGTCTTGTCATTTTACTGATCTGAGTGAGTATTTTTCCCCTTCACCCCCAACACCAAAAACCACTTTATTTATTTGGATGCCATGCC
ATGTTCTTAGGGTTTTTTTGGGTTTCTTAGATGAAGTTGAGGTTCAAGTTTGAAGAGTGAACTACTAGAGGAGTTAGGTTCGAATTTTGTATTTACTAATTGTATTTAAA
TCTTTCTTTTCCTTCTAGAGATCAAAAAAAAAAAAAAAAAAAAACTTGAACTATTTTCTTCTGGCATGACAAGGATTTTAATTTCTCTTCTTCCTGTTTTCCCCTGAAGA
AAATAGCTCACACCATTCTAATTTGTACACCAACTGAGAAAAATAAAAAGCTCAATCAAAAAGTTTTAAGGGAAGGCTTTATATATGTTTTACTCCCTATAGTTTAAAAA
GCTTTAATTTGGACCCTTATATTTTGTAATTTTTCAATTTAGTCTTTTGCAGTTGGTAAATCGATGATGAACTTACACTTTCGAGTTTTCTATTCTTGGTAACATTTTTC
TCTTTAAATCACTATAAGTTTATCATCGGCCAATTTTAACATTTACATATACCAAAATATATATGTAGCCTAAAGGAAAATATAGCACTTGAGAAATGATCCAAATTTCG
GAAATGATAGCTTACAGATAGAAGAACAACAAAAGGCTCAAAATCTAGAGTTTCAGAGGAAAAAAAAAAGAGAGAAAAAAGTACTACCAAACATACAGTAAAAATCAATG
TGAGGTGCAGAAAAATGTTGAAGGAAATGAATTCAGCTCTTATGATTAGGAAATGAAGTACAATTTTGAACATGATTCCTTCAAGGACAGTTTTGAAGATATACAATAAT
CACAGCACATTGCATTTAAATGATAAGAAATAACACTTGAAAAAAAAAAATATGAAAGAGAACATAGGAATTTAGTTCATAACCTTGCATTCCATCCTTCTTTCCATCCA
CAGATTTGATCTATCAAAGCCTTTACTTTACCAGTCTCGTCGTTCATTCATTCGATATCCCAGCTCAAATCTTCGTCGTCTTCGTCAGCTGTACTCAGCCGTTTTCGCAT
TTCCTCTCGATTTGCTTTGTCACCTTGAGTTACTATCATTTTCTTTTCCTCAATGCTGCTCAGATCCTCAATCTCATCCCACCCAAGATCCTCTTCCTCGGGCATCGAGG
GCTGTGTCGAAACAATCGAAACATCACTGTCTTTGGAGGATTCACCATGATCAGACTTGGCCTCCGCCACAGGCACAACCTTCTCGTCCGGTCTCAATCCTCCTTCCAAA
TGGACTTTTTGATCAGACCCTTTATCTCCATCTAATTCCTCACCATTTCCACCACTCTTGTTCTCCCCCTTATTATCATCACCACCAGAACTTCCTGCATTCAATTCCTC
CACAGGAACATCCCTCTTCACAACTTCAGCAGATTCTTTGCCACCCATTTCCTCCTTCGATCCCACATCGGCGACATTCTCCTCTTTCACCACCTCACCGACACTCACTT
CCCCACTCTTCGCAGCTTTCACCGCATCCCCTTTCGACCCGGCATTGATCTTACTGTACTCTTCATCATCATCATCAACATCCCAACTCAAGTCCTCCTCTTCTTCACTT
GCAATAGCTCTCTTCACAAGCTTGGCCCTGACATTCTCAGCTTGCTTAAGCTTATGCACTCTATAAAAATACCTATGCCAGAAAGTTTCATTATCAATAACATTTGGAAC
AACTTTTTTGTGCATATTATCAATTGCTCCATTTTCTTCAAGCAAGTTCTCAATTTCCTCGCTCTTATCATTCAGCTGGAATTGCGATTTCCACTTATCAAAATCATCCA
AATCTTCAGGTTCCTCACAGTAGGTGGCCAAATCGCCTTGAATCGAACGAACCTGAGCATCGAACCTGCTGTAGGGCTTGGAAGTGGAGCTTCTTTGATTACTGTTGTTC
TGGTTATTACTATTATCAGAATCAGATTCCTGATCTACAGCGAGAATCGCATCCTTACCTTGCGCGATGATCTGAGCCGTTCCTTTCAAAACGGAGCTACCGAATTCATC
GAAGGCGTGCCCGACCGTCTCCAGCGATCCCTGAGCGACCTCGATTTCCTTCTTCAAACCGGAACCAAATTCCTGGAGATCGCGCCGGTAGGTCTCGATCACGGATTCCG
ACCTCTGGCTCAGAGTCTTGATGAGACCGCCGAAACTCCATGCTCCGGCAGCCGCGGAGGAATCGTCGATGGGACCTGGCTCCGGGGAACTCGGATCATCCGATTCTGCT
CCGTCCTCCTTCAGTAGCGCTTTTTCAGGCGATCGCGGTCCAGATTCGGACGGCGGCGTGGAATCTGGCTCGTCGGAGAATACCGATCTGAAGAAATTCATTTGAGGGAT
GCGATACAAACAGCGATCAAAATTGGGAAATGGGAAACCTAGTTACAGAAGGAAGAATCGAGAAACGCGTGGATTTGAAGAGTGCAGTGAAGAAATGAAGTTCGTGAAGA
CGGTGGAAAGTCTTTATCTGGAAGAACGCAGGGCCACACCATGATCGGTGAACTTCCAAATCTCAATCTTCAATCCCCATTACAAAGTTCAGTTTTTTCTTCAGCAAATT
TCCAATTTAGGGCTCATATATTATATTGTTTTTTTTAATTAGAATTATTATTTAACTATTTTATTGATATTTATTAATAGTGACAAAGTAAAATTAGATAATTGATAAAT
TAATTCGAAATTTATAATATTTTAC
Protein sequenceShow/hide protein sequence
MPMAAGATDRGRQLVVPAAPTAVTVTDAIAKDAVLAWFRGEFAAANAIIDALCGHLGQVSSESGSEYEAVFAAIHRRRLNWIPVLQMQKYHPIADVAVELKKVTAKKKQE
KGDAAGEGDVQMVEEEEEEKMNEEEENGGEMCLDEREFGEEKAKIEEMPMENNEIDGGRNEALAPIEEEDSIGSEITDSGSQGGVQATSAEVDEICSNHGDCEARPAQMK
LTKGFSAKEPVNVVKGLKCYEDIFTESELAKLNDFVEDLRSAAKNGELSGETFVLFNQQVKGNRREMIQLGVPIFGQIREESANNSQTSNIEPIPALLMTVIDHLIQWQL
IPEYKRPNGCLVNFFEEGEYSQPFQKPPHLEQPISTLFLSESTMAFGRSIVSDNEGNYKGPLMLSLKEGSLLVMRGNSADVARHVMSASPNKRVTITFFRVRPDYDQCQS
PTPQMSNAMSLWQAGVTGACTLPSGAPYGYEALEVMPKWGILRAPVVMLAPVRPMVMSPGGRPQRDGTGVFLPWAVTSRKPAKHLPPRARKGRYLTLSSPVETRQLPESS
PEPGISV