; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g1258 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g1258
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUnknown protein
Genome locationMC02:11646453..11673183
RNA-Seq ExpressionMC02g1258
SyntenyMC02g1258
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040738.1 uncharacterized protein E6C27_scaffold703G00010 [Cucumis melo var. makuwa]4.48e-28882.36Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN
        MENGFDGRSLAEKFS L V+A P EQ NSH   NH N++DSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKV E LA+RFHST+QWNE+
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN

Query:  DH-GSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPV
        DH GSNGGHQSDNSVDNER RFKN+IS VDSHGTLV+H+DVEQKDEVSMR+D ESR+ D KSD +VNALPGVQP VDNAG SQFSSPSTTSFSASR F +
Subjt:  DH-GSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPV

Query:  DGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII
        D EYDP+IKLSGHG+M KAE NNP SLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII
Subjt:  DGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII

Query:  EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS
        EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFA QSPFHS+GATLTAS
Subjt:  EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS

Query:  --------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSS
                                              IGLGV V KNLEPDDLGRYS HASSE TNKQVTFREPVSNSE+DD DVVHQT+R+P+TNWSS
Subjt:  --------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSS

Query:  RESPPVATLDEPSSSHSPILPPVLEEPSPSFSE
         +SPP AT DEPSSSHSPILPPVLEEPSPSFSE
Subjt:  RESPPVATLDEPSSSHSPILPPVLEEPSPSFSE

XP_011649437.1 uncharacterized protein LOC101216431 isoform X1 [Cucumis sativus]3.36e-28882.74Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN
        MENGFDGRSLAEKFS L V+A   EQ NSH   NH N+NDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKV E LA+RFHST+QWNEN
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN

Query:  DH-GSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPV
        DH GSNGGHQSDNSVDNER RFKNNIS VDSHGTLV+H+DVEQKDEVSMR+D ESR+ D KSD +VNALPGVQP VDNAG SQFSSPSTTSFSASR F +
Subjt:  DH-GSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPV

Query:  DGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII
        D EYDP+IKLSGHG+M KAE NNP SLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII
Subjt:  DGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII

Query:  EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS
        EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFA QSPFHS+GATLTAS
Subjt:  EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS

Query:  --------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSS
                                              IGLGV V KNLEPDDLGRYS HASSE TNKQVTFREPVSNSE+DD DVVHQT+R+P+TNWSS
Subjt:  --------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSS

Query:  RESPPVATLDEPSSSHSPILPPVLEEPSPSFSE
         +SPP AT DEPSSSHSPILPPVLEEPSPSFSE
Subjt:  RESPPVATLDEPSSSHSPILPPVLEEPSPSFSE

XP_022158445.1 uncharacterized protein LOC111024932 isoform X1 [Momordica charantia]0.092.5Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY-KVDESLARRFHSTDQWNE
        MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY KVDESLARRFHSTDQWNE
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY-KVDESLARRFHSTDQWNE

Query:  NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPV
        NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASR FPV
Subjt:  NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPV

Query:  DGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII
        DGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII
Subjt:  DGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII

Query:  EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS
        EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS
Subjt:  EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS

Query:  --------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSS
                                              IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSS
Subjt:  --------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSS

Query:  RESPPVATLDEPSSSHSPILPPVLEEPSPSFSE
        RESPPVATLDEPSSSHSPILPPVLEEPSPSFSE
Subjt:  RESPPVATLDEPSSSHSPILPPVLEEPSPSFSE

XP_022158446.1 uncharacterized protein LOC111024932 isoform X2 [Momordica charantia]0.092.67Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN
        MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN

Query:  DHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPVD
        DHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASR FPVD
Subjt:  DHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPVD

Query:  GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE
        GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE
Subjt:  GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE

Query:  ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS-
        ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS 
Subjt:  ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS-

Query:  -------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR
                                             IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR
Subjt:  -------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR

Query:  ESPPVATLDEPSSSHSPILPPVLEEPSPSFSE
        ESPPVATLDEPSSSHSPILPPVLEEPSPSFSE
Subjt:  ESPPVATLDEPSSSHSPILPPVLEEPSPSFSE

XP_022158447.1 uncharacterized protein LOC111024932 isoform X3 [Momordica charantia]0.092.5Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY-KVDESLARRFHSTDQWNE
        MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY KVDESLARRFHSTDQWNE
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY-KVDESLARRFHSTDQWNE

Query:  NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPV
        NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASR FPV
Subjt:  NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPV

Query:  DGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII
        DGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII
Subjt:  DGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII

Query:  EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS
        EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS
Subjt:  EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS

Query:  --------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSS
                                              IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSS
Subjt:  --------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSS

Query:  RESPPVATLDEPSSSHSPILPPVLEEPSPSFSE
        RESPPVATLDEPSSSHSPILPPVLEEPSPSFSE
Subjt:  RESPPVATLDEPSSSHSPILPPVLEEPSPSFSE

TrEMBL top hitse value%identityAlignment
A0A1S3CDV6 uncharacterized protein LOC103499606 isoform X14.72e-28582.26Show/hide
Query:  GFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNENDH-
        GFDGRSLAEKFS L V+A P EQ NSH   NH N++DSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKV E LA+RFHST+QWNE+DH 
Subjt:  GFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNENDH-

Query:  GSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPVDGE
        GSNGGHQSDNSVDNER RFKN+IS VDSHGTLV+H+DVEQKDEVSMR+D ESR+ D KSD +VNALPGVQP VDNAG SQFSSPSTTSFSASR F +D E
Subjt:  GSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPVDGE

Query:  YDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEEN
        YDP+IKLSGHG+M KAE NNP SLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEEN
Subjt:  YDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEEN

Query:  IRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS---
        IRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFA QSPFHS+GATLTAS   
Subjt:  IRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS---

Query:  -----------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSRES
                                           IGLGV V KNLEPDDLGRYS HASSE TNKQVTFREPVSNSE+DD DVVHQT+R+P+TNWSS +S
Subjt:  -----------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSRES

Query:  PPVATLDEPSSSHSPILPPVLEEPSPSFSE
        PP AT DEPSSSHSPILPPVLEEPSPSFSE
Subjt:  PPVATLDEPSSSHSPILPPVLEEPSPSFSE

A0A5A7TCU1 Uncharacterized protein2.17e-28882.36Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN
        MENGFDGRSLAEKFS L V+A P EQ NSH   NH N++DSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKV E LA+RFHST+QWNE+
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN

Query:  DH-GSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPV
        DH GSNGGHQSDNSVDNER RFKN+IS VDSHGTLV+H+DVEQKDEVSMR+D ESR+ D KSD +VNALPGVQP VDNAG SQFSSPSTTSFSASR F +
Subjt:  DH-GSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPV

Query:  DGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII
        D EYDP+IKLSGHG+M KAE NNP SLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII
Subjt:  DGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII

Query:  EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS
        EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFA QSPFHS+GATLTAS
Subjt:  EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS

Query:  --------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSS
                                              IGLGV V KNLEPDDLGRYS HASSE TNKQVTFREPVSNSE+DD DVVHQT+R+P+TNWSS
Subjt:  --------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSS

Query:  RESPPVATLDEPSSSHSPILPPVLEEPSPSFSE
         +SPP AT DEPSSSHSPILPPVLEEPSPSFSE
Subjt:  RESPPVATLDEPSSSHSPILPPVLEEPSPSFSE

A0A6J1DVU9 uncharacterized protein LOC111024932 isoform X10.092.5Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY-KVDESLARRFHSTDQWNE
        MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY KVDESLARRFHSTDQWNE
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY-KVDESLARRFHSTDQWNE

Query:  NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPV
        NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASR FPV
Subjt:  NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPV

Query:  DGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII
        DGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII
Subjt:  DGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII

Query:  EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS
        EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS
Subjt:  EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS

Query:  --------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSS
                                              IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSS
Subjt:  --------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSS

Query:  RESPPVATLDEPSSSHSPILPPVLEEPSPSFSE
        RESPPVATLDEPSSSHSPILPPVLEEPSPSFSE
Subjt:  RESPPVATLDEPSSSHSPILPPVLEEPSPSFSE

A0A6J1DX87 uncharacterized protein LOC111024932 isoform X30.092.5Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY-KVDESLARRFHSTDQWNE
        MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY KVDESLARRFHSTDQWNE
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY-KVDESLARRFHSTDQWNE

Query:  NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPV
        NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASR FPV
Subjt:  NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPV

Query:  DGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII
        DGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII
Subjt:  DGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDII

Query:  EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS
        EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS
Subjt:  EENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS

Query:  --------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSS
                                              IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSS
Subjt:  --------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSS

Query:  RESPPVATLDEPSSSHSPILPPVLEEPSPSFSE
        RESPPVATLDEPSSSHSPILPPVLEEPSPSFSE
Subjt:  RESPPVATLDEPSSSHSPILPPVLEEPSPSFSE

A0A6J1E0X2 uncharacterized protein LOC111024932 isoform X20.092.67Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN
        MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN

Query:  DHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPVD
        DHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASR FPVD
Subjt:  DHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPVD

Query:  GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE
        GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE
Subjt:  GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE

Query:  ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS-
        ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS 
Subjt:  ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS-

Query:  -------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR
                                             IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR
Subjt:  -------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR

Query:  ESPPVATLDEPSSSHSPILPPVLEEPSPSFSE
        ESPPVATLDEPSSSHSPILPPVLEEPSPSFSE
Subjt:  ESPPVATLDEPSSSHSPILPPVLEEPSPSFSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G03560.1 unknown protein1.6e-1736.6Show/hide
Query:  KAEANNPKSLWKQD---LVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEENIRLTYALQEAQ
        K E  N K +   +   L  KV+  E+EI  L++ +A   +K+ Q+ NEKY LE++ A +R+A D++Q + V +A   L+ R+  +EEN++L + L+  +
Subjt:  KAEANNPKSLWKQD---LVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEENIRLTYALQEAQ

Query:  QERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKE
         ER  F++SLL LLAEY + P V +A +I S +K L   LQ K      +++E
Subjt:  QERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKE

AT5G08440.1 unknown protein3.2e-10349.61Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSN--SNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWN
        M+NG + R LAE+FSG+ +           SS +H N   NDS LFQV+KAVEAAEATIKQQVEENN L+ ELQ++  EL KYK  ESL +     +  N
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSN--SNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWN

Query:  ENDHGSNGGHQSDNSVD-NERHRFKNNISAVDSHGTLVVHRDVEQK-DEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQF-SSPSTTSFSASR
            GS+  HQS   +   +R + K N SA    G LVVH+ V    +E ++    E  +++G    + N +  V+ +V   G SQ  SSPST S S  R
Subjt:  ENDHGSNGGHQSDNSVD-NERHRFKNNISAVDSHGTLVVHRDVEQK-DEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQF-SSPSTTSFSASR

Query:  QFPVDGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYR
           ++G++D  I  S H LM   E NN  + WKQ+L+ KVQE + EI++LRK+LADYS KE QIRNEKYVLEKRIA+MR AFDQQQQDLVDAASKALSYR
Subjt:  QFPVDGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYR

Query:  QDIIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGAT
        Q+IIEENIRLTYALQ A+QER+ FVS LLPLL+EYSL P + D+QSI+S+VK+LF+HLQEKL +TETKLKE++YQL PW+SD +HS+ +  SP+  VG  
Subjt:  QDIIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGAT

Query:  LTASIG--------LGVDVAKNLEPDDLGRYS-----------LHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSRESPPVATLDEPSSS
        L  S           G   A N   D     S               S   N +V FREP+SN+ MDD     Q D +     ++ E+     +D+PS S
Subjt:  LTASIG--------LGVDVAKNLEPDDLGRYS-----------LHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSRESPPVATLDEPSSS

Query:  HSPILPPVLEEPSPSFSE
        + PIL PVLEEPS SFSE
Subjt:  HSPILPPVLEEPSPSFSE

AT5G08440.2 unknown protein1.1e-9044.33Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSN--SNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWN
        M+NG + R LAE+FSG+ +           SS +H N   NDS LFQV+KAVEAAEATIKQQVEENN L+ ELQ++  EL KYK  ESL +     +  N
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSN--SNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWN

Query:  ENDHGSNGGHQSDNSVD-NERHRFKNNISAVDSHGTLVVHRDVEQK-DEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQF-SSPSTTSFSASR
            GS+  HQS   +   +R + K N SA    G LVVH+ V    +E ++    E  +++G    + N +  V+ +V   G SQ  SSPST S S  R
Subjt:  ENDHGSNGGHQSDNSVD-NERHRFKNNISAVDSHGTLVVHRDVEQK-DEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQF-SSPSTTSFSASR

Query:  QFPVDGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYR
           ++G++D  I  S H LM   E NN  + WKQ+L+ KVQE + EI++LRK+LADYS KE QIRNEKYVLEKRIA+MR AFDQQQQDLVDAASKALSYR
Subjt:  QFPVDGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYR

Query:  QDIIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLT-----------------------------------
        Q+IIEENIRLTYALQ A+QER+ FVS LLPLL+EYSL P + D+QSI+S+VKI       KL                                      
Subjt:  QDIIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLT-----------------------------------

Query:  -----------ETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTASIG--------LGVDVAKNLEPDDLGRYS-----------LHASSEATNKQ
                    TKLKE++YQL PW+SD +HS+ +  SP+  VG  L  S           G   A N   D     S               S   N +
Subjt:  -----------ETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTASIG--------LGVDVAKNLEPDDLGRYS-----------LHASSEATNKQ

Query:  VTFREPVSNSEMDDPDVVHQTDRDPLTNWSSRESPPVATLDEPSSSHSPILPPVLEEPSPSFSE
        V FREP+SN+ MDD     Q D +     ++ E+     +D+PS S+ PIL PVLEEPS SFSE
Subjt:  VTFREPVSNSEMDDPDVVHQTDRDPLTNWSSRESPPVATLDEPSSSHSPILPPVLEEPSPSFSE

AT5G23490.1 unknown protein6.3e-9948.29Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN
        MENG + R LAE+FSGL         F   S    +   + NLFQV+KAVEAAE TIK+QVEEN+RL+ ELQ+   EL KYK DESL +  +  D  N  
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN

Query:  DHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPVD
           S   HQ    VD +    K   S  DS G LVVH  V    E        +R+     + I N    V+ ++D  G SQF S    S S  R   ++
Subjt:  DHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPVD

Query:  GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE
        GE+D     S HG M   E N+  + WKQDL+ KVQE E EI QLR++L D S+KEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDA+SKALSYRQ+IIE
Subjt:  GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE

Query:  ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTASI
        ENIRLTYALQ  QQER+TFVS LLPLL+EYSLQP V DAQSI+SNVK+LFKHLQEKLLLTETKLKES+YQL PW+SD +HS+ +  +P  S G  LT S 
Subjt:  ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTASI

Query:  ----------------------GLGVDVAKNLEPDDLGRYSLHASSEATNKQV------TFREPVSNSEMDDPDVVHQTDRDPL--TNWSSRESPPV-AT
                                G    +N   DD   +S   +S++   ++      +  E  ++ ++D+    H    +P+  T     ++P   + 
Subjt:  ----------------------GLGVDVAKNLEPDDLGRYSLHASSEATNKQV------TFREPVSNSEMDDPDVVHQTDRDPL--TNWSSRESPPV-AT

Query:  LDEPSSSHSPILPPVLEEPSPSFSEG
         D+PSSS+SP+L PV EEPS SFSEG
Subjt:  LDEPSSSHSPILPPVLEEPSPSFSEG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAATGGTTTCGACGGGAGATCGTTGGCTGAGAAGTTCTCCGGATTGGCCGTCACAGCTGCTCCACCGGAGCAATTTAACTCCCACTCGTCCAACAATCACAGTAA
CAGCAACGACAGCAACTTGTTTCAGGTCTTGAAAGCTGTTGAAGCAGCGGAGGCTACCATCAAGCAACAGGTGGAGGAAAATAATCGTCTGAGGATCGAACTTCAGAAAA
AGATCCAGGAACTGGAGAAATATAAGGTTGATGAATCTTTGGCTCGAAGGTTTCATTCCACAGACCAGTGGAATGAGAATGACCATGGATCAAATGGGGGTCATCAATCA
GATAATTCAGTCGATAATGAAAGGCATAGGTTTAAGAATAACATTTCTGCAGTTGATTCACATGGAACACTAGTTGTCCATCGAGATGTTGAGCAAAAAGATGAAGTTTC
CATGCGTATTGATAAAGAATCTCGTTATGCGGACGGCAAGTCTGACGAAATAGTGAATGCTCTTCCGGGTGTTCAGCCTTCAGTTGATAATGCTGGTTACTCGCAGTTCT
CTTCACCATCTACAACATCCTTCTCTGCTAGCAGGCAGTTTCCAGTGGATGGAGAATATGATCCACAGATTAAGTTGTCTGGACATGGCCTGATGTCAAAGGCTGAAGCA
AACAATCCCAAGAGTCTCTGGAAGCAGGATCTTGTTGTTAAAGTCCAGGAACATGAAGATGAAATTGTGCAGTTACGCAAGCATCTTGCTGATTATTCTATCAAGGAAGC
ACAAATACGAAATGAAAAATATGTTCTGGAAAAACGTATTGCCTATATGCGTTTGGCCTTTGATCAACAACAACAAGACCTTGTTGATGCTGCTTCTAAAGCTCTCTCAT
ACAGACAAGACATAATTGAGGAAAATATACGTCTTACATATGCATTGCAGGAAGCACAGCAAGAGAGAACTACATTTGTATCATCTTTGCTGCCTCTTCTTGCGGAATAT
TCTCTGCAGCCTCCTGTTCCTGATGCTCAGTCCATTATCAGCAATGTCAAGATTCTATTTAAGCACTTGCAGGAGAAACTCCTTCTCACTGAGACAAAATTGAAAGAGTC
GCAGTATCAATTAACACCTTGGCGCTCTGATGCAAGCCACTCGAGTTTTGCACAACAGTCACCTTTCCACTCAGTTGGTGCAACCTTAACTGCTTCAATTGGTTTAGGTG
TTGATGTTGCAAAAAACTTGGAACCAGATGATTTGGGGAGGTATTCACTTCATGCAAGCAGTGAAGCAACAAACAAACAGGTGACATTTCGTGAACCTGTAAGCAACAGT
GAGATGGATGACCCGGATGTTGTACACCAAACAGATAGAGATCCTCTCACCAACTGGAGTTCTCGGGAATCTCCCCCTGTTGCCACTCTCGACGAGCCAAGCTCATCTCA
TTCTCCAATTCTGCCTCCAGTCCTTGAGGAACCTTCACCTTCATTTTCTGAAGGCAAATATCTCCCTGTTTTTGTTGTTAGCATGTCAGTCTTA
mRNA sequenceShow/hide mRNA sequence
ATAAATTTTTAGGTTCCTAAATAATAAAATAGTAAGTGAAGTTTAAAATGTGTCTAAAGAAAATCCTAGAAACTTTCCTCAAAAAAAAAAAAAAAAAGAAGAAGAAGAAG
AAGAAGAAAAATCCTAGAAACGAGGATGGGGGCATGAAATAATCTAATAAACAATGGCGACGAGCCGTCGTGTAATTCACTCACTCGGACTCTTGGAGATGGAGAAGTGA
AGCGCGAAGGAGCCGACTTTCATGCAAACAGAAGACGAAGGATTTGATTGAGATTGATTGATTGATTCATTGAGGGTGCAAAAAGAGTGGTTTCGTTCGAGCTAAAGTTG
AAGCTAAGTCGAGTTACAGTTCATCCTTTCTCTGTCATCTTCATTTCTTCCATTATTTATTGGGTAGAAAGAAATCATCAAACCCCCATCACCATCTGTGAATCATTGCG
GGACAGATAGAGAATTGAGATTTTTAAGTTATAGGTCCTCTTTTTGCTCTGCTCCCACTGGGTTAGGGTTTTTTTTTGTTGGGGTTCACTCGTTGTTTGTATGTCATTTA
TCTCACTATCTCCGAGTTGATTTAATGTGCGGAAGAGGGATGTTTCAGACCCACGACCCAATTCTGCTGCTGAATTTGACGCCAATTTGAAAGACATAGTCATTCTGGGT
CGGATCATTCGTCTATCGGCTGTTTTGAGCTGAATTTTGAGCTTAAAGTTGAAGATGGAGAATGGTTTCGACGGGAGATCGTTGGCTGAGAAGTTCTCCGGATTGGCCGT
CACAGCTGCTCCACCGGAGCAATTTAACTCCCACTCGTCCAACAATCACAGTAACAGCAACGACAGCAACTTGTTTCAGGTCTTGAAAGCTGTTGAAGCAGCGGAGGCTA
CCATCAAGCAACAGGTGGAGGAAAATAATCGTCTGAGGATCGAACTTCAGAAAAAGATCCAGGAACTGGAGAAATATAAGGTTGATGAATCTTTGGCTCGAAGGTTTCAT
TCCACAGACCAGTGGAATGAGAATGACCATGGATCAAATGGGGGTCATCAATCAGATAATTCAGTCGATAATGAAAGGCATAGGTTTAAGAATAACATTTCTGCAGTTGA
TTCACATGGAACACTAGTTGTCCATCGAGATGTTGAGCAAAAAGATGAAGTTTCCATGCGTATTGATAAAGAATCTCGTTATGCGGACGGCAAGTCTGACGAAATAGTGA
ATGCTCTTCCGGGTGTTCAGCCTTCAGTTGATAATGCTGGTTACTCGCAGTTCTCTTCACCATCTACAACATCCTTCTCTGCTAGCAGGCAGTTTCCAGTGGATGGAGAA
TATGATCCACAGATTAAGTTGTCTGGACATGGCCTGATGTCAAAGGCTGAAGCAAACAATCCCAAGAGTCTCTGGAAGCAGGATCTTGTTGTTAAAGTCCAGGAACATGA
AGATGAAATTGTGCAGTTACGCAAGCATCTTGCTGATTATTCTATCAAGGAAGCACAAATACGAAATGAAAAATATGTTCTGGAAAAACGTATTGCCTATATGCGTTTGG
CCTTTGATCAACAACAACAAGACCTTGTTGATGCTGCTTCTAAAGCTCTCTCATACAGACAAGACATAATTGAGGAAAATATACGTCTTACATATGCATTGCAGGAAGCA
CAGCAAGAGAGAACTACATTTGTATCATCTTTGCTGCCTCTTCTTGCGGAATATTCTCTGCAGCCTCCTGTTCCTGATGCTCAGTCCATTATCAGCAATGTCAAGATTCT
ATTTAAGCACTTGCAGGAGAAACTCCTTCTCACTGAGACAAAATTGAAAGAGTCGCAGTATCAATTAACACCTTGGCGCTCTGATGCAAGCCACTCGAGTTTTGCACAAC
AGTCACCTTTCCACTCAGTTGGTGCAACCTTAACTGCTTCAATTGGTTTAGGTGTTGATGTTGCAAAAAACTTGGAACCAGATGATTTGGGGAGGTATTCACTTCATGCA
AGCAGTGAAGCAACAAACAAACAGGTGACATTTCGTGAACCTGTAAGCAACAGTGAGATGGATGACCCGGATGTTGTACACCAAACAGATAGAGATCCTCTCACCAACTG
GAGTTCTCGGGAATCTCCCCCTGTTGCCACTCTCGACGAGCCAAGCTCATCTCATTCTCCAATTCTGCCTCCAGTCCTTGAGGAACCTTCACCTTCATTTTCTGAAGGCA
AATATCTCCCTGTTTTTGTTGTTAGCATGTCAGTCTTA
Protein sequenceShow/hide protein sequence
MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNENDHGSNGGHQS
DNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASRQFPVDGEYDPQIKLSGHGLMSKAEA
NNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEENIRLTYALQEAQQERTTFVSSLLPLLAEY
SLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTASIGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNS
EMDDPDVVHQTDRDPLTNWSSRESPPVATLDEPSSSHSPILPPVLEEPSPSFSEGKYLPVFVVSMSVL