; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS016342 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS016342
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold1038:268913..294948
RNA-Seq ExpressionMS016342
SyntenyMS016342
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040738.1 uncharacterized protein E6C27_scaffold703G00010 [Cucumis melo var. makuwa]2.1e-22982.52Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN
        MENGFDGRSLAEKFS L V+A P EQ NSH   NH N++DSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKV E LA+RFHST+QWNE+
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN

Query:  D-HGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS-FPVD
        D HGSNGGHQSDNSVDNER RFKN+IS VDSHGTLV+H+DVEQKDEVSMR+D ESR+ D KSD +VNALPGVQP VDNAG SQFSSPSTTSFSAS F +D
Subjt:  D-HGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS-FPVD

Query:  GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE
         EYDP+IKLSGHG+M KAE NNP SLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE
Subjt:  GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE

Query:  ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS-
        ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFA QSPFHS+GATLTAS 
Subjt:  ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS-

Query:  -------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR
                                             IGLGV V KNLEPDDLGRYS HASSE TNKQVTFREPVSNSE+DD DVVHQT+R+P+TNWSS 
Subjt:  -------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR

Query:  QSPPVATLDEPSSSHSPILPPVLEEPSPSFSE
        QSPP AT DEPSSSHSPILPPVLEEPSPSFSE
Subjt:  QSPPVATLDEPSSSHSPILPPVLEEPSPSFSE

XP_011649437.1 uncharacterized protein LOC101216431 isoform X1 [Cucumis sativus]3.2e-23082.89Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN
        MENGFDGRSLAEKFS L V+A   EQ NSH   NH N+NDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKV E LA+RFHST+QWNEN
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN

Query:  D-HGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS-FPVD
        D HGSNGGHQSDNSVDNER RFKNNIS VDSHGTLV+H+DVEQKDEVSMR+D ESR+ D KSD +VNALPGVQP VDNAG SQFSSPSTTSFSAS F +D
Subjt:  D-HGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS-FPVD

Query:  GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE
         EYDP+IKLSGHG+M KAE NNP SLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE
Subjt:  GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE

Query:  ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS-
        ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFA QSPFHS+GATLTAS 
Subjt:  ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS-

Query:  -------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR
                                             IGLGV V KNLEPDDLGRYS HASSE TNKQVTFREPVSNSE+DD DVVHQT+R+P+TNWSS 
Subjt:  -------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR

Query:  QSPPVATLDEPSSSHSPILPPVLEEPSPSFSE
        QSPP AT DEPSSSHSPILPPVLEEPSPSFSE
Subjt:  QSPPVATLDEPSSSHSPILPPVLEEPSPSFSE

XP_022158445.1 uncharacterized protein LOC111024932 isoform X1 [Momordica charantia]6.0e-26192.29Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY-KVDESLARRFHSTDQWNE
        MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY KVDESLARRFHSTDQWNE
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY-KVDESLARRFHSTDQWNE

Query:  NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS-FPVD
        NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS FPVD
Subjt:  NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS-FPVD

Query:  GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE
        GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE
Subjt:  GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE

Query:  ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS-
        ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS 
Subjt:  ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS-

Query:  -------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR
                                             IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR
Subjt:  -------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR

Query:  QSPPVATLDEPSSSHSPILPPVLEEPSPSFSE
        +SPPVATLDEPSSSHSPILPPVLEEPSPSFSE
Subjt:  QSPPVATLDEPSSSHSPILPPVLEEPSPSFSE

XP_022158446.1 uncharacterized protein LOC111024932 isoform X2 [Momordica charantia]2.5e-26292.47Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN
        MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN

Query:  DHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS-FPVDG
        DHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS FPVDG
Subjt:  DHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS-FPVDG

Query:  EYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEE
        EYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEE
Subjt:  EYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEE

Query:  NIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS--
        NIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS  
Subjt:  NIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS--

Query:  ------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSRQ
                                            IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR+
Subjt:  ------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSRQ

Query:  SPPVATLDEPSSSHSPILPPVLEEPSPSFSE
        SPPVATLDEPSSSHSPILPPVLEEPSPSFSE
Subjt:  SPPVATLDEPSSSHSPILPPVLEEPSPSFSE

XP_022158447.1 uncharacterized protein LOC111024932 isoform X3 [Momordica charantia]6.0e-26192.29Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY-KVDESLARRFHSTDQWNE
        MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY KVDESLARRFHSTDQWNE
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY-KVDESLARRFHSTDQWNE

Query:  NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS-FPVD
        NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS FPVD
Subjt:  NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS-FPVD

Query:  GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE
        GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE
Subjt:  GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE

Query:  ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS-
        ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS 
Subjt:  ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS-

Query:  -------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR
                                             IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR
Subjt:  -------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR

Query:  QSPPVATLDEPSSSHSPILPPVLEEPSPSFSE
        +SPPVATLDEPSSSHSPILPPVLEEPSPSFSE
Subjt:  QSPPVATLDEPSSSHSPILPPVLEEPSPSFSE

TrEMBL top hitse value%identityAlignment
A0A1S3CDV6 uncharacterized protein LOC103499606 isoform X19.5e-22882.42Show/hide
Query:  GFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEND-H
        GFDGRSLAEKFS L V+A P EQ NSH   NH N++DSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKV E LA+RFHST+QWNE+D H
Subjt:  GFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEND-H

Query:  GSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS-FPVDGEY
        GSNGGHQSDNSVDNER RFKN+IS VDSHGTLV+H+DVEQKDEVSMR+D ESR+ D KSD +VNALPGVQP VDNAG SQFSSPSTTSFSAS F +D EY
Subjt:  GSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS-FPVDGEY

Query:  DPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEENI
        DP+IKLSGHG+M KAE NNP SLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEENI
Subjt:  DPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEENI

Query:  RLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS----
        RLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFA QSPFHS+GATLTAS    
Subjt:  RLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS----

Query:  ----------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSRQSP
                                          IGLGV V KNLEPDDLGRYS HASSE TNKQVTFREPVSNSE+DD DVVHQT+R+P+TNWSS QSP
Subjt:  ----------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSRQSP

Query:  PVATLDEPSSSHSPILPPVLEEPSPSFSE
        P AT DEPSSSHSPILPPVLEEPSPSFSE
Subjt:  PVATLDEPSSSHSPILPPVLEEPSPSFSE

A0A5A7TCU1 Uncharacterized protein1.0e-22982.52Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN
        MENGFDGRSLAEKFS L V+A P EQ NSH   NH N++DSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKV E LA+RFHST+QWNE+
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN

Query:  D-HGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS-FPVD
        D HGSNGGHQSDNSVDNER RFKN+IS VDSHGTLV+H+DVEQKDEVSMR+D ESR+ D KSD +VNALPGVQP VDNAG SQFSSPSTTSFSAS F +D
Subjt:  D-HGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS-FPVD

Query:  GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE
         EYDP+IKLSGHG+M KAE NNP SLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE
Subjt:  GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE

Query:  ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS-
        ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFA QSPFHS+GATLTAS 
Subjt:  ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS-

Query:  -------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR
                                             IGLGV V KNLEPDDLGRYS HASSE TNKQVTFREPVSNSE+DD DVVHQT+R+P+TNWSS 
Subjt:  -------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR

Query:  QSPPVATLDEPSSSHSPILPPVLEEPSPSFSE
        QSPP AT DEPSSSHSPILPPVLEEPSPSFSE
Subjt:  QSPPVATLDEPSSSHSPILPPVLEEPSPSFSE

A0A6J1DVU9 uncharacterized protein LOC111024932 isoform X12.9e-26192.29Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY-KVDESLARRFHSTDQWNE
        MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY KVDESLARRFHSTDQWNE
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY-KVDESLARRFHSTDQWNE

Query:  NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS-FPVD
        NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS FPVD
Subjt:  NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS-FPVD

Query:  GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE
        GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE
Subjt:  GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE

Query:  ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS-
        ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS 
Subjt:  ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS-

Query:  -------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR
                                             IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR
Subjt:  -------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR

Query:  QSPPVATLDEPSSSHSPILPPVLEEPSPSFSE
        +SPPVATLDEPSSSHSPILPPVLEEPSPSFSE
Subjt:  QSPPVATLDEPSSSHSPILPPVLEEPSPSFSE

A0A6J1DX87 uncharacterized protein LOC111024932 isoform X32.9e-26192.29Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY-KVDESLARRFHSTDQWNE
        MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY KVDESLARRFHSTDQWNE
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKY-KVDESLARRFHSTDQWNE

Query:  NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS-FPVD
        NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS FPVD
Subjt:  NDHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS-FPVD

Query:  GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE
        GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE
Subjt:  GEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIE

Query:  ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS-
        ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS 
Subjt:  ENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS-

Query:  -------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR
                                             IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR
Subjt:  -------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR

Query:  QSPPVATLDEPSSSHSPILPPVLEEPSPSFSE
        +SPPVATLDEPSSSHSPILPPVLEEPSPSFSE
Subjt:  QSPPVATLDEPSSSHSPILPPVLEEPSPSFSE

A0A6J1E0X2 uncharacterized protein LOC111024932 isoform X21.2e-26292.47Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN
        MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN

Query:  DHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS-FPVDG
        DHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS FPVDG
Subjt:  DHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSAS-FPVDG

Query:  EYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEE
        EYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEE
Subjt:  EYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEE

Query:  NIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS--
        NIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS  
Subjt:  NIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTAS--

Query:  ------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSRQ
                                            IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSR+
Subjt:  ------------------------------------IGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSRQ

Query:  SPPVATLDEPSSSHSPILPPVLEEPSPSFSE
        SPPVATLDEPSSSHSPILPPVLEEPSPSFSE
Subjt:  SPPVATLDEPSSSHSPILPPVLEEPSPSFSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G03560.1 unknown protein1.6e-1736.6Show/hide
Query:  KAEANNPKSLWKQD---LVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEENIRLTYALQEAQ
        K E  N K +   +   L  KV+  E+EI  L++ +A   +K+ Q+ NEKY LE++ A +R+A D++Q + V +A   L+ R+  +EEN++L + L+  +
Subjt:  KAEANNPKSLWKQD---LVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEENIRLTYALQEAQ

Query:  QERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKE
         ER  F++SLL LLAEY + P V +A +I S +K L   LQ K      +++E
Subjt:  QERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKE

AT5G08440.1 unknown protein4.2e-10350.29Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSN--SNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWN
        M+NG + R LAE+FSG+ +           SS +H N   NDS LFQV+KAVEAAEATIKQQVEENN L+ ELQ++  EL KYK  ESL +     +  N
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSN--SNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWN

Query:  ENDHGSNGGHQSDNSVD-NERHRFKNNISAVDSHGTLVVHRDVEQK-DEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQF-SSPSTTSFSASF
            GS+  HQS   +   +R + K N SA    G LVVH+ V    +E ++    E  +++G    + N +  V+ +V   G SQ  SSPST S S   
Subjt:  ENDHGSNGGHQSDNSVD-NERHRFKNNISAVDSHGTLVVHRDVEQK-DEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQF-SSPSTTSFSASF

Query:  P-VDGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQ
        P ++G++D  I  S H LM   E NN  + WKQ+L+ KVQE + EI++LRK+LADYS KE QIRNEKYVLEKRIA+MR AFDQQQQDLVDAASKALSYRQ
Subjt:  P-VDGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQ

Query:  DIIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATL
        +IIEENIRLTYALQ A+QER+ FVS LLPLL+EYSL P + D+QSI+S+VK+LF+HLQEKL +TETKLKE++YQL PW+SD +HS+ +  SP+  VG  L
Subjt:  DIIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATL

Query:  TASIG--------LGVDVAKNLEPDDLGRYS-----------LHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSRQSPPVATLDEPSSSH
          S           G   A N   D     S               S   N +V FREP+SN+ MDD     Q D    +N +   S  VA +D+PS S+
Subjt:  TASIG--------LGVDVAKNLEPDDLGRYS-----------LHASSEATNKQVTFREPVSNSEMDDPDVVHQTDRDPLTNWSSRQSPPVATLDEPSSSH

Query:  SPILPPVLEEPSPSFSE
         PIL PVLEEPS SFSE
Subjt:  SPILPPVLEEPSPSFSE

AT5G08440.2 unknown protein1.4e-9044.94Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSN--SNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWN
        M+NG + R LAE+FSG+ +           SS +H N   NDS LFQV+KAVEAAEATIKQQVEENN L+ ELQ++  EL KYK  ESL +     +  N
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSN--SNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWN

Query:  ENDHGSNGGHQSDNSVD-NERHRFKNNISAVDSHGTLVVHRDVEQK-DEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQF-SSPSTTSFSASF
            GS+  HQS   +   +R + K N SA    G LVVH+ V    +E ++    E  +++G    + N +  V+ +V   G SQ  SSPST S S   
Subjt:  ENDHGSNGGHQSDNSVD-NERHRFKNNISAVDSHGTLVVHRDVEQK-DEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQF-SSPSTTSFSASF

Query:  P-VDGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQ
        P ++G++D  I  S H LM   E NN  + WKQ+L+ KVQE + EI++LRK+LADYS KE QIRNEKYVLEKRIA+MR AFDQQQQDLVDAASKALSYRQ
Subjt:  P-VDGEYDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQ

Query:  DIIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLT------------------------------------
        +IIEENIRLTYALQ A+QER+ FVS LLPLL+EYSL P + D+QSI+S+VKI       KL                                       
Subjt:  DIIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLT------------------------------------

Query:  ----------ETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTASIG--------LGVDVAKNLEPDDLGRYS-----------LHASSEATNKQV
                   TKLKE++YQL PW+SD +HS+ +  SP+  VG  L  S           G   A N   D     S               S   N +V
Subjt:  ----------ETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTASIG--------LGVDVAKNLEPDDLGRYS-----------LHASSEATNKQV

Query:  TFREPVSNSEMDDPDVVHQTDRDPLTNWSSRQSPPVATLDEPSSSHSPILPPVLEEPSPSFSE
         FREP+SN+ MDD     Q D    +N +   S  VA +D+PS S+ PIL PVLEEPS SFSE
Subjt:  TFREPVSNSEMDDPDVVHQTDRDPLTNWSSRQSPPVATLDEPSSSHSPILPPVLEEPSPSFSE

AT5G23490.1 unknown protein8.2e-9948.09Show/hide
Query:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN
        MENG + R LAE+FSGL         F   S    +   + NLFQV+KAVEAAE TIK+QVEEN+RL+ ELQ+   EL KYK DESL +  +  D  N  
Subjt:  MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNEN

Query:  DHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASFPVDGE
           S   HQ    VD +    K   S  DS G LVVH  V    E        +R+     + I N    V+ ++D  G SQF S  +        ++GE
Subjt:  DHGSNGGHQSDNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASFPVDGE

Query:  YDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEEN
        +D     S HG M   E N+  + WKQDL+ KVQE E EI QLR++L D S+KEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDA+SKALSYRQ+IIEEN
Subjt:  YDPQIKLSGHGLMSKAEANNPKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEEN

Query:  IRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTASI--
        IRLTYALQ  QQER+TFVS LLPLL+EYSLQP V DAQSI+SNVK+LFKHLQEKLLLTETKLKES+YQL PW+SD +HS+ +  +P  S G  LT S   
Subjt:  IRLTYALQEAQQERTTFVSSLLPLLAEYSLQPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTASI--

Query:  --------------------GLGVDVAKNLEPDDLGRYSLHASSEATNKQV------TFREPVSNSEMDDPDVVHQTDRDPL--TNWSSRQSPPV-ATLD
                              G    +N   DD   +S   +S++   ++      +  E  ++ ++D+    H    +P+  T     Q+P   +  D
Subjt:  --------------------GLGVDVAKNLEPDDLGRYSLHASSEATNKQV------TFREPVSNSEMDDPDVVHQTDRDPL--TNWSSRQSPPV-ATLD

Query:  EPSSSHSPILPPVLEEPSPSFSEG
        +PSSS+SP+L PV EEPS SFSEG
Subjt:  EPSSSHSPILPPVLEEPSPSFSEG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAATGGTTTCGACGGGAGATCGTTGGCTGAGAAGTTCTCCGGATTGGCCGTCACAGCTGCTCCACCGGAGCAATTTAACTCCCACTCGTCCAACAATCACAGTAA
CAGCAACGACAGCAACTTGTTTCAGGTCTTGAAAGCTGTTGAAGCAGCGGAGGCTACCATCAAGCAACAGGTGGAGGAAAATAATCGTCTGAGGATCGAACTTCAGAAAA
AGATCCAGGAACTGGAGAAATATAAGGTTGATGAATCTTTGGCTCGAAGGTTTCATTCCACAGACCAGTGGAATGAGAATGACCATGGATCAAATGGGGGTCATCAATCA
GATAATTCAGTCGATAATGAAAGGCATAGGTTTAAGAATAACATTTCTGCAGTTGATTCACATGGAACACTAGTTGTCCATCGAGATGTTGAGCAAAAAGATGAAGTTTC
CATGCGTATTGATAAAGAATCTCGTTATGCGGACGGCAAGTCTGACGAAATAGTGAATGCTCTTCCGGGTGTTCAGCCTTCAGTTGATAATGCTGGTTACTCACAGTTCT
CTTCACCATCTACAACATCCTTCTCTGCTAGCTTTCCAGTGGATGGAGAATATGATCCACAGATTAAGTTGTCTGGACATGGCCTGATGTCAAAGGCTGAAGCAAATAAT
CCCAAGAGTCTCTGGAAGCAGGATCTTGTTGTTAAAGTCCAGGAACATGAAGATGAAATTGTGCAGTTACGCAAGCATCTTGCTGATTATTCTATCAAGGAAGCACAAAT
ACGAAATGAAAAATATGTTCTGGAAAAACGTATTGCCTATATGCGTTTGGCCTTTGATCAACAACAACAAGACCTTGTTGATGCTGCTTCTAAAGCTCTCTCATACAGAC
AAGACATAATTGAGGAAAATATACGTCTTACATATGCATTGCAGGAAGCACAGCAAGAGAGAACTACATTTGTATCATCTTTGCTGCCTCTTCTTGCGGAATATTCTCTG
CAGCCTCCTGTTCCTGATGCTCAGTCCATTATCAGCAATGTCAAGATTCTATTTAAGCACTTGCAGGAGAAACTCCTTCTCACTGAGACAAAATTGAAAGAGTCACAGTA
TCAATTAACACCTTGGCGCTCTGATGCAAGCCACTCGAGTTTTGCACAACAGTCACCTTTCCACTCAGTTGGTGCAACCTTAACTGCTTCAATTGGTTTAGGTGTTGATG
TTGCAAAAAACTTGGAGCCAGATGATTTGGGGAGGTATTCACTTCATGCAAGCAGTGAAGCAACAAACAAACAGGTGACATTTCGTGAACCTGTAAGCAACAGTGAGATG
GATGACCCGGATGTTGTACACCAAACAGATAGAGATCCTCTCACCAACTGGAGTTCTCGGCAATCTCCCCCTGTTGCCACTCTCGACGAGCCAAGCTCATCTCATTCTCC
AATTCTGCCTCCAGTCCTTGAGGAACCTTCACCTTCATTTTCTGAAGGCAAATATCTCCCTGTTTTTGTTGTTAGCATGTCAGTCTTAAATAGTTCA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAATGGTTTCGACGGGAGATCGTTGGCTGAGAAGTTCTCCGGATTGGCCGTCACAGCTGCTCCACCGGAGCAATTTAACTCCCACTCGTCCAACAATCACAGTAA
CAGCAACGACAGCAACTTGTTTCAGGTCTTGAAAGCTGTTGAAGCAGCGGAGGCTACCATCAAGCAACAGGTGGAGGAAAATAATCGTCTGAGGATCGAACTTCAGAAAA
AGATCCAGGAACTGGAGAAATATAAGGTTGATGAATCTTTGGCTCGAAGGTTTCATTCCACAGACCAGTGGAATGAGAATGACCATGGATCAAATGGGGGTCATCAATCA
GATAATTCAGTCGATAATGAAAGGCATAGGTTTAAGAATAACATTTCTGCAGTTGATTCACATGGAACACTAGTTGTCCATCGAGATGTTGAGCAAAAAGATGAAGTTTC
CATGCGTATTGATAAAGAATCTCGTTATGCGGACGGCAAGTCTGACGAAATAGTGAATGCTCTTCCGGGTGTTCAGCCTTCAGTTGATAATGCTGGTTACTCACAGTTCT
CTTCACCATCTACAACATCCTTCTCTGCTAGCTTTCCAGTGGATGGAGAATATGATCCACAGATTAAGTTGTCTGGACATGGCCTGATGTCAAAGGCTGAAGCAAATAAT
CCCAAGAGTCTCTGGAAGCAGGATCTTGTTGTTAAAGTCCAGGAACATGAAGATGAAATTGTGCAGTTACGCAAGCATCTTGCTGATTATTCTATCAAGGAAGCACAAAT
ACGAAATGAAAAATATGTTCTGGAAAAACGTATTGCCTATATGCGTTTGGCCTTTGATCAACAACAACAAGACCTTGTTGATGCTGCTTCTAAAGCTCTCTCATACAGAC
AAGACATAATTGAGGAAAATATACGTCTTACATATGCATTGCAGGAAGCACAGCAAGAGAGAACTACATTTGTATCATCTTTGCTGCCTCTTCTTGCGGAATATTCTCTG
CAGCCTCCTGTTCCTGATGCTCAGTCCATTATCAGCAATGTCAAGATTCTATTTAAGCACTTGCAGGAGAAACTCCTTCTCACTGAGACAAAATTGAAAGAGTCACAGTA
TCAATTAACACCTTGGCGCTCTGATGCAAGCCACTCGAGTTTTGCACAACAGTCACCTTTCCACTCAGTTGGTGCAACCTTAACTGCTTCAATTGGTTTAGGTGTTGATG
TTGCAAAAAACTTGGAGCCAGATGATTTGGGGAGGTATTCACTTCATGCAAGCAGTGAAGCAACAAACAAACAGGTGACATTTCGTGAACCTGTAAGCAACAGTGAGATG
GATGACCCGGATGTTGTACACCAAACAGATAGAGATCCTCTCACCAACTGGAGTTCTCGGCAATCTCCCCCTGTTGCCACTCTCGACGAGCCAAGCTCATCTCATTCTCC
AATTCTGCCTCCAGTCCTTGAGGAACCTTCACCTTCATTTTCTGAAGGCAAATATCTCCCTGTTTTTGTTGTTAGCATGTCAGTCTTAAATAGTTCA
Protein sequenceShow/hide protein sequence
MENGFDGRSLAEKFSGLAVTAAPPEQFNSHSSNNHSNSNDSNLFQVLKAVEAAEATIKQQVEENNRLRIELQKKIQELEKYKVDESLARRFHSTDQWNENDHGSNGGHQS
DNSVDNERHRFKNNISAVDSHGTLVVHRDVEQKDEVSMRIDKESRYADGKSDEIVNALPGVQPSVDNAGYSQFSSPSTTSFSASFPVDGEYDPQIKLSGHGLMSKAEANN
PKSLWKQDLVVKVQEHEDEIVQLRKHLADYSIKEAQIRNEKYVLEKRIAYMRLAFDQQQQDLVDAASKALSYRQDIIEENIRLTYALQEAQQERTTFVSSLLPLLAEYSL
QPPVPDAQSIISNVKILFKHLQEKLLLTETKLKESQYQLTPWRSDASHSSFAQQSPFHSVGATLTASIGLGVDVAKNLEPDDLGRYSLHASSEATNKQVTFREPVSNSEM
DDPDVVHQTDRDPLTNWSSRQSPPVATLDEPSSSHSPILPPVLEEPSPSFSEGKYLPVFVVSMSVLNSS