; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg19643 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg19643
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProtein of unknown function (DUF789)
Genome locationCarg_Chr14:11928650..11932063
RNA-Seq ExpressionCarg19643
SyntenyCarg19643
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582192.1 hypothetical protein SDJN03_22194, partial [Cucurbita argyrosperma subsp. sororia]3.8e-22293.53Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD
        MSVSGGVSIARIRGENRFYHPPAMRRRL     QQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD

Query:  STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL
        STNLDRFLEHTTPLVSAHCIPK                       TRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL
Subjt:  STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL

Query:  SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF
        SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF
Subjt:  SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF

Query:  CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL
        CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL
Subjt:  CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL

Query:  ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLR
        ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLR
Subjt:  ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLR

KAG7018589.1 hypothetical protein SDJN02_20459, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-255100Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD

Query:  STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL
        STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL
Subjt:  STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL

Query:  SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF
        SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF
Subjt:  SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF

Query:  CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL
        CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL
Subjt:  CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL

Query:  ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR
        ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR
Subjt:  ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR

XP_022955563.1 uncharacterized protein LOC111457542 isoform X1 [Cucurbita moschata]6.6e-23593.82Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD
        MSVSGGVSIARIRGENRFYHPPAMRRRL     QQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD

Query:  STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL
        STNLDRFLEHTTPLVSAHCIPK                       TRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL
Subjt:  STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL

Query:  SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF
        SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF
Subjt:  SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF

Query:  CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL
        CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL
Subjt:  CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL

Query:  ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR
        ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR
Subjt:  ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR

XP_022979801.1 uncharacterized protein LOC111479394 isoform X1 [Cucurbita maxima]3.3e-22690.73Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ Q          QKQSALDSKD VAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD

Query:  STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL
        STNLDRFLE TTPLV AHCIPK                       TRLRGWRTREV EA PYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL
Subjt:  STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL

Query:  SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF
        SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQD SISGSQRALQMNTPSAESSSDESDSCY HGQLVFEYMERDPPF
Subjt:  SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF

Query:  CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL
        CREPLTDKI ILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL
Subjt:  CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL

Query:  ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR
        ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR
Subjt:  ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR

XP_023526716.1 uncharacterized protein LOC111790123 isoform X1 [Cucurbita pepo subsp. pepo]8.7e-22791.39Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQ      QKQSALDSKD VAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD

Query:  STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL
        STNLDRFLE+TTPLV AHCIPK                       TRL+GWRTREV EA P FVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL
Subjt:  STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL

Query:  SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF
        SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF
Subjt:  SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF

Query:  CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL
        CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTA QGI SDGLQFQWPRVREVYTADCPLKLQLPIFGL
Subjt:  CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL

Query:  ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR
        ASYKFKLPFWNSTGTEECSKAQSLWQDAE WLRLLNVNHPDYRFFSSHSSFGR
Subjt:  ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR

TrEMBL top hitse value%identityAlignment
A0A0A0L5V4 Uncharacterized protein1.8e-20983.66Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ             KQSALDSKD VAAA++ IDDLEKRSEFDECRSWSTRSDCSVSDRGLAD
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD

Query:  STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL
        STNLDRFLEHTTPLV AHCIPK                       T LRGWR REV EA PYFVLGDLWES+KEWSAYGAGIPLLLNGSDSVVQYYVPYL
Subjt:  STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL

Query:  SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF
        SGIQLY+ PSKSSALSRRRG DSDAESSKETSSDGSSN GAEKKTK  LQ+E IQD ++ GSQRALQMN PS+ESSSDESDSCY HGQLVFEY+ERDPPF
Subjt:  SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF

Query:  CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL
        CREPLTDKIT+LASRF ELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGIS+DGLQF WPRVREVYTADCPLKLQLPIFGL
Subjt:  CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL

Query:  ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR
        ASYKFK+PFWNSTG EECSKA SLWQDA+ WLRLLNVNHPDYRFF+SH+SF R
Subjt:  ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR

A0A1S3AY60 uncharacterized protein LOC103483873 isoform X15.2e-20984.11Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ        KQSALDSKD VAAA++ IDDLEKRSEFDECRSWSTRSDCSVSDRGL D
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD

Query:  STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL
        STNLDRFLEHTTPLV AHCIPK                       T LRGWR REV EA PYFVLGDLWES+KEWSAYGAGIPLLLNGSDSVVQYYVPYL
Subjt:  STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL

Query:  SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF
        SGIQLY+ PSKS ALSRRRG DSDAESSKETSSDGSSN GAEKKTK  LQ+E IQD +  GSQRALQMN PS+ESSSDESDSCY HGQLVFEY+ERDPPF
Subjt:  SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF

Query:  CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL
        CREPLTDKIT+LASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTA QG S+DGLQF WPRVREVYTADCPLKLQLPIFGL
Subjt:  CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL

Query:  ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR
        ASYKFK+PFWNSTG EECSKA SLWQDA+ WLRLLNVNHPDYRFF+SH+SF R
Subjt:  ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR

A0A1S3AY77 uncharacterized protein LOC103483873 isoform X22.8e-20783.89Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ        KQSALDSKD VAAA++ IDDLEKRSEFDECRSWSTRSDCSVSDRGL D
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD

Query:  STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL
        STNLDRFLEHTTPLV AHCIPK                       T LRGWR REV EA PYFVLGDLWES+KEWSAYGAGIPLLLNGSDSVVQYYVPYL
Subjt:  STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL

Query:  SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF
        SGIQLY+ PSKS AL RRRG DSDAESSKETSSDGSSN GAEKKTK  LQ+E IQD +  GSQRALQMN PS+ESSSDESDSCY HGQLVFEY+ERDPPF
Subjt:  SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF

Query:  CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL
        CREPLTDKIT+LASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTA QG S+DGLQF WPRVREVYTADCPLKLQLPIFGL
Subjt:  CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL

Query:  ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR
        ASYKFK+PFWNSTG EECSKA SLWQDA+ WLRLLNVNHPDYRFF+SH+SF R
Subjt:  ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR

A0A6J1GUB7 uncharacterized protein LOC111457542 isoform X13.2e-23593.82Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD
        MSVSGGVSIARIRGENRFYHPPAMRRRL     QQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD

Query:  STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL
        STNLDRFLEHTTPLVSAHCIPK                       TRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL
Subjt:  STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL

Query:  SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF
        SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF
Subjt:  SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF

Query:  CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL
        CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL
Subjt:  CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL

Query:  ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR
        ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR
Subjt:  ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR

A0A6J1IRT8 uncharacterized protein LOC111479394 isoform X11.6e-22690.73Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ Q          QKQSALDSKD VAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLAD

Query:  STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL
        STNLDRFLE TTPLV AHCIPK                       TRLRGWRTREV EA PYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL
Subjt:  STNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYL

Query:  SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF
        SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQD SISGSQRALQMNTPSAESSSDESDSCY HGQLVFEYMERDPPF
Subjt:  SGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPF

Query:  CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL
        CREPLTDKI ILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL
Subjt:  CREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGL

Query:  ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR
        ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR
Subjt:  ASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHSSFGR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)1.5e-7545.89Show/hide
Query:  ADSTNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGS-DSVVQYYV
        A S+N++RFL+  TP V AH + K +                      R RG    +V   +PYF+LGD+WES+ EWSAYG G+PL LN + D V QYYV
Subjt:  ADSTNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGS-DSVVQYYV

Query:  PYLSGIQLY--IYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAE-SSSDESDSCYHHGQLVFEYM
        P LSGIQ+Y  +    SS  +RR+G +S+++  +++SS+GSS+     +++  L   C     IS     L +     E SSSD+ +     G+L+FEY+
Subjt:  PYLSGIQLY--IYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAE-SSSDESDSCYHHGQLVFEYM

Query:  ERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ--GISSDGLQFQWPRVREVYTADCPLK
        ERD P+ REP  DK++ LASRFPELKT RSCDL PSSW SVAWYPIY+IPTGPTL+ LDACFLT+H+L T FQ  G+++  +    PR       +   K
Subjt:  ERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ--GISSDGLQFQWPRVREVYTADCPLK

Query:  LQLPIFGLASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFF
        ++LP+FGLASYK +   W S G      A SL+Q A+ WLRL  VNHPD+ FF
Subjt:  LQLPIFGLASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFF

AT2G01260.1 Protein of unknown function (DUF789)1.1e-7041.82Show/hide
Query:  ASARIDDLEK-RSEFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRT-REVPEALPYF
        A+ RID L + +S+     S +        +     S+NLDRFLE  TP V A  + K L                       LR  R   +  + +PYF
Subjt:  ASARIDDLEK-RSEFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRT-REVPEALPYF

Query:  VLGDLWESYKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYIYP-SKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISG
        VLGD+W+S+ EWSAYG G+PL+LN + D V+QYYVP LS IQ+Y +  +  S+L  RR  DS     +++SSD SS+  +E   + + + +CI       
Subjt:  VLGDLWESYKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYIYP-SKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISG

Query:  SQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHN
               +    +SSSD+ +     G+L+FEY+ERD P+ REP  DK+  LA++FPEL T RSCDL  SSW SVAWYPIYRIPTGPTL+ LDACFLT+H+
Subjt:  SQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHN

Query:  LSTAFQGISSD-GLQFQWPRVREVYTADCPLKLQLPIFGLASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFF
        L T+F G  S+  +    PR  E        K+ LP+FGLASYKF+   W   G  E     SL+Q A+ WL   +V+HPD+ FF
Subjt:  LSTAFQGISSD-GLQFQWPRVREVYTADCPLKLQLPIFGLASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFF

AT2G01260.2 Protein of unknown function (DUF789)3.8e-5542.67Show/hide
Query:  ASARIDDLEK-RSEFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRT-REVPEALPYF
        A+ RID L + +S+     S +        +     S+NLDRFLE  TP V A  + K L                       LR  R   +  + +PYF
Subjt:  ASARIDDLEK-RSEFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRT-REVPEALPYF

Query:  VLGDLWESYKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYIYP-SKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISG
        VLGD+W+S+ EWSAYG G+PL+LN + D V+QYYVP LS IQ+Y +  +  S+L  RR  DS     +++SSD SS+  +E   + + + +CI       
Subjt:  VLGDLWESYKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYIYP-SKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISG

Query:  SQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHN
               +    +SSSD+ +     G+L+FEY+ERD P+ REP  DK+  LA++FPEL T RSCDL  SSW SVAWYPIYRIPTGPTL+ LDACFLT+H+
Subjt:  SQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHN

Query:  LSTAFQG
        L T+F G
Subjt:  LSTAFQG

AT4G16100.1 Protein of unknown function (DUF789)2.1e-8546.21Show/hide
Query:  RIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST-------N
        RIRGENRFY+PP MR+      QQ++++++ + ++ ++++ + +  LD K        ++++ E +   +EC    + SDCSV  R  + +T       N
Subjt:  RIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST-------N

Query:  LDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGI
        L RFL+ TTP+VS   +P                        T  +GWRTRE PE  PYF+L DLW+S++EWSAYG G+PLLLNG DSVVQYYVPYLSGI
Subjt:  LDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGI

Query:  QLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESD-SCYHHGQLVFEYMERDPPFCR
        QLY  PS++    RR G +SD +S ++ SSDGS++C   ++   NL              RA     P   SSSDES+ S    G+LVFEY+E   PF R
Subjt:  QLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESD-SCYHHGQLVFEYMERDPPFCR

Query:  EPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGLAS
        EPLTDKI+ L+S+FP L+TYRSCDLSPSSW+SVAWYPIYRIP G +LQ+LDACFLTFH+LST  +G S++  Q     V          KL LP FGLAS
Subjt:  EPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGLAS

Query:  YKFKLPFWN-STGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHS
        YKFKL  W+  +  +E  +  +L + AE WLR L V  PD+R F SHS
Subjt:  YKFKLPFWN-STGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHS

AT5G49220.1 Protein of unknown function (DUF789)2.2e-7946.2Show/hide
Query:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEF-----DECRSWSTRSDCSV
        MS SGGVSIAR  IRGENRFY+PP MRR  Q+ Q QQQ +++Q++  + +    K+    +  A       +   E +S       + C   S  S  S 
Subjt:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEF-----DECRSWSTRSDCSV

Query:  SDRGLADSTNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGI-----PLLLNG
        S R L+D +NLDRFLEHTTP+V A                      LF P ++R    +TRE  +   YFVL DLWES+ EWSAYGAG+     PL ++G
Subjt:  SDRGLADSTNLDRFLEHTTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGI-----PLLLNG

Query:  SDSVVQYYVPYLSGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQ
        +DS VQYYVPYLSGIQLY+ P     L + R    D E S E SS  +S       +   L    ++D SI+GS             SS E++     G+
Subjt:  SDSVVQYYVPYLSGIQLYIYPSKSSALSRRRGTDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQ

Query:  LVFEYMERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTAD
        L+FEY+E +PPF REPL +KI+ LASR PEL TYRSCDL PSSW+SV+WYPIYRIP GPTLQ+LDACFLTFH+LSTA    S+ G     P         
Subjt:  LVFEYMERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTAD

Query:  CPLKLQLPIFGLASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHS
           KL LP FGLASYK K+  WN    +E  K  SL Q A+ WL+ L V+HPDYRFF+S+S
Subjt:  CPLKLQLPIFGLASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHPDYRFFSSHS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGTCTCCGGTGGGGTTTCCATTGCCCGAATCCGTGGCGAGAATCGTTTCTATCATCCCCCTGCGATGCGGCGTCGTTTGCAGCAGCAGCAGCAGCAGCAGCAGCA
GCAACAACAACAACAACAGCAGCAACAACAGCAGCAACAGCACCAGAAGCAGAGCGCTTTGGATTCTAAGGACGCTGTGGCGGCTGCTTCTGCTAGGATCGATGACTTGG
AGAAGAGGAGTGAGTTTGATGAGTGTCGTTCTTGGTCCACTCGCTCTGATTGCTCTGTTTCGGATCGTGGACTTGCTGATTCTACTAATTTGGATCGCTTCTTGGAGCAC
ACTACTCCTCTTGTTTCGGCTCATTGTATTCCGAAGAACCTGACCTTACTTCAATTTTTCAAATCATTAATTACTCCTGTAATTGGCCTTTTCGGCCCTCAACAGACGAG
GCTGAGGGGATGGAGAACTCGTGAAGTCCCAGAGGCACTTCCTTATTTTGTGCTCGGGGATCTTTGGGAATCTTACAAGGAATGGAGTGCGTACGGTGCTGGTATCCCTC
TATTGTTAAATGGTAGTGACTCAGTAGTACAGTACTATGTTCCATATCTGTCCGGCATTCAACTCTATATCTATCCATCAAAGTCCTCTGCCCTAAGTAGAAGGCGTGGT
ACAGATAGTGATGCTGAGTCTTCAAAGGAAACAAGTAGTGATGGAAGCAGTAATTGTGGGGCAGAAAAAAAAACCAAGGCCAATCTTCAGGATGAGTGTATTCAGGACTC
AAGTATTTCAGGGTCACAGAGAGCTCTTCAAATGAATACACCTTCTGCCGAGTCATCGAGTGATGAAAGTGACTCTTGCTACCATCATGGTCAGCTTGTATTTGAATACA
TGGAGCGTGATCCACCATTTTGTCGCGAACCATTAACCGATAAGATCACTATCCTTGCATCCCGTTTTCCTGAATTAAAGACATATAGGAGCTGTGATTTATCTCCTTCC
AGTTGGATTTCGGTGGCATGGTATCCCATTTATCGGATTCCCACGGGTCCAACTCTACAAAGTCTTGATGCTTGTTTCTTGACCTTCCACAATCTGTCAACAGCATTTCA
AGGCATCAGCTCTGATGGTTTGCAATTCCAATGGCCAAGAGTTAGAGAGGTGTACACTGCGGATTGCCCTCTCAAACTGCAGTTGCCAATATTTGGACTTGCTTCCTATA
AGTTCAAACTTCCTTTTTGGAATTCGACTGGTACCGAGGAATGTTCAAAGGCTCAATCTTTGTGGCAAGATGCTGAATACTGGCTTAGGTTATTAAACGTGAACCATCCT
GATTACAGATTTTTCTCATCTCATAGCTCATTCGGGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGTCTCCGGTGGGGTTTCCATTGCCCGAATCCGTGGCGAGAATCGTTTCTATCATCCCCCTGCGATGCGGCGTCGTTTGCAGCAGCAGCAGCAGCAGCAGCAGCA
GCAACAACAACAACAACAGCAGCAACAACAGCAGCAACAGCACCAGAAGCAGAGCGCTTTGGATTCTAAGGACGCTGTGGCGGCTGCTTCTGCTAGGATCGATGACTTGG
AGAAGAGGAGTGAGTTTGATGAGTGTCGTTCTTGGTCCACTCGCTCTGATTGCTCTGTTTCGGATCGTGGACTTGCTGATTCTACTAATTTGGATCGCTTCTTGGAGCAC
ACTACTCCTCTTGTTTCGGCTCATTGTATTCCGAAGAACCTGACCTTACTTCAATTTTTCAAATCATTAATTACTCCTGTAATTGGCCTTTTCGGCCCTCAACAGACGAG
GCTGAGGGGATGGAGAACTCGTGAAGTCCCAGAGGCACTTCCTTATTTTGTGCTCGGGGATCTTTGGGAATCTTACAAGGAATGGAGTGCGTACGGTGCTGGTATCCCTC
TATTGTTAAATGGTAGTGACTCAGTAGTACAGTACTATGTTCCATATCTGTCCGGCATTCAACTCTATATCTATCCATCAAAGTCCTCTGCCCTAAGTAGAAGGCGTGGT
ACAGATAGTGATGCTGAGTCTTCAAAGGAAACAAGTAGTGATGGAAGCAGTAATTGTGGGGCAGAAAAAAAAACCAAGGCCAATCTTCAGGATGAGTGTATTCAGGACTC
AAGTATTTCAGGGTCACAGAGAGCTCTTCAAATGAATACACCTTCTGCCGAGTCATCGAGTGATGAAAGTGACTCTTGCTACCATCATGGTCAGCTTGTATTTGAATACA
TGGAGCGTGATCCACCATTTTGTCGCGAACCATTAACCGATAAGATCACTATCCTTGCATCCCGTTTTCCTGAATTAAAGACATATAGGAGCTGTGATTTATCTCCTTCC
AGTTGGATTTCGGTGGCATGGTATCCCATTTATCGGATTCCCACGGGTCCAACTCTACAAAGTCTTGATGCTTGTTTCTTGACCTTCCACAATCTGTCAACAGCATTTCA
AGGCATCAGCTCTGATGGTTTGCAATTCCAATGGCCAAGAGTTAGAGAGGTGTACACTGCGGATTGCCCTCTCAAACTGCAGTTGCCAATATTTGGACTTGCTTCCTATA
AGTTCAAACTTCCTTTTTGGAATTCGACTGGTACCGAGGAATGTTCAAAGGCTCAATCTTTGTGGCAAGATGCTGAATACTGGCTTAGGTTATTAAACGTGAACCATCCT
GATTACAGATTTTTCTCATCTCATAGCTCATTCGGGAGATGA
Protein sequenceShow/hide protein sequence
MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQHQKQSALDSKDAVAAASARIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLDRFLEH
TTPLVSAHCIPKNLTLLQFFKSLITPVIGLFGPQQTRLRGWRTREVPEALPYFVLGDLWESYKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIYPSKSSALSRRRG
TDSDAESSKETSSDGSSNCGAEKKTKANLQDECIQDSSISGSQRALQMNTPSAESSSDESDSCYHHGQLVFEYMERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS
SWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISSDGLQFQWPRVREVYTADCPLKLQLPIFGLASYKFKLPFWNSTGTEECSKAQSLWQDAEYWLRLLNVNHP
DYRFFSSHSSFGR