; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G07970 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G07970
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of unknown function (DUF789)
Genome locationClcChr05:6003422..6007215
RNA-Seq ExpressionClc05G07970
SyntenyClc05G07970
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049523.1 uncharacterized protein E6C27_scaffold171G007780 [Cucumis melo var. makuwa]3.1e-21781.3Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQ-KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNL
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ+QQ KQSALDSKDVVAA+T+TIDDLEKRSEFDECRSWSTRSDCSVSDRGL DSTNL
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQ-KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNL

Query:  DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLIGY
        DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS A             
Subjt:  DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLIGY

Query:  ASMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRAI
                                                        L++ RGADSDAESSKE SSDGSSNSGAEKKTKT LQ+EWIQDF+ LGSQRA+
Subjt:  ASMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRAI

Query:  QMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAF
        QMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKTYRSC+LSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAF
Subjt:  QMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAF

Query:  Q------------GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLRENDVDSNITVEMG
        Q            G STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD+WLRENDVDSNITVEMG
Subjt:  Q------------GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLRENDVDSNITVEMG

KAG6582192.1 hypothetical protein SDJN03_22194, partial [Cucurbita argyrosperma subsp. sororia]3.5e-21379.11Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQKQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLD
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ+ QKQSALDSKD VAA++A IDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLD
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQKQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLD

Query:  RFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLIGYA
        RFLEHTTPLV AHCIPKT LRGWR REV EA PYFVLGDLWES+KEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLY+ PSKSSA              
Subjt:  RFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLIGYA

Query:  SMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRAIQ
                                                       L++ RG DSDAESSKE SSDGSSN GAEKKTK  LQDE IQD S+ GSQRA+Q
Subjt:  SMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRAIQ

Query:  MNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ
        MN PS+ESSSDESDSCY HGQLVFEY+ERDPPFCREPLTDKITILASRFPELKTYRSC+LSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ
Subjt:  MNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ

Query:  GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLRENDVDSNITVEMGIGFRIVVILGASIP
        GIS+DGLQF WPRVREVYTADCPLKLQLPIFGLASYKFK+PFWNSTG EECSKA SLWQDA+ WLRENDVDSNI V+MGIGFRIVVILG SIP
Subjt:  GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLRENDVDSNITVEMGIGFRIVVILGASIP

TYK16202.1 uncharacterized protein E5676_scaffold209G001310 [Cucumis melo var. makuwa]1.8e-21781.5Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQ-KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNL
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ+QQ KQSALDSKDVVAA+T+TIDDLEKRSEFDECRSWSTRSDCSVSDRGL DSTNL
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQ-KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNL

Query:  DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLIGY
        DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS ALR           
Subjt:  DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLIGY

Query:  ASMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRAI
                                                            RGADSDAESSKE SSDGSSNSGAEKKTKT LQ+EWIQDF+ LGSQRA+
Subjt:  ASMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRAI

Query:  QMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAF
        QMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKTYRSC+LSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAF
Subjt:  QMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAF

Query:  Q------------GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLRENDVDSNITVEMG
        Q            G STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD+WLRENDVDSNITVEMG
Subjt:  Q------------GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLRENDVDSNITVEMG

XP_004134231.3 uncharacterized protein LOC101208769 isoform X1 [Cucumis sativus]3.0e-21282.94Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQ---KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ+QQ   KQSALDSKDVVAA+T+TIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQ---KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST

Query:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLI
        NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSA           
Subjt:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLI

Query:  GYASMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQR
                                                          L++ RGADSDAESSKE SSDGSSNSGAEKKTKT LQ+EWIQDF+V GSQR
Subjt:  GYASMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQR

Query:  AIQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLST
        A+QMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRF ELKTYRSC+LSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLST
Subjt:  AIQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLST

Query:  AFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLR
        AFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD+WLR
Subjt:  AFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLR

XP_011651067.2 uncharacterized protein LOC101208769 isoform X2 [Cucumis sativus]1.7e-21283.16Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQ---KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ+QQ   KQSALDSKDVVAA+T+TIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQ---KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST

Query:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLI
        NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALR         
Subjt:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLI

Query:  GYASMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQR
                                                              RGADSDAESSKE SSDGSSNSGAEKKTKT LQ+EWIQDF+V GSQR
Subjt:  GYASMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQR

Query:  AIQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLST
        A+QMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRF ELKTYRSC+LSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLST
Subjt:  AIQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLST

Query:  AFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLR
        AFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD+WLR
Subjt:  AFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLR

TrEMBL top hitse value%identityAlignment
A0A0A0L5V4 Uncharacterized protein8.7e-21081.97Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQKQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLD
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQ       Q KQSALDSKDVVAA+T+TIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLD
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQKQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLD

Query:  RFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLIGYA
        RFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSA              
Subjt:  RFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLIGYA

Query:  SMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRAIQ
                                                       L++ RGADSDAESSKE SSDGSSNSGAEKKTKT LQ+EWIQDF+V GSQRA+Q
Subjt:  SMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRAIQ

Query:  MNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ
        MNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRF ELKTYRSC+LSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ
Subjt:  MNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ

Query:  GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLR
        GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD+WLR
Subjt:  GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLR

A0A1S3AY60 uncharacterized protein LOC103483873 isoform X11.0e-21082.4Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQKQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLD
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ  Q KQSALDSKDVVAA+T+TIDDLEKRSEFDECRSWSTRSDCSVSDRGL DSTNLD
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQKQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLD

Query:  RFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLIGYA
        RFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS A              
Subjt:  RFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLIGYA

Query:  SMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRAIQ
                                                       L++ RGADSDAESSKE SSDGSSNSGAEKKTKT LQ+EWIQDF+ LGSQRA+Q
Subjt:  SMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRAIQ

Query:  MNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ
        MNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKTYRSC+LSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTA Q
Subjt:  MNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ

Query:  GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLR
        G STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD+WLR
Subjt:  GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLR

A0A1S3AY77 uncharacterized protein LOC103483873 isoform X26.0e-21182.62Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQKQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLD
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ  Q KQSALDSKDVVAA+T+TIDDLEKRSEFDECRSWSTRSDCSVSDRGL DSTNLD
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQKQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLD

Query:  RFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLIGYA
        RFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS ALR            
Subjt:  RFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLIGYA

Query:  SMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRAIQ
                                                           RGADSDAESSKE SSDGSSNSGAEKKTKT LQ+EWIQDF+ LGSQRA+Q
Subjt:  SMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRAIQ

Query:  MNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ
        MNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKTYRSC+LSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTA Q
Subjt:  MNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ

Query:  GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLR
        G STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD+WLR
Subjt:  GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLR

A0A5A7U113 Uncharacterized protein1.5e-21781.3Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQ-KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNL
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ+QQ KQSALDSKDVVAA+T+TIDDLEKRSEFDECRSWSTRSDCSVSDRGL DSTNL
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQ-KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNL

Query:  DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLIGY
        DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS A             
Subjt:  DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLIGY

Query:  ASMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRAI
                                                        L++ RGADSDAESSKE SSDGSSNSGAEKKTKT LQ+EWIQDF+ LGSQRA+
Subjt:  ASMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRAI

Query:  QMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAF
        QMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKTYRSC+LSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAF
Subjt:  QMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAF

Query:  Q------------GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLRENDVDSNITVEMG
        Q            G STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD+WLRENDVDSNITVEMG
Subjt:  Q------------GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLRENDVDSNITVEMG

A0A5D3CXG0 Uncharacterized protein8.7e-21881.5Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQ-KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNL
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ+QQ KQSALDSKDVVAA+T+TIDDLEKRSEFDECRSWSTRSDCSVSDRGL DSTNL
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQ-KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNL

Query:  DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLIGY
        DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS ALR           
Subjt:  DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLIGY

Query:  ASMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRAI
                                                            RGADSDAESSKE SSDGSSNSGAEKKTKT LQ+EWIQDF+ LGSQRA+
Subjt:  ASMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRAI

Query:  QMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAF
        QMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKTYRSC+LSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAF
Subjt:  QMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAF

Query:  Q------------GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLRENDVDSNITVEMG
        Q            G STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD+WLRENDVDSNITVEMG
Subjt:  Q------------GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLRENDVDSNITVEMG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)3.2e-7141.88Show/hide
Query:  ADSTNLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYVDPSKSSALRLFFN
        A S+N++RFL+  TP VPAH + KT +R     +V    PYF+LGD+WESF EWSAYG G+PL LN + D V QYYVP LSGIQ+Y D            
Subjt:  ADSTNLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYVDPSKSSALRLFFN

Query:  ELNLIGYASMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDG---SSNSGAEKKTKTTLQDEWIQD
                                             L+S LQ   +GE     +       DS +E S   S  G   S    + +  K +L+ E  +D
Subjt:  ELNLIGYASMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDG---SSNSGAEKKTKTTLQDEWIQD

Query:  FSVLGSQRAIQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACF
                          SSSD+ +     G+L+FEYLERD P+ REP  DK++ LASRFPELKT RSC+L PSSW SVAWYPIY+IPTGPTL+ LDACF
Subjt:  FSVLGSQRAIQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACF

Query:  LTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLRENDVD
        LT+H+L T FQG        H  + RE        K++LP+FGLASYK +   W S G      A+SL+Q ADNWLR   V+
Subjt:  LTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLRENDVD

AT2G01260.1 Protein of unknown function (DUF789)6.9e-6641.18Show/hide
Query:  STNLDRFLEHTTPLVPAHCIPKTSLRGWR-NREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYVDPSKSSALRLFFNE
        S+NLDRFLE  TP VPA  + KT LR  R + + ++  PYFVLGD+W+SF EWSAYG G+PL+LN + D V+QYYVP LS IQ+Y               
Subjt:  STNLDRFLEHTTPLVPAHCIPKTSLRGWR-NREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYVDPSKSSALRLFFNE

Query:  LNLIGYASMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSD-AESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSV
                                            L S L+    G+S           +DSD  +SS + SSD  S   + +    +L+D+  +D   
Subjt:  LNLIGYASMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSD-AESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSV

Query:  LGSQRAIQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTF
                       SSSD+ +     G+L+FEYLERD P+ REP  DK+  LA++FPEL T RSC+L  SSW SVAWYPIYRIPTGPTL+ LDACFLT+
Subjt:  LGSQRAIQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTF

Query:  HNLSTAFQGI-STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWL
        H+L T+F G  S   +    PR  E        K+ LP+FGLASYKF+   W   G  E    +SL+Q AD WL
Subjt:  HNLSTAFQGI-STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWL

AT2G01260.2 Protein of unknown function (DUF789)2.1e-5441.75Show/hide
Query:  STNLDRFLEHTTPLVPAHCIPKTSLRGWR-NREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYVDPSKSSALRLFFNE
        S+NLDRFLE  TP VPA  + KT LR  R + + ++  PYFVLGD+W+SF EWSAYG G+PL+LN + D V+QYYVP LS IQ+Y               
Subjt:  STNLDRFLEHTTPLVPAHCIPKTSLRGWR-NREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYVDPSKSSALRLFFNE

Query:  LNLIGYASMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSD-AESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSV
                                            L S L+    G+S           +DSD  +SS + SSD  S   + +    +L+D+  +D   
Subjt:  LNLIGYASMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSD-AESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSV

Query:  LGSQRAIQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTF
                       SSSD+ +     G+L+FEYLERD P+ REP  DK+  LA++FPEL T RSC+L  SSW SVAWYPIYRIPTGPTL+ LDACFLT+
Subjt:  LGSQRAIQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTF

Query:  HNLSTAFQG
        H+L T+F G
Subjt:  HNLSTAFQG

AT4G16100.1 Protein of unknown function (DUF789)7.8e-7841.94Show/hide
Query:  RIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQKQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST-------NLDRFL
        RIRGENRFY+PP MR+  Q++++++ + ++ +++K++ +  LD K  V        + ++  + +EC    + SDCSV  R  + +T       NL RFL
Subjt:  RIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQKQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST-------NLDRFL

Query:  EHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLIGYASMQ
        + TTP+V    +P TS +GWR RE  E  PYF+L DLW+SF+EWSAYG G+PLLLNG DSVVQYYVPYLSGIQLY DPS++   R               
Subjt:  EHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLIGYASMQ

Query:  FCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRAIQMNV
          +R+                                            G +SD +S ++ SSDGS++             E  Q+       RA     
Subjt:  FCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRAIQMNV

Query:  PSSESSSDESD-SCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGI
        P   SSSDES+ S    G+LVFEYLE   PF REPLTDKI+ L+S+FP L+TYRSC+LSPSSW+SVAWYPIYRIP G +LQ+LDACFLTFH+LST  +G 
Subjt:  PSSESSSDESD-SCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGI

Query:  STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWN-STGAEECSKAHSLWQDADNWLR
        S +  Q     V          KL LP FGLASYKFK+  W+  +  +E  +  +L + A+ WLR
Subjt:  STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWN-STGAEECSKAHSLWQDADNWLR

AT5G49220.1 Protein of unknown function (DUF789)6.0e-7842.97Show/hide
Query:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQKQSALDSKDVVAASTAT--------IDDLEKR---SEFDECRSWSTRSDCS
        MS SGGVSIAR  IRGENRFY+PP M RR+QQ+ Q QQQ +++Q++  + +  +D +   AA+ A         + + + R   S  + C   S  S  S
Subjt:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQKQSALDSKDVVAASTAT--------IDDLEKR---SEFDECRSWSTRSDCS

Query:  VSDRGLADSTNLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGI-----PLLLNGSDSVVQYYVPYLSGIQLYVDPS
         S R L+D +NLDRFLEHTTP+VPA   P  S    + RE S+   YFVL DLWESF EWSAYGAG+     PL ++G+DS VQYYVPYLSGIQLYVDP 
Subjt:  VSDRGLADSTNLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGI-----PLLLNGSDSVVQYYVPYLSGIQLYVDPS

Query:  KSSALRLFFNELNLIGYASMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTT-
        K           N +G                                                                E SS+GSSNS       +  
Subjt:  KSSALRLFFNELNLIGYASMQFCKRMLASINLLFRIVQYYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTT-

Query:  -LQDEWIQDFSVLGSQRAIQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGP
         L    ++D S+ GS             SS E++     G+L+FEYLE +PPF REPL +KI+ LASR PEL TYRSC+L PSSW+SV+WYPIYRIP GP
Subjt:  -LQDEWIQDFSVLGSQRAIQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGP

Query:  TLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLRENDVD
        TLQ+LDACFLTFH+LSTA    S  G     P            KL LP FGLASYK K+  WN    +E  K  SL Q AD WL+   VD
Subjt:  TLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLRENDVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGTCTCCGGTGGGGTTTCGATTGCCCGAATCCGTGGCGAGAATCGCTTTTACCATCCACCTGCGATGCGGCGTCGTTTGCAGCAGCAACAACAACAGCAGCAGCA
GCAGCAGCAGCAGCAGCAGCAGAAGCAGCAGAAGCAAAGTGCCTTGGATTCTAAGGACGTTGTGGCGGCTTCTACTGCTACGATCGATGACTTGGAGAAGAGGAGTGAGT
TTGATGAGTGTCGTTCTTGGTCCACTCGCTCTGATTGCTCTGTTTCGGATCGTGGACTAGCTGATTCTACTAATTTGGATCGCTTCTTGGAACACACTACTCCCCTTGTT
CCGGCTCATTGTATTCCTAAGACGAGCCTGAGGGGATGGAGAAACCGTGAAGTCTCGGAGGCATCTCCTTATTTTGTGCTCGGTGATCTCTGGGAATCTTTCAAGGAATG
GAGTGCATATGGCGCGGGAATCCCTCTATTGTTAAATGGTAGTGACTCTGTAGTACAGTACTATGTTCCTTATCTGTCCGGCATTCAACTCTATGTTGATCCTTCGAAGT
CCTCTGCCCTAAGGCTGTTTTTCAATGAATTAAATTTGATAGGTTATGCATCTATGCAGTTTTGTAAAAGAATGCTGGCTTCAATAAATTTATTGTTCAGGATCGTGCAA
TATTACAAAAAATGCGGAGGTGGGGGATTGTCCTCGGATCTTCAATTGTGTTTAAAGGGGGAAAGTGGAGTACTGTTTTATCTTGCGCAATGGCGTGGCGCAGATAGTGA
TGCCGAGTCCTCGAAGGAAGCAAGCAGTGATGGAAGCAGTAATTCCGGGGCAGAAAAGAAAACGAAGACTACCCTTCAGGATGAGTGGATCCAGGACTTTAGTGTCCTGG
GGTCACAAAGAGCTATTCAAATGAATGTACCCTCTTCCGAGTCATCAAGTGATGAAAGTGACTCTTGCTACCGTCATGGTCAGCTTGTGTTTGAATACTTGGAGCGCGAT
CCACCATTTTGTCGTGAACCATTAACTGATAAGATCACTATCCTTGCATCTCGTTTTCCTGAATTAAAGACATATAGGAGTTGTGAGCTATCTCCTTCCAGTTGGATTTC
TGTGGCATGGTATCCCATTTATCGGATTCCCACAGGGCCAACTTTACAAAGTCTAGATGCTTGCTTCTTGACCTTTCATAATCTGTCAACAGCATTTCAAGGCATCAGCA
CTGATGGGTTGCAATTCCATTGGCCAAGAGTTAGAGAGGTGTACACTGCGGATTGCCCTCTCAAACTGCAGTTGCCAATATTTGGACTTGCTTCCTATAAGTTCAAAATT
CCTTTTTGGAACTCGACTGGTGCAGAGGAATGTTCGAAGGCTCACTCTTTGTGGCAAGATGCTGACAACTGGCTCAGGGAGAATGATGTTGATAGTAACATTACGGTTGA
AATGGGGATTGGCTTTAGGATAGTTGTTATTCTGGGAGCGTCAATCCCCATGGTATTCTATTAG
mRNA sequenceShow/hide mRNA sequence
TTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTCACTGTATATATATACACAAACATCCACCCGAATTCCTACCCTGCCTCCGCCTTGTTCTTTGTTTCT
TGCAATGTCAGTCTCCGGTGGGGTTTCGATTGCCCGAATCCGTGGCGAGAATCGCTTTTACCATCCACCTGCGATGCGGCGTCGTTTGCAGCAGCAACAACAACAGCAGC
AGCAGCAGCAGCAGCAGCAGCAGCAGAAGCAGCAGAAGCAAAGTGCCTTGGATTCTAAGGACGTTGTGGCGGCTTCTACTGCTACGATCGATGACTTGGAGAAGAGGAGT
GAGTTTGATGAGTGTCGTTCTTGGTCCACTCGCTCTGATTGCTCTGTTTCGGATCGTGGACTAGCTGATTCTACTAATTTGGATCGCTTCTTGGAACACACTACTCCCCT
TGTTCCGGCTCATTGTATTCCTAAGACGAGCCTGAGGGGATGGAGAAACCGTGAAGTCTCGGAGGCATCTCCTTATTTTGTGCTCGGTGATCTCTGGGAATCTTTCAAGG
AATGGAGTGCATATGGCGCGGGAATCCCTCTATTGTTAAATGGTAGTGACTCTGTAGTACAGTACTATGTTCCTTATCTGTCCGGCATTCAACTCTATGTTGATCCTTCG
AAGTCCTCTGCCCTAAGGCTGTTTTTCAATGAATTAAATTTGATAGGTTATGCATCTATGCAGTTTTGTAAAAGAATGCTGGCTTCAATAAATTTATTGTTCAGGATCGT
GCAATATTACAAAAAATGCGGAGGTGGGGGATTGTCCTCGGATCTTCAATTGTGTTTAAAGGGGGAAAGTGGAGTACTGTTTTATCTTGCGCAATGGCGTGGCGCAGATA
GTGATGCCGAGTCCTCGAAGGAAGCAAGCAGTGATGGAAGCAGTAATTCCGGGGCAGAAAAGAAAACGAAGACTACCCTTCAGGATGAGTGGATCCAGGACTTTAGTGTC
CTGGGGTCACAAAGAGCTATTCAAATGAATGTACCCTCTTCCGAGTCATCAAGTGATGAAAGTGACTCTTGCTACCGTCATGGTCAGCTTGTGTTTGAATACTTGGAGCG
CGATCCACCATTTTGTCGTGAACCATTAACTGATAAGATCACTATCCTTGCATCTCGTTTTCCTGAATTAAAGACATATAGGAGTTGTGAGCTATCTCCTTCCAGTTGGA
TTTCTGTGGCATGGTATCCCATTTATCGGATTCCCACAGGGCCAACTTTACAAAGTCTAGATGCTTGCTTCTTGACCTTTCATAATCTGTCAACAGCATTTCAAGGCATC
AGCACTGATGGGTTGCAATTCCATTGGCCAAGAGTTAGAGAGGTGTACACTGCGGATTGCCCTCTCAAACTGCAGTTGCCAATATTTGGACTTGCTTCCTATAAGTTCAA
AATTCCTTTTTGGAACTCGACTGGTGCAGAGGAATGTTCGAAGGCTCACTCTTTGTGGCAAGATGCTGACAACTGGCTCAGGGAGAATGATGTTGATAGTAACATTACGG
TTGAAATGGGGATTGGCTTTAGGATAGTTGTTATTCTGGGAGCGTCAATCCCCATGGTATTCTATTAG
Protein sequenceShow/hide protein sequence
MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQKQQKQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTPLV
PAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRLFFNELNLIGYASMQFCKRMLASINLLFRIVQ
YYKKCGGGGLSSDLQLCLKGESGVLFYLAQWRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRAIQMNVPSSESSSDESDSCYRHGQLVFEYLERD
PPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKI
PFWNSTGAEECSKAHSLWQDADNWLRENDVDSNITVEMGIGFRIVVILGASIPMVFY