; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G1295 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G1295
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionProtein of unknown function (DUF789)
Genome locationctg1:10999973..11001857
RNA-Seq ExpressionCucsat.G1295
SyntenyCucsat.G1295
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134231.3 uncharacterized protein LOC101208769 isoform X1 [Cucumis sativus]1.60e-310100Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST

Query:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
        NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
Subjt:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA

Query:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC
        ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC
Subjt:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW

Query:  QDADSWLRLLNVNHPDYRFFASHNSFWR
        QDADSWLRLLNVNHPDYRFFASHNSFWR
Subjt:  QDADSWLRLLNVNHPDYRFFASHNSFWR

XP_008438916.1 PREDICTED: uncharacterized protein LOC103483873 isoform X1 [Cucumis melo]1.07e-30097.2Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ     PKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGL DST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST

Query:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
        NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS ALSRRRGADSDA
Subjt:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA

Query:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC
        ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFN  GSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRF ELKTYRSC
Subjt:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTA QG STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW

Query:  QDADSWLRLLNVNHPDYRFFASHNSFWR
        QDADSWLRLLNVNHPDYRFFASHNSFWR
Subjt:  QDADSWLRLLNVNHPDYRFFASHNSFWR

XP_008438917.1 PREDICTED: uncharacterized protein LOC103483873 isoform X2 [Cucumis melo]1.99e-29896.96Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ     PKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGL DST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST

Query:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
        NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS AL RRRGADSDA
Subjt:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA

Query:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC
        ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFN  GSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRF ELKTYRSC
Subjt:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTA QG STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW

Query:  QDADSWLRLLNVNHPDYRFFASHNSFWR
        QDADSWLRLLNVNHPDYRFFASHNSFWR
Subjt:  QDADSWLRLLNVNHPDYRFFASHNSFWR

XP_011651067.2 uncharacterized protein LOC101208769 isoform X2 [Cucumis sativus]2.97e-30899.77Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST

Query:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
        NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSAL RRRGADSDA
Subjt:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA

Query:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC
        ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC
Subjt:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW

Query:  QDADSWLRLLNVNHPDYRFFASHNSFWR
        QDADSWLRLLNVNHPDYRFFASHNSFWR
Subjt:  QDADSWLRLLNVNHPDYRFFASHNSFWR

XP_038877692.1 uncharacterized protein LOC120069924 [Benincasa hispida]4.66e-29896.03Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQ   KQS LDSKDV+ A+T+TIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST

Query:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
        NLDRFLEHTTPLVPAHCIPKTSLRGWRNREV EASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSS+LSRRRG DSDA
Subjt:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA

Query:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC
         SSKETSSDGSSNSGAEKKTKTALQ+EWIQDF+VPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRF ELKTYRSC
Subjt:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW

Query:  QDADSWLRLLNVNHPDYRFFASHNSFWR
        QDAD+WLRLLNVNHPDYRFFASHNSFWR
Subjt:  QDADSWLRLLNVNHPDYRFFASHNSFWR

TrEMBL top hitse value%identityAlignment
A0A0A0L5V4 Uncharacterized protein9.39e-30597.66Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ          PKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST

Query:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
        NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
Subjt:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA

Query:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC
        ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC
Subjt:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW

Query:  QDADSWLRLLNVNHPDYRFFASHNSFWR
        QDADSWLRLLNVNHPDYRFFASHNSFWR
Subjt:  QDADSWLRLLNVNHPDYRFFASHNSFWR

A0A1S3AY60 uncharacterized protein LOC103483873 isoform X15.17e-30197.2Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ     PKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGL DST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST

Query:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
        NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS ALSRRRGADSDA
Subjt:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA

Query:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC
        ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFN  GSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRF ELKTYRSC
Subjt:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTA QG STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW

Query:  QDADSWLRLLNVNHPDYRFFASHNSFWR
        QDADSWLRLLNVNHPDYRFFASHNSFWR
Subjt:  QDADSWLRLLNVNHPDYRFFASHNSFWR

A0A1S3AY77 uncharacterized protein LOC103483873 isoform X29.61e-29996.96Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ     PKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGL DST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST

Query:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
        NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS AL RRRGADSDA
Subjt:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA

Query:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC
        ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFN  GSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRF ELKTYRSC
Subjt:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTA QG STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW

Query:  QDADSWLRLLNVNHPDYRFFASHNSFWR
        QDADSWLRLLNVNHPDYRFFASHNSFWR
Subjt:  QDADSWLRLLNVNHPDYRFFASHNSFWR

A0A5A7U113 Uncharacterized protein2.32e-28294.35Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQ  PKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGL DST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST

Query:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
        NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS ALSRRRGADSDA
Subjt:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA

Query:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC
        ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFN  GSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRF ELKTYRSC
Subjt:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ------------GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNST
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ            G STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNST
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ------------GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNST

Query:  GAEECSKAHSLWQDADSWLRLLNVN
        GAEECSKAHSLWQDADSWLR  +V+
Subjt:  GAEECSKAHSLWQDADSWLRLLNVN

A0A5D3CXG0 Uncharacterized protein4.31e-28094.12Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQ  PKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGL DST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST

Query:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
        NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS AL RRRGADSDA
Subjt:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA

Query:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC
        ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFN  GSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRF ELKTYRSC
Subjt:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ------------GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNST
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ            G STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNST
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ------------GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNST

Query:  GAEECSKAHSLWQDADSWLRLLNVN
        GAEECSKAHSLWQDADSWLR  +V+
Subjt:  GAEECSKAHSLWQDADSWLRLLNVN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)1.0e-8150.3Show/hide
Query:  ADSTNLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLY--VDPSKSSALSRR
        A S+N++RFL+  TP VPAH + KT +R     +V    PYF+LGD+WESF EWSAYG G+PL LN + D V QYYVP LSGIQ+Y  VD   SS  +RR
Subjt:  ADSTNLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLY--VDPSKSSALSRR

Query:  RGADSDAESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNV---PSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASR
        +G +S+++  +++SS+GSS   +E +       E I       S R  ++++      +SSSD+ +     G+L+FEYLERD P+ REP  DK++ LASR
Subjt:  RGADSDAESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNV---PSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASR

Query:  FSELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGA
        F ELKT RSCDL PSSW SVAWYPIY+IPTGPTL+ LDACFLT+H+L T FQG        H  + RE        K++LP+FGLASYK +   W S G 
Subjt:  FSELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGA

Query:  EECSKAHSLWQDADSWLRLLNVNHPDYRFF
             A+SL+Q AD+WLRL  VNHPD+ FF
Subjt:  EECSKAHSLWQDADSWLRLLNVNHPDYRFF

AT2G01260.1 Protein of unknown function (DUF789)3.7e-7647.88Show/hide
Query:  STNLDRFLEHTTPLVPAHCIPKTSLRGWR-NREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYV-DPSKSSALSRRRG
        S+NLDRFLE  TP VPA  + KT LR  R + + ++  PYFVLGD+W+SF EWSAYG G+PL+LN + D V+QYYVP LS IQ+Y    +  S+L  RR 
Subjt:  STNLDRFLEHTTPLVPAHCIPKTSLRGWR-NREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYV-DPSKSSALSRRRG

Query:  ADSDAESSKETSSDGSSNSGAEKKTK----TALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRF
         DS     +++SSD SS+S +E+ +      +L+++  +D                  SSSD+ +     G+L+FEYLERD P+ REP  DK+  LA++F
Subjt:  ADSDAESSKETSSDGSSNSGAEKKTK----TALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRF

Query:  SELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGI-STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGA
         EL T RSCDL  SSW SVAWYPIYRIPTGPTL+ LDACFLT+H+L T+F G  S   +    PR  E        K+ LP+FGLASYKF+   W   G 
Subjt:  SELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGI-STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGA

Query:  EECSKAHSLWQDADSWLRLLNVNHPDYRFF
         E    +SL+Q AD WL   +V+HPD+ FF
Subjt:  EECSKAHSLWQDADSWLRLLNVNHPDYRFF

AT2G01260.2 Protein of unknown function (DUF789)4.9e-6050.4Show/hide
Query:  STNLDRFLEHTTPLVPAHCIPKTSLRGWR-NREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYV-DPSKSSALSRRRG
        S+NLDRFLE  TP VPA  + KT LR  R + + ++  PYFVLGD+W+SF EWSAYG G+PL+LN + D V+QYYVP LS IQ+Y    +  S+L  RR 
Subjt:  STNLDRFLEHTTPLVPAHCIPKTSLRGWR-NREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYV-DPSKSSALSRRRG

Query:  ADSDAESSKETSSDGSSNSGAEKKTK----TALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRF
         DS     +++SSD SS+S +E+ +      +L+++  +D                  SSSD+ +     G+L+FEYLERD P+ REP  DK+  LA++F
Subjt:  ADSDAESSKETSSDGSSNSGAEKKTK----TALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRF

Query:  SELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQG
         EL T RSCDL  SSW SVAWYPIYRIPTGPTL+ LDACFLT+H+L T+F G
Subjt:  SELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQG

AT4G16100.1 Protein of unknown function (DUF789)3.2e-8848.13Show/hide
Query:  RIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST-------NLD
        RIRGENRFY+PP MR+    QQ++++++ + ++ +++++  +  LD K  V        + ++  + +EC    + SDCSV  R  + +T       NL 
Subjt:  RIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST-------NLD

Query:  RFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESS
        RFL+ TTP+V    +P TS +GWR RE  E  PYF+L DLW+SF+EWSAYG G+PLLLNG DSVVQYYVPYLSGIQLY DPS++    RR G +SD +S 
Subjt:  RFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESS

Query:  KETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESD-SCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSCDL
        ++ SSDGS++       +   QN +          RA     P   SSSDES+ S    G+LVFEYLE   PF REPLTDKI+ L+S+F  L+TYRSCDL
Subjt:  KETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESD-SCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSCDL

Query:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWN-STGAEECSKAHSLWQ
        SPSSW+SVAWYPIYRIP G +LQ+LDACFLTFH+LST  +G S +  Q     V          KL LP FGLASYKFK+  W+  +  +E  +  +L +
Subjt:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWN-STGAEECSKAHSLWQ

Query:  DADSWLRLLNVNHPDYRFFASHN-SFWR
         A+ WLR L V  PD+R F SH+ S WR
Subjt:  DADSWLRLLNVNHPDYRFFASHN-SFWR

AT5G49220.1 Protein of unknown function (DUF789)4.2e-8849.21Show/hide
Query:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSV-------
        MS SGGVSIAR  IRGENRFY+PP MRR      QQ+ Q QQQ +++Q++  +   L  K+   AAT       K     E +S    S   V       
Subjt:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSV-------

Query:  ---SDRGLADSTNLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGI-----PLLLNGSDSVVQYYVPYLSGIQLYVD
           S R L+D +NLDRFLEHTTP+VPA   P  S    + RE S+   YFVL DLWESF EWSAYGAG+     PL ++G+DS VQYYVPYLSGIQLYVD
Subjt:  ---SDRGLADSTNLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGI-----PLLLNGSDSVVQYYVPYLSGIQLYVD

Query:  PSKSSALSRRRGADSDAESSKETSSDGSSNSGAEKKTKTA--LQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLT
        P     L + R    D     E SS+GSSNS       +   L    ++D ++ GS             SS E++     G+L+FEYLE +PPF REPL 
Subjt:  PSKSSALSRRRGADSDAESSKETSSDGSSNSGAEKKTKTA--LQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLT

Query:  DKITVLASRFSELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFK
        +KI+ LASR  EL TYRSCDL PSSW+SV+WYPIYRIP GPTLQ+LDACFLTFH+LSTA    S  G     P            KL LP FGLASYK K
Subjt:  DKITVLASRFSELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFK

Query:  IPFWNSTGAEECSKAHSLWQDADSWLRLLNVNHPDYRFFASHN
        +  WN    +E  K  SL Q AD WL+ L V+HPDYRFF S++
Subjt:  IPFWNSTGAEECSKAHSLWQDADSWLRLLNVNHPDYRFFASHN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGTCTCCGGTGGTGTTTCGATTGCCCGAATCCGTGGCGAGAATCGCTTCTACCATCCACCTGCGATGCGACGTCGTTTGCAGCAGCAGCAGCAGCAGCAGCAGCA
ACAGCAACAACAACAACAACAGCAGCAGCAACAGCAGCCGAAGCAAAGTGCTTTAGATTCTAAGGACGTTGTGGCGGCTGCTACTTCCACGATCGACGACTTGGAAAAGA
GGAGTGAGTTTGATGAGTGTCGTTCTTGGTCCACTCGCTCTGATTGTTCTGTTTCCGATCGGGGACTCGCTGATTCTACTAATTTGGATCGCTTTTTGGAGCATACTACT
CCTCTTGTTCCGGCTCATTGTATTCCTAAGACGAGTCTGAGGGGGTGGAGAAATCGTGAAGTTTCAGAGGCATCTCCTTATTTTGTGCTCGGTGATCTCTGGGAATCTTT
CAAGGAATGGAGTGCATATGGAGCCGGTATCCCTCTATTGTTAAATGGCAGTGACTCTGTAGTACAGTATTATGTTCCATATCTGTCCGGCATTCAACTCTATGTTGATC
CTTCAAAGTCCTCTGCACTAAGTAGAAGGCGTGGTGCAGATAGTGATGCTGAGTCATCAAAGGAAACAAGCAGTGATGGAAGCAGTAATTCTGGGGCGGAAAAGAAAACG
AAAACTGCCCTTCAGAATGAGTGGATACAGGACTTTAATGTTCCGGGGTCACAAAGAGCTCTTCAGATGAATGTACCTTCTTCCGAGTCATCGAGTGATGAAAGTGACTC
TTGCTACCGTCATGGTCAGCTTGTTTTTGAATACTTGGAACGTGATCCACCATTTTGTCGTGAACCATTAACTGATAAGATCACTGTCCTTGCATCACGTTTTTCTGAAT
TAAAGACATACAGGAGCTGTGACTTATCTCCTTCCAGTTGGATATCTGTGGCATGGTATCCAATTTATCGGATTCCGACGGGGCCGACTCTACAAAGTCTAGATGCTTGT
TTCTTGACCTTCCATAATCTGTCAACAGCATTTCAAGGCATCAGCACGGATGGTTTACAATTCCATTGGCCAAGAGTTAGAGAGGTGTACACTGCAGATTGCCCTCTCAA
ACTGCAGTTGCCAATATTTGGACTTGCTTCCTATAAGTTCAAAATTCCTTTTTGGAATTCGACTGGTGCAGAGGAATGTTCGAAGGCTCACTCTTTGTGGCAAGATGCCG
ACAGCTGGCTTAGGTTATTAAACGTAAACCATCCTGATTACAGATTTTTCGCATCTCATAACTCATTCTGGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGTCTCCGGTGGTGTTTCGATTGCCCGAATCCGTGGCGAGAATCGCTTCTACCATCCACCTGCGATGCGACGTCGTTTGCAGCAGCAGCAGCAGCAGCAGCAGCA
ACAGCAACAACAACAACAACAGCAGCAGCAACAGCAGCCGAAGCAAAGTGCTTTAGATTCTAAGGACGTTGTGGCGGCTGCTACTTCCACGATCGACGACTTGGAAAAGA
GGAGTGAGTTTGATGAGTGTCGTTCTTGGTCCACTCGCTCTGATTGTTCTGTTTCCGATCGGGGACTCGCTGATTCTACTAATTTGGATCGCTTTTTGGAGCATACTACT
CCTCTTGTTCCGGCTCATTGTATTCCTAAGACGAGTCTGAGGGGGTGGAGAAATCGTGAAGTTTCAGAGGCATCTCCTTATTTTGTGCTCGGTGATCTCTGGGAATCTTT
CAAGGAATGGAGTGCATATGGAGCCGGTATCCCTCTATTGTTAAATGGCAGTGACTCTGTAGTACAGTATTATGTTCCATATCTGTCCGGCATTCAACTCTATGTTGATC
CTTCAAAGTCCTCTGCACTAAGTAGAAGGCGTGGTGCAGATAGTGATGCTGAGTCATCAAAGGAAACAAGCAGTGATGGAAGCAGTAATTCTGGGGCGGAAAAGAAAACG
AAAACTGCCCTTCAGAATGAGTGGATACAGGACTTTAATGTTCCGGGGTCACAAAGAGCTCTTCAGATGAATGTACCTTCTTCCGAGTCATCGAGTGATGAAAGTGACTC
TTGCTACCGTCATGGTCAGCTTGTTTTTGAATACTTGGAACGTGATCCACCATTTTGTCGTGAACCATTAACTGATAAGATCACTGTCCTTGCATCACGTTTTTCTGAAT
TAAAGACATACAGGAGCTGTGACTTATCTCCTTCCAGTTGGATATCTGTGGCATGGTATCCAATTTATCGGATTCCGACGGGGCCGACTCTACAAAGTCTAGATGCTTGT
TTCTTGACCTTCCATAATCTGTCAACAGCATTTCAAGGCATCAGCACGGATGGTTTACAATTCCATTGGCCAAGAGTTAGAGAGGTGTACACTGCAGATTGCCCTCTCAA
ACTGCAGTTGCCAATATTTGGACTTGCTTCCTATAAGTTCAAAATTCCTTTTTGGAATTCGACTGGTGCAGAGGAATGTTCGAAGGCTCACTCTTTGTGGCAAGATGCCG
ACAGCTGGCTTAGGTTATTAAACGTAAACCATCCTGATTACAGATTTTTCGCATCTCATAACTCATTCTGGAGATGA
Protein sequenceShow/hide protein sequence
MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTT
PLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKETSSDGSSNSGAEKKT
KTALQNEWIQDFNVPGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDAC
FLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADSWLRLLNVNHPDYRFFASHNSFWR