; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC05G088570 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC05G088570
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationCicolChr05:6154694..6158214
RNA-Seq ExpressionCcUC05G088570
SyntenyCcUC05G088570
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134231.3 uncharacterized protein LOC101208769 isoform X1 [Cucumis sativus]2.0e-23395.57Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ-------KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ       KQSALDSKDVVAA+T+TIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ-------KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST

Query:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
        NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
Subjt:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA

Query:  ESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSC
        ESSKE SSDGSSNSGAEKKTKT LQ+EWIQDF+V GSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRF ELKTYRSC
Subjt:  ESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSC

Query:  ELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSL
        +LSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSL
Subjt:  ELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSL

Query:  WQDADNWLRLLNVNHPDYRFFASHNSFWR
        WQDAD+WLRLLNVNHPDYRFFASHNSFWR
Subjt:  WQDADNWLRLLNVNHPDYRFFASHNSFWR

XP_008438916.1 PREDICTED: uncharacterized protein LOC103483873 isoform X1 [Cucumis melo]5.8e-23395.99Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ--KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ  KQSALDSKDVVAA+T+TIDDLEKRSEFDECRSWSTRSDCSVSDRGL DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ--KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLDRF

Query:  LEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE
        LEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS ALSRRRGADSDAESSKE
Subjt:  LEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE

Query:  ASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPS
         SSDGSSNSGAEKKTKT LQ+EWIQDF+ LGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKTYRSC+LSPS
Subjt:  ASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD
        SWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTA Q G STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD

Query:  NWLRLLNVNHPDYRFFASHNSFWR
        +WLRLLNVNHPDYRFFASHNSFWR
Subjt:  NWLRLLNVNHPDYRFFASHNSFWR

XP_008438917.1 PREDICTED: uncharacterized protein LOC103483873 isoform X2 [Cucumis melo]3.2e-23195.75Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ--KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ  KQSALDSKDVVAA+T+TIDDLEKRSEFDECRSWSTRSDCSVSDRGL DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ--KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLDRF

Query:  LEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE
        LEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS AL RRRGADSDAESSKE
Subjt:  LEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE

Query:  ASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPS
         SSDGSSNSGAEKKTKT LQ+EWIQDF+ LGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKTYRSC+LSPS
Subjt:  ASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD
        SWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTA Q G STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD

Query:  NWLRLLNVNHPDYRFFASHNSFWR
        +WLRLLNVNHPDYRFFASHNSFWR
Subjt:  NWLRLLNVNHPDYRFFASHNSFWR

XP_011651067.2 uncharacterized protein LOC101208769 isoform X2 [Cucumis sativus]1.1e-23195.34Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ-------KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ       KQSALDSKDVVAA+T+TIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ-------KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST

Query:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
        NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSAL RRRGADSDA
Subjt:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA

Query:  ESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSC
        ESSKE SSDGSSNSGAEKKTKT LQ+EWIQDF+V GSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRF ELKTYRSC
Subjt:  ESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSC

Query:  ELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSL
        +LSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSL
Subjt:  ELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSL

Query:  WQDADNWLRLLNVNHPDYRFFASHNSFWR
        WQDAD+WLRLLNVNHPDYRFFASHNSFWR
Subjt:  WQDADNWLRLLNVNHPDYRFFASHNSFWR

XP_038877692.1 uncharacterized protein LOC120069924 [Benincasa hispida]2.6e-23396.02Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ-----KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNL
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ     KQS LDSKDV+ ASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNL
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ-----KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNL

Query:  DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAES
        DRFLEHTTPLVPAHCIPKTSLRGWRNREV EASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSS+LSRRRG DSDA S
Subjt:  DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAES

Query:  SKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCEL
        SKE SSDGSSNSGAEKKTKT LQDEWIQDFSV GSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSC+L
Subjt:  SKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCEL

Query:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQ
        SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQ
Subjt:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQ

Query:  DADNWLRLLNVNHPDYRFFASHNSFWR
        DADNWLRLLNVNHPDYRFFASHNSFWR
Subjt:  DADNWLRLLNVNHPDYRFFASHNSFWR

TrEMBL top hitse value%identityAlignment
A0A0A0L5V4 Uncharacterized protein1.4e-23296.21Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQKQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLDRFLE
        MSVSGGVSIARIRGENRFYHPPAMRRRL   QQQQQQQQQQQ KQSALDSKDVVAA+T+TIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLDRFLE
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQKQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLDRFLE

Query:  HTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKEAS
        HTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE S
Subjt:  HTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKEAS

Query:  SDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSW
        SDGSSNSGAEKKTKT LQ+EWIQDF+V GSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRF ELKTYRSC+LSPSSW
Subjt:  SDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSW

Query:  ISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNW
        ISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD+W
Subjt:  ISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNW

Query:  LRLLNVNHPDYRFFASHNSFWR
        LRLLNVNHPDYRFFASHNSFWR
Subjt:  LRLLNVNHPDYRFFASHNSFWR

A0A1S3AY60 uncharacterized protein LOC103483873 isoform X12.8e-23395.99Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ--KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ  KQSALDSKDVVAA+T+TIDDLEKRSEFDECRSWSTRSDCSVSDRGL DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ--KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLDRF

Query:  LEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE
        LEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS ALSRRRGADSDAESSKE
Subjt:  LEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE

Query:  ASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPS
         SSDGSSNSGAEKKTKT LQ+EWIQDF+ LGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKTYRSC+LSPS
Subjt:  ASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD
        SWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTA Q G STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD

Query:  NWLRLLNVNHPDYRFFASHNSFWR
        +WLRLLNVNHPDYRFFASHNSFWR
Subjt:  NWLRLLNVNHPDYRFFASHNSFWR

A0A1S3AY77 uncharacterized protein LOC103483873 isoform X21.5e-23195.75Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ--KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ  KQSALDSKDVVAA+T+TIDDLEKRSEFDECRSWSTRSDCSVSDRGL DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ--KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLDRF

Query:  LEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE
        LEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS AL RRRGADSDAESSKE
Subjt:  LEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE

Query:  ASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPS
         SSDGSSNSGAEKKTKT LQ+EWIQDF+ LGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKTYRSC+LSPS
Subjt:  ASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD
        SWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTA Q G STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD

Query:  NWLRLLNVNHPDYRFFASHNSFWR
        +WLRLLNVNHPDYRFFASHNSFWR
Subjt:  NWLRLLNVNHPDYRFFASHNSFWR

A0A5A7U113 Uncharacterized protein2.3e-21993.06Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ-----KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNL
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ     KQSALDSKDVVAA+T+TIDDLEKRSEFDECRSWSTRSDCSVSDRGL DSTNL
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ-----KQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNL

Query:  DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAES
        DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS ALSRRRGADSDAES
Subjt:  DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAES

Query:  SKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCEL
        SKE SSDGSSNSGAEKKTKT LQ+EWIQDF+ LGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKTYRSC+L
Subjt:  SKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCEL

Query:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ-----------AGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGA
        SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ           AG STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGA
Subjt:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ-----------AGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGA

Query:  EECSKAHSLWQDADNWLR
        EECSKAHSLWQDAD+WLR
Subjt:  EECSKAHSLWQDADNWLR

A0A6J1GUB7 uncharacterized protein LOC111457542 isoform X17.4e-21890.38Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQ----QKQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLD
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQ    QKQSALDSKD VAA++A IDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLD
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQ----QKQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLD

Query:  RFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESS
        RFLEHTTPLV AHCIPKT LRGWR REV EA PYFVLGDLWES+KEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLY+ PSKSSALSRRRG DSDAESS
Subjt:  RFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESS

Query:  KEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELS
        KE SSDGSSN GAEKKTK  LQDE IQD S+ GSQRALQMN PS+ESSSDESDSCY HGQLVFEY+ERDPPFCREPLTDKITILASRFPELKTYRSC+LS
Subjt:  KEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELS

Query:  PSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQD
        PSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ GIS+DGLQF WPRVREVYTADCPLKLQLPIFGLASYKFK+PFWNSTG EECSKA SLWQD
Subjt:  PSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQD

Query:  ADNWLRLLNVNHPDYRFFASHNSFWR
        A+ WLRLLNVNHPDYRFF+SH+SF R
Subjt:  ADNWLRLLNVNHPDYRFFASHNSFWR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)1.2e-8250.15Show/hide
Query:  ADSTNLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLY--VDPSKSSALSRR
        A S+N++RFL+  TP VPAH + KT +R     +V    PYF+LGD+WESF EWSAYG G+PL LN + D V QYYVP LSGIQ+Y  VD   SS  +RR
Subjt:  ADSTNLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLY--VDPSKSSALSRR

Query:  RGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPE
        +G +S+++  +++SS+GSS+        +  Q     D   L  +          +SSSD+ +     G+L+FEYLERD P+ REP  DK++ LASRFPE
Subjt:  RGADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPE

Query:  LKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQA-GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAE
        LKT RSC+L PSSW SVAWYPIY+IPTGPTL+ LDACFLT+H+L T FQ  G++T  +    PR       +   K++LP+FGLASYK +   W S G  
Subjt:  LKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQA-GISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAE

Query:  ECSKAHSLWQDADNWLRLLNVNHPDYRFF
            A+SL+Q ADNWLRL  VNHPD+ FF
Subjt:  ECSKAHSLWQDADNWLRLLNVNHPDYRFF

AT2G01260.1 Protein of unknown function (DUF789)2.5e-7748.47Show/hide
Query:  STNLDRFLEHTTPLVPAHCIPKTSLRGWR-NREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYV-DPSKSSALSRRRG
        S+NLDRFLE  TP VPA  + KT LR  R + + ++  PYFVLGD+W+SF EWSAYG G+PL+LN + D V+QYYVP LS IQ+Y    +  S+L  RR 
Subjt:  STNLDRFLEHTTPLVPAHCIPKTSLRGWR-NREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYV-DPSKSSALSRRRG

Query:  ADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELK
         DS     +++SSD SS+S +E+ +          D   L  Q          +SSSD+ +     G+L+FEYLERD P+ REP  DK+  LA++FPEL 
Subjt:  ADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELK

Query:  TYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECS
        T RSC+L  SSW SVAWYPIYRIPTGPTL+ LDACFLT+H+L T+F    S   +    PR  E        K+ LP+FGLASYKF+   W   G  E  
Subjt:  TYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECS

Query:  KAHSLWQDADNWLRLLNVNHPDYRFF
          +SL+Q AD WL   +V+HPD+ FF
Subjt:  KAHSLWQDADNWLRLLNVNHPDYRFF

AT2G01260.2 Protein of unknown function (DUF789)2.8e-6051.63Show/hide
Query:  STNLDRFLEHTTPLVPAHCIPKTSLRGWR-NREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYV-DPSKSSALSRRRG
        S+NLDRFLE  TP VPA  + KT LR  R + + ++  PYFVLGD+W+SF EWSAYG G+PL+LN + D V+QYYVP LS IQ+Y    +  S+L  RR 
Subjt:  STNLDRFLEHTTPLVPAHCIPKTSLRGWR-NREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYV-DPSKSSALSRRRG

Query:  ADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELK
         DS     +++SSD SS+S +E+ +          D   L  Q          +SSSD+ +     G+L+FEYLERD P+ REP  DK+  LA++FPEL 
Subjt:  ADSDAESSKEASSDGSSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELK

Query:  TYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAF
        T RSC+L  SSW SVAWYPIYRIPTGPTL+ LDACFLT+H+L T+F
Subjt:  TYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAF

AT4G16100.1 Protein of unknown function (DUF789)1.5e-9048.82Show/hide
Query:  RIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQKQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST-------NLDRFLEHTT
        RIRGENRFY+PP M R+LQQ++++++ + ++ +K+    +K+++       +   K+ E  EC    + SDCSV  R  + +T       NL RFL+ TT
Subjt:  RIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQKQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADST-------NLDRFLEHTT

Query:  PLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKEASSDG
        P+V    +P TS +GWR RE  E  PYF+L DLW+SF+EWSAYG G+PLLLNG DSVVQYYVPYLSGIQLY DPS++    RR G +SD +S ++ SSDG
Subjt:  PLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKEASSDG

Query:  SSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESD-SCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWIS
        S++             E  Q+       RA     P   SSSDES+ S    G+LVFEYLE   PF REPLTDKI+ L+S+FP L+TYRSC+LSPSSW+S
Subjt:  SSNSGAEKKTKTTLQDEWIQDFSVLGSQRALQMNVPSSESSSDESD-SCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWIS

Query:  VAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWN-STGAEECSKAHSLWQDADNWL
        VAWYPIYRIP G +LQ+LDACFLTFH+LST  +   + +G        + V +A    KL LP FGLASYKFK+  W+  +  +E  +  +L + A+ WL
Subjt:  VAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWN-STGAEECSKAHSLWQDADNWL

Query:  RLLNVNHPDYRFFASHN-SFWR
        R L V  PD+R F SH+ S WR
Subjt:  RLLNVNHPDYRFFASHN-SFWR

AT5G49220.1 Protein of unknown function (DUF789)6.9e-9149.43Show/hide
Query:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQ--QQQQQQQKQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSV----------SD
        MS SGGVSIAR  IRGENRFY+PP MRR  Q+ Q QQQ  ++Q++  +   L  K+   A+T       K     E +S    S   V          S 
Subjt:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQ--QQQQQQQKQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSV----------SD

Query:  RGLADSTNLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGI-----PLLLNGSDSVVQYYVPYLSGIQLYVDPSKSS
        R L+D +NLDRFLEHTTP+VPA   P  S    + RE S+   YFVL DLWESF EWSAYGAG+     PL ++G+DS VQYYVPYLSGIQLYVDP    
Subjt:  RGLADSTNLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGI-----PLLLNGSDSVVQYYVPYLSGIQLYVDPSKSS

Query:  ALSRRRGADSDAESSKEASSDGSSNSGAEKKTKTT--LQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITI
         L + R    D     E SS+GSSNS       +   L    ++D S+ GS             SS E++     G+L+FEYLE +PPF REPL +KI+ 
Subjt:  ALSRRRGADSDAESSKEASSDGSSNSGAEKKTKTT--LQDEWIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITI

Query:  LASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFW
        LASR PEL TYRSC+L PSSW+SV+WYPIYRIP GPTLQ+LDACFLTFH+LSTA     S  G     P            KL LP FGLASYK K+  W
Subjt:  LASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFW

Query:  NSTGAEECSKAHSLWQDADNWLRLLNVNHPDYRFFASHN
        N    +E  K  SL Q AD WL+ L V+HPDYRFF S++
Subjt:  NSTGAEECSKAHSLWQDADNWLRLLNVNHPDYRFFASHN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGTCTCCGGTGGGGTTTCGATTGCCCGAATCCGTGGCGAGAATCGCTTTTACCATCCACCTGCGATGCGGCGTCGTTTGCAGCAGCAGCAGCAGCAGCAACAACA
ACAGCAGCAGCAGCAGCAGAAGCAAAGTGCCTTGGATTCTAAGGACGTTGTGGCGGCTTCTACTGCTACGATCGATGACTTAGAGAAGAGGAGTGAGTTTGATGAGTGCC
GTTCTTGGTCCACTCGCTCTGATTGCTCTGTTTCGGATCGTGGACTAGCTGATTCTACTAATTTGGATCGCTTCTTGGAACACACTACTCCCCTTGTTCCGGCTCATTGT
ATTCCTAAGACGAGCCTGAGGGGATGGAGAAACCGTGAAGTCTCAGAAGCATCTCCTTATTTTGTGCTCGGTGATCTCTGGGAATCTTTCAAGGAATGGAGTGCATATGG
CGCGGGAATCCCTCTATTGTTAAATGGTAGTGACTCTGTTGTACAGTACTATGTTCCTTATCTGTCCGGCATTCAACTCTATGTTGATCCTTCGAAGTCCTCTGCCCTAA
GTAGAAGGCGTGGCGCAGATAGTGATGCCGAGTCCTCGAAGGAAGCAAGCAGTGATGGAAGCAGTAATTCCGGGGCAGAAAAGAAAACGAAGACTACCCTTCAGGATGAG
TGGATCCAGGACTTTAGTGTCCTGGGGTCACAAAGAGCTCTTCAAATGAATGTACCTTCTTCCGAGTCATCAAGTGATGAAAGTGACTCTTGCTACCGTCATGGTCAGCT
TGTGTTTGAATACTTGGAGCGCGATCCACCATTTTGTCGTGAACCATTAACTGATAAGATCACTATCCTTGCATCTCGTTTTCCTGAATTAAAGACATATAGGAGTTGCG
AGCTATCTCCTTCCAGTTGGATTTCTGTGGCATGGTATCCCATTTATCGAATTCCCACGGGGCCAACTTTACAAAGTCTAGATGCTTGTTTCTTGACCTTTCATAATCTG
TCAACAGCATTTCAAGCAGGCATCAGCACTGATGGGTTGCAATTCCATTGGCCAAGAGTTAGAGAGGTGTACACTGCGGATTGCCCTCTCAAACTGCAGTTGCCAATATT
TGGACTTGCTTCCTATAAGTTCAAAATTCCTTTTTGGAATTCGACTGGTGCAGAGGAATGTTCGAAGGCTCACTCTTTGTGGCAAGATGCTGACAACTGGCTCAGGTTAT
TAAATGTAAACCATCCTGATTACAGATTTTTCGCATCTCATAACTCATTCTGGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGTCTCCGGTGGGGTTTCGATTGCCCGAATCCGTGGCGAGAATCGCTTTTACCATCCACCTGCGATGCGGCGTCGTTTGCAGCAGCAGCAGCAGCAGCAACAACA
ACAGCAGCAGCAGCAGCAGAAGCAAAGTGCCTTGGATTCTAAGGACGTTGTGGCGGCTTCTACTGCTACGATCGATGACTTAGAGAAGAGGAGTGAGTTTGATGAGTGCC
GTTCTTGGTCCACTCGCTCTGATTGCTCTGTTTCGGATCGTGGACTAGCTGATTCTACTAATTTGGATCGCTTCTTGGAACACACTACTCCCCTTGTTCCGGCTCATTGT
ATTCCTAAGACGAGCCTGAGGGGATGGAGAAACCGTGAAGTCTCAGAAGCATCTCCTTATTTTGTGCTCGGTGATCTCTGGGAATCTTTCAAGGAATGGAGTGCATATGG
CGCGGGAATCCCTCTATTGTTAAATGGTAGTGACTCTGTTGTACAGTACTATGTTCCTTATCTGTCCGGCATTCAACTCTATGTTGATCCTTCGAAGTCCTCTGCCCTAA
GTAGAAGGCGTGGCGCAGATAGTGATGCCGAGTCCTCGAAGGAAGCAAGCAGTGATGGAAGCAGTAATTCCGGGGCAGAAAAGAAAACGAAGACTACCCTTCAGGATGAG
TGGATCCAGGACTTTAGTGTCCTGGGGTCACAAAGAGCTCTTCAAATGAATGTACCTTCTTCCGAGTCATCAAGTGATGAAAGTGACTCTTGCTACCGTCATGGTCAGCT
TGTGTTTGAATACTTGGAGCGCGATCCACCATTTTGTCGTGAACCATTAACTGATAAGATCACTATCCTTGCATCTCGTTTTCCTGAATTAAAGACATATAGGAGTTGCG
AGCTATCTCCTTCCAGTTGGATTTCTGTGGCATGGTATCCCATTTATCGAATTCCCACGGGGCCAACTTTACAAAGTCTAGATGCTTGTTTCTTGACCTTTCATAATCTG
TCAACAGCATTTCAAGCAGGCATCAGCACTGATGGGTTGCAATTCCATTGGCCAAGAGTTAGAGAGGTGTACACTGCGGATTGCCCTCTCAAACTGCAGTTGCCAATATT
TGGACTTGCTTCCTATAAGTTCAAAATTCCTTTTTGGAATTCGACTGGTGCAGAGGAATGTTCGAAGGCTCACTCTTTGTGGCAAGATGCTGACAACTGGCTCAGGTTAT
TAAATGTAAACCATCCTGATTACAGATTTTTCGCATCTCATAACTCATTCTGGAGATGATAATGAAGGATATTATGCAATGAGGCATAAATGTGGGTTTACAGTTTTAAG
TCCAAAGAAACTTGCTTCTCCTGAATGTCGTGAAAGTTTTTATGGCATCTTTGGCTTTTTTTTTTTTTT
Protein sequenceShow/hide protein sequence
MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQKQSALDSKDVVAASTATIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTPLVPAHC
IPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKEASSDGSSNSGAEKKTKTTLQDE
WIQDFSVLGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCELSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNL
STAFQAGISTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADNWLRLLNVNHPDYRFFASHNSFWR