; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G035280 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G035280
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionProtein of unknown function (DUF1068)
Genome locationCiama_Chr02:14438729..14442225
RNA-Seq ExpressionCaUC02G035280
SyntenyCaUC02G035280
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010471 - Protein of unknown function DUF1068


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8652279.1 hypothetical protein Csa_022101 [Cucumis sativus]8.9e-7480.31Show/hide
Query:  MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTFR-------------EKSFAELLS
        MAVK  GSCSPGLTKVGL +M LCIAAYILGPPLYWHFMEGL AFSSSS STCPPCFCDCS  TD  FTEELENTTFR             EK+FAELLS
Subjt:  MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTFR-------------EKSFAELLS

Query:  EELKLREAEALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS
        EELKLREAEALE+HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETR  QRGW+D+IV SR    GT+Q S
Subjt:  EELKLREAEALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS

XP_004151898.1 uncharacterized protein LOC101219040 [Cucumis sativus]8.9e-7480.31Show/hide
Query:  MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTFR-------------EKSFAELLS
        MAVK  GSCSPGLTKVGL +M LCIAAYILGPPLYWHFMEGL AFSSSS STCPPCFCDCS  TD  FTEELENTTFR             EK+FAELLS
Subjt:  MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTFR-------------EKSFAELLS

Query:  EELKLREAEALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS
        EELKLREAEALE+HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETR  QRGW+D+IV SR    GT+Q S
Subjt:  EELKLREAEALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS

XP_008455917.1 PREDICTED: uncharacterized protein LOC103495987 isoform X1 [Cucumis melo]3.4e-7379.79Show/hide
Query:  MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTFR-------------EKSFAELLS
        MA K  GS SPGLTKVGLC M +CIAAYILGPPLYWHF EGLAAFSSSS STCPPCFCDCS  TD  FTEEL+NTTFR             EK+FAELLS
Subjt:  MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTFR-------------EKSFAELLS

Query:  EELKLREAEALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS
        EELKLREAEALE+HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLT LWETR  QRGW+DDIV SR    GTVQTS
Subjt:  EELKLREAEALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS

XP_023534557.1 uncharacterized protein LOC111796098 isoform X1 [Cucurbita pepo subsp. pepo]2.7e-7077.2Show/hide
Query:  MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTFR-------------EKSFAELLS
        MAVK  GSCSPGLTKVGL  + LCIAAYILGPPLYWHFMEGLA  SSSS STCPPCFCDCS  TD  FT+E ENTTFR             E+SFAELLS
Subjt:  MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTFR-------------EKSFAELLS

Query:  EELKLREAEALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS
        EELKLREAEA+E HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LTALWE R  QRGW+DDIV S A     VQTS
Subjt:  EELKLREAEALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS

XP_038900340.1 uncharacterized protein LOC120087594 [Benincasa hispida]8.3e-7283.8Show/hide
Query:  MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAF--SSSSFSTCPPCFCDCSFHTDLTFTEELENTTFR-------------EKSFAEL
        MAVK  GS SPGLTKVGLC+M LC+AAYILGPPLYWHFMEGLAAF  SSSSFSTCPPCFCDCS HTD  FTEELE+TTFR             EKSFAEL
Subjt:  MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAF--SSSSFSTCPPCFCDCSFHTDLTFTEELENTTFR-------------EKSFAEL

Query:  LSEELKLREAEALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKD
        LSEELKLREAEALESHR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETR  QRGW+D
Subjt:  LSEELKLREAEALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKD

TrEMBL top hitse value%identityAlignment
A0A0A0LQC2 Uncharacterized protein4.3e-7480.31Show/hide
Query:  MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTFR-------------EKSFAELLS
        MAVK  GSCSPGLTKVGL +M LCIAAYILGPPLYWHFMEGL AFSSSS STCPPCFCDCS  TD  FTEELENTTFR             EK+FAELLS
Subjt:  MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTFR-------------EKSFAELLS

Query:  EELKLREAEALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS
        EELKLREAEALE+HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETR  QRGW+D+IV SR    GT+Q S
Subjt:  EELKLREAEALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS

A0A1S3C1Z9 uncharacterized protein LOC103495987 isoform X11.6e-7379.79Show/hide
Query:  MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTFR-------------EKSFAELLS
        MA K  GS SPGLTKVGLC M +CIAAYILGPPLYWHF EGLAAFSSSS STCPPCFCDCS  TD  FTEEL+NTTFR             EK+FAELLS
Subjt:  MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTFR-------------EKSFAELLS

Query:  EELKLREAEALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS
        EELKLREAEALE+HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLT LWETR  QRGW+DDIV SR    GTVQTS
Subjt:  EELKLREAEALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS

A0A1S3C2Q4 uncharacterized protein LOC103495987 isoform X21.9e-6979.46Show/hide
Query:  MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELEN-----TTFREKSFAELLSEELKLREA
        MA K  GS SPGLTKVGLC M +CIAAYILGPPLYWHF EGLAAFSSSS STCPPCFCDCS  TD  FTE+            EK+FAELLSEELKLREA
Subjt:  MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELEN-----TTFREKSFAELLSEELKLREA

Query:  EALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS
        EALE+HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLT LWETR  QRGW+DDIV SR    GTVQTS
Subjt:  EALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS

A0A6J1CER0 uncharacterized protein LOC1110110151.1e-6975.65Show/hide
Query:  MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTFR-------------EKSFAELLS
        MAVK  G CSPG TKVGL  MGL +AAYI+ PPLYWHF+E LAA SSSS STCPPCFCDCS +TD   +EELENTTFR             EKSF ELLS
Subjt:  MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTFR-------------EKSFAELLS

Query:  EELKLREAEALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS
        EELKLREAEALESHR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARE+AEA+L SQ+RLTALWETR  QRGW+ DIVRSRA   GTVQT+
Subjt:  EELKLREAEALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS

A0A6J1G5W1 uncharacterized protein LOC111451120 isoform X16.4e-7076.17Show/hide
Query:  MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTFR-------------EKSFAELLS
        MAVK  GSCSPGLTKVGL  + LCIAAYILGPPLYWHF+EGLA  SSSS STCPPCFCDCS  TD  FT+E ENTTFR             E++FAELLS
Subjt:  MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTFR-------------EKSFAELLS

Query:  EELKLREAEALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS
        EELKLREAEA+E HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LTALWE R  QRGW+DDIV S A     VQTS
Subjt:  EELKLREAEALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05070.1 Protein of unknown function (DUF1068)9.8e-4754.19Show/hide
Query:  KVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTF-------------REKSFAELLSEELKLREAEALESH
        K+GL ++GL +A YILGPPLYWH  E LAA S+SS   CP C C+CS ++ +T  +EL N +F              EK++AELL+EELKLREAE+LE H
Subjt:  KVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTF-------------REKSFAELLSEELKLREAEALESH

Query:  RLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS
        + AD+ LLEAKK+TS YQKEADKCNSGMETCE ARE+AE  LA QK+LT+ WE R  Q+GW++   +   ++   VQ +
Subjt:  RLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS

AT2G32580.1 Protein of unknown function (DUF1068)5.8e-3951.16Show/hide
Query:  KVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTF-------------REKSFAELLSEELKLREAEALESH
        KVGL ++ L +  YILGPPLYWH  E LA     S ++C  C CDCS    LT    L N +F              EK++AELL+EELK REA ++E H
Subjt:  KVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTF-------------REKSFAELLSEELKLREAEALESH

Query:  RLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQT
        +  D  LLEAKK+TS YQKEADKCNSGMETCE ARE+AE  L  QK+LT++WE R  Q+G+KD   +S  ++
Subjt:  RLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQT

AT2G32580.2 Protein of unknown function (DUF1068)4.3e-2661.05Show/hide
Query:  EKSFAELLSEELKLREAEALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQT
        EK++AELL+EELK REA ++E H+  D  LLEAKK+TS YQKEADKCNSGMETCE ARE+AE  L  QK+LT++WE R  Q+G+KD   +S  ++
Subjt:  EKSFAELLSEELKLREAEALESHRLADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQT

AT4G04360.1 Protein of unknown function (DUF1068)1.4e-3753.29Show/hide
Query:  KVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTF------------REKSFAELLSEELKLREAEALESHR
        KV   VMGLCI AYI GP LYWH  E +A    S  S+CPPC CDCS    L+  + L N +F             E SF E+++EELKLREA+A E   
Subjt:  KVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTF------------REKSFAELLSEELKLREAEALESHR

Query:  LADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRS
         AD  LL+AKK  SQYQKEADKC+ GMETCE ARE+AEA L  Q+RL+ +WE R  Q GWK+  V S
Subjt:  LADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRS

AT4G30996.1 Protein of unknown function (DUF1068)3.3e-2641.98Show/hide
Query:  LCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDL--------------TFTEELENTTFREKSFAELLSEELKLREAEALESHRL
        L +  +  A  + GP LYW F +G    S+ + S CPPC CDC     L                +++ E     EK F +LL+EELKL+EA A E  R 
Subjt:  LCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDL--------------TFTEELENTTFREKSFAELLSEELKLREAEALESHRL

Query:  ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDD
         +++L EAK++ SQYQKEA+KCN+  E CE+ARERAEA L  ++++T+LWE R  Q GW+ +
Subjt:  ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGTGAAGTCGACGGGTTCATGCTCTCCAGGGCTGACGAAGGTGGGATTGTGTGTTATGGGTCTTTGTATAGCGGCTTACATTCTGGGTCCGCCTCTCTACTGGCA
TTTCATGGAGGGCTTGGCCGCTTTCTCTTCTTCCTCTTTCTCAACTTGCCCACCTTGCTTTTGTGATTGCTCTTTTCACACTGACTTGACCTTTACTGAAGAGCTCGAAA
ACACCACTTTTAGAGAGAAGAGTTTTGCAGAGTTGTTGTCGGAGGAACTGAAACTGAGGGAAGCTGAAGCTTTGGAAAGTCATCGGCTCGCCGATATATCCCTGCTAGAA
GCAAAGAAGATGACATCTCAATATCAGAAAGAAGCAGATAAGTGCAATTCAGGCATGGAAACATGTGAAGCAGCAAGGGAAAGGGCTGAAGCTACATTAGCTTCACAAAA
GAGGCTAACAGCATTATGGGAGACTAGGCTTTGCCAAAGAGGATGGAAAGACGACATTGTTAGATCCCGTGCTCAGACCCATGGTACTGTCCAAACCTCATAA
mRNA sequenceShow/hide mRNA sequence
ATCCACCTTTCCCAACGAGTTCATTCCCGTTTTCCATTTTTGGACTACACGAAGGTCTTGAAAATGGCAGTGAAGTCGACGGGTTCATGCTCTCCAGGGCTGACGAAGGT
GGGATTGTGTGTTATGGGTCTTTGTATAGCGGCTTACATTCTGGGTCCGCCTCTCTACTGGCATTTCATGGAGGGCTTGGCCGCTTTCTCTTCTTCCTCTTTCTCAACTT
GCCCACCTTGCTTTTGTGATTGCTCTTTTCACACTGACTTGACCTTTACTGAAGAGCTCGAAAACACCACTTTTAGAGAGAAGAGTTTTGCAGAGTTGTTGTCGGAGGAA
CTGAAACTGAGGGAAGCTGAAGCTTTGGAAAGTCATCGGCTCGCCGATATATCCCTGCTAGAAGCAAAGAAGATGACATCTCAATATCAGAAAGAAGCAGATAAGTGCAA
TTCAGGCATGGAAACATGTGAAGCAGCAAGGGAAAGGGCTGAAGCTACATTAGCTTCACAAAAGAGGCTAACAGCATTATGGGAGACTAGGCTTTGCCAAAGAGGATGGA
AAGACGACATTGTTAGATCCCGTGCTCAGACCCATGGTACTGTCCAAACCTCATAA
Protein sequenceShow/hide protein sequence
MAVKSTGSCSPGLTKVGLCVMGLCIAAYILGPPLYWHFMEGLAAFSSSSFSTCPPCFCDCSFHTDLTFTEELENTTFREKSFAELLSEELKLREAEALESHRLADISLLE
AKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRLCQRGWKDDIVRSRAQTHGTVQTS