; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C04G074930 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C04G074930
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionUnknown protein
Genome locationCla97Chr04:22544991..22546796
RNA-Seq ExpressionCla97C04G074930
SyntenyCla97C04G074930
Gene Ontology termsNA
InterPro domainsIPR040344 - Uncharacterized protein At3g17950-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032638.1 uncharacterized protein E6C27_scaffold184G00230 [Cucumis melo var. makuwa]9.8e-9782.25Show/hide
Query:  ATPQMDVTDLLTCPPPRTRIDACPFLMKNYRDTRFAFDSPSSLILWQSTPPPLIFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRNFPGSIS
        ATPQ DVT+LLT PPPRTRIDACPFLMKN+RDTRFAF SPSSLILWQ+T   L FLSISNQPPQSQPVMAQQDDGWPLGLR+LNARVGLLENR+FPGSIS
Subjt:  ATPQMDVTDLLTCPPPRTRIDACPFLMKNYRDTRFAFDSPSSLILWQSTPPPLIFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRNFPGSIS

Query:  FNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDSTSSIMELSRR----GNTEANLGGDRKINN-FRFKYKPWLFSLCCKLSTDVVSATRTHSL
        FNTLPTGSPISFTDSS LDSESSGSFFH+KS TLGSL+G STS+IMELSRR    G+TEA+LG DRKINN F+FK K WLFSLCCKLSTD V ATRTHSL
Subjt:  FNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDSTSSIMELSRR----GNTEANLGGDRKINN-FRFKYKPWLFSLCCKLSTDVVSATRTHSL

Query:  AHFLEVERRRTAATAAARPLPIAGRANNLLT
        AHFLE+ER+RTA  AAA P PI GR+N+ LT
Subjt:  AHFLEVERRRTAATAAARPLPIAGRANNLLT

KAG6595318.1 hypothetical protein SDJN03_11871, partial [Cucurbita argyrosperma subsp. sororia]2.2e-5671.89Show/hide
Query:  LWQS---TPPPLIFLSISNQPPQSQP-------VMAQQDDGWPLGLRLLNARVGLLENRNFPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITL
        LW S    PP L   SISNQPP+SQP       +MAQQDDGWPLGLRLLNARVGLLENR+F GSISFNTLPTGSPISFTDSSDLDS+SSGSF HAKSI+ 
Subjt:  LWQS---TPPPLIFLSISNQPPQSQP-------VMAQQDDGWPLGLRLLNARVGLLENRNFPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITL

Query:  GSLIGDSTSSIMELSRR----GNTEANLGGDRKINNFRFKYKPWLFSLCCKLSTDVVSATRTHSLAHFLEVERRRTAATAAARPL
        GSLI  S S I+ELSRR    G+TE +LGG RK +   FK KPWLFSLCCKLSTD VS TRTHSLAHFLE ERRRTAA      L
Subjt:  GSLIGDSTSSIMELSRR----GNTEANLGGDRKINNFRFKYKPWLFSLCCKLSTDVVSATRTHSLAHFLEVERRRTAATAAARPL

XP_004142071.2 uncharacterized protein LOC101214483 [Cucumis sativus]2.2e-7278.84Show/hide
Query:  LILWQSTPPPLIFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRNFPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDST
        ++   +T   L FLSISNQPPQSQPVMAQQDDGWPLGLR+LNARVGLLENR+FPGSISFNTLPTGSPISFTDSS LDSESSGSFFH+KSITLGSLIG ST
Subjt:  LILWQSTPPPLIFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRNFPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDST

Query:  SSIMELSRR----GNTEANLGGDRKINN-FRFKYKPWLFSLCCKLSTDVVSATRTHSLAHFLEVERRRTAATAAARPLPIAGRANNLLT
        S+IMEL+RR    G+TEA+LG DRKINN F+ K KPWLFSLCCKLSTD V ATRTHSLAHFLE+ER+RTA  AAA P PI GR++N+LT
Subjt:  SSIMELSRR----GNTEANLGGDRKINN-FRFKYKPWLFSLCCKLSTDVVSATRTHSLAHFLEVERRRTAATAAARPLPIAGRANNLLT

XP_016900475.1 PREDICTED: uncharacterized protein LOC103490160 isoform X1 [Cucumis melo]4.1e-7182.12Show/hide
Query:  LIFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRNFPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDSTSSIMELSRR-
        L FLSISNQPPQSQPVMAQQDDGWPLGLR+LNARVGLLENR+FPGSISFNTLPTGSPISFTDSS LDSESSGSFFH+KS TLGSL+G STS+IMELSRR 
Subjt:  LIFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRNFPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDSTSSIMELSRR-

Query:  ---GNTEANLGGDRKINN-FRFKYKPWLFSLCCKLSTDVVSATRTHSLAHFLEVERRRTAATAAARPLPIAGRANNLLT
           G+TEA+LG DRKINN F+FK K WLFSLCCKLSTD V ATRTHSLAHFLE+ER+RTA  AAA P PI GR+N+ LT
Subjt:  ---GNTEANLGGDRKINN-FRFKYKPWLFSLCCKLSTDVVSATRTHSLAHFLEVERRRTAATAAARPLPIAGRANNLLT

XP_038877635.1 uncharacterized protein LOC120069886 [Benincasa hispida]1.1e-7485.16Show/hide
Query:  PPPLIFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRNFPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDSTSSIMELS
        PPPL FLSISNQPP+SQ VMAQQDDGWPLGLRLLNARVGLLENR+FPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIG ST  IMELS
Subjt:  PPPLIFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRNFPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDSTSSIMELS

Query:  RR----GNTEANLGGDRKINN-FRFKYKPWLFSLCCKLSTDVVSATRTHSLAHFLEVERRRTAATAAARPLPIAGRANNLLT
        RR    G+TEA+LG DRKINN F+FK KPWLFSLCCKLSTD V ATRT SLAHFLEVERRRTAA AA+ P PIA R NN+LT
Subjt:  RR----GNTEANLGGDRKINN-FRFKYKPWLFSLCCKLSTDVVSATRTHSLAHFLEVERRRTAATAAARPLPIAGRANNLLT

TrEMBL top hitse value%identityAlignment
A0A0A0KX74 Uncharacterized protein2.4e-6481.6Show/hide
Query:  MAQQDDGWPLGLRLLNARVGLLENRNFPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDSTSSIMELSRR----GNTEANLGGDRKI
        MAQQDDGWPLGLR+LNARVGLLENR+FPGSISFNTLPTGSPISFTDSS LDSESSGSFFH+KSITLGSLIG STS+IMEL+RR    G+TEA+LG DRKI
Subjt:  MAQQDDGWPLGLRLLNARVGLLENRNFPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDSTSSIMELSRR----GNTEANLGGDRKI

Query:  NN-FRFKYKPWLFSLCCKLSTDVVSATRTHSLAHFLEVERRRTAATAAARPLPIAGRANNLLT
        NN F+ K KPWLFSLCCKLSTD V ATRTHSLAHFLE+ER+RTA  AAA P PI GR++N+LT
Subjt:  NN-FRFKYKPWLFSLCCKLSTDVVSATRTHSLAHFLEVERRRTAATAAARPLPIAGRANNLLT

A0A1S4DWX0 uncharacterized protein LOC103490160 isoform X12.0e-7182.12Show/hide
Query:  LIFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRNFPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDSTSSIMELSRR-
        L FLSISNQPPQSQPVMAQQDDGWPLGLR+LNARVGLLENR+FPGSISFNTLPTGSPISFTDSS LDSESSGSFFH+KS TLGSL+G STS+IMELSRR 
Subjt:  LIFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRNFPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDSTSSIMELSRR-

Query:  ---GNTEANLGGDRKINN-FRFKYKPWLFSLCCKLSTDVVSATRTHSLAHFLEVERRRTAATAAARPLPIAGRANNLLT
           G+TEA+LG DRKINN F+FK K WLFSLCCKLSTD V ATRTHSLAHFLE+ER+RTA  AAA P PI GR+N+ LT
Subjt:  ---GNTEANLGGDRKINN-FRFKYKPWLFSLCCKLSTDVVSATRTHSLAHFLEVERRRTAATAAARPLPIAGRANNLLT

A0A5D3DJ71 Uncharacterized protein4.7e-9782.25Show/hide
Query:  ATPQMDVTDLLTCPPPRTRIDACPFLMKNYRDTRFAFDSPSSLILWQSTPPPLIFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRNFPGSIS
        ATPQ DVT+LLT PPPRTRIDACPFLMKN+RDTRFAF SPSSLILWQ+T   L FLSISNQPPQSQPVMAQQDDGWPLGLR+LNARVGLLENR+FPGSIS
Subjt:  ATPQMDVTDLLTCPPPRTRIDACPFLMKNYRDTRFAFDSPSSLILWQSTPPPLIFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRNFPGSIS

Query:  FNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDSTSSIMELSRR----GNTEANLGGDRKINN-FRFKYKPWLFSLCCKLSTDVVSATRTHSL
        FNTLPTGSPISFTDSS LDSESSGSFFH+KS TLGSL+G STS+IMELSRR    G+TEA+LG DRKINN F+FK K WLFSLCCKLSTD V ATRTHSL
Subjt:  FNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDSTSSIMELSRR----GNTEANLGGDRKINN-FRFKYKPWLFSLCCKLSTDVVSATRTHSL

Query:  AHFLEVERRRTAATAAARPLPIAGRANNLLT
        AHFLE+ER+RTA  AAA P PI GR+N+ LT
Subjt:  AHFLEVERRRTAATAAARPLPIAGRANNLLT

A0A6J1CR51 uncharacterized protein LOC1110135623.4e-5569.47Show/hide
Query:  PPPLIFLSISNQPPQSQP-------VMAQQDDGWPLGLRLLNARVGLLENRNF-PGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDS
        PP L   SI NQPPQSQP       VMAQQDDGWPLGLRLLNARVGLL +R+F  GSISFNTLPTGS  SFTDSSDLDSES+GSFFH KSITLGSL+ DS
Subjt:  PPPLIFLSISNQPPQSQP-------VMAQQDDGWPLGLRLLNARVGLLENRNF-PGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDS

Query:  TSSIMELSR---RGNTEANLGGDRKINNFRFKYKPWLFSLCCKLSTDVVSATRTHSLAHFLEVERRRTAATAAAR--PLPIAGRANNLLT
        +SSI+ELSR   RG TE +LGG+    N   K K WLFSLCCKLSTD VSATRT SLAHFLE ERR++AA A  R   L I+   NNL+T
Subjt:  TSSIMELSR---RGNTEANLGGDRKINNFRFKYKPWLFSLCCKLSTDVVSATRTHSLAHFLEVERRRTAATAAAR--PLPIAGRANNLLT

A0A6J1I9M4 uncharacterized protein LOC1114712926.9e-5670.16Show/hide
Query:  PPPLIFLSISNQPPQSQP-------VMAQQDDGWPLGLRLLNARVGLLENRNFPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDST
        PPPL   SISN PP+SQP       +MAQQDDGWPLGLRLLNARVGLLENR+F GSISFNTLPTGSPISFTDSSDLDS+SSGSFFHAKSI+  SLI  S 
Subjt:  PPPLIFLSISNQPPQSQP-------VMAQQDDGWPLGLRLLNARVGLLENRNFPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDST

Query:  SSIMELSRR----GNTEANLGGDRKINNFRFKYKPWLFSLCCKLSTDVVSATRTHSLAHFLEVERRRTAATAAARPLPI----AGRANNLL
        S I+ELS R    G+TE +LGG RK +   FK KPWLFSLCCKLSTD VS TRTHSLAHFLE ERRRTAA      L        R NNLL
Subjt:  SSIMELSRR----GNTEANLGGDRKINNFRFKYKPWLFSLCCKLSTDVVSATRTHSLAHFLEVERRRTAATAAARPLPI----AGRANNLL

SwissProt top hitse value%identityAlignment
Q6DR24 Uncharacterized protein At3g179501.6e-0445.45Show/hide
Query:  LPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDSTSSIMELSRRGNTEANL
        +P+   IS   SSDLD+ES+GSFFH +SITLG+L+G S ++ M +  R ++  ++
Subjt:  LPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDSTSSIMELSRRGNTEANL

Arabidopsis top hitse value%identityAlignment
AT3G17950.1 unknown protein1.1e-0545.45Show/hide
Query:  LPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDSTSSIMELSRRGNTEANL
        +P+   IS   SSDLD+ES+GSFFH +SITLG+L+G S ++ M +  R ++  ++
Subjt:  LPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDSTSSIMELSRRGNTEANL

AT5G02440.1 unknown protein1.4e-2445.14Show/hide
Query:  MAQQDDGWPLGLRLLNARVGLL---------ENRNFPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDSTSSIMELSRRGNTEAN--
        MA Q++GWPLGLR +NAR+G L           +   GSISF++L + SP S   SSDLDS+S GSFF  +S TLG+LIG   SS +ELSRR N   N  
Subjt:  MAQQDDGWPLGLRLLNARVGLL---------ENRNFPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDSTSSIMELSRRGNTEAN--

Query:  LGGDRK---INNFRFKYKPWLFSLCCKLSTD--VVSATR-----------THSLAHFLEVERRRTAATAAARPLP
         G  R      N +  YKPW+FS+C KLST+  V+S  R             SL HFL +ERR   +T  + P P
Subjt:  LGGDRK---INNFRFKYKPWLFSLCCKLSTD--VVSATR-----------THSLAHFLEVERRRTAATAAARPLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGCTGCTTGCAGCCTGCAGGTTGCCACCCCACAAATGGATGTTACTGACCTTTTAACTTGCCCACCACCACGGACGAGGATTGATGCCTGCCCATTCCTG
ATGAAGAATTATCGGGATACGAGATTTGCATTTGATTCTCCATCTTCCTTAATACTTTGGCAGAGTACTCCACCTCCTTTGATATTTCTTAGTATATCAAATCAA
CCACCACAGTCTCAGCCAGTGATGGCTCAACAGGACGATGGATGGCCTTTGGGATTAAGACTGCTAAATGCTAGAGTTGGGTTGCTGGAAAATCGAAACTTTCCT
GGATCAATCTCCTTCAACACTTTGCCTACTGGATCTCCCATCTCCTTCACAGACTCTTCAGATCTTGATTCTGAGTCAAGTGGGTCGTTCTTCCATGCTAAAAGC
ATCACTCTGGGTAGTCTAATTGGTGATTCTACTTCTAGTATCATGGAACTCTCGAGAAGGGGAAACACAGAAGCCAACCTAGGAGGGGACAGGAAGATCAATAAC
TTCAGGTTCAAGTACAAGCCATGGTTGTTTTCACTCTGCTGCAAACTGAGCACCGACGTCGTCAGCGCCACCAGAACTCACTCCCTGGCTCACTTTCTAGAAGTA
GAGAGGAGGAGAACTGCCGCCACTGCCGCTGCCCGCCCCCTGCCAATTGCCGGAAGAGCCAATAATTTGTTAACCGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGCTGCTTGCAGCCTGCAGGTTGCCACCCCACAAATGGATGTTACTGACCTTTTAACTTGCCCACCACCACGGACGAGGATTGATGCCTGCCCATTCCTG
ATGAAGAATTATCGGGATACGAGATTTGCATTTGATTCTCCATCTTCCTTAATACTTTGGCAGAGTACTCCACCTCCTTTGATATTTCTTAGTATATCAAATCAA
CCACCACAGTCTCAGCCAGTGATGGCTCAACAGGACGATGGATGGCCTTTGGGATTAAGACTGCTAAATGCTAGAGTTGGGTTGCTGGAAAATCGAAACTTTCCT
GGATCAATCTCCTTCAACACTTTGCCTACTGGATCTCCCATCTCCTTCACAGACTCTTCAGATCTTGATTCTGAGTCAAGTGGGTCGTTCTTCCATGCTAAAAGC
ATCACTCTGGGTAGTCTAATTGGTGATTCTACTTCTAGTATCATGGAACTCTCGAGAAGGGGAAACACAGAAGCCAACCTAGGAGGGGACAGGAAGATCAATAAC
TTCAGGTTCAAGTACAAGCCATGGTTGTTTTCACTCTGCTGCAAACTGAGCACCGACGTCGTCAGCGCCACCAGAACTCACTCCCTGGCTCACTTTCTAGAAGTA
GAGAGGAGGAGAACTGCCGCCACTGCCGCTGCCCGCCCCCTGCCAATTGCCGGAAGAGCCAATAATTTGTTAACCGGCTGAAACTCTGTTTTCATTAGCCAATAG
CAAAGTTGGTCGTCAGCAAATAGGAAGAGGCCAAGGAGAGTGGTTAATTGATGCCAGATGGGAATGGGAATGGGGTTCTTGGTGTTGTTTTCATCCTACCTTCAG
ACAAATTATTAGGCTGCCACTCATAATATCGTTAGATGTGCCGGTAATGAGATGCCTTTCTGGATCTTCATCACAACTGTTTTGAGATTTTACTCTTATCTTAAG
TCAGGTACTCGAGAACCCTTTACTTTAAATTGTTTGTTACCTCTCTTTTTCTTGCATATTCGAATGGTGAATATTACTTTC
Protein sequenceShow/hide protein sequence
MSAACSLQVATPQMDVTDLLTCPPPRTRIDACPFLMKNYRDTRFAFDSPSSLILWQSTPPPLIFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRNFP
GSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGDSTSSIMELSRRGNTEANLGGDRKINNFRFKYKPWLFSLCCKLSTDVVSATRTHSLAHFLEV
ERRRTAATAAARPLPIAGRANNLLTG