; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc03G07080 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc03G07080
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Description101 kDa malaria antigen-like
Genome locationClcChr03:7202934..7204305
RNA-Seq ExpressionClc03G07080
SyntenyClc03G07080
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444445.1 PREDICTED: 101 kDa malaria antigen-like [Cucumis melo]5.9e-6466.52Show/hide
Query:  MRGFSTAKSLVMAINLPKKFLHLLAKGALFLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKTISSPPQHNLNKTVV
        MR FS  KS+VM I LPKKFLHL AK ALFL+ FF FF     SLS+D  NHTNFWFFLSNTLIFIIA DSGAFS P +F  AAK   + PQHN N T+V
Subjt:  MRGFSTAKSLVMAINLPKKFLHLLAKGALFLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKTISSPPQHNLNKTVV

Query:  LNEPPN-----------EEEEEEIIIPLTTEISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSKMMIRHDMTSTK--------KEEEENEFAEM
        +NE PN           EEEEEEI IPLTTEISIPP+FNNP + YQRSKSEK+IKR+  KA+KI M+RSK MIR D TSTK        +EEE+NEF +M
Subjt:  LNEPPN-----------EEEEEEIIIPLTTEISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSKMMIRHDMTSTK--------KEEEENEFAEM

Query:  TNEELNRRVEEFIERFNRQIRLQKTNE
        T+EELNRRVEEFIERFNRQIRLQ+ NE
Subjt:  TNEELNRRVEEFIERFNRQIRLQKTNE

XP_011657186.2 uncharacterized protein LOC105435817 [Cucumis sativus]2.7e-6166.51Show/hide
Query:  FSTAKSLVMAINLPKKFLHLLAKGALFLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKTISSPPQHNLNKTVVLNE
        FS  KS+VM I LPKKFL+L AKGALFL+ FF  +     SLS+DL NHTNFWFFLSNTLIF+IA DSGAFS P +F  AAK   S PQHN N T+V+N+
Subjt:  FSTAKSLVMAINLPKKFLHLLAKGALFLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKTISSPPQHNLNKTVVLNE

Query:  PPN-----EEEEEEIIIPLTTEISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSKMMIRHDMTSTKKEEEEN-EFAEMTNEELNRRVEEFIERF
         PN     + EEEE IIPLTTEIS P +FNNP + YQRSKSEK+IKR+ EKA+K+ MRRSK MI+ + TSTK++EEEN EF +MT+EELNRRVEEFIERF
Subjt:  PPN-----EEEEEEIIIPLTTEISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSKMMIRHDMTSTKKEEEEN-EFAEMTNEELNRRVEEFIERF

Query:  NRQIRLQKTNEDGNENRE
        NRQIRLQ+ NED NE  +
Subjt:  NRQIRLQKTNEDGNENRE

XP_022927241.1 uncharacterized protein LOC111434146 [Cucurbita moschata]6.6e-3960.7Show/hide
Query:  MAINLPKKFLHLLAKGAL----FLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKT--ISSPPQHNLNKTVVLNEPP
        MA+   +KFLHL A+ AL    FLL  FI+FS F LSLST L  HT FWF LSNTLIFIIAA S AFS PPTF AAA +  I +PP +N N     N   
Subjt:  MAINLPKKFLHLLAKGAL----FLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKT--ISSPPQHNLNKTVVLNEPP

Query:  NEEEEEEIIIPLTTEISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSKMMIRHDMTSTKKEEEE--NEFAEMTNEELNRRVEEFIERFNRQIRL
        N ++++   IPL TEISI    NNPR+SY RSKSEK I+RV  K  KI MRRSK M R+D TST+ E+EE  NE AEM+NEELN++VEEFIERFNRQ+RL
Subjt:  NEEEEEEIIIPLTTEISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSKMMIRHDMTSTKKEEEE--NEFAEMTNEELNRRVEEFIERFNRQIRL

Query:  Q
        Q
Subjt:  Q

XP_023520396.1 uncharacterized protein LOC111783708 isoform X2 [Cucurbita pepo subsp. pepo]6.2e-3757.07Show/hide
Query:  MAINLPKKFLHLLAKGAL----FLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKTISSPPQHNLNKTVVLNEPPNE
        MA+   +KFLHL A+ AL    FLL  FI+FS F LSLST L  HT FWF LSNTLIFIIAA S AFSPPPTF +AA  I +PPQ+N N     N   N 
Subjt:  MAINLPKKFLHLLAKGAL----FLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKTISSPPQHNLNKTVVLNEPPNE

Query:  EEEEEIIIPLTTEISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSKMMIRHDMTSTKKEEEENEFAEMTNEELNRRVEEFIERFNRQIRLQKTN
        ++++   IPL TEISI    NN R+SY  SKSEK ++RV  K  K+ MRRSK M R++     KEE +NE AEM+NEELN+RVEEFIERFNRQ+RLQ   
Subjt:  EEEEEIIIPLTTEISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSKMMIRHDMTSTKKEEEENEFAEMTNEELNRRVEEFIERFNRQIRLQKTN

Query:  EDGNE
        +   E
Subjt:  EDGNE

XP_038895571.1 uncharacterized protein LOC120083777 [Benincasa hispida]1.3e-7174.34Show/hide
Query:  MRGFSTAKSLVMAINLPKKFLHLLAKGALFLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKTISSPPQHNLNKTVV
        MRGFS AKSLVM I LPKKFLHL AK ALFLL FFI+F     SLS++L NHTNFWFFL+NTLIFIIAADSGAFSPP +F AA  T+ +PP+ + NK VV
Subjt:  MRGFSTAKSLVMAINLPKKFLHLLAKGALFLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKTISSPPQHNLNKTVV

Query:  LNEPPN------EEEEEEIIIPLTTE-ISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSKMMIRHDMTST-KKEEEENEFAEMTNEELNRRVEE
        + EPPN       EEEEEIII LTTE  SIPP+FNN R+SYQRSKSEKEIKR+ EKA KI M+RSK MIRHD T+T KKEEE++EFAEMTNEELNRRVEE
Subjt:  LNEPPN------EEEEEEIIIPLTTE-ISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSKMMIRHDMTST-KKEEEENEFAEMTNEELNRRVEE

Query:  FIERFNRQIRLQKTNEDGNENRELLI
        FIERFNRQIRLQ+ NE GNENRELLI
Subjt:  FIERFNRQIRLQKTNEDGNENRELLI

TrEMBL top hitse value%identityAlignment
A0A0A0LUF0 Uncharacterized protein1.3e-6166.51Show/hide
Query:  FSTAKSLVMAINLPKKFLHLLAKGALFLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKTISSPPQHNLNKTVVLNE
        FS  KS+VM I LPKKFL+L AKGALFL+ FF  +     SLS+DL NHTNFWFFLSNTLIF+IA DSGAFS P +F  AAK   S PQHN N T+V+N+
Subjt:  FSTAKSLVMAINLPKKFLHLLAKGALFLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKTISSPPQHNLNKTVVLNE

Query:  PPN-----EEEEEEIIIPLTTEISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSKMMIRHDMTSTKKEEEEN-EFAEMTNEELNRRVEEFIERF
         PN     + EEEE IIPLTTEIS P +FNNP + YQRSKSEK+IKR+ EKA+K+ MRRSK MI+ + TSTK++EEEN EF +MT+EELNRRVEEFIERF
Subjt:  PPN-----EEEEEEIIIPLTTEISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSKMMIRHDMTSTKKEEEEN-EFAEMTNEELNRRVEEFIERF

Query:  NRQIRLQKTNEDGNENRE
        NRQIRLQ+ NED NE  +
Subjt:  NRQIRLQKTNEDGNENRE

A0A1S3B9V3 101 kDa malaria antigen-like2.9e-6466.52Show/hide
Query:  MRGFSTAKSLVMAINLPKKFLHLLAKGALFLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKTISSPPQHNLNKTVV
        MR FS  KS+VM I LPKKFLHL AK ALFL+ FF FF     SLS+D  NHTNFWFFLSNTLIFIIA DSGAFS P +F  AAK   + PQHN N T+V
Subjt:  MRGFSTAKSLVMAINLPKKFLHLLAKGALFLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKTISSPPQHNLNKTVV

Query:  LNEPPN-----------EEEEEEIIIPLTTEISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSKMMIRHDMTSTK--------KEEEENEFAEM
        +NE PN           EEEEEEI IPLTTEISIPP+FNNP + YQRSKSEK+IKR+  KA+KI M+RSK MIR D TSTK        +EEE+NEF +M
Subjt:  LNEPPN-----------EEEEEEIIIPLTTEISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSKMMIRHDMTSTK--------KEEEENEFAEM

Query:  TNEELNRRVEEFIERFNRQIRLQKTNE
        T+EELNRRVEEFIERFNRQIRLQ+ NE
Subjt:  TNEELNRRVEEFIERFNRQIRLQKTNE

A0A6J1CEX1 uncharacterized protein LOC1110108731.9e-2045.64Show/hide
Query:  LPKKFLHLLAKG-----ALFLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKTISSPPQHNLNKTVVLNEPPNEEEE
        L +++L+L+A+G     A FLL  F + SV  LS    L +   FWF +SNTLIFIIA D GAFSPPP      + I S    N +   ++ E PN+   
Subjt:  LPKKFLHLLAKG-----ALFLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKTISSPPQHNLNKTVVLNEPPNEEEE

Query:  EEIIIPLTTEISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSK-MMIRHDMTSTKKEEEENEFAEMTNEELNRRVEEFIERFNRQIRLQ
        E                 N  ++Y+RSKSEK +    EK RKI MRRSK M I+      +  +E+NEF EMT+EELNRRVEEFIERFNR+IRLQ
Subjt:  EEIIIPLTTEISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSK-MMIRHDMTSTKKEEEENEFAEMTNEELNRRVEEFIERFNRQIRLQ

A0A6J1EH50 uncharacterized protein LOC1114341463.2e-3960.7Show/hide
Query:  MAINLPKKFLHLLAKGAL----FLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKT--ISSPPQHNLNKTVVLNEPP
        MA+   +KFLHL A+ AL    FLL  FI+FS F LSLST L  HT FWF LSNTLIFIIAA S AFS PPTF AAA +  I +PP +N N     N   
Subjt:  MAINLPKKFLHLLAKGAL----FLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKT--ISSPPQHNLNKTVVLNEPP

Query:  NEEEEEEIIIPLTTEISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSKMMIRHDMTSTKKEEEE--NEFAEMTNEELNRRVEEFIERFNRQIRL
        N ++++   IPL TEISI    NNPR+SY RSKSEK I+RV  K  KI MRRSK M R+D TST+ E+EE  NE AEM+NEELN++VEEFIERFNRQ+RL
Subjt:  NEEEEEEIIIPLTTEISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSKMMIRHDMTSTKKEEEE--NEFAEMTNEELNRRVEEFIERFNRQIRL

Query:  Q
        Q
Subjt:  Q

A0A6J1KIQ0 uncharacterized protein LOC1114955972.1e-3557.07Show/hide
Query:  MAINLPKKFLHLLAKGAL----FLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKTISSPPQHNLNKTVVLNEPPNE
        MA+    K LHL A+ AL    FLL  FI+FS F LSLST L  H NFWF LSNTL+ IIAA S AFSPPPTF A    I +PP +N +     +  P E
Subjt:  MAINLPKKFLHLLAKGAL----FLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKTISSPPQHNLNKTVVLNEPPNE

Query:  EEEEEIIIPLTTEISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSKMMIRHDMTSTKKEEEENEFAEMTNEELNRRVEEFIERFNRQIRLQKTN
         +E++  IPL TEI IP E NN R+SY RSKSEK ++RV  K  KI MRRSK M R++     +EE +NE AEM+NEELN+RVEEFIERFNRQIRLQ   
Subjt:  EEEEEIIIPLTTEISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSKMMIRHDMTSTKKEEEENEFAEMTNEELNRRVEEFIERFNRQIRLQKTN

Query:  EDGNE
         D NE
Subjt:  EDGNE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30190.1 unknown protein9.2e-0732.38Show/hide
Query:  LFLLIF--FIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPT------FTAAAKTISSPP-----------QHNLNKTVVLN----EPP
        +FL IF   + F VF++SLS+ +   T   FF+SNTLI IIAAD G+FS   +      +T AA T+ +             + N     + N    E  
Subjt:  LFLLIF--FIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPT------FTAAAKTISSPP-----------QHNLNKTVVLN----EPP

Query:  NEEEEEEIII-------------------------------PLTTEISIPPE------FNNPRRSYQRSKSEK-EIKRVG---EKARKIMMRRSKMMIRH
        N EEE+E ++                               P+T +     E        NP + Y RSKS+K   KR+    E  ++    R K     
Subjt:  NEEEEEEIII-------------------------------PLTTEISIPPE------FNNPRRSYQRSKSEK-EIKRVG---EKARKIMMRRSKMMIRH

Query:  DMTSTKK----EEEENEFAEMTNEELNRRVEEFIERFNRQIRLQ
         M   +K    +EE  EF++++NEELN+RVEEFI+RFNRQIR Q
Subjt:  DMTSTKK----EEEENEFAEMTNEELNRRVEEFIERFNRQIRLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGGATTTTCAACAGCAAAATCCTTAGTAATGGCCATAAACTTACCCAAAAAGTTCCTCCATTTACTTGCAAAGGGAGCTCTGTTTCTTCTCATCTTCTTCATTTT
CTTCTCTGTTTTCAAACTTTCTCTTTCAACTGATCTCTCCAACCACACCAACTTCTGGTTTTTCCTTTCCAACACTCTCATTTTCATCATTGCCGCCGATTCCGGCGCTT
TCTCTCCGCCCCCCACCTTCACCGCCGCAGCCAAAACCATCTCATCTCCTCCACAACATAACCTTAATAAGACTGTCGTCCTTAATGAACCTCCTAATGAAGAAGAAGAA
GAAGAAATAATAATCCCACTCACCACAGAAATTTCTATTCCTCCTGAGTTTAATAATCCAAGAAGATCCTACCAACGAAGCAAGTCAGAGAAAGAGATAAAAAGAGTAGG
AGAGAAAGCAAGAAAAATTATGATGAGGAGATCCAAGATGATGATACGACACGATATGACATCGACAAAAAAGGAGGAAGAAGAAAATGAGTTTGCTGAGATGACAAATG
AAGAATTGAACAGAAGAGTTGAAGAGTTTATTGAAAGATTCAACAGACAAATAAGACTCCAAAAAACTAATGAAGATGGCAATGAGAATAGAGAGCTTTTAATTTAA
mRNA sequenceShow/hide mRNA sequence
CCTTGTGTTTGTTTGTTGATATATATTTGATGAGGGGATTTTCAACAGCAAAATCCTTAGTAATGGCCATAAACTTACCCAAAAAGTTCCTCCATTTACTTGCAAAGGGA
GCTCTGTTTCTTCTCATCTTCTTCATTTTCTTCTCTGTTTTCAAACTTTCTCTTTCAACTGATCTCTCCAACCACACCAACTTCTGGTTTTTCCTTTCCAACACTCTCAT
TTTCATCATTGCCGCCGATTCCGGCGCTTTCTCTCCGCCCCCCACCTTCACCGCCGCAGCCAAAACCATCTCATCTCCTCCACAACATAACCTTAATAAGACTGTCGTCC
TTAATGAACCTCCTAATGAAGAAGAAGAAGAAGAAATAATAATCCCACTCACCACAGAAATTTCTATTCCTCCTGAGTTTAATAATCCAAGAAGATCCTACCAACGAAGC
AAGTCAGAGAAAGAGATAAAAAGAGTAGGAGAGAAAGCAAGAAAAATTATGATGAGGAGATCCAAGATGATGATACGACACGATATGACATCGACAAAAAAGGAGGAAGA
AGAAAATGAGTTTGCTGAGATGACAAATGAAGAATTGAACAGAAGAGTTGAAGAGTTTATTGAAAGATTCAACAGACAAATAAGACTCCAAAAAACTAATGAAGATGGCA
ATGAGAATAGAGAGCTTTTAATTTAATTTGTTGTCTATATGTGTAATATGCATTAAGTATTTTTGTTTTGTTTTTAGTTAGTGTGTGTTGTCTTCTTCCTCACTTGTCTC
TTTCCAAATAGCTCTAGTTTTTATTTTCTTCTTTAGTGGTGATAACTGATAATGCTTTTTCATGTATGGACCTGTCTTTATCCACAATCCACATATATGAATCATCTTCC
TTCTC
Protein sequenceShow/hide protein sequence
MRGFSTAKSLVMAINLPKKFLHLLAKGALFLLIFFIFFSVFKLSLSTDLSNHTNFWFFLSNTLIFIIAADSGAFSPPPTFTAAAKTISSPPQHNLNKTVVLNEPPNEEEE
EEIIIPLTTEISIPPEFNNPRRSYQRSKSEKEIKRVGEKARKIMMRRSKMMIRHDMTSTKKEEEENEFAEMTNEELNRRVEEFIERFNRQIRLQKTNEDGNENRELLI