; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019433 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019433
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Description101 kDa malaria antigen-like
Genome locationtig00153347:578259..578960
RNA-Seq ExpressionSgr019433
SyntenySgr019433
Gene Ontology termsNA
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583890.1 hypothetical protein SDJN03_19822, partial [Cucurbita argyrosperma subsp. sororia]3.5e-3047.87Show/hide
Query:  LHLFAEGGPSFFAFILSIFIYVSVFNLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKTSNPKQENEEVIVQERQNRED
        LHL+AE    F AF+LS+FIY S FNLSL TLF  T FWFC+SNTL+FIIAA   AFSPP H +          N+A     P Q N             
Subjt:  LHLFAEGGPSFFAFILSIFIYVSVFNLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKTSNPKQENEEVIVQERQNRED

Query:  KLQIIIAGPANSGKPSEDIPQ-----TRGSTKIPAKTYRRSKSEKAKRGVEKERMITMRSSKTMIRS------HEEKENRDQFAEMTDEELNRRVEEFIE
                  NSG P+E+  Q     T  S   P K+Y RSKSEKA R V KE  I MR SKTM R+      +E++E + + AEM++EELN+RVEEFIE
Subjt:  KLQIIIAGPANSGKPSEDIPQ-----TRGSTKIPAKTYRRSKSEKAKRGVEKERMITMRSSKTMIRS------HEEKENRDQFAEMTDEELNRRVEEFIE

Query:  RFNRQIRLQGI
        RFNRQ++LQ I
Subjt:  RFNRQIRLQGI

KAG7019508.1 hypothetical protein SDJN02_18469, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-3048.34Show/hide
Query:  LHLFAEGGPSFFAFILSIFIYVSVFNLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKTSNPKQENEEVIVQERQNRED
        LHL+AE    F AF+LS+FIY S FNLSL TLF  T FWFC+SNTL+FIIAA   AFSPP H +          N+A     P Q N             
Subjt:  LHLFAEGGPSFFAFILSIFIYVSVFNLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKTSNPKQENEEVIVQERQNRED

Query:  KLQIIIAGPANSGKPSEDIPQ-----TRGSTKIPAKTYRRSKSEKAKRGVEKERMITMRSSKTMIRS------HEEKENRDQFAEMTDEELNRRVEEFIE
                  NSG P+E+  Q     T  S   P K+Y RSKSEKA R V KE  I MR SKTM R+      +E++E + + AEM++EELN+RVEEFIE
Subjt:  KLQIIIAGPANSGKPSEDIPQ-----TRGSTKIPAKTYRRSKSEKAKRGVEKERMITMRSSKTMIRS------HEEKENRDQFAEMTDEELNRRVEEFIE

Query:  RFNRQIRLQGI
        RFNRQ+RLQ I
Subjt:  RFNRQIRLQGI

XP_022140144.1 uncharacterized protein LOC111010873 [Momordica charantia]3.9e-3753.88Show/hide
Query:  MANK--AKDYLHLFAEGGPSFFA-FILSIFIYVSVFNLS-LPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKTSNPKQENE
        MAN   A++YL+L AEG PSFFA F+LSIF YVSV NLS LPTLF+D KFWFCISNTL+FIIA D GAFSPP  ++ +        N+A K         
Subjt:  MANK--AKDYLHLFAEGGPSFFA-FILSIFIYVSVFNLS-LPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKTSNPKQENE

Query:  EVIVQERQNREDKLQIIIAGPANSGKPSEDIPQTRGSTKIPAKTYRRSKSEKAKRGVEKERMITMRSSKTM-IRSHEEKENR----DQFAEMTDEELNRR
          IV+ER                   P++   +TR      AKTYRRSKSEKA   VEKER I MR SKTM I+ H+E+E      ++F EMTDEELNRR
Subjt:  EVIVQERQNREDKLQIIIAGPANSGKPSEDIPQTRGSTKIPAKTYRRSKSEKAKRGVEKERMITMRSSKTM-IRSHEEKENR----DQFAEMTDEELNRR

Query:  VEEFIERFNRQIRLQGIEH
        VEEFIERFNR+IRLQ +E+
Subjt:  VEEFIERFNRQIRLQGIEH

XP_022927241.1 uncharacterized protein LOC111434146 [Cucurbita moschata]2.0e-3046.58Show/hide
Query:  KDYLHLFAEGGPSFFAFILSIFIYVSVFNLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKTSNPKQENEEVIVQERQN
        + +LHL+AE    FFAF+LS+FIY S FNLSL TLF  T FWFC+SNTL+FIIAA   AFS P               NAA +S                
Subjt:  KDYLHLFAEGGPSFFAFILSIFIYVSVFNLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKTSNPKQENEEVIVQERQN

Query:  REDKLQIIIAGPA----NSGKPSEDIPQ------TRGSTKIPAKTYRRSKSEKAKRGVEKERMITMRSSKTMIRS------HEEKENRDQFAEMTDEELN
              +II  P     NSG P+E+  Q      T  S   P K+Y RSKSEKA R V KE  I MR SKTM R+      +E++E +++ AEM++EELN
Subjt:  REDKLQIIIAGPA----NSGKPSEDIPQ------TRGSTKIPAKTYRRSKSEKAKRGVEKERMITMRSSKTMIRS------HEEKENRDQFAEMTDEELN

Query:  RRVEEFIERFNRQIRLQGI
        ++VEEFIERFNRQ+RLQ I
Subjt:  RRVEEFIERFNRQIRLQGI

XP_023520396.1 uncharacterized protein LOC111783708 isoform X2 [Cucurbita pepo subsp. pepo]1.5e-2846.89Show/hide
Query:  KDYLHLFAEGGPSFFAFILSIFIYVSVFNLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKTSNPKQENEEVIVQERQN
        + +LHL+AE    F AF+LS+FIY S FNLSL TLF  T FWFC+SNTL+FIIAA   AFSPP    P  +        A     P Q N          
Subjt:  KDYLHLFAEGGPSFFAFILSIFIYVSVFNLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKTSNPKQENEEVIVQERQN

Query:  REDKLQIIIAGPANSGKPSEDIPQ------TRGSTKIPAKTYRRSKSEKAKRGVEKERMITMRSSKTMIRSHEEKENRDQFAEMTDEELNRRVEEFIERF
                     NSG P+E+  Q      T  S     K+Y  SKSEKA R V KE  + MR SKTM R +E++E +++ AEM++EELN+RVEEFIERF
Subjt:  REDKLQIIIAGPANSGKPSEDIPQ------TRGSTKIPAKTYRRSKSEKAKRGVEKERMITMRSSKTMIRSHEEKENRDQFAEMTDEELNRRVEEFIERF

Query:  NRQIRLQGI
        NRQ+RLQ I
Subjt:  NRQIRLQGI

TrEMBL top hitse value%identityAlignment
A0A6J1CEX1 uncharacterized protein LOC1110108731.9e-3753.88Show/hide
Query:  MANK--AKDYLHLFAEGGPSFFA-FILSIFIYVSVFNLS-LPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKTSNPKQENE
        MAN   A++YL+L AEG PSFFA F+LSIF YVSV NLS LPTLF+D KFWFCISNTL+FIIA D GAFSPP  ++ +        N+A K         
Subjt:  MANK--AKDYLHLFAEGGPSFFA-FILSIFIYVSVFNLS-LPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKTSNPKQENE

Query:  EVIVQERQNREDKLQIIIAGPANSGKPSEDIPQTRGSTKIPAKTYRRSKSEKAKRGVEKERMITMRSSKTM-IRSHEEKENR----DQFAEMTDEELNRR
          IV+ER                   P++   +TR      AKTYRRSKSEKA   VEKER I MR SKTM I+ H+E+E      ++F EMTDEELNRR
Subjt:  EVIVQERQNREDKLQIIIAGPANSGKPSEDIPQTRGSTKIPAKTYRRSKSEKAKRGVEKERMITMRSSKTM-IRSHEEKENR----DQFAEMTDEELNRR

Query:  VEEFIERFNRQIRLQGIEH
        VEEFIERFNR+IRLQ +E+
Subjt:  VEEFIERFNRQIRLQGIEH

A0A6J1EH50 uncharacterized protein LOC1114341469.9e-3146.58Show/hide
Query:  KDYLHLFAEGGPSFFAFILSIFIYVSVFNLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKTSNPKQENEEVIVQERQN
        + +LHL+AE    FFAF+LS+FIY S FNLSL TLF  T FWFC+SNTL+FIIAA   AFS P               NAA +S                
Subjt:  KDYLHLFAEGGPSFFAFILSIFIYVSVFNLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKTSNPKQENEEVIVQERQN

Query:  REDKLQIIIAGPA----NSGKPSEDIPQ------TRGSTKIPAKTYRRSKSEKAKRGVEKERMITMRSSKTMIRS------HEEKENRDQFAEMTDEELN
              +II  P     NSG P+E+  Q      T  S   P K+Y RSKSEKA R V KE  I MR SKTM R+      +E++E +++ AEM++EELN
Subjt:  REDKLQIIIAGPA----NSGKPSEDIPQ------TRGSTKIPAKTYRRSKSEKAKRGVEKERMITMRSSKTMIRS------HEEKENRDQFAEMTDEELN

Query:  RRVEEFIERFNRQIRLQGI
        ++VEEFIERFNRQ+RLQ I
Subjt:  RRVEEFIERFNRQIRLQGI

A0A6J1KIQ0 uncharacterized protein LOC1114955973.0e-2748.76Show/hide
Query:  LHLFAEGGPSFFAFILSIFIYVSVFNLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKTSNPKQENEEVIVQERQNRED
        LHL+AE    F AF+LS+FIY S FNLSL TLF    FWFC+SNTLV IIAA   AFSPP               NA     P   N        +N+E 
Subjt:  LHLFAEGGPSFFAFILSIFIYVSVFNLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKTSNPKQENEEVIVQERQNRED

Query:  KLQIIIAGPANSGKPSED-IPQTRGSTKIPAKTYRRSKSEKAKRGVEKERMITMRSSKTMIRSHEEKENRDQFAEMTDEELNRRVEEFIERFNRQIRLQG
        ++ +          P+E  IP    +++   K+Y RSKSEKA R V KE  I MR SKTM R +EE+E +++ AEM++EELN+RVEEFIERFNRQIRLQ 
Subjt:  KLQIIIAGPANSGKPSED-IPQTRGSTKIPAKTYRRSKSEKAKRGVEKERMITMRSSKTMIRSHEEKENRDQFAEMTDEELNRRVEEFIERFNRQIRLQG

Query:  I
        I
Subjt:  I

A0A6P3ZKV5 uncharacterized protein LOC1074125093.1e-2442.13Show/hide
Query:  GPSFFAFILSIFIYVS---VFNLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKT----SNP-----------------
        G SFFAFI SIFIY+S   +FNLS  T+FN+TKFWF ISNTL+ IIA D G+FS  K K  + +  Y R + A  T    S+P                 
Subjt:  GPSFFAFILSIFIYVS---VFNLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKT----SNP-----------------

Query:  ----KQENE---EVI----VQERQNREDKLQII-------IAGPANSGKPSEDIPQTR----------------GSTKIPAKTYRRSKSEKAKRGVEKER
            KQE E   +VI      + +N ++KLQI+       +    +  KPSEDI + R                G     AKTYRRSKSEKAKR V  ER
Subjt:  ----KQENE---EVI----VQERQNREDKLQII-------IAGPANSGKPSEDIPQTR----------------GSTKIPAKTYRRSKSEKAKRGVEKER

Query:  M-ITMRSSKTMIRSHEEKEN--RDQFAEMTDEELNRRVEEFIERFNRQIRLQGI
          I +R S+T      E ++   ++F+ M++EELNRRVEEFI++FNRQIRLQ +
Subjt:  M-ITMRSSKTMIRSHEEKEN--RDQFAEMTDEELNRRVEEFIERFNRQIRLQGI

A0A7N2LQF9 Uncharacterized protein6.7e-2748.13Show/hide
Query:  GPSFFAFILSIFIYVSV---FNLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHK------------LPNSHGGYGRRNNAAKTSNPK------QE
        G SF+A + SIFIY+SV   FNLS   LF +TKFWF +SNTL+ IIA D GA+S    K            + N      +     K S PK      QE
Subjt:  GPSFFAFILSIFIYVSV---FNLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHK------------LPNSHGGYGRRNNAAKTSNPK------QE

Query:  NEEVIVQERQ-NREDKLQIIIAGPANSGKPSEDIPQTRGSTKIPAKTYRRSKSEKAKRGV-EKERMITMRSSKTMIRSHEEKENRDQFAEMTDEELNRRV
          EVIVQE Q   E  LQ++I   ++S KPSED+ +     KI AKTYRRSKSE+AKR V ++ + I  RSS+T      EK   ++F+ M+DEELNRRV
Subjt:  NEEVIVQERQ-NREDKLQIIIAGPANSGKPSEDIPQTRGSTKIPAKTYRRSKSEKAKRGV-EKERMITMRSSKTMIRSHEEKENRDQFAEMTDEELNRRV

Query:  EEFIERFNRQIRLQ
        EEFI+RFNR+IRLQ
Subjt:  EEFIERFNRQIRLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30190.1 unknown protein1.0e-1130.65Show/hide
Query:  SFFAFILSIFIYV---SVFNLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHK----------------------LP------NSHGGYGRRNNAA
        S     L IF Y+    VF +SL ++F DTK  F ISNTL+ IIAAD G+FS  + +                      +P      N+  G  +     
Subjt:  SFFAFILSIFIYV---SVFNLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHK----------------------LP------NSHGGYGRRNNAA

Query:  KTSNPKQENEEV---------------IVQERQNREDKLQIIIAGPANSGKPSEDIPQTRGSTKIPAKTYRRSKSEKAKR-GVEKERMITMRSSKTMIRS
        +  NP++E+E +               +V E++ R+D               SE+   TR     P K Y RSKS+K +R  +  +   T R S    +S
Subjt:  KTSNPKQENEEV---------------IVQERQNREDKLQIIIAGPANSGKPSEDIPQTRGSTKIPAKTYRRSKSEKAKR-GVEKERMITMRSSKTMIRS

Query:  ------------HEEKENRDQFAEMTDEELNRRVEEFIERFNRQIRLQ
                       KE  ++F+++++EELN+RVEEFI+RFNRQIR Q
Subjt:  ------------HEEKENRDQFAEMTDEELNRRVEEFIERFNRQIRLQ

AT2G34610.1 unknown protein3.3e-1031.44Show/hide
Query:  SFFAFILSIFIYVSVF---NLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKTSNPKQENEEV---IVQERQNREDKLQ
        S    ILSIF Y+ +F   ++S  ++FNDTK  F ISN L+ IIAAD GAF+  ++   + +G Y          NP+ E       + +E +NRE +  
Subjt:  SFFAFILSIFIYVSVF---NLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKTSNPKQENEEV---IVQERQNREDKLQ

Query:  IIIAG-------PANSGKPSEDIPQT--------------------------------RGSTKIPAKTYRRSKSEKAKRGV-EKERM---ITMR------
         + A        P    K +E I Q                                   S  + +K Y RSKS+KA+  V  KER    I  R      
Subjt:  IIIAG-------PANSGKPSEDIPQT--------------------------------RGSTKIPAKTYRRSKSEKAKRGV-EKERM---ITMR------

Query:  ----SSKTMI-----RSHEE-----------KENRDQFAEMTDEELNRRVEEFIERFNRQIRLQ
            SSK M+     ++ EE           KE  ++F++M++EELNRRVE+FI+RFNR I+ Q
Subjt:  ----SSKTMI-----RSHEE-----------KENRDQFAEMTDEELNRRVEEFIERFNRQIRLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCCTTCTTCACCACCAAATCCAGTAGCTGCAGCTGAATCATTAATGGCGAACAAAGCCAAAGACTACCTCCATTTATTTGCAGAGGGAGGTCCTTCCTTCTTCGC
CTTCATTCTCTCCATTTTCATTTACGTCTCTGTCTTCAACCTCTCTCTTCCAACTCTCTTCAACGACACCAAGTTCTGGTTTTGCATCTCCAACACCCTCGTTTTCATCA
TTGCCGCCGATTGCGGAGCTTTCTCTCCACCCAAACACAAACTACCGAACTCCCATGGCGGATATGGCCGGAGAAACAACGCCGCCAAAACGAGCAACCCCAAGCAAGAA
AACGAAGAAGTGATCGTTCAAGAGAGACAAAACCGAGAAGACAAATTGCAAATCATCATCGCAGGACCTGCTAATTCGGGCAAACCAAGTGAAGATATCCCACAGACGAG
AGGTTCGACGAAGATTCCGGCGAAAACTTACCGGCGAAGCAAGTCGGAGAAAGCGAAAAGAGGGGTGGAGAAGGAGAGGATGATCACCATGAGGAGCTCAAAGACGATGA
TAAGATCACACGAAGAAAAAGAGAATCGTGATCAGTTTGCAGAGATGACAGATGAAGAACTGAACAGAAGAGTTGAAGAGTTTATTGAAAGATTCAACAGACAGATTCGA
CTTCAAGGTATCGAACACATGGCGATGGGGCGTAGATTATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCCCTTCTTCACCACCAAATCCAGTAGCTGCAGCTGAATCATTAATGGCGAACAAAGCCAAAGACTACCTCCATTTATTTGCAGAGGGAGGTCCTTCCTTCTTCGC
CTTCATTCTCTCCATTTTCATTTACGTCTCTGTCTTCAACCTCTCTCTTCCAACTCTCTTCAACGACACCAAGTTCTGGTTTTGCATCTCCAACACCCTCGTTTTCATCA
TTGCCGCCGATTGCGGAGCTTTCTCTCCACCCAAACACAAACTACCGAACTCCCATGGCGGATATGGCCGGAGAAACAACGCCGCCAAAACGAGCAACCCCAAGCAAGAA
AACGAAGAAGTGATCGTTCAAGAGAGACAAAACCGAGAAGACAAATTGCAAATCATCATCGCAGGACCTGCTAATTCGGGCAAACCAAGTGAAGATATCCCACAGACGAG
AGGTTCGACGAAGATTCCGGCGAAAACTTACCGGCGAAGCAAGTCGGAGAAAGCGAAAAGAGGGGTGGAGAAGGAGAGGATGATCACCATGAGGAGCTCAAAGACGATGA
TAAGATCACACGAAGAAAAAGAGAATCGTGATCAGTTTGCAGAGATGACAGATGAAGAACTGAACAGAAGAGTTGAAGAGTTTATTGAAAGATTCAACAGACAGATTCGA
CTTCAAGGTATCGAACACATGGCGATGGGGCGTAGATTATAA
Protein sequenceShow/hide protein sequence
MSPSSPPNPVAAAESLMANKAKDYLHLFAEGGPSFFAFILSIFIYVSVFNLSLPTLFNDTKFWFCISNTLVFIIAADCGAFSPPKHKLPNSHGGYGRRNNAAKTSNPKQE
NEEVIVQERQNREDKLQIIIAGPANSGKPSEDIPQTRGSTKIPAKTYRRSKSEKAKRGVEKERMITMRSSKTMIRSHEEKENRDQFAEMTDEELNRRVEEFIERFNRQIR
LQGIEHMAMGRRL