; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017796 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017796
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTPD1 protein-like 1
Genome locationtig00153056:123098..133075
RNA-Seq ExpressionSgr017796
SyntenySgr017796
Gene Ontology termsGO:0001709 - cell fate determination (biological process)
InterPro domainsIPR040361 - Tapetum determinant 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8721733.1 hypothetical protein F3Y22_tig00015331pilonHSYRG00009 [Hibiscus syriacus]6.2e-3742.11Show/hide
Query:  LVNNGNCQSCALIDIIINQIPTGGYVGGKQEWRVTITNRCICSQYNVKLDCNGFQSSEAIDPSILAITDSVCLLNSGHPISRKDLITFTYAFAWDKSFP-
        LV+ G C  C+L DI+I  + TG  + GK EW+VT+TN   C+Q+ +++ C GFQ+ EA+DPSI       CL+N G+ ++    ++F+Y F     FP 
Subjt:  LVNNGNCQSCALIDIIINQIPTGGYVGGKQEWRVTITNRCICSQYNVKLDCNGFQSSEAIDPSILAITDSVCLLNSGHPISRKDLITFTYAFAWDKSFP-

Query:  FEVISTQISCNCQCTLEDIEISQNTTGSQAQGKLEWKATISNNCICSQYSLKLDCKTFKTVENIDPSILAIAGSVCLVNNGLPIFQSNPISFTYAWDNPF
          V+    S    C+L DI +    TG++ QG  EWK TI+NNC CSQ+ LKL CK F++VE +DPSI    G  CLV  G  +     +SF YAWD PF
Subjt:  FEVISTQISCNCQCTLEDIEISQNTTGSQAQGKLEWKATISNNCICSQYSLKLDCKTFKTVENIDPSILAIAGSVCLVNNGLPIFQSNPISFTYAWDNPF

Query:  PFKPLFSQI
           P  S +
Subjt:  PFKPLFSQI

KGN60075.1 hypothetical protein Csa_001952 [Cucumis sativus]8.0e-3770.59Show/hide
Query:  NCQCTLEDIEISQNTTGSQAQGKLEWKATISNNCICSQYSLKLDCKTFKTVENIDPSILAIAGSVCLVNNGLPIFQSNPISFTYAWDNPFPFKPLFSQIA
        NC+C++ DIEISQ+TTG +  GK EW+ATI N C+CSQYS+K DC  F TVE +D SIL +AGSVCLVNNG PIF S+PISFTYAWDN FPF PLFSQ+A
Subjt:  NCQCTLEDIEISQNTTGSQAQGKLEWKATISNNCICSQYSLKLDCKTFKTVENIDPSILAIAGSVCLVNNGLPIFQSNPISFTYAWDNPFPFKPLFSQIA

Query:  CS
        CS
Subjt:  CS

RDX72768.1 TPD1 protein-like 1, partial [Mucuna pruriens]8.8e-5239.72Show/hide
Query:  KLFCALFFFCLLCKGN-CQCSVDQITVSQTKTGAHVLGKPEWRATITNHCACSQLSVLLDCIGYQTVEDVDPAILAISGNVCLLNSGNCQCSLDD-----
        KL   + F  L+ +G+   C ++ I++ Q+KTG    G PEW+ +I++ CAC+   + L+C G+QT E VDP+IL++SG +CL+ +G    S        
Subjt:  KLFCALFFFCLLCKGN-CQCSVDQITVSQTKTGAHVLGKPEWRATITNHCACSQLSVLLDCIGYQTVEDVDPAILAISGNVCLLNSGNCQCSLDD-----

Query:  ---------------LAISQTTTGSEVQGKEEWRATITNNCVCSQYSVKFDCNGFETVESVDPSILMVAGSVCLVNNGNCQSCALIDIIINQIPTGGYVG
                       + + Q  TG+   G  EW+  +T+NC+C+   VK +CN F+T  +VDPSIL V+       +  C      ++II Q  TG    
Subjt:  ---------------LAISQTTTGSEVQGKEEWRATITNNCVCSQYSVKFDCNGFETVESVDPSILMVAGSVCLVNNGNCQSCALIDIIINQIPTGGYVG

Query:  GKQEWRVTITNRCICSQYNVKLDCNGFQSSEAIDPSILAITDSVCLLNSGHPISRKDLITFTYAFAWDKSFPFEVISTQISC
        GK EW+V+ITN C CSQ  VKL C+GFQ++EA+DPSIL I+D +CLLN G PIS    + F Y  AWD  FPF + S+Q+ C
Subjt:  GKQEWRVTITNRCICSQYNVKLDCNGFQSSEAIDPSILAITDSVCLLNSGHPISRKDLITFTYAFAWDKSFPFEVISTQISC

XP_038881121.1 uncharacterized protein At1g05835-like [Benincasa hispida]1.2e-2961.82Show/hide
Query:  VISTQISCNCQCTLEDIEISQNTTGSQAQGKLEWKATISNNCICSQYSLKLDCKTFKTVENIDPSILAIAGSVCLVNNGLPIFQSNPISFTYAWDNPFPF
        V+S   + NCQC L +I ISQ  TG Q  G  EWK TISNNC CSQ  +K+DCK F+T E+IDPSILA+  + CLVNNGLPIF SNPI+F+YA D  F F
Subjt:  VISTQISCNCQCTLEDIEISQNTTGSQAQGKLEWKATISNNCICSQYSLKLDCKTFKTVENIDPSILAIAGSVCLVNNGLPIFQSNPISFTYAWDNPFPF

Query:  KPLFSQIACS
        KP+ SQI+CS
Subjt:  KPLFSQIACS

XP_038896903.1 TPD1 protein homolog 1-like [Benincasa hispida]1.3e-3469Show/hide
Query:  QCTLEDIEISQNTTGSQAQGKLEWKATISNNCICSQYSLKLDCKTFKTVENIDPSILAIAGSVCLVNNGLPIFQSNPISFTYAWDNPFPFKPLFSQIACS
        QC++ DI ISQ+TTG +  GK EW+ATI+NNC+CSQYS+K DC  F +VE +D SIL +AGS CLVNNG P+F+SNPISFTYAWDN F FKPLFSQ+ACS
Subjt:  QCTLEDIEISQNTTGSQAQGKLEWKATISNNCICSQYSLKLDCKTFKTVENIDPSILAIAGSVCLVNNGLPIFQSNPISFTYAWDNPFPFKPLFSQIACS

TrEMBL top hitse value%identityAlignment
A0A0A0LDN0 Uncharacterized protein3.9e-3770.59Show/hide
Query:  NCQCTLEDIEISQNTTGSQAQGKLEWKATISNNCICSQYSLKLDCKTFKTVENIDPSILAIAGSVCLVNNGLPIFQSNPISFTYAWDNPFPFKPLFSQIA
        NC+C++ DIEISQ+TTG +  GK EW+ATI N C+CSQYS+K DC  F TVE +D SIL +AGSVCLVNNG PIF S+PISFTYAWDN FPF PLFSQ+A
Subjt:  NCQCTLEDIEISQNTTGSQAQGKLEWKATISNNCICSQYSLKLDCKTFKTVENIDPSILAIAGSVCLVNNGLPIFQSNPISFTYAWDNPFPFKPLFSQIA

Query:  CS
        CS
Subjt:  CS

A0A0A0LEA1 Uncharacterized protein4.8e-3573.53Show/hide
Query:  NCQCTLEDIEISQNTTGSQAQGKLEWKATISNNCICSQYSLKLDCKTFKTVENIDPSILAIAGSVCLVNNGLPIFQSNPISFTYAWDNPFPFKPLFSQIA
        NC+C L DI ISQ TTGS  QGK  WKATI+NNCIC Q SLKLDC  F TV+ +DPSILA++GSVCLVN G PIFQS PISFTYA DN FPFKPL SQI+
Subjt:  NCQCTLEDIEISQNTTGSQAQGKLEWKATISNNCICSQYSLKLDCKTFKTVENIDPSILAIAGSVCLVNNGLPIFQSNPISFTYAWDNPFPFKPLFSQIA

Query:  CS
        CS
Subjt:  CS

A0A371F3A1 TPD1 protein-like 1 (Fragment)4.3e-5239.72Show/hide
Query:  KLFCALFFFCLLCKGN-CQCSVDQITVSQTKTGAHVLGKPEWRATITNHCACSQLSVLLDCIGYQTVEDVDPAILAISGNVCLLNSGNCQCSLDD-----
        KL   + F  L+ +G+   C ++ I++ Q+KTG    G PEW+ +I++ CAC+   + L+C G+QT E VDP+IL++SG +CL+ +G    S        
Subjt:  KLFCALFFFCLLCKGN-CQCSVDQITVSQTKTGAHVLGKPEWRATITNHCACSQLSVLLDCIGYQTVEDVDPAILAISGNVCLLNSGNCQCSLDD-----

Query:  ---------------LAISQTTTGSEVQGKEEWRATITNNCVCSQYSVKFDCNGFETVESVDPSILMVAGSVCLVNNGNCQSCALIDIIINQIPTGGYVG
                       + + Q  TG+   G  EW+  +T+NC+C+   VK +CN F+T  +VDPSIL V+       +  C      ++II Q  TG    
Subjt:  ---------------LAISQTTTGSEVQGKEEWRATITNNCVCSQYSVKFDCNGFETVESVDPSILMVAGSVCLVNNGNCQSCALIDIIINQIPTGGYVG

Query:  GKQEWRVTITNRCICSQYNVKLDCNGFQSSEAIDPSILAITDSVCLLNSGHPISRKDLITFTYAFAWDKSFPFEVISTQISC
        GK EW+V+ITN C CSQ  VKL C+GFQ++EA+DPSIL I+D +CLLN G PIS    + F Y  AWD  FPF + S+Q+ C
Subjt:  GKQEWRVTITNRCICSQYNVKLDCNGFQSSEAIDPSILAITDSVCLLNSGHPISRKDLITFTYAFAWDKSFPFEVISTQISC

A0A6A3BYG4 Uncharacterized protein3.0e-3742.11Show/hide
Query:  LVNNGNCQSCALIDIIINQIPTGGYVGGKQEWRVTITNRCICSQYNVKLDCNGFQSSEAIDPSILAITDSVCLLNSGHPISRKDLITFTYAFAWDKSFP-
        LV+ G C  C+L DI+I  + TG  + GK EW+VT+TN   C+Q+ +++ C GFQ+ EA+DPSI       CL+N G+ ++    ++F+Y F     FP 
Subjt:  LVNNGNCQSCALIDIIINQIPTGGYVGGKQEWRVTITNRCICSQYNVKLDCNGFQSSEAIDPSILAITDSVCLLNSGHPISRKDLITFTYAFAWDKSFP-

Query:  FEVISTQISCNCQCTLEDIEISQNTTGSQAQGKLEWKATISNNCICSQYSLKLDCKTFKTVENIDPSILAIAGSVCLVNNGLPIFQSNPISFTYAWDNPF
          V+    S    C+L DI +    TG++ QG  EWK TI+NNC CSQ+ LKL CK F++VE +DPSI    G  CLV  G  +     +SF YAWD PF
Subjt:  FEVISTQISCNCQCTLEDIEISQNTTGSQAQGKLEWKATISNNCICSQYSLKLDCKTFKTVENIDPSILAIAGSVCLVNNGLPIFQSNPISFTYAWDNPF

Query:  PFKPLFSQI
           P  S +
Subjt:  PFKPLFSQI

A0A6J1C8A1 uncharacterized protein LOC111008881 isoform X21.8e-2971.43Show/hide
Query:  MAVPMKLFCALFFFCLLCKGNCQCSVDQITVSQTKTGAHVLGKPEWRATITNHCACSQLSVLLDCIGYQTVEDVDPAILAISGNVCLLNSG
        MAVPMKLFC L  F LL KGNCQCSV  I VSQ+ TG+ VLGK +WR TI+N+C CSQLS+LLDC+GYQTVE+VDPAIL  + NVCL NSG
Subjt:  MAVPMKLFCALFFFCLLCKGNCQCSVDQITVSQTKTGAHVLGKPEWRATITNHCACSQLSVLLDCIGYQTVEDVDPAILAISGNVCLLNSG

SwissProt top hitse value%identityAlignment
Q1G3T1 TPD1 protein homolog 19.3e-0426.73Show/hide
Query:  CTLEDIEISQNTTGSQAQGKLEWKATISNNCI--CSQYSLKLDCKTFKTVENIDPSIL-AIAGSVCLVNNGLPIFQSNPISFTYAWDNPFPFKPLFSQIA
        C+ +DI + Q +T     G   +   I N+C+  C+   + + C  F +V  ++P +   +    CLVN+G P+     +SF YA  N F +    + ++
Subjt:  CTLEDIEISQNTTGSQAQGKLEWKATISNNCI--CSQYSLKLDCKTFKTVENIDPSIL-AIAGSVCLVNNGLPIFQSNPISFTYAWDNPFPFKPLFSQIA

Query:  C
        C
Subjt:  C

Arabidopsis top hitse value%identityAlignment
AT1G32583.1 FUNCTIONS IN: molecular_function unknown6.6e-0526.73Show/hide
Query:  CTLEDIEISQNTTGSQAQGKLEWKATISNNCI--CSQYSLKLDCKTFKTVENIDPSIL-AIAGSVCLVNNGLPIFQSNPISFTYAWDNPFPFKPLFSQIA
        C+ +DI + Q +T     G   +   I N+C+  C+   + + C  F +V  ++P +   +    CLVN+G P+     +SF YA  N F +    + ++
Subjt:  CTLEDIEISQNTTGSQAQGKLEWKATISNNCI--CSQYSLKLDCKTFKTVENIDPSIL-AIAGSVCLVNNGLPIFQSNPISFTYAWDNPFPFKPLFSQIA

Query:  C
        C
Subjt:  C

AT4G32090.1 Beta-1,3-N-Acetylglucosaminyltransferase family protein3.3e-1234.07Show/hide
Query:  MAVPMKLFCALFFFCLLCKGNCQCSVDQITVSQTKTGAHVLGKPEWRATITNHCACSQLSVLLDCIGYQTVEDVDPAILAISGNVCLLNSG
        MA  +K F  +    ++  G C C+  +I +   +TG  + G+PEW+ T+ N C C Q  V L C G+   + V P +L   GN CL+  G
Subjt:  MAVPMKLFCALFFFCLLCKGNCQCSVDQITVSQTKTGAHVLGKPEWRATITNHCACSQLSVLLDCIGYQTVEDVDPAILAISGNVCLLNSG

AT4G32100.1 Beta-1,3-N-Acetylglucosaminyltransferase family protein9.9e-0931.46Show/hide
Query:  KLFCALFFFCLLCK--GNCQCSVDQITVSQTKTGAHVLGKPEWRATITNHCACSQLSVLLDCIGYQTVEDVDPAILAISGNVCLLNSGN
        K  C +  F  + +  G+   S++ ++V Q+KTG  V  KPEW   + N   C      L C+ +++V  +D  +L+ SG+ CLL +G+
Subjt:  KLFCALFFFCLLCK--GNCQCSVDQITVSQTKTGAHVLGKPEWRATITNHCACSQLSVLLDCIGYQTVEDVDPAILAISGNVCLLNSGN

AT4G32105.1 Beta-1,3-N-Acetylglucosaminyltransferase family protein5.1e-1338.64Show/hide
Query:  KLFCALFFFCLLCKGNCQCSVDQITVSQTKTGAHVLGKPEWRATITNHC-ACSQLSVLLDCIGYQTVEDVDPAILAISGNVCLLNSGN
        K+ C + F   + +G   C ++ ++V Q+KTG  V  KPEW   +TN C  C      L C+G+Q+V  V  ++L+ SG++CLLN+GN
Subjt:  KLFCALFFFCLLCKGNCQCSVDQITVSQTKTGAHVLGKPEWRATITNHC-ACSQLSVLLDCIGYQTVEDVDPAILAISGNVCLLNSGN

AT4G32110.1 Beta-1,3-N-Acetylglucosaminyltransferase family protein1.9e-1237.93Show/hide
Query:  KLFCALFFFCLLCKGNCQCSVDQITVSQTKTGAHVLGKPEWRATITNHC-ACSQLSVLLDCIGYQTVEDVDPAILAISGNVCLLNSG
        KL C + F   + +G   CS++ ++V Q+KTG  V  KPEW   +TN C  C   +  L C+G+ +V  +D ++L  SG+ CL+N+G
Subjt:  KLFCALFFFCLLCKGNCQCSVDQITVSQTKTGAHVLGKPEWRATITNHC-ACSQLSVLLDCIGYQTVEDVDPAILAISGNVCLLNSG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGTTCCCATGAAGCTTTTCTGTGCACTTTTCTTCTTTTGTCTCCTCTGCAAAGGGAATTGCCAGTGCTCTGTGGATCAAATTACAGTCAGCCAAACTAAAACTGG
AGCTCATGTGCTGGGCAAGCCAGAATGGAGAGCTACAATCACAAACCATTGTGCTTGTTCCCAATTAAGTGTGCTGTTGGATTGCATTGGATATCAGACAGTAGAGGATG
TTGATCCAGCCATCTTGGCCATTTCAGGGAATGTGTGCTTGTTAAATAGTGGCAATTGCCAATGCTCCTTGGATGACCTAGCAATCAGCCAAACTACAACTGGATCTGAA
GTGCAGGGCAAGGAAGAATGGAGAGCTACAATCACAAACAACTGTGTTTGTTCCCAATACAGTGTGAAGTTTGATTGCAATGGATTTGAGACAGTGGAGAGTGTTGATCC
ATCCATTTTAATGGTTGCAGGCTCTGTGTGTTTGGTTAATAATGGGAATTGCCAATCTTGTGCTTTGATTGACATTATAATAAACCAAATTCCAACTGGGGGTTATGTGG
GGGGGAAGCAAGAATGGAGAGTTACAATCACCAACAGATGCATATGTTCTCAGTACAATGTGAAATTGGATTGCAATGGGTTTCAGAGTTCAGAGGCCATTGACCCATCC
ATCTTAGCCATTACAGACTCTGTTTGTTTGCTCAACAGTGGTCACCCAATCTCTAGAAAAGACCTCATCACCTTCACTTATGCATTTGCTTGGGACAAATCCTTCCCTTT
TGAGGTCATCTCCACCCAAATTTCATGCAATTGCCAATGCACATTGGAAGACATTGAAATCAGCCAAAACACAACTGGGTCTCAAGCACAAGGCAAGCTTGAATGGAAAG
CTACAATCTCCAACAACTGCATTTGTTCTCAATACAGCCTCAAGTTGGACTGCAAAACATTTAAAACAGTGGAGAATATTGATCCATCCATCTTAGCCATTGCAGGCTCT
GTTTGTTTGGTCAATAATGGTCTCCCCATCTTTCAATCCAACCCCATTTCTTTCACCTATGCTTGGGACAACCCCTTCCCTTTTAAGCCCCTCTTCTCCCAAATTGCCTG
CTCCCTATCTCAATCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGTTCCCATGAAGCTTTTCTGTGCACTTTTCTTCTTTTGTCTCCTCTGCAAAGGGAATTGCCAGTGCTCTGTGGATCAAATTACAGTCAGCCAAACTAAAACTGG
AGCTCATGTGCTGGGCAAGCCAGAATGGAGAGCTACAATCACAAACCATTGTGCTTGTTCCCAATTAAGTGTGCTGTTGGATTGCATTGGATATCAGACAGTAGAGGATG
TTGATCCAGCCATCTTGGCCATTTCAGGGAATGTGTGCTTGTTAAATAGTGGCAATTGCCAATGCTCCTTGGATGACCTAGCAATCAGCCAAACTACAACTGGATCTGAA
GTGCAGGGCAAGGAAGAATGGAGAGCTACAATCACAAACAACTGTGTTTGTTCCCAATACAGTGTGAAGTTTGATTGCAATGGATTTGAGACAGTGGAGAGTGTTGATCC
ATCCATTTTAATGGTTGCAGGCTCTGTGTGTTTGGTTAATAATGGGAATTGCCAATCTTGTGCTTTGATTGACATTATAATAAACCAAATTCCAACTGGGGGTTATGTGG
GGGGGAAGCAAGAATGGAGAGTTACAATCACCAACAGATGCATATGTTCTCAGTACAATGTGAAATTGGATTGCAATGGGTTTCAGAGTTCAGAGGCCATTGACCCATCC
ATCTTAGCCATTACAGACTCTGTTTGTTTGCTCAACAGTGGTCACCCAATCTCTAGAAAAGACCTCATCACCTTCACTTATGCATTTGCTTGGGACAAATCCTTCCCTTT
TGAGGTCATCTCCACCCAAATTTCATGCAATTGCCAATGCACATTGGAAGACATTGAAATCAGCCAAAACACAACTGGGTCTCAAGCACAAGGCAAGCTTGAATGGAAAG
CTACAATCTCCAACAACTGCATTTGTTCTCAATACAGCCTCAAGTTGGACTGCAAAACATTTAAAACAGTGGAGAATATTGATCCATCCATCTTAGCCATTGCAGGCTCT
GTTTGTTTGGTCAATAATGGTCTCCCCATCTTTCAATCCAACCCCATTTCTTTCACCTATGCTTGGGACAACCCCTTCCCTTTTAAGCCCCTCTTCTCCCAAATTGCCTG
CTCCCTATCTCAATCCTAA
Protein sequenceShow/hide protein sequence
MAVPMKLFCALFFFCLLCKGNCQCSVDQITVSQTKTGAHVLGKPEWRATITNHCACSQLSVLLDCIGYQTVEDVDPAILAISGNVCLLNSGNCQCSLDDLAISQTTTGSE
VQGKEEWRATITNNCVCSQYSVKFDCNGFETVESVDPSILMVAGSVCLVNNGNCQSCALIDIIINQIPTGGYVGGKQEWRVTITNRCICSQYNVKLDCNGFQSSEAIDPS
ILAITDSVCLLNSGHPISRKDLITFTYAFAWDKSFPFEVISTQISCNCQCTLEDIEISQNTTGSQAQGKLEWKATISNNCICSQYSLKLDCKTFKTVENIDPSILAIAGS
VCLVNNGLPIFQSNPISFTYAWDNPFPFKPLFSQIACSLSQS