; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015638 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015638
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionthionin-like protein 2
Genome locationtig00004836:497751..508881
RNA-Seq ExpressionSgr015638
SyntenySgr015638
Gene Ontology termsNA
InterPro domainsIPR038975 - Thionin-like protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EOY26692.1 To encode a PR protein, Belongs to the plant thionin family with the following members:, putative [Theobroma cacao]3.3e-3445.31Show/hide
Query:  MKSVVVVCLALILSLFAGRSTAGSFGDCFDRCYVSCFNGAAPWELTLCPAKCVAECALRYVGSSPMDFNHKKDTRYFCNFGCATSLCTKFSTKKDPGMST
        + +V++VC  L+L    G+STA     C+  C++ C   +       C AKC+ +C L             KDT+YFC  GCAT+LCT  STK+DPG ST
Subjt:  MKSVVVVCLALILSLFAGRSTAGSFGDCFDRCYVSCFNGAAPWELTLCPAKCVAECALRYVGSSPMDFNHKKDTRYFCNFGCATSLCTKFSTKKDPGMST

Query:  A-GPFGNCFARCFGPCCITPMDSIF-CTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCA
        A G    C+A CF PC   P  + F CT  C+  C LP        +   KDT+YFC LGCAT+LCT  STK+DP EKKV SCVD+CS TCA
Subjt:  A-GPFGNCFARCFGPCCITPMDSIF-CTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCA

KAG6575552.1 Thionin-like protein 2, partial [Cucurbita argyrosperma subsp. sororia]5.6e-3448.24Show/hide
Query:  SFGDCFDRCYVSCFNGAAPWELTLCPAKCVAECALRYVGSSPMDFNHKKDTRYFCNFGCATSLCTKFSTKKDPGMSTAGPFGNCFARCFGPCCITPMDSI
        S G C+  C ++C +G  P E  LC   C+          SPMD NH  +  YFC  GCATS CT+  +    G STA  F  C+A+CF  C ITP  ++
Subjt:  SFGDCFDRCYVSCFNGAAPWELTLCPAKCVAECALRYVGSSPMDFNHKKDTRYFCNFGCATSLCTKFSTKKDPGMSTAGPFGNCFARCFGPCCITPMDSI

Query:  -FCTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCANA
          C   C+ +C   ++ S+PMD NH  DT YFC LGCATS+CTKFSTK DP+EKKVE CVDSC+ TC  A
Subjt:  -FCTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCANA

KAG6575557.1 Thionin-like protein 2, partial [Cucurbita argyrosperma subsp. sororia]8.1e-2542.33Show/hide
Query:  SFGDCFDRCYVSCFNGAAPWELTLCPAKCVAECALRYVGSSPMDFNHKKDTRYFCNFGCATSLCTKFSTKKDPGMSTAGPFGNCFARCFGPCCITPMDSI
        S G C+  C ++C +G  P E  LC   C+          SPMD NH  +  YFC  GCATS CT+  +    G++     G C A C   C        
Subjt:  SFGDCFDRCYVSCFNGAAPWELTLCPAKCVAECALRYVGSSPMDFNHKKDTRYFCNFGCATSLCTKFSTKKDPGMSTAGPFGNCFARCFGPCCITPMDSI

Query:  FCTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCS
                     ++ S+PMD NH  DT YFC LGCATS+CTKFSTK DP+EKKVE CVDSC+
Subjt:  FCTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCS

KAG6575558.1 hypothetical protein SDJN03_26197, partial [Cucurbita argyrosperma subsp. sororia]2.4e-2438.02Show/hide
Query:  LILSLFAGRSTAGSFGDCFDRCYVSC--FNGAAPWELTLCPAKCVAECALR----YVGSSPMDFNHKKDTRYFCNFGCATSLCTKFSTKKDPGMS-----
        ++ SL    ST  SF +C+  C+V C    G A    + CP +C+  C +     +  ++  DF+ ++  ++FC  GCA S CTKFST ++PG+S     
Subjt:  LILSLFAGRSTAGSFGDCFDRCYVSC--FNGAAPWELTLCPAKCVAECALR----YVGSSPMDFNHKKDTRYFCNFGCATSLCTKFSTKKDPGMS-----

Query:  TAGPFGNCFARCFGPCCITPMDSIFCTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCAN
          GP  NC+  C   C     +S F   +C+G+C +  V S+PM  NH  D RYFC LGC+TS C K      P EK+++ CV+SCS TC N
Subjt:  TAGPFGNCFARCFGPCCITPMDSIFCTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCAN

XP_038899787.1 thionin-like protein 2 [Benincasa hispida]4.5e-2362.89Show/hide
Query:  GMSTAGPFGNCFARCFGPCCITPMDSI-FCTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCANA
        G STA  FG C+A+CF  C ITP   I  C   C+G C   ++ SSP+D NH  DT YFC LGCATSLCTKFSTKKDPAEKKVESCV+SC +TC  A
Subjt:  GMSTAGPFGNCFARCFGPCCITPMDSI-FCTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCANA

TrEMBL top hitse value%identityAlignment
A0A061GC54 To encode a PR protein, Belongs to the plant thionin family with the following members:, putative1.6e-3445.31Show/hide
Query:  MKSVVVVCLALILSLFAGRSTAGSFGDCFDRCYVSCFNGAAPWELTLCPAKCVAECALRYVGSSPMDFNHKKDTRYFCNFGCATSLCTKFSTKKDPGMST
        + +V++VC  L+L    G+STA     C+  C++ C   +       C AKC+ +C L             KDT+YFC  GCAT+LCT  STK+DPG ST
Subjt:  MKSVVVVCLALILSLFAGRSTAGSFGDCFDRCYVSCFNGAAPWELTLCPAKCVAECALRYVGSSPMDFNHKKDTRYFCNFGCATSLCTKFSTKKDPGMST

Query:  A-GPFGNCFARCFGPCCITPMDSIF-CTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCA
        A G    C+A CF PC   P  + F CT  C+  C LP        +   KDT+YFC LGCAT+LCT  STK+DP EKKV SCVD+CS TCA
Subjt:  A-GPFGNCFARCFGPCCITPMDSIF-CTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCA

A0A0A0KC18 Uncharacterized protein2.7e-1857.14Show/hide
Query:  MKSVVVVCLALILSLFAGRSTAGSFGDCFDRCYVSC-FNGAAPWELTLCPAKCVAECALRYVGSSPMDFNHKKDTRYFCNFGCATSLCTKFSTKKDPG
        MKSVV++C   ILSL AGRSTA SFG C+ +C++ C      P  +  C AKC+A+C   ++ SSPMD N+  DT YFC  GCATS CTKFSTKKDPG
Subjt:  MKSVVVVCLALILSLFAGRSTAGSFGDCFDRCYVSC-FNGAAPWELTLCPAKCVAECALRYVGSSPMDFNHKKDTRYFCNFGCATSLCTKFSTKKDPG

A0A2U1N652 Uncharacterized protein2.0e-1638.52Show/hide
Query:  FCNFGCATSLCTKFSTKKDPGMSTAG--------PFGNCFARCFGPCCITPMDSIFCTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFS
        FC  GCA SLC    T+++PG    G        PF +C+ RCF  C I P ++  CT  C+ +C L     + M L+       FC LGCA SLC    
Subjt:  FCNFGCATSLCTKFSTKKDPGMSTAG--------PFGNCFARCFGPCCITPMDSIFCTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFS

Query:  TKKDPAEKKVESCVDSCSRTCA
        T+++P E  +  CVDSCS  C+
Subjt:  TKKDPAEKKVESCVDSCSRTCA

A0A6J1DDL8 thionin-like protein 22.0e-2160.82Show/hide
Query:  GMSTAGPFGNCFARCFGPCCITPMDSI-FCTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCANA
        G STA  FG C+A+CF  C ITP   +  C   C+  C L    S+  D N + DTRYFC LGCATSLCTKFSTKKDPAEKKV SCVDSCS+ C NA
Subjt:  GMSTAGPFGNCFARCFGPCCITPMDSI-FCTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCANA

A0A6J1KDK0 thionin-like protein 24.8e-2363.92Show/hide
Query:  GMSTAGPFGNCFARCFGPCCITPMDSI-FCTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCANA
        G STA  FG C+A+CF  C ITP   I  C G C+  C   ++ S+P DLNH  DT YFC LGCATSLCTKFSTK DPAEKKVESCV+SCSRTC  A
Subjt:  GMSTAGPFGNCFARCFGPCCITPMDSI-FCTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCANA

SwissProt top hitse value%identityAlignment
A8MRP4 Thionin-like protein 23.6e-0734.41Show/hide
Query:  PFGNCFARCFGPCCITPMDSIF------CTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTC
        PF  C+  C   C        +      CT  C+ Q + P V S+ +D     ++ YFC LGCAT  C   S+ ++P  ++V +CVDSCS  C
Subjt:  PFGNCFARCFGPCCITPMDSIF------CTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTC

Arabidopsis top hitse value%identityAlignment
AT1G12660.1 Predicted to encode a PR (pathogenesis-related) protein. Belongs to the plant thionin (PR-13) family with the following members: At1g66100, At5g36910, At1g72260, At2g15010, At1g12663, At1g12660.3.8e-0430.99Show/hide
Query:  FGNCFARCFGPCCI--TPMDSIFCTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKD
        F  C+  C   C +   P+  +FC  +C+  C    + S   +LN    T  +C LGCAT  C   S+  D
Subjt:  FGNCFARCFGPCCI--TPMDSIFCTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKD

AT1G12663.1 Predicted to encode a PR (pathogenesis-related) protein. Belongs to the plant thionin (PR-13) family with the following members: At1g66100, At5g36910, At1g72260, At2g15010, At1g12663, At1g12660.2.5e-0834.41Show/hide
Query:  PFGNCFARCFGPCCITPMDSIF------CTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTC
        PF  C+  C   C        +      CT  C+ Q + P V S+ +D     ++ YFC LGCAT  C   S+ ++P  ++V +CVDSCS  C
Subjt:  PFGNCFARCFGPCCITPMDSIF------CTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTC

AT1G12672.1 unknown protein9.0e-0643.14Show/hide
Query:  SPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCAN
        S  ++NH     Y+C LGC+T  C   S+ ++P   KV  CVDSCS  C+N
Subjt:  SPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCAN

AT1G12672.2 unknown protein1.3e-0741.89Show/hide
Query:  PMDSIFCTGMCMGQCALPYVGSSPMD-LNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCAN
        P  S F T  C+  C  P   SS MD  N      Y+C LGC+T  C   S+ ++P   KV  CVDSCS  C+N
Subjt:  PMDSIFCTGMCMGQCALPYVGSSPMD-LNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCAN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTCAGTTGTTGTAGTTTGCTTGGCTCTGATTCTGAGCTTATTTGCAGGCAGGTCCACAGCTGGCTCCTTCGGGGATTGCTTTGACCGTTGCTACGTCTCCTGCTT
TAATGGAGCCGCACCCTGGGAATTAACCCTTTGCCCTGCAAAGTGCGTGGCAGAATGCGCCCTGCGTTATGTGGGATCCTCCCCAATGGACTTCAACCACAAGAAGGACA
CTCGTTACTTCTGCAACTTTGGCTGTGCCACTTCTCTCTGCACCAAATTCAGCACCAAGAAAGACCCAGGCATGTCCACAGCCGGCCCCTTCGGGAATTGCTTTGCCCGT
TGCTTCGGTCCCTGCTGTATAACACCCATGGATTCAATCTTCTGCACTGGAATGTGCATGGGACAATGCGCCCTGCCTTATGTGGGATCCTCCCCAATGGACTTGAACCA
CAAGAAGGACACTCGTTACTTCTGCAACCTTGGCTGTGCCACTTCTCTCTGCACCAAATTTAGCACCAAGAAAGACCCAGCTGAGAAGAAAGTGGAAAGCTGTGTGGACT
CGTGCTCTCGAACATGCGCAAACGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGTCAGTTGTTGTAGTTTGCTTGGCTCTGATTCTGAGCTTATTTGCAGGCAGGTCCACAGCTGGCTCCTTCGGGGATTGCTTTGACCGTTGCTACGTCTCCTGCTT
TAATGGAGCCGCACCCTGGGAATTAACCCTTTGCCCTGCAAAGTGCGTGGCAGAATGCGCCCTGCGTTATGTGGGATCCTCCCCAATGGACTTCAACCACAAGAAGGACA
CTCGTTACTTCTGCAACTTTGGCTGTGCCACTTCTCTCTGCACCAAATTCAGCACCAAGAAAGACCCAGGCATGTCCACAGCCGGCCCCTTCGGGAATTGCTTTGCCCGT
TGCTTCGGTCCCTGCTGTATAACACCCATGGATTCAATCTTCTGCACTGGAATGTGCATGGGACAATGCGCCCTGCCTTATGTGGGATCCTCCCCAATGGACTTGAACCA
CAAGAAGGACACTCGTTACTTCTGCAACCTTGGCTGTGCCACTTCTCTCTGCACCAAATTTAGCACCAAGAAAGACCCAGCTGAGAAGAAAGTGGAAAGCTGTGTGGACT
CGTGCTCTCGAACATGCGCAAACGCTTAA
Protein sequenceShow/hide protein sequence
MKSVVVVCLALILSLFAGRSTAGSFGDCFDRCYVSCFNGAAPWELTLCPAKCVAECALRYVGSSPMDFNHKKDTRYFCNFGCATSLCTKFSTKKDPGMSTAGPFGNCFAR
CFGPCCITPMDSIFCTGMCMGQCALPYVGSSPMDLNHKKDTRYFCNLGCATSLCTKFSTKKDPAEKKVESCVDSCSRTCANA