; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023394 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023394
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00000892:2932686..2939857
RNA-Seq ExpressionSgr023394
SyntenySgr023394
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605498.1 hypothetical protein SDJN03_02815, partial [Cucurbita argyrosperma subsp. sororia]3.3e-5682.24Show/hide
Query:  MASVCISNCINDARAPVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPEKTNKCFGKVGIMRHTVSRRR---KKKD
        MASVCISNCINDAR PVRPTYINLYKWPESDAEFIRSVSSK+NR +RVVDSISCRQMYLRSYTFSRE++NVPEKTNKCFGKVG MR  VSRRR   KKK 
Subjt:  MASVCISNCINDARAPVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPEKTNKCFGKVGIMRHTVSRRR---KKKD

Query:  DAAGVEK------RKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVRE
         A   E+      RK STLKKAKELSCAA TSVFRRLLSCTAKVDVADN RE
Subjt:  DAAGVEK------RKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVRE

XP_022146233.1 uncharacterized protein LOC111015497 [Momordica charantia]4.8e-5581.21Show/hide
Query:  MASVCISNCINDAR--APVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPEKTNKCFGKVGIMRHTVSRRRKKK--
        MASVCISNCINDAR   PVRPTY NLY WPESDAEFIRSVSSKVNRA+RVVDSI CRQMYLRSYTFSR+D++VPEKTNKCF K+G+ +  +SRRRKKK  
Subjt:  MASVCISNCINDAR--APVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPEKTNKCFGKVGIMRHTVSRRRKKK--

Query:  DDAAGVEK-RKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVREF
        +D    EK RK S LKKAKELSCAAFTSVFRRLLSCTAKVDVADNVREF
Subjt:  DDAAGVEK-RKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVREF

XP_022946966.1 uncharacterized protein LOC111450987 [Cucurbita moschata]7.5e-5682.12Show/hide
Query:  MASVCISNCINDARAPVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPEKTNKCFGKVGIMRHTVSRRR--KKKDD
        MASVCISNCINDAR PVRPTYINLYKWPESDAEFIRSVSSK+NR +RVVDSISCRQMYLRSYTFSRE++N+PEKTNKCFGKVG MR  VSRRR  KKK  
Subjt:  MASVCISNCINDARAPVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPEKTNKCFGKVGIMRHTVSRRR--KKKDD

Query:  AAGVEK------RKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVRE
         A  E+      RK STLKKAKELSCAA TSVFRRLLSCTAKVDVADN RE
Subjt:  AAGVEK------RKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVRE

XP_023006939.1 uncharacterized protein LOC111499579 [Cucurbita maxima]8.2e-5579.22Show/hide
Query:  MASVCISNCINDARAPVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPEKTNKCFGKVGIMRHTVSRRRKKK----
        MASVCISNCINDAR PVRPTYINLYKWPESDAEFIRSVSSK+NR +RVVDSISCRQMYLRSYTFSRE++NV EKTNKCFGKVG MR  VS RR KK    
Subjt:  MASVCISNCINDARAPVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPEKTNKCFGKVGIMRHTVSRRRKKK----

Query:  -------DDAAGVEKRKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVRE
               DD+  +  RK STLKKAKELSCAA TSVFRRLLSCTAKVDVADN RE
Subjt:  -------DDAAGVEKRKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVRE

XP_023532318.1 uncharacterized protein LOC111794509 [Cucurbita pepo subsp. pepo]2.2e-5582.12Show/hide
Query:  MASVCISNCINDARAPVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPEKTNKCFGKVGIMRHTVSRRR--KKKDD
        MASVCISNCINDAR PVRPTYINLYKWPESDAEFIRSVSSK+NR +RVVDSISCRQMYLRSYTFSRE++NV EKTNKCFGKVG MR  VSRRR  KKK  
Subjt:  MASVCISNCINDARAPVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPEKTNKCFGKVGIMRHTVSRRR--KKKDD

Query:  AAGVEK------RKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVRE
        A   E+      RK STLKKAKELSCAA TSVFRRLLSCTAKVDVADN RE
Subjt:  AAGVEK------RKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVRE

TrEMBL top hitse value%identityAlignment
A0A0A0KET2 Uncharacterized protein7.5e-5481.21Show/hide
Query:  MASVCISNCINDARAPVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPE-KTNKCFGKVGIMRHTVSRRRKKKDDA
        MASVCISNCINDAR PVRPTYINLYKWPESDAEFIRSVSSK+NR +RVVDSISCRQMYLRSYTFSRE+E+ PE KTNKCF KVG MR  +SRR+KKK  A
Subjt:  MASVCISNCINDARAPVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPE-KTNKCFGKVGIMRHTVSRRRKKKDDA

Query:  A----GVEK-RKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVRE
        +    G EK RKSS LKKAKE+SCAA TSVFRRLLSCTAKVDVAD +RE
Subjt:  A----GVEK-RKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVRE

A0A5A7SXD3 Uncharacterized protein4.9e-5378.15Show/hide
Query:  MASVCISNCINDARAPVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPE-KTNKCFGKVGIMRHTVSRRRKKKDDA
        MASVCISNCINDA  PVRPTYINLYKWPESDAEFIRSVSSK+NR +RVVDSISCRQMYLRSYTFSRE+E+ PE KTNKCF KVG MR  +SRR+KKK  A
Subjt:  MASVCISNCINDARAPVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPE-KTNKCFGKVGIMRHTVSRRRKKKDDA

Query:  AGVEK-------RKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVRE
        +  E        RKSS LKKAKE+SCAA TSVFRRLLSCTAKVDVAD +RE
Subjt:  AGVEK-------RKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVRE

A0A6J1CY16 uncharacterized protein LOC1110154972.3e-5581.21Show/hide
Query:  MASVCISNCINDAR--APVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPEKTNKCFGKVGIMRHTVSRRRKKK--
        MASVCISNCINDAR   PVRPTY NLY WPESDAEFIRSVSSKVNRA+RVVDSI CRQMYLRSYTFSR+D++VPEKTNKCF K+G+ +  +SRRRKKK  
Subjt:  MASVCISNCINDAR--APVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPEKTNKCFGKVGIMRHTVSRRRKKK--

Query:  DDAAGVEK-RKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVREF
        +D    EK RK S LKKAKELSCAAFTSVFRRLLSCTAKVDVADNVREF
Subjt:  DDAAGVEK-RKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVREF

A0A6J1G522 uncharacterized protein LOC1114509873.6e-5682.12Show/hide
Query:  MASVCISNCINDARAPVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPEKTNKCFGKVGIMRHTVSRRR--KKKDD
        MASVCISNCINDAR PVRPTYINLYKWPESDAEFIRSVSSK+NR +RVVDSISCRQMYLRSYTFSRE++N+PEKTNKCFGKVG MR  VSRRR  KKK  
Subjt:  MASVCISNCINDARAPVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPEKTNKCFGKVGIMRHTVSRRR--KKKDD

Query:  AAGVEK------RKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVRE
         A  E+      RK STLKKAKELSCAA TSVFRRLLSCTAKVDVADN RE
Subjt:  AAGVEK------RKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVRE

A0A6J1L3K0 uncharacterized protein LOC1114995794.0e-5579.22Show/hide
Query:  MASVCISNCINDARAPVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPEKTNKCFGKVGIMRHTVSRRRKKK----
        MASVCISNCINDAR PVRPTYINLYKWPESDAEFIRSVSSK+NR +RVVDSISCRQMYLRSYTFSRE++NV EKTNKCFGKVG MR  VS RR KK    
Subjt:  MASVCISNCINDARAPVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPEKTNKCFGKVGIMRHTVSRRRKKK----

Query:  -------DDAAGVEKRKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVRE
               DD+  +  RK STLKKAKELSCAA TSVFRRLLSCTAKVDVADN RE
Subjt:  -------DDAAGVEKRKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVRE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G46300.1 unknown protein2.5e-1742.07Show/hide
Query:  MASVCISNCINDARAP--VRP--TYINLYKWPESDAEFIRSVSSKVNR-ATRVVDSISCRQMYLRSYTFSREDENVPEKTNKCFGKVGIMRHTVSRRRKK
        MAS CI +C+N  R    VRP  TY NLYKWP ++AEF+RS++   ++  T VVDSISCRQMYLRSYTFS E+    E  +   G+    RH  S  R  
Subjt:  MASVCISNCINDARAP--VRP--TYINLYKWPESDAEFIRSVSSKVNR-ATRVVDSISCRQMYLRSYTFSREDENVPEKTNKCFGKVGIMRHTVSRRRKK

Query:  KDDAAGVEKRKSSTLKKAKELSCAAFT-SVFRRLLSCTAKVDVAD
             G +K   + ++  K  SC  F   + R+ LSC +   V +
Subjt:  KDDAAGVEKRKSSTLKKAKELSCAAFT-SVFRRLLSCTAKVDVAD

AT3G46310.1 unknown protein2.9e-1350.67Show/hide
Query:  MASVCISNCIN--DARAPVRP--TYINLYKWPESDAEFIRSVSSKVN-RATRVVDSISCRQMYLRSYTFSREDEN
        MAS CI +C+N  +    VRP  T+  ++KWP ++ EF++S+S   + R T  V+S+SCRQMYLRSYTFSR++EN
Subjt:  MASVCISNCIN--DARAPVRP--TYINLYKWPESDAEFIRSVSSKVN-RATRVVDSISCRQMYLRSYTFSREDEN

AT5G02640.1 unknown protein1.3e-2952.76Show/hide
Query:  MASVCISNCINDA---RAPVRP---TYINLYKWPESDAEFIRSVSSKVN-RATRVVDSISCRQMYLRSYTFSRE-DENVPEKTN-----KCFGKVGIMRH
        M SVCIS+CINDA   R PVRP   +Y+NLYKWPESDAEF+RSV       A RVVDSISCRQMYLRSYTFSRE DE+  EK +      C G+V   + 
Subjt:  MASVCISNCINDA---RAPVRP---TYINLYKWPESDAEFIRSVSSKVN-RATRVVDSISCRQMYLRSYTFSRE-DENVPEKTN-----KCFGKVGIMRH

Query:  TVSRRRKKKDDAAGV-----------EKRKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVAD
        T S RRK K++   +           EKR+ S  K  +E +C+    +FRRLLSC A VDV D
Subjt:  TVSRRRKKKDDAAGV-----------EKRKSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVAD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCTGTTTGCATATCCAACTGCATCAACGACGCCCGCGCCCCCGTCCGTCCCACCTACATAAACCTCTACAAGTGGCCGGAATCCGACGCCGAGTTCATC
CGATCCGTCAGCTCCAAAGTCAACAGAGCAACCCGCGTCGTCGACAGCATCTCTTGCCGCCAGATGTACCTGAGGAGCTACACCTTCTCCAGGGAGGACGAGAAC
GTCCCGGAGAAGACGAACAAATGCTTCGGCAAAGTCGGGATCATGAGACACACAGTTTCTCGCCGGAGAAAGAAGAAGGACGACGCCGCCGGCGTTGAGAAGCGG
AAGAGTTCCACGCTCAAGAAGGCCAAGGAGTTGTCGTGCGCCGCCTTCACCTCCGTCTTCCGCCGGTTGCTATCCTGCACCGCCAAAGTCGACGTGGCCGATAAT
GTGCGGGAGTTCACTTTTTCGCTTCTCAATGGTCGTCAATGGGATCCGATTCAGCGAAGAAGCGACGGTGATGGTGTACGGAGAAATATGTCACCTACCCTCGGA
TTTCACCAGCTGAGAGAACACCCTACTCATGGAGACTCCAGAAAGATCATCAACACCCGGAATGTCAAACCGTGTGGCAGAAACACCCGCCTCGATGTCATTGAA
ATTGTTAATTTGTGGAGAGAGCTCGCCAATCATGTTTGTACTGTTATCTTCGGGATCAACTTCAATGACAGCTTCTTAGTAGAATCAATGATTTGGTCCAACTGC
CCTCTCTTCTGCCATTTCATTGCCCATTCAGCTAAGTTTACCATTTCTCTAGGAAGGGTTGGATCTATGACAGGATCAAGGTACCCAAAACTCCCTTTCACAGCT
GTACTGACATGGGTTTGGTCAATTTCAGGCCCTGTCTTTGAAAGCCCAAACATAACCGGTGTGAAGGTAATGAAGTCCTCTAGCGGCCCCAATGCAGACCTCCAG
TCTCTCCTTCCAGCTCAAACTAGGGAAACCAGAACCATAGAGATGACCTTTCAGGCAGTGGATATTGAAATCCATGTCTTCGATGGAGCTGGACGCATAGCATTT
CTTTTTCGTCTGTGCAACATAAACAGAATGCCAACCAATATGGCTGCAACAGATGCCCCAACACTCACGCCCACAATGACGCCAACATGCTTATTTGAAGAAGTC
GAGTCTGGAAAATTAATTACAGAGTCTTCCCCACTAAGACTACCAACAGAGTTGTTCATTTTCAGGATTTCCAGACCATTTAGAATGGCATTTGGATACACATTA
GCTAAAGTTGACGGGCCAATACTTACAAGAACGATAAGAATGTCAGGAACAGAAGCAACTTCAATGGCATTCAGATATGCTAAGGAGCCATTGGCAGGAGCAAAA
ATAACTTCAAGAGTATCTGATGTCACTGAAGATTCTAGCCGTTTCGAACAGCGACGAATCATTAGAAGTGGCGGGTATCGAAGTCGTATTAGCCAGAATGACTTG
CGGGAAAAGCAAGGCAGCAACGAAATTAACAAAACCCAGAAGAGTAATCCAAATCTTCTTCTACAATCCATCGCTACAGTCAGAACCCAGGAAAACGACCGAGAA
AGAACACCCTTATCATCAGAGATCCAGACAGTGATAGACCCTCTATATCCCTCGGGAGGAGGCAAATCTGAGGCCAACAAGGAAGCAACAGGGAAGCAAGGCCAG
CAAGAAAACGCCAGAAAGAACAGAAGCTATCGCCGAGAAATGAAAGGAGCAACGGAGAAAAGGGAGGTTAAGCCGTCCACCACGAGGCAGTCATCAGTGAGCACA
TGGATTTCGGGTTTTCGGGTCCGAGTCCTCTACGGTCTTGGATCTAAAAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCTCTGTTTGCATATCCAACTGCATCAACGACGCCCGCGCCCCCGTCCGTCCCACCTACATAAACCTCTACAAGTGGCCGGAATCCGACGCCGAGTTCATC
CGATCCGTCAGCTCCAAAGTCAACAGAGCAACCCGCGTCGTCGACAGCATCTCTTGCCGCCAGATGTACCTGAGGAGCTACACCTTCTCCAGGGAGGACGAGAAC
GTCCCGGAGAAGACGAACAAATGCTTCGGCAAAGTCGGGATCATGAGACACACAGTTTCTCGCCGGAGAAAGAAGAAGGACGACGCCGCCGGCGTTGAGAAGCGG
AAGAGTTCCACGCTCAAGAAGGCCAAGGAGTTGTCGTGCGCCGCCTTCACCTCCGTCTTCCGCCGGTTGCTATCCTGCACCGCCAAAGTCGACGTGGCCGATAAT
GTGCGGGAGTTCACTTTTTCGCTTCTCAATGGTCGTCAATGGGATCCGATTCAGCGAAGAAGCGACGGTGATGGTGTACGGAGAAATATGTCACCTACCCTCGGA
TTTCACCAGCTGAGAGAACACCCTACTCATGGAGACTCCAGAAAGATCATCAACACCCGGAATGTCAAACCGTGTGGCAGAAACACCCGCCTCGATGTCATTGAA
ATTGTTAATTTGTGGAGAGAGCTCGCCAATCATGTTTGTACTGTTATCTTCGGGATCAACTTCAATGACAGCTTCTTAGTAGAATCAATGATTTGGTCCAACTGC
CCTCTCTTCTGCCATTTCATTGCCCATTCAGCTAAGTTTACCATTTCTCTAGGAAGGGTTGGATCTATGACAGGATCAAGGTACCCAAAACTCCCTTTCACAGCT
GTACTGACATGGGTTTGGTCAATTTCAGGCCCTGTCTTTGAAAGCCCAAACATAACCGGTGTGAAGGTAATGAAGTCCTCTAGCGGCCCCAATGCAGACCTCCAG
TCTCTCCTTCCAGCTCAAACTAGGGAAACCAGAACCATAGAGATGACCTTTCAGGCAGTGGATATTGAAATCCATGTCTTCGATGGAGCTGGACGCATAGCATTT
CTTTTTCGTCTGTGCAACATAAACAGAATGCCAACCAATATGGCTGCAACAGATGCCCCAACACTCACGCCCACAATGACGCCAACATGCTTATTTGAAGAAGTC
GAGTCTGGAAAATTAATTACAGAGTCTTCCCCACTAAGACTACCAACAGAGTTGTTCATTTTCAGGATTTCCAGACCATTTAGAATGGCATTTGGATACACATTA
GCTAAAGTTGACGGGCCAATACTTACAAGAACGATAAGAATGTCAGGAACAGAAGCAACTTCAATGGCATTCAGATATGCTAAGGAGCCATTGGCAGGAGCAAAA
ATAACTTCAAGAGTATCTGATGTCACTGAAGATTCTAGCCGTTTCGAACAGCGACGAATCATTAGAAGTGGCGGGTATCGAAGTCGTATTAGCCAGAATGACTTG
CGGGAAAAGCAAGGCAGCAACGAAATTAACAAAACCCAGAAGAGTAATCCAAATCTTCTTCTACAATCCATCGCTACAGTCAGAACCCAGGAAAACGACCGAGAA
AGAACACCCTTATCATCAGAGATCCAGACAGTGATAGACCCTCTATATCCCTCGGGAGGAGGCAAATCTGAGGCCAACAAGGAAGCAACAGGGAAGCAAGGCCAG
CAAGAAAACGCCAGAAAGAACAGAAGCTATCGCCGAGAAATGAAAGGAGCAACGGAGAAAAGGGAGGTTAAGCCGTCCACCACGAGGCAGTCATCAGTGAGCACA
TGGATTTCGGGTTTTCGGGTCCGAGTCCTCTACGGTCTTGGATCTAAAAATTAG
Protein sequenceShow/hide protein sequence
MASVCISNCINDARAPVRPTYINLYKWPESDAEFIRSVSSKVNRATRVVDSISCRQMYLRSYTFSREDENVPEKTNKCFGKVGIMRHTVSRRRKKKDDAAGVEKR
KSSTLKKAKELSCAAFTSVFRRLLSCTAKVDVADNVREFTFSLLNGRQWDPIQRRSDGDGVRRNMSPTLGFHQLREHPTHGDSRKIINTRNVKPCGRNTRLDVIE
IVNLWRELANHVCTVIFGINFNDSFLVESMIWSNCPLFCHFIAHSAKFTISLGRVGSMTGSRYPKLPFTAVLTWVWSISGPVFESPNITGVKVMKSSSGPNADLQ
SLLPAQTRETRTIEMTFQAVDIEIHVFDGAGRIAFLFRLCNINRMPTNMAATDAPTLTPTMTPTCLFEEVESGKLITESSPLRLPTELFIFRISRPFRMAFGYTL
AKVDGPILTRTIRMSGTEATSMAFRYAKEPLAGAKITSRVSDVTEDSSRFEQRRIIRSGGYRSRISQNDLREKQGSNEINKTQKSNPNLLLQSIATVRTQENDRE
RTPLSSEIQTVIDPLYPSGGGKSEANKEATGKQGQQENARKNRSYRREMKGATEKREVKPSTTRQSSVSTWISGFRVRVLYGLGSKN