; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029903 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029903
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionthymocyte nuclear protein 1
Genome locationtig00153554:1000361..1003286
RNA-Seq ExpressionSgr029903
SyntenySgr029903
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR002740 - EVE domain
IPR015947 - PUA-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600275.1 Thymocyte nuclear protein 1, partial [Cucurbita argyrosperma subsp. sororia]3.1e-6386.33Show/hide
Query:  ASAGNKQYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVDL
        A+   KQ+WLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQK+LKSMKLGDLCFFYHSGAKARR+VGVVSVA+EWYSE DG + VVDVEAVGEMRE VDL
Subjt:  ASAGNKQYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVDL

Query:  KEMKKEAEGMKNFALFRQPRLSVVPVSKETWEKICEMGG
        KEMKK  EGMK+FALFRQPRLSVVPV+KE WEKICE+GG
Subjt:  KEMKKEAEGMKNFALFRQPRLSVVPVSKETWEKICEMGG

KAG6761277.1 hypothetical protein POTOM_034485 [Populus tomentosa]9.6e-6555.21Show/hide
Query:  KQYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVDLKEMKK
        KQYWLLKTEP EWSW DQA+NGG + WDGVKNKQAQK+LK+MKL DLCFFYHSG+ ARRVVGVV+V +EWY E  G E VVDV+AVGEMR P+DLKE+K 
Subjt:  KQYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVDLKEMKK

Query:  EAEGMKNFALFRQPRLSVVPVSKETWEKICEMGGQEEESKYVFVLCEEDDPENEGSAF------------------------------------------
        + EG+K F LFRQPRLSVVPVSKE    I     Q E+S    V  +  D E+    F                                          
Subjt:  EAEGMKNFALFRQPRLSVVPVSKETWEKICEMGGQEEESKYVFVLCEEDDPENEGSAF------------------------------------------

Query:  -----------------------SMANQLGNLVESIKSKVKALKKSKKPYIKMDKSSSVKVEIRSRKARQLIDKTLKVADRPGKRTMS
                                MA QL NLV+SIKSKV+ALKKSKKPYIKMDKS+SVKVEIRSRKAR+LIDKTL+VADRPGKRT+S
Subjt:  -----------------------SMANQLGNLVESIKSKVKALKKSKKPYIKMDKSSSVKVEIRSRKARQLIDKTLKVADRPGKRTMS

KAG7030934.1 Thymocyte nuclear protein 1, partial [Cucurbita argyrosperma subsp. argyrosperma]4.0e-6385.61Show/hide
Query:  ASAGNKQYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVDL
        A+   KQ+WLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQK+LKSMKLGDLCFFYHSGAKARR+VGV+SVA+EWYSE DG + VVDVEAVGEMRE VDL
Subjt:  ASAGNKQYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVDL

Query:  KEMKKEAEGMKNFALFRQPRLSVVPVSKETWEKICEMGG
        KEMKK  EGMK+FALFRQPRLSVVPV+KE WEKICE+GG
Subjt:  KEMKKEAEGMKNFALFRQPRLSVVPVSKETWEKICEMGG

XP_023530564.1 thymocyte nuclear protein 1 [Cucurbita pepo subsp. pepo]1.6e-6487.77Show/hide
Query:  ASAGNKQYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVDL
        A    KQ+WLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARR+VGVVSVA+EWYSE DG + VVDVEAVGEMREPVDL
Subjt:  ASAGNKQYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVDL

Query:  KEMKKEAEGMKNFALFRQPRLSVVPVSKETWEKICEMGG
        KEMKK  EGMK+FALFRQPRLSVVPV+KE WEKICE+GG
Subjt:  KEMKKEAEGMKNFALFRQPRLSVVPVSKETWEKICEMGG

XP_038905606.1 thymocyte nuclear protein 1 [Benincasa hispida]5.2e-6391.04Show/hide
Query:  KQYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVDLKEMKK
        KQYWLLKTEPAEWSWADQAAN GRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVV+VA+EWYS AD D+VVVDVEAVGEMREPVDLKEMKK
Subjt:  KQYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVDLKEMKK

Query:  EAEGMKNFALFRQPRLSVVPVSKETWEKICEMGG
          EGMKNFALFRQPRLSVVPV+KE W+KICE+GG
Subjt:  EAEGMKNFALFRQPRLSVVPVSKETWEKICEMGG

TrEMBL top hitse value%identityAlignment
A0A0A0KWC0 EVE domain-containing protein2.8e-6287.86Show/hide
Query:  ASAGNK-QYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVD
        A AG+K QYWLLKTEPAEWSWADQAAN GRT WDGVKNKQAQKHLKSMKLGD CFFYHSGAKARRVVGVV+VA+EWYS  D DEVVVDVEAVGEMREPVD
Subjt:  ASAGNK-QYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVD

Query:  LKEMKKEAEGMKNFALFRQPRLSVVPVSKETWEKICEMGG
        LKEMKK  EGMKNFALFRQPRLSVVPV+KE W+KICE+GG
Subjt:  LKEMKKEAEGMKNFALFRQPRLSVVPVSKETWEKICEMGG

A0A5D3E131 Thymocyte nuclear protein 11.4e-6187.68Show/hide
Query:  AGNK-QYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVDLK
        AG+K QYWLLKTEPAEWSWADQAAN GR+KWDGVKNKQAQKHLKSMKLGD CFFYHSGAKARRVVGVV+VA+EWYS  + DEVVVDVEAVGEMREPVDLK
Subjt:  AGNK-QYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVDLK

Query:  EMKKEAEGMKNFALFRQPRLSVVPVSKETWEKICEMGG
        EMKK  EGMKNFALFRQPRLSVVPV+KE W+KICE+GG
Subjt:  EMKKEAEGMKNFALFRQPRLSVVPVSKETWEKICEMGG

A0A6J1FML1 thymocyte nuclear protein 17.4e-6385.61Show/hide
Query:  ASAGNKQYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVDL
        A+   KQ+WLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQK+LKSMKLGDLCFFYHSGAKARR+VGVVSVA+EWY E DG + VVDVEAVGEMRE VDL
Subjt:  ASAGNKQYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVDL

Query:  KEMKKEAEGMKNFALFRQPRLSVVPVSKETWEKICEMGG
        KEMKK  EGMK+FALFRQPRLSVVPV+KE WEKICE+GG
Subjt:  KEMKKEAEGMKNFALFRQPRLSVVPVSKETWEKICEMGG

A0A6J1JS35 thymocyte nuclear protein 14.3e-6386.33Show/hide
Query:  ASAGNKQYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVDL
        A    KQ+WLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGD CFFYHSGAKARR+VGVVSVA+EWYSE DG + VVDVE VGEMREPVDL
Subjt:  ASAGNKQYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVDL

Query:  KEMKKEAEGMKNFALFRQPRLSVVPVSKETWEKICEMGG
        KEMKK  EGMK+FALFRQPRLSVVPV+KE WEKICE+GG
Subjt:  KEMKKEAEGMKNFALFRQPRLSVVPVSKETWEKICEMGG

E5GBH2 EVE domain-containing protein1.4e-6187.68Show/hide
Query:  AGNK-QYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVDLK
        AG+K QYWLLKTEPAEWSWADQAAN GR+KWDGVKNKQAQKHLKSMKLGD CFFYHSGAKARRVVGVV+VA+EWYS  + DEVVVDVEAVGEMREPVDLK
Subjt:  AGNK-QYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVDLK

Query:  EMKKEAEGMKNFALFRQPRLSVVPVSKETWEKICEMGG
        EMKK  EGMKNFALFRQPRLSVVPV+KE W+KICE+GG
Subjt:  EMKKEAEGMKNFALFRQPRLSVVPVSKETWEKICEMGG

SwissProt top hitse value%identityAlignment
Q6P3E0 Thymocyte nuclear protein 14.8e-1133.54Show/hide
Query:  YWLLKTEP---------AEWSWAD-QAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWY----------------SEADG
        YWL+K+EP          ++S  D QA     T WDGV+N QA   L++MKL D  FFYHS  K   +VG++ + +E Y                S+ D 
Subjt:  YWLLKTEP---------AEWSWAD-QAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWY----------------SEADG

Query:  DE-VVVDVEAVGEMREPVDLKEMK-----KEAEG--MKNFALFRQPRLSVVPVSKETWEKICEM
         +  +VDV+ V  M+  + L E+K      +A G  +K+  LF + RLSV P+++E ++ I  +
Subjt:  DE-VVVDVEAVGEMREPVDLKEMK-----KEAEG--MKNFALFRQPRLSVVPVSKETWEKICEM

Q6PFL8 Thymocyte nuclear protein 12.7e-0928.74Show/hide
Query:  WLLKTEP---------AEWSWADQAANGGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWY----------------SEADGD
        WL+K+EP          ++   D  A   +T  WDGV+N QA+  ++ MK+G   FFYHS  K   + G++ + +E Y                S+AD  
Subjt:  WLLKTEP---------AEWSWADQAANGGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWY----------------SEADGD

Query:  E-VVVDVEAVGEMREPVDLKEMKK-------EAEGMKNFALFRQPRLSVVPVSKETWEKICEMGGQE
        +  +VDV+    ++  + L E+KK       +   +K+ ALF + RLSV P++ E +E +  +  ++
Subjt:  E-VVVDVEAVGEMREPVDLKEMKK-------EAEGMKNFALFRQPRLSVVPVSKETWEKICEMGGQE

Q90679 Thymocyte nuclear protein 16.3e-1130.64Show/hide
Query:  YWLLKTEP---------AEWSWADQAANGGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSE-----------------AD
        +WLLK+EP          ++S  D  A   +T  W+GV+N QA+  L++MKLG   FFYHS  K   +VG+V + +E Y +                  +
Subjt:  YWLLKTEP---------AEWSWADQAANGGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSE-----------------AD

Query:  GDEVVVDVEAVGEMREPVDLKEMK-----KEAEG--MKNFALFRQPRLSVVPVSKETWEKICEMGGQEEESKY
            +VDV+ V   +  + L E+K      +A+G  +KN  LF + RLS+ P+++E ++ +  +   EEE  +
Subjt:  GDEVVVDVEAVGEMREPVDLKEMK-----KEAEG--MKNFALFRQPRLSVVPVSKETWEKICEMGGQEEESKY

Q91YJ3 Thymocyte nuclear protein 17.5e-1232.93Show/hide
Query:  YWLLKTEP---------AEWSWADQAANGGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWY-----------------SEAD
        YWL+K+EP          ++S  D  A   +T  WDGV+N QA+  L++MKL D  FFYHS  K   +VG++ + +E Y                  E D
Subjt:  YWLLKTEP---------AEWSWADQAANGGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWY-----------------SEAD

Query:  GDEVVVDVEAVGEMREPVDLKEMK-----KEAEG--MKNFALFRQPRLSVVPVSKETWEKICEM
            +VDV+ V  M+  + L+E+K      +A G  +K+  LF + RLSV P+++E ++ I  +
Subjt:  GDEVVVDVEAVGEMREPVDLKEMK-----KEAEG--MKNFALFRQPRLSVVPVSKETWEKICEM

Q9P016 Thymocyte nuclear protein 12.8e-1130.95Show/hide
Query:  YWLLKTEP---------AEWSWADQAANGGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWY----------------SEADG
        +WL+K+EP          ++S  D  A   +T  WDGV+N QA+  L++MKLG+  FFYHS  K   + G++ + +E Y                S+ D 
Subjt:  YWLLKTEP---------AEWSWADQAANGGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWY----------------SEADG

Query:  DE-VVVDVEAVGEMREPVDLKEMKK-----EAEG--MKNFALFRQPRLSVVPVSKETWEKICEMGGQE
         +  +VDV+ V  M+  + L E+K      +A G  +KN  LF + RLS+ P+++E ++ +  +  +E
Subjt:  DE-VVVDVEAVGEMREPVDLKEMKK-----EAEG--MKNFALFRQPRLSVVPVSKETWEKICEMGGQE

Arabidopsis top hitse value%identityAlignment
AT2G14660.1 unknown protein5.6e-4762.11Show/hide
Query:  KQYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYS-EADG--DEVVVDVEAVGEMREPVDLKE
        K+YWLLKTEP EWSW+DQ +NGG +KWDGVKNKQAQK+LKSM LGDLCFFYHSG K+R VVGVV V++EWY+ +A+G   E  VDV+A+GEMR+ VDLKE
Subjt:  KQYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYS-EADG--DEVVVDVEAVGEMREPVDLKE

Query:  MKKEAEGM--KNFALFRQPRLSVVPVSKETWEKICEMG----GQEEESKYVFVLCEEDDPE
        MK + +G+  K F LFRQPRLSVVPV ++ W  ICE+G    G  +E       CE  D E
Subjt:  MKKEAEGM--KNFALFRQPRLSVVPVSKETWEKICEMG----GQEEESKYVFVLCEEDDPE

AT3G19660.1 unknown protein1.1e-1571.43Show/hide
Query:  MANQLGNLVESIKSKVKALKKSKKPYIKMDKSSSVKVEIRSRKARQLIDKTLKVADRPGKRTM
        M  QL +L+E+IKSKV  L+K KK YIKMDKSSSV+V IR +K R LIDKTLKVADRPGKRT+
Subjt:  MANQLGNLVESIKSKVKALKKSKKPYIKMDKSSSVKVEIRSRKARQLIDKTLKVADRPGKRTM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAGCGCCGGAAACAAACAGTACTGGCTTTTGAAGACAGAGCCGGCGGAGTGGTCGTGGGCGGATCAAGCTGCCAATGGCGGACGGACAAAGTGGGACGGCGTCAA
GAACAAGCAAGCTCAGAAGCATCTCAAGTCCATGAAACTCGGCGACCTCTGCTTCTTCTACCACTCCGGCGCTAAAGCCCGCCGTGTGGTCGGCGTGGTCTCTGTCGCAC
AGGAATGGTATTCGGAGGCCGACGGCGATGAGGTCGTCGTAGATGTCGAGGCGGTTGGGGAGATGAGGGAGCCGGTGGACCTGAAGGAGATGAAGAAGGAGGCCGAGGGG
ATGAAGAATTTCGCTCTCTTTCGGCAACCGCGGCTCTCGGTTGTGCCTGTTTCGAAGGAGACATGGGAGAAGATCTGCGAAATGGGAGGCCAGGAAGAAGAAAGCAAATA
TGTTTTTGTTCTGTGCGAAGAAGACGATCCAGAAAATGAAGGCTCAGCGTTTTCTATGGCGAATCAGCTCGGCAATTTGGTGGAATCCATAAAATCCAAGGTGAAAGCAT
TGAAGAAGTCCAAAAAGCCATATATAAAGATGGACAAAAGCTCCAGCGTGAAGGTTGAGATCCGTAGCCGTAAAGCTCGACAGTTGATCGATAAGACCTTGAAGGTCGCC
GATCGCCCTGGCAAGCGTACCATGTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCAGCGCCGGAAACAAACAGTACTGGCTTTTGAAGACAGAGCCGGCGGAGTGGTCGTGGGCGGATCAAGCTGCCAATGGCGGACGGACAAAGTGGGACGGCGTCAA
GAACAAGCAAGCTCAGAAGCATCTCAAGTCCATGAAACTCGGCGACCTCTGCTTCTTCTACCACTCCGGCGCTAAAGCCCGCCGTGTGGTCGGCGTGGTCTCTGTCGCAC
AGGAATGGTATTCGGAGGCCGACGGCGATGAGGTCGTCGTAGATGTCGAGGCGGTTGGGGAGATGAGGGAGCCGGTGGACCTGAAGGAGATGAAGAAGGAGGCCGAGGGG
ATGAAGAATTTCGCTCTCTTTCGGCAACCGCGGCTCTCGGTTGTGCCTGTTTCGAAGGAGACATGGGAGAAGATCTGCGAAATGGGAGGCCAGGAAGAAGAAAGCAAATA
TGTTTTTGTTCTGTGCGAAGAAGACGATCCAGAAAATGAAGGCTCAGCGTTTTCTATGGCGAATCAGCTCGGCAATTTGGTGGAATCCATAAAATCCAAGGTGAAAGCAT
TGAAGAAGTCCAAAAAGCCATATATAAAGATGGACAAAAGCTCCAGCGTGAAGGTTGAGATCCGTAGCCGTAAAGCTCGACAGTTGATCGATAAGACCTTGAAGGTCGCC
GATCGCCCTGGCAAGCGTACCATGTCCTAG
Protein sequenceShow/hide protein sequence
MASAGNKQYWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVSVAQEWYSEADGDEVVVDVEAVGEMREPVDLKEMKKEAEG
MKNFALFRQPRLSVVPVSKETWEKICEMGGQEEESKYVFVLCEEDDPENEGSAFSMANQLGNLVESIKSKVKALKKSKKPYIKMDKSSSVKVEIRSRKARQLIDKTLKVA
DRPGKRTMS