; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016412 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016412
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationtig00152909:673316..676500
RNA-Seq ExpressionSgr016412
SyntenySgr016412
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8472030.1 hypothetical protein CXB51_036099 [Gossypium anomalum]9.5e-4282.35Show/hide
Query:  YMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGLV
        YMAI EACKEAIWL+GLF EL+EDLQI+TVFCDSQSAIFLTKDQMFHERTKHID+RYHF+ +IIACGDIVVSKIST +NP DM+ KSLP+ KF HCL LV
Subjt:  YMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGLV

Query:  GV
        GV
Subjt:  GV

KAG8481665.1 hypothetical protein CXB51_026496 [Gossypium anomalum]6.2e-4182.35Show/hide
Query:  YMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGLV
        YMAIIEACKEAIWL+GLF EL+EDLQI+TVFCDSQSAIFLTKDQMFHERTKHID+RYHF+ +IIA GDIVVSKIST +NP DM+ KSLP+ KF HCL LV
Subjt:  YMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGLV

Query:  GV
        GV
Subjt:  GV

KAG8485521.1 hypothetical protein CXB51_019057 [Gossypium anomalum]6.2e-4171.31Show/hide
Query:  RHVEADLV-------AYALNYMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNP
        R+V+AD         +    YMAI EACKEAIWL+GLF EL+EDLQI+TVFCDSQSAIFLTKDQMFHERTKHID+RYHF+ +IIA GDIVVSKIST +NP
Subjt:  RHVEADLV-------AYALNYMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNP

Query:  VDMIAKSLPLNKFNHCLGLVGV
         DM+ KSLP+ KF HCL LVGV
Subjt:  VDMIAKSLPLNKFNHCLGLVGV

KAG8489021.1 hypothetical protein CXB51_017120 [Gossypium anomalum]6.2e-4180.39Show/hide
Query:  YMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGLV
        YMAI EACKEAIWL+GLFGEL++DLQI+TVFCDSQS IFLTKDQMFHERTKHID+RYHF+ +IIACGDIVVSKIS  +NP DM+ KSLP+ KF HCL LV
Subjt:  YMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGLV

Query:  GV
        GV
Subjt:  GV

KAG8496686.1 hypothetical protein CXB51_007936 [Gossypium anomalum]9.5e-4282.35Show/hide
Query:  YMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGLV
        YMAI EACKEAIWL+GLF EL+EDLQI+TVFCDSQSAIFLTKDQMFHERTKHID+RYHF+ +IIACGDIVVSKIST +NP DM+ KSLP+ KF HCL LV
Subjt:  YMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGLV

Query:  GV
        GV
Subjt:  GV

TrEMBL top hitse value%identityAlignment
A0A2G2W3Z5 NB-ARC domain-containing protein3.8e-3672.22Show/hide
Query:  DLVAYALNYMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNK
        D   +   YMAI +  KEAIWL+ LFGELS+DLQITTVFCDSQS IFLTKDQMFHERTKHID+RYHF+ EIIA GDIVVSKIST DNP DM+ K+LP +K
Subjt:  DLVAYALNYMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNK

Query:  FNHCLGLV
          HCL L+
Subjt:  FNHCLGLV

A0A2G2WD02 Uncharacterized protein2.9e-3675.49Show/hide
Query:  YMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGLV
        YMAI EACKEAIWL+GLFGELS+DLQI T+FC+SQSA FLTKDQMF+ER KHID+RY F+ EIIA GDIVVSKIST DNP +M+ K+LP  KF HCL L+
Subjt:  YMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGLV

Query:  GV
        GV
Subjt:  GV

A0A2G2ZIX5 Uncharacterized protein1.1e-3575.49Show/hide
Query:  YMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGLV
        YMAI EA KEAIWL+GLFGELS+DLQITTVFCDSQS IFLTKD MFHERTK+ID+RYHF+ EI+A GDI+VSKIST DNP DM+ K L   KF HCL LV
Subjt:  YMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGLV

Query:  GV
         +
Subjt:  GV

A0A2G2ZTG6 F-box domain-containing protein9.0e-3873.87Show/hide
Query:  HVEADLVAYALNYMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSL
        +V++D       YMAI EA KEAIWL+GLFGELS+DLQITTVFCDSQSAIFLTKDQMFHERT HID+RYHF+ EIIA GDIVV KIST DNP DM+ K L
Subjt:  HVEADLVAYALNYMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSL

Query:  PLNKFNHCLGL
        P  KF HCL L
Subjt:  PLNKFNHCLGL

A0A2G3B6L9 NB-ARC domain-containing protein4.0e-3881Show/hide
Query:  YMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGLV
        YMAI EA KEAIWL+GLFGELS+DLQ TTVFCDSQSAIFLTKDQMFHERTKHID+RYHF+ EIIA GDIVVSKIST +NPVDM+ K+LP  KF HCL L+
Subjt:  YMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGLV

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-0828.38Show/hide
Query:  LDQDGASSKIPHVEDTTSSSRSPSQYTIAKDKPRREIRLPQRHVEADLVAYALNYMAIIEACKEAIWLRGLFGELSEDLQ-ITTVFCDSQSAIFLTKDQM
        +D D A S+I   + TT        + +     +R+  +     EA+       YMA+ EA +EA+WL+ L   ++  L+    ++ D+Q  I +  +  
Subjt:  LDQDGASSKIPHVEDTTSSSRSPSQYTIAKDKPRREIRLPQRHVEADLVAYALNYMAIIEACKEAIWLRGLFGELSEDLQ-ITTVFCDSQSAIFLTKDQM

Query:  FHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKF
         H+R KHID++YHF  E +    I +  I T +   D+  K LP  +F
Subjt:  FHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-2150Show/hide
Query:  YMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGLV
        Y+A  E  KE IWL+    EL    +   V+CDSQSAI L+K+ M+H RTKHID+RYH+I E++    + V KIST++NP DM+ K +P NKF  C  LV
Subjt:  YMAIIEACKEAIWLRGLFGELSEDLQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGLV

Query:  GV
        G+
Subjt:  GV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.2e-0831.07Show/hide
Query:  YMAIIEACKEAIWLRGLFGELSEDL-QITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGL
        Y ++     E  W+  L  EL   L +   ++CD+  A +L  + +FH R KHI + YHFI   +  G + V  +ST D   D + K L    F +    
Subjt:  YMAIIEACKEAIWLRGLFGELSEDL-QITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGL

Query:  VGV
        +GV
Subjt:  VGV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.3e-0832.04Show/hide
Query:  YMAIIEACKEAIWLRGLFGELSEDL-QITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGL
        Y ++     E  W+  L  EL   L     ++CD+  A +L  + +FH R KHI L YHFI   +  G + V  +ST D   D + K L    F +    
Subjt:  YMAIIEACKEAIWLRGLFGELSEDL-QITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGL

Query:  VGV
        +GV
Subjt:  VGV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.0e-0641.27Show/hide
Query:  YMAIIEACKEAIWLRGLFGELSEDL-QITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILE
        Y A+  A  E +WL   F EL   L + T +FCD+ +AI +  + +FHERTKHI+   H + E
Subjt:  YMAIIEACKEAIWLRGLFGELSEDL-QITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAATAAGCAATCTACTAAAGAAGAGTCTTTAGATAAGATTAAACAAAATTTGCACGTGCAAGTGGAGTTTAGGTTAGAGCCATCTTCCAGGTTAGATCAAGATGG
TGCTTCTTCAAAGATCCCTCATGTTGAGGATACTACTTCATCTTCACGATCACCATCACAATATACTATTGCTAAAGACAAACCTAGAAGAGAAATTAGACTTCCACAGA
GACATGTAGAAGCTGATCTAGTTGCTTATGCTTTAAATTATATGGCAATTATAGAGGCTTGTAAAGAAGCTATATGGTTGAGAGGATTGTTTGGTGAACTTAGTGAAGAT
TTACAGATTACTACAGTCTTTTGTGACAGTCAAAGTGCTATCTTCCTTACGAAAGATCAAATGTTTCATGAGAGGACAAAGCACATTGATCTTCGATACCATTTTATACT
TGAAATCATTGCTTGTGGTGATATTGTTGTGAGCAAGATTAGCACAAGTGATAATCCAGTTGATATGATTGCTAAATCTCTTCCATTGAACAAATTCAACCATTGCTTGG
GTTTGGTTGGTGTTTGCAATTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGAATAAGCAATCTACTAAAGAAGAGTCTTTAGATAAGATTAAACAAAATTTGCACGTGCAAGTGGAGTTTAGGTTAGAGCCATCTTCCAGGTTAGATCAAGATGG
TGCTTCTTCAAAGATCCCTCATGTTGAGGATACTACTTCATCTTCACGATCACCATCACAATATACTATTGCTAAAGACAAACCTAGAAGAGAAATTAGACTTCCACAGA
GACATGTAGAAGCTGATCTAGTTGCTTATGCTTTAAATTATATGGCAATTATAGAGGCTTGTAAAGAAGCTATATGGTTGAGAGGATTGTTTGGTGAACTTAGTGAAGAT
TTACAGATTACTACAGTCTTTTGTGACAGTCAAAGTGCTATCTTCCTTACGAAAGATCAAATGTTTCATGAGAGGACAAAGCACATTGATCTTCGATACCATTTTATACT
TGAAATCATTGCTTGTGGTGATATTGTTGTGAGCAAGATTAGCACAAGTGATAATCCAGTTGATATGATTGCTAAATCTCTTCCATTGAACAAATTCAACCATTGCTTGG
GTTTGGTTGGTGTTTGCAATTTATGA
Protein sequenceShow/hide protein sequence
MLNKQSTKEESLDKIKQNLHVQVEFRLEPSSRLDQDGASSKIPHVEDTTSSSRSPSQYTIAKDKPRREIRLPQRHVEADLVAYALNYMAIIEACKEAIWLRGLFGELSED
LQITTVFCDSQSAIFLTKDQMFHERTKHIDLRYHFILEIIACGDIVVSKISTSDNPVDMIAKSLPLNKFNHCLGLVGVCNL