; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015419 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015419
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationtig00003469:1691646..1692122
RNA-Seq ExpressionSgr015419
SyntenySgr015419
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB5529228.1 hypothetical protein DKX38_019309 [Salix brachista]9.7e-1652.87Show/hide
Query:  QLQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEESFRYILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLDDSTLAS
        QLQS+KKG+ SIHDY+LKM+++S++ + AGQ+I D+E   YILG L   ++ VVVNLTSR D  + QEVQY+ QSQ++ L+   L++
Subjt:  QLQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEESFRYILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLDDSTLAS

KAB5551985.1 hypothetical protein DKX38_009296 [Salix brachista]3.1e-1452.81Show/hide
Query:  QLQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEESFRYILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLD--DSTLAS
        QLQ++KKG+ SIHDY+L+MK+++++   AGQ I D+E   YILG L   ++ VVVNLTSR D  + QEVQY+ QSQ++ L+  +S LAS
Subjt:  QLQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEESFRYILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLD--DSTLAS

PON45747.1 hypothetical protein TorRG33x02_328130 [Trema orientale]6.3e-1559.26Show/hide
Query:  QLQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEESFRYILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLD
        QLQSIKKGS SIHDYILK K+++D  S AGQ+I DE    YILG LS  ++ VVVN  SR D  S QEVQ++ QSQ++ L+
Subjt:  QLQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEESFRYILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLD

PON91412.1 Zinc finger, CCHC-type, partial [Trema orientale]1.1e-1442.55Show/hide
Query:  NPEYTFEEKPSSMVMSWLRT------------LNLRRDIWS---RCSGMKNSQ--------LQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEESFR
        NP YT   +    +MSWL +                 +IWS   R    K+          LQ+IKKGS+SI +YILKM+ ++DS   AGQ I DEE   
Subjt:  NPEYTFEEKPSSMVMSWLRT------------LNLRRDIWS---RCSGMKNSQ--------LQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEESFR

Query:  YILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLD
        YILG L   +E VVVNLTSR+D  S QEVQ+L Q+Q++ L+
Subjt:  YILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLD

XP_030509070.1 uncharacterized protein LOC115723734 [Cannabis sativa]1.9e-1637.06Show/hide
Query:  QLNPEYTFEEKPSSMVMSWLRT------------LNLRRDIWSRCSGMKNS-----------QLQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEES
        Q+NP++T   +    ++SWL                +  ++WS    +  +           QLQS+KKGS +IHDYILKMK+I+D  + AGQ   D++ 
Subjt:  QLNPEYTFEEKPSSMVMSWLRT------------LNLRRDIWSRCSGMKNS-----------QLQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEES

Query:  FRYILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLD
          YILG L   ++ VV+NLTSR D  +  EVQ+L QSQ++ +D
Subjt:  FRYILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLD

TrEMBL top hitse value%identityAlignment
A0A2P5BAE7 Uncharacterized protein3.0e-1559.26Show/hide
Query:  QLQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEESFRYILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLD
        QLQSIKKGS SIHDYILK K+++D  S AGQ+I DE    YILG LS  ++ VVVN  SR D  S QEVQ++ QSQ++ L+
Subjt:  QLQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEESFRYILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLD

A0A2P5F0U9 Zinc finger, CCHC-type (Fragment)5.2e-1542.55Show/hide
Query:  NPEYTFEEKPSSMVMSWLRT------------LNLRRDIWS---RCSGMKNSQ--------LQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEESFR
        NP YT   +    +MSWL +                 +IWS   R    K+          LQ+IKKGS+SI +YILKM+ ++DS   AGQ I DEE   
Subjt:  NPEYTFEEKPSSMVMSWLRT------------LNLRRDIWS---RCSGMKNSQ--------LQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEESFR

Query:  YILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLD
        YILG L   +E VVVNLTSR+D  S QEVQ+L Q+Q++ L+
Subjt:  YILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLD

A0A5N5KFY9 WD_REPEATS_REGION domain-containing protein4.7e-1652.87Show/hide
Query:  QLQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEESFRYILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLDDSTLAS
        QLQS+KKG+ SIHDY+LKM+++S++ + AGQ+I D+E   YILG L   ++ VVVNLTSR D  + QEVQY+ QSQ++ L+   L++
Subjt:  QLQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEESFRYILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLDDSTLAS

A0A5N5MA44 Uncharacterized protein1.5e-1452.81Show/hide
Query:  QLQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEESFRYILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLD--DSTLAS
        QLQ++KKG+ SIHDY+L+MK+++++   AGQ I D+E   YILG L   ++ VVVNLTSR D  + QEVQY+ QSQ++ L+  +S LAS
Subjt:  QLQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEESFRYILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLD--DSTLAS

A0A6N2LP94 Integrase catalytic domain-containing protein2.7e-1637.58Show/hide
Query:  AVQAATQLNPEYTFEEKPSSMVMSWLRT------------LNLRRDIWSRCSGMKNS-----------QLQSIKKGSTSIHDYILKMKTISDSFSIAGQI
        AV++  Q+NP ++   +    +MSWL               +  R++W     +  +           QLQS+KKG  SIHDY+LKMK++ ++   AG  
Subjt:  AVQAATQLNPEYTFEEKPSSMVMSWLRT------------LNLRRDIWSRCSGMKNS-----------QLQSIKKGSTSIHDYILKMKTISDSFSIAGQI

Query:  IGDEESFRYILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLD
        I D+E   YILG L   ++ VVVNLTSR D  + QEVQY+ QSQ+I L+
Subjt:  IGDEESFRYILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)9.0e-0422.22Show/hide
Query:  WLRTLNLRRDIWSRCSGMKNSQLQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEESFRYILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIG
        WL   NL RD     +    ++L++      S+H+Y  K+K++SD  +     I D     ++L  L+  +++++  +  +  F SF E + +   ++  
Subjt:  WLRTLNLRRDIWSRCSGMKNSQLQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEESFRYILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIG

Query:  LDDSTLAS
        L + + +S
Subjt:  LDDSTLAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCTTGAAGATCTCTTGCTTAATACAACATAGAAGCCACCAAATTCTATCTCTACATAACCTGCTGAAATTAGCAGTACAAGCTGCAACTCAATTGAATCCAGAGTA
CACATTTGAAGAAAAACCAAGCAGTATGGTGATGAGTTGGCTGCGGACCCTCAATCTCCGAAGAGATATTTGGTCACGTTGCTCAGGTATGAAAAATTCGCAATTACAGT
CAATTAAGAAAGGCTCGACGAGCATTCACGATTATATACTGAAGATGAAAACTATATCTGATAGTTTTTCAATCGCCGGACAAATTATTGGTGATGAAGAGTCATTCAGG
TACATTTTGGGTAGTTTAAGTCTTGCTTTTGAATTTGTGGTCGTTAATCTCACATCTCGAAAAGATTTTGCTTCTTTTCAAGAAGTTCAGTATTTGTTCCAGAGCCAAGA
CATTGGACTGGATGATTCAACTCTAGCGTCTTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGATCTTGAAGATCTCTTGCTTAATACAACATAGAAGCCACCAAATTCTATCTCTACATAACCTGCTGAAATTAGCAGTACAAGCTGCAACTCAATTGAATCCAGAGTA
CACATTTGAAGAAAAACCAAGCAGTATGGTGATGAGTTGGCTGCGGACCCTCAATCTCCGAAGAGATATTTGGTCACGTTGCTCAGGTATGAAAAATTCGCAATTACAGT
CAATTAAGAAAGGCTCGACGAGCATTCACGATTATATACTGAAGATGAAAACTATATCTGATAGTTTTTCAATCGCCGGACAAATTATTGGTGATGAAGAGTCATTCAGG
TACATTTTGGGTAGTTTAAGTCTTGCTTTTGAATTTGTGGTCGTTAATCTCACATCTCGAAAAGATTTTGCTTCTTTTCAAGAAGTTCAGTATTTGTTCCAGAGCCAAGA
CATTGGACTGGATGATTCAACTCTAGCGTCTTGGTAA
Protein sequenceShow/hide protein sequence
MILKISCLIQHRSHQILSLHNLLKLAVQAATQLNPEYTFEEKPSSMVMSWLRTLNLRRDIWSRCSGMKNSQLQSIKKGSTSIHDYILKMKTISDSFSIAGQIIGDEESFR
YILGSLSLAFEFVVVNLTSRKDFASFQEVQYLFQSQDIGLDDSTLASW