; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g11140 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g11140
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE2
Genome locationchr9:9462833..9465907
RNA-Seq ExpressionMoc09g11140
SyntenyMoc09g11140
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_021717245.1 uncharacterized protein LOC110685089 [Chenopodium quinoa]3.0e-3831.94Show/hide
Query:  TPLGVVTKIDTSSPFYLGPQDRPGDFITPVCWKLDHFDEWSHAIRVSLSSWRKFGFLNATITKVVSP---------------------------------
        TP     KI+T+SPFYLGPQDRPGDFIT    KLD+++EWSHAI ++LSS RKFGFL+ TIT    P                                 
Subjt:  TPLGVVTKIDTSSPFYLGPQDRPGDFITPVCWKLDHFDEWSHAIRVSLSSWRKFGFLNATITKVVSP---------------------------------

Query:  ----------------------------------------------NVL---------------------------------------------------
                                                      +VL                                                   
Subjt:  ----------------------------------------------NVL---------------------------------------------------

Query:  ------DPLPSLNWAFQQVAQDEWVRGITRPRSNNDEKPEVVGFAVCTDNRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSPDCYLEMYGSSSDNK
              DPLPSLN A+Q +AQ+E VRGI    S+ +E PE V FAV  +N  K +L RAE   L  THCHK GH ++ CF  HG P  Y+E YGS  +N+
Subjt:  ------DPLPSLNWAFQQVAQDEWVRGITRPRSNNDEKPEVVGFAVCTDNRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSPDCYLEMYGSSSDNK

Query:  GNFRGKATFHKASNNCGRSTARANVVAVDGPHSTSPVVLAPLEFTTEQWAALTVEFG--TPSSNRLHGKYKMTDWIIDTGCS
        G  RG+            +  +AN        ST+P  +A    T EQW AL    G  +P +NR++GK     WI+DTGCS
Subjt:  GNFRGKATFHKASNNCGRSTARANVVAVDGPHSTSPVVLAPLEFTTEQWAALTVEFG--TPSSNRLHGKYKMTDWIIDTGCS

XP_021746636.1 uncharacterized protein LOC110712479 [Chenopodium quinoa]1.4e-3233.73Show/hide
Query:  KIDTSSPFYLGPQDRPGDFITPVCWKLDHFDEWSHAIRVSLSSWRKFGFLNATITKVVSPNVLDPLPSL-----NWAFQQV-------------------
        KI+ +SPFYL  QD+ G++IT V  KL++FD W+H I V+LSS RKFGFL+ TI  VV P   D   ++     +W    +                   
Subjt:  KIDTSSPFYLGPQDRPGDFITPVCWKLDHFDEWSHAIRVSLSSWRKFGFLNATITKVVSPNVLDPLPSL-----NWAFQQV-------------------

Query:  --------------------------------------------------------------------------AQDEWVRGITRPRSNNDEKPEVVGFA
                                                                                  +QDE V  I++ R   +EKPE+ GFA
Subjt:  --------------------------------------------------------------------------AQDEWVRGITRPRSNNDEKPEVVGFA

Query:  VCTDNRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSPDCYLEMYGSSS-DNKGNFRGKATFHKASNNCGRSTARANVVAVDGPHSTSPVVLAPLEF
        V  + R K +L RAE   LT THCHKSGH ++ CF LHG P  Y E YGSS+ D+KG  +G++    A    GR    AN  A   P   S   L+   F
Subjt:  VCTDNRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSPDCYLEMYGSSS-DNKGNFRGKATFHKASNNCGRSTARANVVAVDGPHSTSPVVLAPLEF

Query:  TTEQWAALTVEFGT--PSSNRLHGKYKMTDWIIDTGCS
        + EQW +L   FGT  P SNRL+G     +WIIDTGCS
Subjt:  TTEQWAALTVEFGT--PSSNRLHGKYKMTDWIIDTGCS

XP_021746757.1 uncharacterized protein LOC110712595 [Chenopodium quinoa]2.9e-2528.61Show/hide
Query:  TKIDTSSPFYLGPQDRPGDFITPVCWKLDHFDEWSHAIRVSLSSWRKFGFLNATITKVVSP---------------------------------------
        TKI+ +SP+YLGPQDRPGDFIT    KLD+F+EWS  I ++LSS R+FGFL+ TIT    P                                       
Subjt:  TKIDTSSPFYLGPQDRPGDFITPVCWKLDHFDEWSHAIRVSLSSWRKFGFLNATITKVVSP---------------------------------------

Query:  ----------------------------------------NVL---------------------------------------------------------
                                                +VL                                                         
Subjt:  ----------------------------------------NVL---------------------------------------------------------

Query:  DPLPSLNWAFQQVAQDEWVRGITRPRSNNDEKPEVVGFAVCTDNRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSPDCYLEMYGSSSDNKGNFRGK
        DPLP+LN AFQ +AQ++ VRGI   +   +E PEV GFAV  +   K +L RAE   L  T+C+K  H S+ CF  HG P  Y+E YGS  +NKG   G+
Subjt:  DPLPSLNWAFQQVAQDEWVRGITRPRSNNDEKPEVVGFAVCTDNRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSPDCYLEMYGSSSDNKGNFRGK

Query:  ATFHKASNNCGRSTARANVVAVDGPHSTSPVVLAPLEFTTEQWAALTVEFGTPSSNRLHG
        +               A V A   P S  P   +    T EQW  + +    P SNR++G
Subjt:  ATFHKASNNCGRSTARANVVAVDGPHSTSPVVLAPLEFTTEQWAALTVEFGTPSSNRLHG

XP_021756883.1 uncharacterized protein LOC110721955 [Chenopodium quinoa]2.3e-3029.92Show/hide
Query:  LAKTPLGVVTKIDTSSPFYLGPQDRPGDFITPVCWKLDHFDEWSHAIRVSLSSWRKFGFLNATITKVVSP------------------------------
        +A TP     KI+++SP+YLGP+DR GDFIT    KLD+++EWSHAI ++LSS RKFGFL+ TIT    P                              
Subjt:  LAKTPLGVVTKIDTSSPFYLGPQDRPGDFITPVCWKLDHFDEWSHAIRVSLSSWRKFGFLNATITKVVSP------------------------------

Query:  -------------------------------------------------NVL------------------------------------------------
                                                         +VL                                                
Subjt:  -------------------------------------------------NVL------------------------------------------------

Query:  ---------DPLPSLNWAFQQVAQDEWVRGITRPRSNNDEKPEVVGFAVCTDNRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSPDCYLEMYGSSS
                 DPLPSLN AFQ +AQ+E VRGI    ++ +E PEVVGFA   +N+ K +L RAE   L  T+C + GH +S CF  HG P+ Y E YG   
Subjt:  ---------DPLPSLNWAFQQVAQDEWVRGITRPRSNNDEKPEVVGFAVCTDNRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSPDCYLEMYGSSS

Query:  DNKGNFRGKATFHKASNNCGRSTARANVVAVDGPHSTSPVV------LAPLEFTTEQWAALTVEFGT--PSSNRLHGKYKMTDWIIDTGCS
         NK            SNN         + A   P +T+PV       + PL  + EQW A+    G   P  +R++GK     WIIDTGCS
Subjt:  DNKGNFRGKATFHKASNNCGRSTARANVVAVDGPHSTSPVV------LAPLEFTTEQWAALTVEFGT--PSSNRLHGKYKMTDWIIDTGCS

XP_021757931.1 uncharacterized protein LOC110722967 [Chenopodium quinoa]9.9e-2642.61Show/hide
Query:  DPLPSLNWAFQQVAQDEWVRGITRPRSNNDEKPEVVGFAVCTDNRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSPDCYLEMYGSSSDNKGNFRGK
        DPLPSLN AFQ +AQ+E VRGI   +   +E PEV GFAV  +N  K +L RAE   L   HCHK GH ++ CF  HG P  Y+E YGS  D++G  RG+
Subjt:  DPLPSLNWAFQQVAQDEWVRGITRPRSNNDEKPEVVGFAVCTDNRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSPDCYLEMYGSSSDNKGNFRGK

Query:  ATFHKASNNCGRSTARANVVAVDGPHSTSPVVLAPLEFTTEQWAALTVEFGTP--SSNRLHGKYKMTDWIIDTGCS
            KA+         A   A   P + S         T E+W A+    G     S+R++GK     WIIDTGCS
Subjt:  ATFHKASNNCGRSTARANVVAVDGPHSTSPVVLAPLEFTTEQWAALTVEFGTP--SSNRLHGKYKMTDWIIDTGCS

TrEMBL top hitse value%identityAlignment
A0A3Q7IBW4 Uncharacterized protein1.1e-1726.33Show/hide
Query:  TKIDTSSPFYLGPQDRPGDFITPVCWKLDHFDEWSHAIRVSLSSWRKFGFLNATITKVVSP---------------------------------------
        +KI+  +PFYLG  DRPGDFITP+  KLD+FD WSHA++V+LSS RKFGFL+ TI   VSP                                       
Subjt:  TKIDTSSPFYLGPQDRPGDFITPVCWKLDHFDEWSHAIRVSLSSWRKFGFLNATITKVVSP---------------------------------------

Query:  -----------------------------------------------------------------------------------------------NVL--
                                                                                                       N+L  
Subjt:  -----------------------------------------------------------------------------------------------NVL--

Query:  DPLPSLNWAFQQVAQDEWVRGITRPRSNNDEKPEVVGFA--------------------VCTDNRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSP
        DPLPSLN A+QQV+Q+E+VRG+ R +   D+    VGFA                    VC+  ++   L      +   THC K GH+ S C+ L+G P
Subjt:  DPLPSLNWAFQQVAQDEWVRGITRPRSNNDEKPEVVGFA--------------------VCTDNRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSP

Query:  DCYLEMYGSSSDNKGNFRGKATFHKASNNCGRSTARANVVAVDGP----------HSTSPVVLAPLEFTTEQWAAL
        +      G +  ++GN +   T H      GR  ARAN  A   P           ST+P       F+ EQW A+
Subjt:  DCYLEMYGSSSDNKGNFRGKATFHKASNNCGRSTARANVVAVDGP----------HSTSPVVLAPLEFTTEQWAAL

A0A438GHZ2 Uncharacterized protein3.1e-1731.84Show/hide
Query:  MTNDNKTPLAKTPLGVVTKIDTSSPFYLGPQDRPGDFITPVCWKLDHFDEWSHAIR------VSLSSWRKFG-FLNATITKVV-SPNVLDPLPSLNWAFQ
        M  D++ P    P     K + +SPF+LG  DRPGDFITP   + D++D+W+  I        S + W+      N  I  ++ +P++  PLPSL+ A+Q
Subjt:  MTNDNKTPLAKTPLGVVTKIDTSSPFYLGPQDRPGDFITPVCWKLDHFDEWSHAIR------VSLSSWRKFG-FLNATITKVV-SPNVLDPLPSLNWAFQ

Query:  QVAQDEWVRGITRPRSNNDEKP-EVVGFAVCTD-NRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSPDCYLEMYGSSSDNKGNFRGKATFHKASNN
         V QDE VR     ++  ++KP EV+GFAV T   R + + +R        +HC K+GH +S C                     G   G A  + AS+ 
Subjt:  QVAQDEWVRGITRPRSNNDEKP-EVVGFAVCTD-NRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSPDCYLEMYGSSSDNKGNFRGKATFHKASNN

Query:  CGRSTARANVVAVDGPHSTSPVVLAPLEFTTEQWAALTVEFGTPS--SNRLHGKYKMTDWIIDTGCS
         G S+ ++         ST  +      FT EQW AL    G      +RL+ K+    WIIDTG +
Subjt:  CGRSTARANVVAVDGPHSTSPVVLAPLEFTTEQWAALTVEFGTPS--SNRLHGKYKMTDWIIDTGCS

A0A438K0Z3 Retrovirus-related Pol polyprotein from transposon RE24.3e-1928.73Show/hide
Query:  MTNDNKTPLAKTPLGVVTKIDTSSPFYLGPQDRPGDFITPVCWKLDHFDEWSHAIRVSLSSWRKFGFLNATITKVVSP----------------------
        M  D++ P    P     K + +SPF+LG  DRPGDFITP   + D++D+W+  I+++L + RKF FL  TIT    P                      
Subjt:  MTNDNKTPLAKTPLGVVTKIDTSSPFYLGPQDRPGDFITPVCWKLDHFDEWSHAIRVSLSSWRKFGFLNATITKVVSP----------------------

Query:  ------------------------------------------------------------------NVL--DPLPSLNWAFQQVAQDEWVRGITRPRSNN
                                                                          N+L  DPLPSL+ A+Q V QDE VR     ++  
Subjt:  ------------------------------------------------------------------NVL--DPLPSLNWAFQQVAQDEWVRGITRPRSNN

Query:  DEKP-EVVGFAVCTD-NRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSPDCYLEMYGSSSDN----KGNFRGKATFHKASNNCGRS---------T
        ++KP EV+GFAV T   R + +++R        +HC K+GH +S C+ L   P C+   +G   +N     G   G    +KA    GRS         +
Subjt:  DEKP-EVVGFAVCTD-NRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSPDCYLEMYGSSSDN----KGNFRGKATFHKASNNCGRS---------T

Query:  ARANVVAVDGPHSTSPVVLAPLEFTTEQWAALTVEFGTP--SSNRLHGKYKMTDWIIDTGCS
        ARAN  +     S++      L FT EQW AL    G    S +RL+GK+    WIIDTG +
Subjt:  ARANVVAVDGPHSTSPVVLAPLEFTTEQWAALTVEFGTP--SSNRLHGKYKMTDWIIDTGCS

A0A443PJ17 Copia protein gag-int-pol protein2.5e-1935.86Show/hide
Query:  DPLPSLNWAFQQVAQDEWVRGITRPRSNNDEKPEVVGFAVCTDNRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSPDCYLE---MYGSSSDNKGNF
        +PLPSL+ A+QQ+ Q+E VRGIT  R   +  PEVVGFAV  + R +++ D+ + + L  +HCH+SGH    CF L G P+ + +     G++       
Subjt:  DPLPSLNWAFQQVAQDEWVRGITRPRSNNDEKPEVVGFAVCTDNRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSPDCYLE---MYGSSSDNKGNF

Query:  RGKATFHKASNNCGRSTARANVVAVDGPH------------------STSPVVLAPLEFTTEQWAALTVEFGT--PSSNRLHGKYKMTDWIIDTGCSS
         G+ T   A+     +T RA  VAVDG +                  STS    +    + EQW  +   FG    S++RLHG++  T WIIDTG S+
Subjt:  RGKATFHKASNNCGRSTARANVVAVDGPH------------------STSPVVLAPLEFTTEQWAALTVEFGT--PSSNRLHGKYKMTDWIIDTGCSS

A0A803LXS4 Uncharacterized protein5.9e-1635.71Show/hide
Query:  KIDTSSPFYLGPQDRPGDFITPVCWKLDHFDEWSHAIRVSLSSWRKFGFLNATITKVVSPNVLDPLPSLNW------------AFQQ------------V
        KI+ SSP+YLG  D PG+ IT V  K D++  WS AI +SL S RKF F+N TITK      L     L+W             F+Q            V
Subjt:  KIDTSSPFYLGPQDRPGDFITPVCWKLDHFDEWSHAIRVSLSSWRKFGFLNATITKVVSPNVLDPLPSLNW------------AFQQ------------V

Query:  AQDEWVRGITRPRSNNDEKPEVVGFAVCTDNRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSPD
           E  RGI R ++  D+  +       ++ R K  +D  + + L+ +HC K+GH+   CF+LHG PD
Subjt:  AQDEWVRGITRPRSNNDEKPEVVGFAVCTDNRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSPD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAATGACAACAAAACTCCACTGGCAAAAACTCCATTAGGAGTTGTCACAAAAATTGATACAAGTTCTCCGTTTTACCTAGGGCCACAGGATCGTCCTGGT
GATTTCATTACTCCGGTTTGTTGGAAGCTCGATCATTTTGACGAATGGTCTCATGCGATTCGAGTTTCTCTTTCTTCTTGGAGAAAATTTGGTTTTCTTAACGCA
ACTATCACTAAGGTCGTTTCGCCAAATGTGTTGGACCCTCTTCCATCCTTGAATTGGGCATTTCAGCAAGTGGCTCAAGATGAGTGGGTTAGAGGAATTACTCGA
CCTAGATCAAATAATGATGAGAAACCTGAGGTGGTTGGTTTTGCTGTGTGTACGGATAATAGACAAAAGTCACAATTGGATCGAGCAGAAAATACAGTATTGACC
TATACTCATTGTCACAAATCAGGACATTCTAGTAGTATATGTTTTGTTTTACATGGCAGTCCAGATTGTTATTTGGAGATGTATGGGTCCTCATCTGATAATAAA
GGAAATTTTAGAGGTAAAGCCACGTTCCACAAAGCTTCAAACAATTGTGGGCGTTCCACAGCACGTGCAAATGTCGTTGCCGTTGATGGCCCTCACAGTACATCA
CCCGTCGTGTTAGCCCCTCTGGAATTTACAACCGAACAGTGGGCGGCCTTAACAGTTGAGTTTGGTACTCCCTCTTCCAACCGGCTACACGGTAAGTATAAAATG
ACTGATTGGATTATTGATACCGGATGCTCATCATGTTACAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACAAATGACAACAAAACTCCACTGGCAAAAACTCCATTAGGAGTTGTCACAAAAATTGATACAAGTTCTCCGTTTTACCTAGGGCCACAGGATCGTCCTGGT
GATTTCATTACTCCGGTTTGTTGGAAGCTCGATCATTTTGACGAATGGTCTCATGCGATTCGAGTTTCTCTTTCTTCTTGGAGAAAATTTGGTTTTCTTAACGCA
ACTATCACTAAGGTCGTTTCGCCAAATGTGTTGGACCCTCTTCCATCCTTGAATTGGGCATTTCAGCAAGTGGCTCAAGATGAGTGGGTTAGAGGAATTACTCGA
CCTAGATCAAATAATGATGAGAAACCTGAGGTGGTTGGTTTTGCTGTGTGTACGGATAATAGACAAAAGTCACAATTGGATCGAGCAGAAAATACAGTATTGACC
TATACTCATTGTCACAAATCAGGACATTCTAGTAGTATATGTTTTGTTTTACATGGCAGTCCAGATTGTTATTTGGAGATGTATGGGTCCTCATCTGATAATAAA
GGAAATTTTAGAGGTAAAGCCACGTTCCACAAAGCTTCAAACAATTGTGGGCGTTCCACAGCACGTGCAAATGTCGTTGCCGTTGATGGCCCTCACAGTACATCA
CCCGTCGTGTTAGCCCCTCTGGAATTTACAACCGAACAGTGGGCGGCCTTAACAGTTGAGTTTGGTACTCCCTCTTCCAACCGGCTACACGGTAAGTATAAAATG
ACTGATTGGATTATTGATACCGGATGCTCATCATGTTACAGGTAA
Protein sequenceShow/hide protein sequence
MTNDNKTPLAKTPLGVVTKIDTSSPFYLGPQDRPGDFITPVCWKLDHFDEWSHAIRVSLSSWRKFGFLNATITKVVSPNVLDPLPSLNWAFQQVAQDEWVRGITR
PRSNNDEKPEVVGFAVCTDNRQKSQLDRAENTVLTYTHCHKSGHSSSICFVLHGSPDCYLEMYGSSSDNKGNFRGKATFHKASNNCGRSTARANVVAVDGPHSTS
PVVLAPLEFTTEQWAALTVEFGTPSSNRLHGKYKMTDWIIDTGCSSCYR