; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g30600 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g30600
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr8:21936112..21937492
RNA-Seq ExpressionMoc08g30600
SyntenyMoc08g30600
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]6.5e-2143.57Show/hide
Query:  VSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWHRRLGHPSSK
        VSKLA DN++ +EF  N C VKD  +GKV+LKG LKDGLYQ                    L+  K + SA           FV  K +WHRRLGHP++K
Subjt:  VSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWHRRLGHPSSK

Query:  VLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSNS
        VLD ++++C + V  +D F+FCE+CQ+GK H++ F +S+S
Subjt:  VLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSNS

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]1.2e-1940.43Show/hide
Query:  VSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWHRRLGHPSSK
        +SKL  DND+++EFH   C VKD  +G+++L+G +KDGLYQL  GG+ + +   H                           F   K TWHR+LGHP+SK
Subjt:  VSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWHRRLGHPSSK

Query:  VLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSNSC
        VL+ ++K C +     + F FCE+CQFGK+H + F NS SC
Subjt:  VLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSNSC

PNY01489.1 copia-like polyprotein, partial [Trifolium pratense]2.1e-1935.71Show/hide
Query:  QFAGSNLNQGKTGNPNGSYAQNGRNNPSRAPHAF-LTTENTNHFVANPETVLDPNCVSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDT
        +F G N + GK    N     NG      A  +  L T N +  +  P+   +   VSKL  DN++F+EF AN C VKD  +G+ +LKG LKDGLYQL  
Subjt:  QFAGSNLNQGKTGNPNGSYAQNGRNNPSRAPHAF-LTTENTNHFVANPETVLDPNCVSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDT

Query:  GGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWHRRLGHPSSKVLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSNS
              S  S+   C+                      ++  K +WHR+LGHP++KVL+ ++K+C + +  +D F+FCE+CQFGK H++ F +S+S
Subjt:  GGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWHRRLGHPSSKVLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSNS

RZB67542.1 Retrovirus-related Pol polyprotein from transposon RE1 [Glycine soja]1.2e-1937.27Show/hide
Query:  LTTENTNHFVANPETVLDPNCVSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFAL
        L   N ++ +  PE   +   VSKL  DN+  +EF AN C VKD  +GK +LKG L+DGLYQL +                     +V+           
Subjt:  LTTENTNHFVANPETVLDPNCVSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFAL

Query:  SVNFVVSKVTWHRRLGHPSSKVLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSNS
           ++  K  WHR+LGHP++KVL+ ++KNC +    ND F+FCE+CQFGK H++ F  S+S
Subjt:  SVNFVVSKVTWHRRLGHPSSKVLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSNS

TXG52359.1 hypothetical protein EZV62_021528 [Acer yangbiense]1.1e-2040.74Show/hide
Query:  NHFVANPETVLDPNCVSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDTGGA-------VTGSASSHTTGCLELANDKVSHSAYILSNFA
        N+ + +P+   +   +S+L   N+VF+EF++  CLVKD   G V+LKG LKD LY LD           VT S  S T+    L N +   S  I   + 
Subjt:  NHFVANPETVLDPNCVSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDTGGA-------VTGSASSHTTGCLELANDKVSHSAYILSNFA

Query:  LSVNFVVSKVTWHRRLGHPSSKVLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSNS
           + V SK TWHR L HPSSKVL  ++ NC   +K+N  FTFC++ Q+GKSH++ +  SNS
Subjt:  LSVNFVVSKVTWHRRLGHPSSKVLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSNS

TrEMBL top hitse value%identityAlignment
A0A151S6M8 Retrovirus-related Pol polyprotein from transposon TNT 1-945.9e-2040.43Show/hide
Query:  VSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWHRRLGHPSSK
        +SKL  DND+++EFH   C VKD  +G+++L+G +KDGLYQL  GG+ + +   H                           F   K TWHR+LGHP+SK
Subjt:  VSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWHRRLGHPSSK

Query:  VLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSNSC
        VL+ ++K C +     + F FCE+CQFGK+H + F NS SC
Subjt:  VLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSNSC

A0A2Z6MBG6 Integrase catalytic domain-containing protein3.1e-2143.57Show/hide
Query:  VSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWHRRLGHPSSK
        VSKLA DN++ +EF  N C VKD  +GKV+LKG LKDGLYQ                    L+  K + SA           FV  K +WHRRLGHP++K
Subjt:  VSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWHRRLGHPSSK

Query:  VLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSNS
        VLD ++++C + V  +D F+FCE+CQ+GK H++ F +S+S
Subjt:  VLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSNS

A0A445H1W7 Retrovirus-related Pol polyprotein from transposon RE15.9e-2037.27Show/hide
Query:  LTTENTNHFVANPETVLDPNCVSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFAL
        L   N ++ +  PE   +   VSKL  DN+  +EF AN C VKD  +GK +LKG L+DGLYQL +                     +V+           
Subjt:  LTTENTNHFVANPETVLDPNCVSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFAL

Query:  SVNFVVSKVTWHRRLGHPSSKVLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSNS
           ++  K  WHR+LGHP++KVL+ ++KNC +    ND F+FCE+CQFGK H++ F  S+S
Subjt:  SVNFVVSKVTWHRRLGHPSSKVLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSNS

A0A5C7H6K5 Integrase catalytic domain-containing protein5.4e-2140.74Show/hide
Query:  NHFVANPETVLDPNCVSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDTGGA-------VTGSASSHTTGCLELANDKVSHSAYILSNFA
        N+ + +P+   +   +S+L   N+VF+EF++  CLVKD   G V+LKG LKD LY LD           VT S  S T+    L N +   S  I   + 
Subjt:  NHFVANPETVLDPNCVSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDTGGA-------VTGSASSHTTGCLELANDKVSHSAYILSNFA

Query:  LSVNFVVSKVTWHRRLGHPSSKVLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSNS
           + V SK TWHR L HPSSKVL  ++ NC   +K+N  FTFC++ Q+GKSH++ +  SNS
Subjt:  LSVNFVVSKVTWHRRLGHPSSKVLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSNS

A0A803P5A9 Uncharacterized protein8.3e-2240.27Show/hide
Query:  PETVLDPNCVSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWH
        PE       +SKL  DND+ +EF ++FC VKD  + KV+L G LKDGLYQL        ++      C    +   +H     ++     NF+  K  WH
Subjt:  PETVLDPNCVSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWH

Query:  RRLGHPSSKVLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSNS
        RRLGHPSSK+L L++ +  +PV  N+  +FC++CQ+GKSH + F  SNS
Subjt:  RRLGHPSSKVLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSNS

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.2e-0527.97Show/hide
Query:  NCVSKLARDNDVFLEFHAN--FCLVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWHRRLGH
        N +S +A D D +  + AN  + L K      V+ KG  +  LY+ +         +    G L  A D++S                     WH+R+GH
Subjt:  NCVSKLARDNDVFLEFHAN--FCLVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWHRRLGH

Query:  PSSKVLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSN
         S K L +L K  L+          C+ C FGK H VSF  S+
Subjt:  PSSKVLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFANSN

P93293 Uncharacterized mitochondrial protein AtMg003006.8e-0528.21Show/hide
Query:  LVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWHRRLGHPSSKVLDLLVKNCLLPVKVNDIF
        ++K +   + +LKG+  D LY L       GS  +  +   E A D+                       WH RL H S + ++LLVK   L        
Subjt:  LVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWHRRLGHPSSKVLDLLVKNCLLPVKVNDIF

Query:  TFCESCQFGKSHVVSFA
         FCE C +GK+H V+F+
Subjt:  TFCESCQFGKSHVVSFA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.7e-0931.65Show/hide
Query:  VSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWHRRLGHPSSK
        V +L   N V +EF      VKD+++G  +L+G  KD LY+          ASS           K +HS                  +WH RLGHPS  
Subjt:  VSKLARDNDVFLEFHANFCLVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWHRRLGHPSSK

Query:  VLDLLVKNCLLPV-KVNDIFTFCESCQFGKSHVVSFANS
        +L+ ++ N  LPV   +     C  C   KSH V F+NS
Subjt:  VLDLLVKNCLLPV-KVNDIFTFCESCQFGKSHVVSFANS

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein4.8e-0628.21Show/hide
Query:  LVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWHRRLGHPSSKVLDLLVKNCLLPVKVNDIF
        ++K +   + +LKG+  D LY L       GS  +  +   E A D+                       WH RL H S + ++LLVK   L        
Subjt:  LVKDIHSGKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWHRRLGHPSSKVLDLLVKNCLLPVKVNDIF

Query:  TFCESCQFGKSHVVSFA
         FCE C +GK+H V+F+
Subjt:  TFCESCQFGKSHVVSFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATACGAGAATGCTAGTAATCTCTGGACAGCCATTCAAGAGCTATTTGGAGTTCAATCTCGAGCAGAGGAAGATCATCTTCGTCAGTTTGCAGGGTCTAATTTAAA
TCAAGGAAAAACTGGAAATCCCAATGGTTCATATGCACAAAATGGGAGAAATAATCCAAGTCGAGCTCCCCATGCCTTTCTCACAACAGAGAACACCAATCATTTTGTGG
CAAATCCTGAAACTGTCCTAGATCCAAATTGTGTGTCAAAACTTGCTCGAGATAATGATGTGTTTTTGGAATTTCATGCTAATTTCTGTCTTGTAAAGGACATTCATTCG
GGCAAGGTAGTGCTGAAGGGGTCTCTTAAAGATGGACTTTACCAACTTGACACTGGAGGTGCAGTCACTGGTAGTGCTTCAAGTCACACTACGGGTTGCTTGGAGTTGGC
TAATGATAAAGTTTCTCACTCTGCTTATATTCTTTCTAATTTTGCTCTTAGTGTAAATTTTGTGGTGTCTAAAGTTACGTGGCATCGAAGACTTGGACACCCATCTTCCA
AAGTTCTTGATTTATTAGTTAAAAATTGTCTTCTACCGGTAAAAGTGAATGATATTTTCACATTTTGTGAATCATGTCAATTTGGTAAATCTCATGTTGTTTCGTTCGCC
AATTCAAATTCTTGCTACTGCTCCATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGATACGAGAATGCTAGTAATCTCTGGACAGCCATTCAAGAGCTATTTGGAGTTCAATCTCGAGCAGAGGAAGATCATCTTCGTCAGTTTGCAGGGTCTAATTTAAA
TCAAGGAAAAACTGGAAATCCCAATGGTTCATATGCACAAAATGGGAGAAATAATCCAAGTCGAGCTCCCCATGCCTTTCTCACAACAGAGAACACCAATCATTTTGTGG
CAAATCCTGAAACTGTCCTAGATCCAAATTGTGTGTCAAAACTTGCTCGAGATAATGATGTGTTTTTGGAATTTCATGCTAATTTCTGTCTTGTAAAGGACATTCATTCG
GGCAAGGTAGTGCTGAAGGGGTCTCTTAAAGATGGACTTTACCAACTTGACACTGGAGGTGCAGTCACTGGTAGTGCTTCAAGTCACACTACGGGTTGCTTGGAGTTGGC
TAATGATAAAGTTTCTCACTCTGCTTATATTCTTTCTAATTTTGCTCTTAGTGTAAATTTTGTGGTGTCTAAAGTTACGTGGCATCGAAGACTTGGACACCCATCTTCCA
AAGTTCTTGATTTATTAGTTAAAAATTGTCTTCTACCGGTAAAAGTGAATGATATTTTCACATTTTGTGAATCATGTCAATTTGGTAAATCTCATGTTGTTTCGTTCGCC
AATTCAAATTCTTGCTACTGCTCCATTTGA
Protein sequenceShow/hide protein sequence
MGYENASNLWTAIQELFGVQSRAEEDHLRQFAGSNLNQGKTGNPNGSYAQNGRNNPSRAPHAFLTTENTNHFVANPETVLDPNCVSKLARDNDVFLEFHANFCLVKDIHS
GKVVLKGSLKDGLYQLDTGGAVTGSASSHTTGCLELANDKVSHSAYILSNFALSVNFVVSKVTWHRRLGHPSSKVLDLLVKNCLLPVKVNDIFTFCESCQFGKSHVVSFA
NSNSCYCSI