; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g02610 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g02610
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr7:2238805..2241223
RNA-Seq ExpressionMoc07g02610
SyntenyMoc07g02610
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143573.1 uncharacterized protein LOC111013441 [Momordica charantia]2.9e-1826.69Show/hide
Query:  MLLMKPPPTINRAFTLVSQEVQQRAISSISSPFVPSSVPSAAFLKTSVKATEPPSKPTPNAISDSL---------------------------------S
        +LLM PPP++N+A +LV Q+ QQR+I   +S  +P++ PS A L     A++PPSK T   ++  L                                 S
Subjt:  MLLMKPPPTINRAFTLVSQEVQQRAISSISSPFVPSSVPSAAFLKTSVKATEPPSKPTPNAISDSL---------------------------------S

Query:  HISA---DQCQGLLNLLQSHLA--KVKTESEPSTSHVVGTCFYALQNSGISTDQWVIDSGASTHICYSRDLFLGLRSVSGVHISLPDNSRVNVETS----
        H ++      Q L  LLQS L+  K   +++ +TS+ V T           T   ++D GAS HIC  R LF  +  +S VH++LP+  R  VE S    
Subjt:  HISA---DQCQGLLNLLQSHLA--KVKTESEPSTSHVVGTCFYALQNSGISTDQWVIDSGASTHICYSRDLFLGLRSVSGVHISLPDNSRVNVETS----

Query:  -------------------------------LLIVQFSGDSCLIQDKFSLKTIGKDFVLPKAFDFVSSPSGVSSLPHNASNLHSPAVDVTPTNAWTHDMA
                                        L V+F+ D+C+IQDK   KTI K  +    +  + + S  SS      ++   A  +   + W + + 
Subjt:  -------------------------------LLIVQFSGDSCLIQDKFSLKTIGKDFVLPKAFDFVSSPSGVSSLPHNASNLHSPAVDVTPTNAWTHDMA

Query:  SPIHNATTA--------------YVVNNSNFP------------SFTAVILHD----------DHV----VDVPFAAIVENS----NVPSAVIENSVVPS
         P      A              ++ ++ +FP             F  ++L +          DH+     D+   A+V  +     VP      + VPS
Subjt:  SPIHNATTA--------------YVVNNSNFP------------SFTAVILHD----------DHV----VDVPFAAIVENS----NVPSAVIENSVVPS

Query:  AVDIKTSVVPSVVMP------VDPWIQQSISVIPSTSV-RRSQRDSRPPSYLKDYHCNLLASAALPPFQSRYPLQKVLSYDRLSPSY
        A    ++ V S  MP        P      S++P   V RRS R S+ PSYL+D+HC+LL ++   P  +R+PLQ+ LSY RLS ++
Subjt:  AVDIKTSVVPSVVMP------VDPWIQQSISVIPSTSV-RRSQRDSRPPSYLKDYHCNLLASAALPPFQSRYPLQKVLSYDRLSPSY

XP_022150388.1 uncharacterized protein LOC111018564 isoform X1 [Momordica charantia]2.7e-8898.28Show/hide
Query:  DFVLPKAFDFVSSPSGVSSLPHNASNLHSPAVDVTPTNAWTHDMASPIHNATTAYVVNNSNFPSFTAVILHDDHVVDVPFAAIVENSNVPSAVIENSVVP
        DFVLPKAFDFVSSPSGVSSLPHNASNLHSPAVDVTPTNAWTHDMASPIHNATTAYVVNNSNFPSFTAVILHDDHVVDVPFAAIVENSNVPSAVIENSVVP
Subjt:  DFVLPKAFDFVSSPSGVSSLPHNASNLHSPAVDVTPTNAWTHDMASPIHNATTAYVVNNSNFPSFTAVILHDDHVVDVPFAAIVENSNVPSAVIENSVVP

Query:  SAVDIKTSVVPSVVMPVDPWIQQSISVIPSTSVRRSQRDSRPPSYLKDYHCNLLASAALPPFQSRYPLQKVLSY
        SAVDIKTSVVPSVVMPVDPWIQQSISVIPSTSVRRSQRDSRPPSYLKDYHCNLLASAALPPFQSRYPLQKV+ +
Subjt:  SAVDIKTSVVPSVVMPVDPWIQQSISVIPSTSVRRSQRDSRPPSYLKDYHCNLLASAALPPFQSRYPLQKVLSY

XP_022150390.1 uncharacterized protein LOC111018564 isoform X2 [Momordica charantia]3.3e-22100Show/hide
Query:  DFVLPKAFDFVSSPSGVSSLPHNASNLHSPAVDVTPTNAWTHDMASPIHNATTAYV
        DFVLPKAFDFVSSPSGVSSLPHNASNLHSPAVDVTPTNAWTHDMASPIHNATTAYV
Subjt:  DFVLPKAFDFVSSPSGVSSLPHNASNLHSPAVDVTPTNAWTHDMASPIHNATTAYV

XP_022150391.1 uncharacterized protein LOC111018564 isoform X3 [Momordica charantia]4.3e-6297.71Show/hide
Query:  MASPIHNATTAYVVNNSNFPSFTAVILHDDHVVDVPFAAIVENSNVPSAVIENSVVPSAVDIKTSVVPSVVMPVDPWIQQSISVIPSTSVRRSQRDSRPP
        MASPIHNATTAYVVNNSNFPSFTAVILHDDHVVDVPFAAIVENSNVPSAVIENSVVPSAVDIKTSVVPSVVMPVDPWIQQSISVIPSTSVRRSQRDSRPP
Subjt:  MASPIHNATTAYVVNNSNFPSFTAVILHDDHVVDVPFAAIVENSNVPSAVIENSVVPSAVDIKTSVVPSVVMPVDPWIQQSISVIPSTSVRRSQRDSRPP

Query:  SYLKDYHCNLLASAALPPFQSRYPLQKVLSY
        SYLKDYHCNLLASAALPPFQSRYPLQKV+ +
Subjt:  SYLKDYHCNLLASAALPPFQSRYPLQKVLSY

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]1.3e-1827.53Show/hide
Query:  MLLMKPPPTINRAFTLVSQEVQQRAIS--SISSPFVPS------------SVPSAAFLK-----------------------------------------
        +LLM+P PTINRAF LV+QE+QQR+IS  S++SP   +            +  SA+ +K                                         
Subjt:  MLLMKPPPTINRAFTLVSQEVQQRAIS--SISSPFVPS------------SVPSAAFLK-----------------------------------------

Query:  -TSVKATEPPSK---PTPNAISDSLSHISADQCQGLLNLLQSHLAKVKT--ESEPSTSHVVGTCFYALQNSGISTDQWVIDSG---ASTHICYSRDLF--
         TS ++ E PSK    TP+ IS+SL+ ++ADQCQ LL LLQSHL   KT  +++  TSHV  T F      G +  ++  D     +     +S+ +   
Subjt:  -TSVKATEPPSK---PTPNAISDSLSHISADQCQGLLNLLQSHLAKVKT--ESEPSTSHVVGTCFYALQNSGISTDQWVIDSG---ASTHICYSRDLF--

Query:  ---LGLRSVSGV----HISLPDNSRVNVETSLLIVQFSGDSCLI--------------------------QDKFSLKTIG-----------KDFVLPKAF
           +G    + V    H  L + +R     S +   F G+  L                            D  SLK  G           +    P+A 
Subjt:  ---LGLRSVSGV----HISLPDNSRVNVETSLLIVQFSGDSCLI--------------------------QDKFSLKTIG-----------KDFVLPKAF

Query:  D--FVSSPSGV-----------------------SSLPHNASNLHSPAVD-----VTPTNAWTHDMASPI----HNATTAYVVNNSNFPSFTAVILHDDH
           FV  P G+                       S  P +  +++SP VD     V P +    D +S      HN  T   V+ ++      V++ D  
Subjt:  D--FVSSPSGV-----------------------SSLPHNASNLHSPAVD-----VTPTNAWTHDMASPI----HNATTAYVVNNSNFPSFTAVILHDDH

Query:  VVDVPFAA--------IVENSNVPSAVIENSVV-PSAVDIKTSVVPSVVMPV-DPWIQQSISVIPSTSVRRSQRDSRPPSYLKDYHCNLLASAALPPFQS
         + V   +        +   SN+   ++ENS++ P   D+  +V  +VV+P  DP          S ++RRS R ++ PSYL+DYHC L+ +        
Subjt:  VVDVPFAA--------IVENSNVPSAVIENSVV-PSAVDIKTSVVPSVVMPV-DPWIQQSISVIPSTSVRRSQRDSRPPSYLKDYHCNLLASAALPPFQS

Query:  RYPLQKVLSYDRLSPSYKNFILNISTDFERQFYH
         YPLQK L Y+ LS SYK F+L++S D+E QFYH
Subjt:  RYPLQKVLSYDRLSPSYKNFILNISTDFERQFYH

TrEMBL top hitse value%identityAlignment
A0A6J1CR17 uncharacterized protein LOC1110134411.4e-1826.69Show/hide
Query:  MLLMKPPPTINRAFTLVSQEVQQRAISSISSPFVPSSVPSAAFLKTSVKATEPPSKPTPNAISDSL---------------------------------S
        +LLM PPP++N+A +LV Q+ QQR+I   +S  +P++ PS A L     A++PPSK T   ++  L                                 S
Subjt:  MLLMKPPPTINRAFTLVSQEVQQRAISSISSPFVPSSVPSAAFLKTSVKATEPPSKPTPNAISDSL---------------------------------S

Query:  HISA---DQCQGLLNLLQSHLA--KVKTESEPSTSHVVGTCFYALQNSGISTDQWVIDSGASTHICYSRDLFLGLRSVSGVHISLPDNSRVNVETS----
        H ++      Q L  LLQS L+  K   +++ +TS+ V T           T   ++D GAS HIC  R LF  +  +S VH++LP+  R  VE S    
Subjt:  HISA---DQCQGLLNLLQSHLA--KVKTESEPSTSHVVGTCFYALQNSGISTDQWVIDSGASTHICYSRDLFLGLRSVSGVHISLPDNSRVNVETS----

Query:  -------------------------------LLIVQFSGDSCLIQDKFSLKTIGKDFVLPKAFDFVSSPSGVSSLPHNASNLHSPAVDVTPTNAWTHDMA
                                        L V+F+ D+C+IQDK   KTI K  +    +  + + S  SS      ++   A  +   + W + + 
Subjt:  -------------------------------LLIVQFSGDSCLIQDKFSLKTIGKDFVLPKAFDFVSSPSGVSSLPHNASNLHSPAVDVTPTNAWTHDMA

Query:  SPIHNATTA--------------YVVNNSNFP------------SFTAVILHD----------DHV----VDVPFAAIVENS----NVPSAVIENSVVPS
         P      A              ++ ++ +FP             F  ++L +          DH+     D+   A+V  +     VP      + VPS
Subjt:  SPIHNATTA--------------YVVNNSNFP------------SFTAVILHD----------DHV----VDVPFAAIVENS----NVPSAVIENSVVPS

Query:  AVDIKTSVVPSVVMP------VDPWIQQSISVIPSTSV-RRSQRDSRPPSYLKDYHCNLLASAALPPFQSRYPLQKVLSYDRLSPSY
        A    ++ V S  MP        P      S++P   V RRS R S+ PSYL+D+HC+LL ++   P  +R+PLQ+ LSY RLS ++
Subjt:  AVDIKTSVVPSVVMP------VDPWIQQSISVIPSTSV-RRSQRDSRPPSYLKDYHCNLLASAALPPFQSRYPLQKVLSYDRLSPSY

A0A6J1D8C2 uncharacterized protein LOC111018564 isoform X21.6e-22100Show/hide
Query:  DFVLPKAFDFVSSPSGVSSLPHNASNLHSPAVDVTPTNAWTHDMASPIHNATTAYV
        DFVLPKAFDFVSSPSGVSSLPHNASNLHSPAVDVTPTNAWTHDMASPIHNATTAYV
Subjt:  DFVLPKAFDFVSSPSGVSSLPHNASNLHSPAVDVTPTNAWTHDMASPIHNATTAYV

A0A6J1D9X8 uncharacterized protein LOC111018564 isoform X11.3e-8898.28Show/hide
Query:  DFVLPKAFDFVSSPSGVSSLPHNASNLHSPAVDVTPTNAWTHDMASPIHNATTAYVVNNSNFPSFTAVILHDDHVVDVPFAAIVENSNVPSAVIENSVVP
        DFVLPKAFDFVSSPSGVSSLPHNASNLHSPAVDVTPTNAWTHDMASPIHNATTAYVVNNSNFPSFTAVILHDDHVVDVPFAAIVENSNVPSAVIENSVVP
Subjt:  DFVLPKAFDFVSSPSGVSSLPHNASNLHSPAVDVTPTNAWTHDMASPIHNATTAYVVNNSNFPSFTAVILHDDHVVDVPFAAIVENSNVPSAVIENSVVP

Query:  SAVDIKTSVVPSVVMPVDPWIQQSISVIPSTSVRRSQRDSRPPSYLKDYHCNLLASAALPPFQSRYPLQKVLSY
        SAVDIKTSVVPSVVMPVDPWIQQSISVIPSTSVRRSQRDSRPPSYLKDYHCNLLASAALPPFQSRYPLQKV+ +
Subjt:  SAVDIKTSVVPSVVMPVDPWIQQSISVIPSTSVRRSQRDSRPPSYLKDYHCNLLASAALPPFQSRYPLQKVLSY

A0A6J1DBD0 uncharacterized protein LOC111018564 isoform X32.1e-6297.71Show/hide
Query:  MASPIHNATTAYVVNNSNFPSFTAVILHDDHVVDVPFAAIVENSNVPSAVIENSVVPSAVDIKTSVVPSVVMPVDPWIQQSISVIPSTSVRRSQRDSRPP
        MASPIHNATTAYVVNNSNFPSFTAVILHDDHVVDVPFAAIVENSNVPSAVIENSVVPSAVDIKTSVVPSVVMPVDPWIQQSISVIPSTSVRRSQRDSRPP
Subjt:  MASPIHNATTAYVVNNSNFPSFTAVILHDDHVVDVPFAAIVENSNVPSAVIENSVVPSAVDIKTSVVPSVVMPVDPWIQQSISVIPSTSVRRSQRDSRPP

Query:  SYLKDYHCNLLASAALPPFQSRYPLQKVLSY
        SYLKDYHCNLLASAALPPFQSRYPLQKV+ +
Subjt:  SYLKDYHCNLLASAALPPFQSRYPLQKVLSY

A0A6J1DNP7 uncharacterized protein LOC1110220656.4e-1927.53Show/hide
Query:  MLLMKPPPTINRAFTLVSQEVQQRAIS--SISSPFVPS------------SVPSAAFLK-----------------------------------------
        +LLM+P PTINRAF LV+QE+QQR+IS  S++SP   +            +  SA+ +K                                         
Subjt:  MLLMKPPPTINRAFTLVSQEVQQRAIS--SISSPFVPS------------SVPSAAFLK-----------------------------------------

Query:  -TSVKATEPPSK---PTPNAISDSLSHISADQCQGLLNLLQSHLAKVKT--ESEPSTSHVVGTCFYALQNSGISTDQWVIDSG---ASTHICYSRDLF--
         TS ++ E PSK    TP+ IS+SL+ ++ADQCQ LL LLQSHL   KT  +++  TSHV  T F      G +  ++  D     +     +S+ +   
Subjt:  -TSVKATEPPSK---PTPNAISDSLSHISADQCQGLLNLLQSHLAKVKT--ESEPSTSHVVGTCFYALQNSGISTDQWVIDSG---ASTHICYSRDLF--

Query:  ---LGLRSVSGV----HISLPDNSRVNVETSLLIVQFSGDSCLI--------------------------QDKFSLKTIG-----------KDFVLPKAF
           +G    + V    H  L + +R     S +   F G+  L                            D  SLK  G           +    P+A 
Subjt:  ---LGLRSVSGV----HISLPDNSRVNVETSLLIVQFSGDSCLI--------------------------QDKFSLKTIG-----------KDFVLPKAF

Query:  D--FVSSPSGV-----------------------SSLPHNASNLHSPAVD-----VTPTNAWTHDMASPI----HNATTAYVVNNSNFPSFTAVILHDDH
           FV  P G+                       S  P +  +++SP VD     V P +    D +S      HN  T   V+ ++      V++ D  
Subjt:  D--FVSSPSGV-----------------------SSLPHNASNLHSPAVD-----VTPTNAWTHDMASPI----HNATTAYVVNNSNFPSFTAVILHDDH

Query:  VVDVPFAA--------IVENSNVPSAVIENSVV-PSAVDIKTSVVPSVVMPV-DPWIQQSISVIPSTSVRRSQRDSRPPSYLKDYHCNLLASAALPPFQS
         + V   +        +   SN+   ++ENS++ P   D+  +V  +VV+P  DP          S ++RRS R ++ PSYL+DYHC L+ +        
Subjt:  VVDVPFAA--------IVENSNVPSAVIENSVV-PSAVDIKTSVVPSVVMPV-DPWIQQSISVIPSTSVRRSQRDSRPPSYLKDYHCNLLASAALPPFQS

Query:  RYPLQKVLSYDRLSPSYKNFILNISTDFERQFYH
         YPLQK L Y+ LS SYK F+L++S D+E QFYH
Subjt:  RYPLQKVLSYDRLSPSYKNFILNISTDFERQFYH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGTTGATGAAGCCACCTCCGACGATCAATCGGGCTTTCACTCTAGTTTCACAAGAAGTTCAACAGCGTGCTATCTCATCAATTTCTTCTCCCTTTGTGCCTTCGTC
TGTACCGTCAGCTGCTTTTTTGAAGACTTCTGTTAAGGCTACTGAGCCTCCATCCAAGCCTACTCCAAATGCTATTTCAGATTCTCTTTCGCATATTAGTGCTGATCAAT
GCCAAGGATTGTTAAATCTTCTTCAATCTCATTTGGCGAAAGTGAAAACTGAATCTGAACCAAGTACTTCACATGTTGTAGGTACATGTTTTTATGCTTTACAAAATTCT
GGAATTTCTACTGATCAGTGGGTGATAGATTCTGGCGCTTCTACACACATTTGTTACTCTCGTGATTTATTTCTTGGTCTGAGATCAGTTTCTGGGGTTCATATTTCACT
GCCTGATAATTCTCGAGTCAATGTTGAGACTTCTTTACTAATTGTTCAATTTTCTGGTGATTCTTGTCTAATCCAGGATAAGTTTTCTTTGAAGACGATTGGCAAGGATT
TTGTTTTGCCGAAAGCTTTTGATTTTGTTAGCAGTCCTTCTGGTGTATCTTCTTTGCCTCATAATGCTTCCAACTTGCATAGTCCTGCTGTGGATGTCACACCAACTAAT
GCATGGACTCATGATATGGCTTCTCCTATACATAATGCTACCACTGCTTATGTGGTAAATAATTCTAATTTTCCATCTTTCACTGCTGTTATTTTGCATGATGATCATGT
TGTTGATGTTCCATTTGCTGCTATTGTTGAGAATTCTAATGTCCCATCTGCAGTCATTGAGAATTCTGTTGTACCATCTGCTGTTGATATTAAGACTTCTGTTGTGCCGT
CTGTTGTTATGCCTGTTGATCCCTGGATTCAACAGTCAATTTCTGTTATTCCTTCAACATCTGTTCGCAGGTCACAGAGGGATTCTCGGCCCCCATCTTATCTTAAGGAT
TATCACTGTAATTTGCTTGCTTCTGCTGCTTTGCCCCCTTTTCAGTCTCGGTATCCTTTGCAGAAGGTTTTATCTTATGACAGATTATCGCCATCCTACAAGAATTTTAT
TCTGAATATATCCACTGATTTTGAGCGGCAATTCTATCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGTTGATGAAGCCACCTCCGACGATCAATCGGGCTTTCACTCTAGTTTCACAAGAAGTTCAACAGCGTGCTATCTCATCAATTTCTTCTCCCTTTGTGCCTTCGTC
TGTACCGTCAGCTGCTTTTTTGAAGACTTCTGTTAAGGCTACTGAGCCTCCATCCAAGCCTACTCCAAATGCTATTTCAGATTCTCTTTCGCATATTAGTGCTGATCAAT
GCCAAGGATTGTTAAATCTTCTTCAATCTCATTTGGCGAAAGTGAAAACTGAATCTGAACCAAGTACTTCACATGTTGTAGGTACATGTTTTTATGCTTTACAAAATTCT
GGAATTTCTACTGATCAGTGGGTGATAGATTCTGGCGCTTCTACACACATTTGTTACTCTCGTGATTTATTTCTTGGTCTGAGATCAGTTTCTGGGGTTCATATTTCACT
GCCTGATAATTCTCGAGTCAATGTTGAGACTTCTTTACTAATTGTTCAATTTTCTGGTGATTCTTGTCTAATCCAGGATAAGTTTTCTTTGAAGACGATTGGCAAGGATT
TTGTTTTGCCGAAAGCTTTTGATTTTGTTAGCAGTCCTTCTGGTGTATCTTCTTTGCCTCATAATGCTTCCAACTTGCATAGTCCTGCTGTGGATGTCACACCAACTAAT
GCATGGACTCATGATATGGCTTCTCCTATACATAATGCTACCACTGCTTATGTGGTAAATAATTCTAATTTTCCATCTTTCACTGCTGTTATTTTGCATGATGATCATGT
TGTTGATGTTCCATTTGCTGCTATTGTTGAGAATTCTAATGTCCCATCTGCAGTCATTGAGAATTCTGTTGTACCATCTGCTGTTGATATTAAGACTTCTGTTGTGCCGT
CTGTTGTTATGCCTGTTGATCCCTGGATTCAACAGTCAATTTCTGTTATTCCTTCAACATCTGTTCGCAGGTCACAGAGGGATTCTCGGCCCCCATCTTATCTTAAGGAT
TATCACTGTAATTTGCTTGCTTCTGCTGCTTTGCCCCCTTTTCAGTCTCGGTATCCTTTGCAGAAGGTTTTATCTTATGACAGATTATCGCCATCCTACAAGAATTTTAT
TCTGAATATATCCACTGATTTTGAGCGGCAATTCTATCATTAG
Protein sequenceShow/hide protein sequence
MLLMKPPPTINRAFTLVSQEVQQRAISSISSPFVPSSVPSAAFLKTSVKATEPPSKPTPNAISDSLSHISADQCQGLLNLLQSHLAKVKTESEPSTSHVVGTCFYALQNS
GISTDQWVIDSGASTHICYSRDLFLGLRSVSGVHISLPDNSRVNVETSLLIVQFSGDSCLIQDKFSLKTIGKDFVLPKAFDFVSSPSGVSSLPHNASNLHSPAVDVTPTN
AWTHDMASPIHNATTAYVVNNSNFPSFTAVILHDDHVVDVPFAAIVENSNVPSAVIENSVVPSAVDIKTSVVPSVVMPVDPWIQQSISVIPSTSVRRSQRDSRPPSYLKD
YHCNLLASAALPPFQSRYPLQKVLSYDRLSPSYKNFILNISTDFERQFYH