; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g00220 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g00220
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr7:117424..117873
RNA-Seq ExpressionMoc07g00220
SyntenyMoc07g00220
Gene Ontology termsGO:0006897 - endocytosis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0005905 - clathrin-coated pit (cellular component)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0005543 - phospholipid binding (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAV70211.1 hypothetical protein CFOL_v3_13709, partial [Cephalotus follicularis]1.0e-4664.83Show/hide
Query:  IDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDR
        I+ N +L  ++E P V++E YQRLVGKLIYL+HTRPDIAY V ++SQFMH P+E +LQA Y++LHYLK S G+ + FK+ +KL LE YTD  YAGSI DR
Subjt:  IDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDR

Query:  KFTSGYCTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL
        + TSGYCTF GGNLVTWR KKQ VVARSS+K+EFR M   IC+LL
Subjt:  KFTSGYCTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL

KAG8474986.1 hypothetical protein CXB51_031725 [Gossypium anomalum]1.4e-4867.11Show/hide
Query:  MEMSIDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGS
        +E  I+ NHRL  + E   V+K +YQRLVGKLIYLSHTRPDIAY VG+VSQFMH PKE +L+A+YQIL YLK + G+G+ FKK E L LE YTD  YAGS
Subjt:  MEMSIDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGS

Query:  IDDRKFTSGYCTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL
        + DR+ TSGYCTF G NLVTWR KKQNVVARSS++A FRAM L +C+LL
Subjt:  IDDRKFTSGYCTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL

KAG8487880.1 hypothetical protein CXB51_018329 [Gossypium anomalum]2.4e-4867.11Show/hide
Query:  MEMSIDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGS
        +E  I+ NHRL  + E   V+K +YQRLVGKLIYLSHTRPDIAY VG+VSQFMH PKE +L A+YQIL YLK + G+ + FKK E L LE YTD  YAGS
Subjt:  MEMSIDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGS

Query:  IDDRKFTSGYCTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL
        + DR+ TSGYCTF GGNLVTWR KKQNVVARSS++AEFRA+ L +C+LL
Subjt:  IDDRKFTSGYCTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL

XP_017615187.1 PREDICTED: uncharacterized mitochondrial protein AtMg00810-like [Gossypium arboreum]3.3e-5071.03Show/hide
Query:  IDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDR
        I+ NHRL  + E   V+K +YQRLVGKLIYLSHTRPDIAY VG+VSQFMH PKE +L+AMYQIL YLK + G+G+ FKK E L LE YTD  YAGS+ DR
Subjt:  IDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDR

Query:  KFTSGYCTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL
        + TSGYCTF GGNLVTWR KKQNVVARSS++AEFRAM L IC+LL
Subjt:  KFTSGYCTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL

XP_017639732.1 PREDICTED: uncharacterized mitochondrial protein AtMg00810-like [Gossypium arboreum]3.4e-4766.22Show/hide
Query:  EMSIDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSI
        E  I+ NH+L  + E   V+K +YQRLVGKLIYLSHTRPDIAY VG+VSQFMH PKE +L+A+YQIL YL   +G+G+ FKK + L LE YTD  YAGS+
Subjt:  EMSIDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSI

Query:  DDRKFTSGYCTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL
         DR+ TSGYCTF GGNLVTWR KKQNVVARSS++AEFRAM L + +LL
Subjt:  DDRKFTSGYCTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL

TrEMBL top hitse value%identityAlignment
A0A151RHV5 Copia protein8.2e-4766.21Show/hide
Query:  IDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDR
        I+QNHR+   EESP V+K  YQRLVGKLIYLSHTRPDIAY V +VSQFMH PKE +LQA+ +I+ YLK S G+GL FKK   L++EV+ D  YAGS+ DR
Subjt:  IDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDR

Query:  KFTSGYCTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL
        + T+GYC F GGNLVTWR KKQNVVARSS++AEFRAM   +C++L
Subjt:  KFTSGYCTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL

A0A151SDK3 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-4665.52Show/hide
Query:  IDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDR
        ID N +L  +EE   V+KE YQRLVG+LIYLSHTRPD+A+ V LVSQFMH+PKEV+LQA  +I+ YLK + GRG+ FK+   +NLE YTD  YAGSI DR
Subjt:  IDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDR

Query:  KFTSGYCTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL
        + T+GYCTF GGNLVTW+ KKQ+VVARSS++AEFRAM   IC+LL
Subjt:  KFTSGYCTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL

A0A1Q3BQB2 Uncharacterized protein (Fragment)4.8e-4764.83Show/hide
Query:  IDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDR
        I+ N +L  ++E P V++E YQRLVGKLIYL+HTRPDIAY V ++SQFMH P+E +LQA Y++LHYLK S G+ + FK+ +KL LE YTD  YAGSI DR
Subjt:  IDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDR

Query:  KFTSGYCTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL
        + TSGYCTF GGNLVTWR KKQ VVARSS+K+EFR M   IC+LL
Subjt:  KFTSGYCTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL

A0A6P4MWD8 uncharacterized mitochondrial protein AtMg00810-like1.6e-5071.03Show/hide
Query:  IDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDR
        I+ NHRL  + E   V+K +YQRLVGKLIYLSHTRPDIAY VG+VSQFMH PKE +L+AMYQIL YLK + G+G+ FKK E L LE YTD  YAGS+ DR
Subjt:  IDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDR

Query:  KFTSGYCTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL
        + TSGYCTF GGNLVTWR KKQNVVARSS++AEFRAM L IC+LL
Subjt:  KFTSGYCTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL

A0A6P4PTY4 uncharacterized mitochondrial protein AtMg00810-like1.7e-4766.22Show/hide
Query:  EMSIDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSI
        E  I+ NH+L  + E   V+K +YQRLVGKLIYLSHTRPDIAY VG+VSQFMH PKE +L+A+YQIL YL   +G+G+ FKK + L LE YTD  YAGS+
Subjt:  EMSIDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSI

Query:  DDRKFTSGYCTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL
         DR+ TSGYCTF GGNLVTWR KKQNVVARSS++AEFRAM L + +LL
Subjt:  DDRKFTSGYCTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL

SwissProt top hitse value%identityAlignment
P0CV72 Secreted RxLR effector protein 1611.9e-1639.5Show/hide
Query:  YQRLVGKLIYLS-HTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDRKFTSGYCTFFGGNLVTWRI
        Y   VG ++YL   TRPD+A  VG++SQF   P   + QA+ ++L YL+++   GL F +     L  Y+D  +AG ++ R+ TSGY     G  V+WR 
Subjt:  YQRLVGKLIYLS-HTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDRKFTSGYCTFFGGNLVTWRI

Query:  KKQNVVARSSSKAEFRAMT
        KKQ  VA SS++ E+ A++
Subjt:  KKQNVVARSSSKAEFRAMT

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-1740.46Show/hide
Query:  TSEESPPVNKETYQRLVGKLIY-LSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDRKFTSGYC
        T EE   + K  Y   VG L+Y +  TRPDIA+ VG+VS+F+  P + + +A+  IL YL+ + G  L F   + + L+ YTD   AG ID+RK ++GY 
Subjt:  TSEESPPVNKETYQRLVGKLIY-LSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDRKFTSGYC

Query:  TFFGGNLVTWRIKKQNVVARSSSKAEFRAMT
          F G  ++W+ K Q  VA S+++AE+ A T
Subjt:  TFFGGNLVTWRIKKQNVVARSSSKAEFRAMT

P92519 Uncharacterized mitochondrial protein AtMg008103.6e-2340.32Show/hide
Query:  YQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDRKFTSGYCTFFGGNLVTWRIK
        ++ +VG L YL+ TRPDI+Y V +V Q MH P       + ++L Y+K +I  GL+  K  KLN++ + D  +AG    R+ T+G+CTF G N+++W  K
Subjt:  YQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDRKFTSGYCTFFGGNLVTWRIK

Query:  KQNVVARSSSKAEFRAMTLDICKL
        +Q  V+RSS++ E+RA+ L   +L
Subjt:  KQNVVARSSSKAEFRAMTLDICKL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.4e-2446.15Show/hide
Query:  YQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDRKFTSGYCTFFGGNLVTWRIK
        Y+ +VG L YL+ TRPDI+Y V  +SQFMH P E +LQA+ +IL YL  +   G+F KK   L+L  Y+D  +AG  DD   T+GY  + G + ++W  K
Subjt:  YQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDRKFTSGYCTFFGGNLVTWRIK

Query:  KQNVVARSSSKAEFRAM
        KQ  V RSS++AE+R++
Subjt:  KQNVVARSSSKAEFRAM

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-2141.03Show/hide
Query:  YQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDRKFTSGYCTFFGGNLVTWRIK
        Y+ +VG L YL+ TRPD++Y V  +SQ+MH P + +  A+ ++L YL  +   G+F KK   L+L  Y+D  +AG  DD   T+GY  + G + ++W  K
Subjt:  YQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDRKFTSGYCTFFGGNLVTWRIK

Query:  KQNVVARSSSKAEFRAM
        KQ  V RSS++AE+R++
Subjt:  KQNVVARSSSKAEFRAM

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.5e-2842.28Show/hide
Query:  VNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDRKFTSGYCTFFGGNLV
        V+ + Y+RL+G+L+YL  TR DI++ V  +SQF   P+  + QA+ +ILHY+K ++G+GLF+    ++ L+V++D  +    D R+ T+GYC F G +L+
Subjt:  VNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDRKFTSGYCTFFGGNLV

Query:  TWRIKKQNVVARSSSKAEFRAMT
        +W+ KKQ VV++SS++AE+RA++
Subjt:  TWRIKKQNVVARSSSKAEFRAMT

ATMG00240.1 Gag-Pol-related retrotransposon family protein5.9e-1336.25Show/hide
Query:  IYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDRKFTSGYCT
        +YL+ TRPD+ + V  +SQF    +   +QA+Y++LHY+K ++G+GLF+     L L+ + D  +A   D R+  +G+C+
Subjt:  IYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDRKFTSGYCT

ATMG00810.1 DNA/RNA polymerases superfamily protein2.6e-2440.32Show/hide
Query:  YQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDRKFTSGYCTFFGGNLVTWRIK
        ++ +VG L YL+ TRPDI+Y V +V Q MH P       + ++L Y+K +I  GL+  K  KLN++ + D  +AG    R+ T+G+CTF G N+++W  K
Subjt:  YQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDRKFTSGYCTFFGGNLVTWRIK

Query:  KQNVVARSSSKAEFRAMTLDICKL
        +Q  V+RSS++ E+RA+ L   +L
Subjt:  KQNVVARSSSKAEFRAMTLDICKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATGTCAATTGATCAAAACCATAGATTATGTACATCCGAAGAAAGTCCTCCAGTAAATAAGGAAACTTACCAGAGGTTAGTTGGAAAATTAATATATCTTTCTCA
TACGAGACCAGATATTGCGTATGTTGTAGGACTGGTAAGTCAATTTATGCATCGTCCAAAAGAAGTTTATCTCCAAGCAATGTACCAAATACTTCACTATTTGAAAAACT
CCATTGGGAGAGGATTGTTCTTTAAGAAATGTGAGAAACTCAATCTAGAAGTTTATACAGATAGATATTATGCAGGATCGATAGACGATAGAAAATTCACTTCTGGCTAT
TGTACTTTCTTTGGTGGAAACTTAGTGACATGGAGAATTAAAAAACAAAATGTAGTTGCAAGATCGAGCTCAAAAGCAGAATTTAGAGCAATGACATTGGACATCTGCAA
ATTATTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAATGTCAATTGATCAAAACCATAGATTATGTACATCCGAAGAAAGTCCTCCAGTAAATAAGGAAACTTACCAGAGGTTAGTTGGAAAATTAATATATCTTTCTCA
TACGAGACCAGATATTGCGTATGTTGTAGGACTGGTAAGTCAATTTATGCATCGTCCAAAAGAAGTTTATCTCCAAGCAATGTACCAAATACTTCACTATTTGAAAAACT
CCATTGGGAGAGGATTGTTCTTTAAGAAATGTGAGAAACTCAATCTAGAAGTTTATACAGATAGATATTATGCAGGATCGATAGACGATAGAAAATTCACTTCTGGCTAT
TGTACTTTCTTTGGTGGAAACTTAGTGACATGGAGAATTAAAAAACAAAATGTAGTTGCAAGATCGAGCTCAAAAGCAGAATTTAGAGCAATGACATTGGACATCTGCAA
ATTATTATAG
Protein sequenceShow/hide protein sequence
MEMSIDQNHRLCTSEESPPVNKETYQRLVGKLIYLSHTRPDIAYVVGLVSQFMHRPKEVYLQAMYQILHYLKNSIGRGLFFKKCEKLNLEVYTDRYYAGSIDDRKFTSGY
CTFFGGNLVTWRIKKQNVVARSSSKAEFRAMTLDICKLL