; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g24870 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g24870
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:18010256..18013513
RNA-Seq ExpressionMoc04g24870
SyntenyMoc04g24870
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0005488 - binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.2e-1146.67Show/hide
Query:  APEGNGLESYIDEDAEIPPKFLTVTEGT--AITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLAK
        A E   LE++++ ++E P K+L  TE +  + T   N  Y  WKRQD LI+S LL SMSE+IL   LHC++AKEIW  L+ +F+S+ LA+
Subjt:  APEGNGLESYIDEDAEIPPKFLTVTEGT--AITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLAK

KAA0067213.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]9.7e-1246.67Show/hide
Query:  APEGNGLESYIDEDAEIPPKFL--TVTEGTAITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLAK
        A E   LE++++ ++E P K+L  T +   + T+  N TY  WKRQD LI+S LL SMSE+IL   LHC++AKEIW  L+ +F+S+ LA+
Subjt:  APEGNGLESYIDEDAEIPPKFL--TVTEGTAITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLAK

TYK18917.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]9.7e-1246.67Show/hide
Query:  APEGNGLESYIDEDAEIPPKFL--TVTEGTAITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLAK
        A E   LE++++ ++E P K+L  T +   + T+  N TY  WKRQD LI+S LL SMSE+IL   LHC++AKEIW  L+ +F+S+ LA+
Subjt:  APEGNGLESYIDEDAEIPPKFL--TVTEGTAITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLAK

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]6.7e-1347.25Show/hide
Query:  APEGNGLESYIDEDAEIPPKFLTVT--EGTAITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLAKL
        A +GNGLESYID + + P +F+  T  E ++ + + N  Y  W +QD LI++ LL SM+EDIL   L C++A+EIW+ LE MFAS+ LA++
Subjt:  APEGNGLESYIDEDAEIPPKFLTVT--EGTAITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLAKL

XP_022156747.1 uncharacterized protein LOC111023586 [Momordica charantia]2.3e-1342.02Show/hide
Query:  NRPVLVEMSVEAPEGNGLESYIDEDAEIPPKFLTVTEG--TAITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLA
        +RP      + A +G+GLE YID D E P +F+   +G  ++ TQ+ N  Y HW +QD LI+  LL SMSE+IL   L C+  KEIW+ LE  FAS+NLA
Subjt:  NRPVLVEMSVEAPEGNGLESYIDEDAEIPPKFLTVTEG--TAITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLA

Query:  KLSICISSITYPQTSPTNL
        ++    S +   +    NL
Subjt:  KLSICISSITYPQTSPTNL

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-1146.67Show/hide
Query:  APEGNGLESYIDEDAEIPPKFLTVTEGT--AITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLAK
        A E   LE++++ ++E P K+L  TE +  + T   N  Y  WKRQD LI+S LL SMSE+IL   LHC++AKEIW  L+ +F+S+ LA+
Subjt:  APEGNGLESYIDEDAEIPPKFLTVTEGT--AITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLAK

A0A5A7VGJ8 Keratin, type II cytoskeletal 1-like4.7e-1246.67Show/hide
Query:  APEGNGLESYIDEDAEIPPKFL--TVTEGTAITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLAK
        A E   LE++++ ++E P K+L  T +   + T+  N TY  WKRQD LI+S LL SMSE+IL   LHC++AKEIW  L+ +F+S+ LA+
Subjt:  APEGNGLESYIDEDAEIPPKFL--TVTEGTAITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLAK

A0A5D3D5T2 Keratin, type II cytoskeletal 1-like4.7e-1246.67Show/hide
Query:  APEGNGLESYIDEDAEIPPKFL--TVTEGTAITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLAK
        A E   LE++++ ++E P K+L  T +   + T+  N TY  WKRQD LI+S LL SMSE+IL   LHC++AKEIW  L+ +F+S+ LA+
Subjt:  APEGNGLESYIDEDAEIPPKFL--TVTEGTAITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLAK

A0A6J1DLT9 uncharacterized protein LOC1110217573.2e-1347.25Show/hide
Query:  APEGNGLESYIDEDAEIPPKFLTVT--EGTAITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLAKL
        A +GNGLESYID + + P +F+  T  E ++ + + N  Y  W +QD LI++ LL SM+EDIL   L C++A+EIW+ LE MFAS+ LA++
Subjt:  APEGNGLESYIDEDAEIPPKFLTVT--EGTAITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLAKL

A0A6J1DSS1 uncharacterized protein LOC1110235861.1e-1342.02Show/hide
Query:  NRPVLVEMSVEAPEGNGLESYIDEDAEIPPKFLTVTEG--TAITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLA
        +RP      + A +G+GLE YID D E P +F+   +G  ++ TQ+ N  Y HW +QD LI+  LL SMSE+IL   L C+  KEIW+ LE  FAS+NLA
Subjt:  NRPVLVEMSVEAPEGNGLESYIDEDAEIPPKFLTVTEG--TAITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLA

Query:  KLSICISSITYPQTSPTNL
        ++    S +   +    NL
Subjt:  KLSICISSITYPQTSPTNL

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.0e-0432.1Show/hide
Query:  EGNGLESYIDEDAEIPPKFLTVTEGTAITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFAS
        +G  L  ++D    +PP     T GT    + N  Y  WKRQD LI S +L ++S  +        TA +IW  L +++A+
Subjt:  EGNGLESYIDEDAEIPPKFLTVTEGTAITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFAS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.5e-0430.86Show/hide
Query:  EGNGLESYIDEDAEIPPKFLTVTEGTAITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFAS
        +G  L  ++D    +PP     T GT    + N  Y  W+RQD LI S +L ++S  +        TA +IW  L +++A+
Subjt:  EGNGLESYIDEDAEIPPKFLTVTEGTAITQKSNATYHHWKRQDCLITSLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFAS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTATGATTCAAATGTGTGGCTTATCGTGTAATGAAAAAGGGGATATACAAGTTCATAGCCTCGTGGGCAATGGTCGAGGCGGTTACGTCACTACTGGGTGT
CGAGGCTTCGAGAGCAAAAGGCCGGGGGTCAACAGACCAGTCTTGGTAGAGATGAGTGTCGAGGCTCCGGAAGGAAATGGTCTAGAATCATATATTGATGAAGAT
GCAGAGATACCACCAAAGTTCTTAACCGTTACAGAAGGCACCGCCATTACACAAAAATCAAATGCTACATATCATCACTGGAAACGGCAAGATTGCCTAATTACA
TCATTGCTTTTAAGCTCAATGTCTGAGGACATTCTTGTAGACTTTCTTCACTGTCAAACTGCCAAAGAAATCTGGAGTAATCTTGAACAAATGTTCGCATCCAAA
AATCTTGCCAAGCTTTCCATCTGTATCTCTTCTATTACCTACCCTCAAACTTCTCCAACAAACTTATACTATACTGATACTCCACATTCAATATCTCCTATCTTA
CCATCCCAATCAAACATCCTCCCAAACCAACCCTCAGGTTCTTCGGTTGAAACTGCTTCTCAAATAGGTACCTCTGCTGCTCCTACACAAATTGCTCTTACTTAT
CCTACATCAAGTACCACTACTACTCCTACAATAGATACTTTGGTTGAGTCTTCATCTTCTGAACAGCAGGATGTTCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTATGATTCAAATGTGTGGCTTATCGTGTAATGAAAAAGGGGATATACAAGTTCATAGCCTCGTGGGCAATGGTCGAGGCGGTTACGTCACTACTGGGTGT
CGAGGCTTCGAGAGCAAAAGGCCGGGGGTCAACAGACCAGTCTTGGTAGAGATGAGTGTCGAGGCTCCGGAAGGAAATGGTCTAGAATCATATATTGATGAAGAT
GCAGAGATACCACCAAAGTTCTTAACCGTTACAGAAGGCACCGCCATTACACAAAAATCAAATGCTACATATCATCACTGGAAACGGCAAGATTGCCTAATTACA
TCATTGCTTTTAAGCTCAATGTCTGAGGACATTCTTGTAGACTTTCTTCACTGTCAAACTGCCAAAGAAATCTGGAGTAATCTTGAACAAATGTTCGCATCCAAA
AATCTTGCCAAGCTTTCCATCTGTATCTCTTCTATTACCTACCCTCAAACTTCTCCAACAAACTTATACTATACTGATACTCCACATTCAATATCTCCTATCTTA
CCATCCCAATCAAACATCCTCCCAAACCAACCCTCAGGTTCTTCGGTTGAAACTGCTTCTCAAATAGGTACCTCTGCTGCTCCTACACAAATTGCTCTTACTTAT
CCTACATCAAGTACCACTACTACTCCTACAATAGATACTTTGGTTGAGTCTTCATCTTCTGAACAGCAGGATGTTCCTTAG
Protein sequenceShow/hide protein sequence
MAMIQMCGLSCNEKGDIQVHSLVGNGRGGYVTTGCRGFESKRPGVNRPVLVEMSVEAPEGNGLESYIDEDAEIPPKFLTVTEGTAITQKSNATYHHWKRQDCLIT
SLLLSSMSEDILVDFLHCQTAKEIWSNLEQMFASKNLAKLSICISSITYPQTSPTNLYYTDTPHSISPILPSQSNILPNQPSGSSVETASQIGTSAAPTQIALTY
PTSSTTTTPTIDTLVESSSSEQQDVP