; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g25010 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g25010
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:18166017..18172594
RNA-Seq ExpressionMoc04g25010
SyntenyMoc04g25010
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060586.1 putative gag protein [Cucumis melo var. makuwa]8.8e-1146.99Show/hide
Query:  KAILDPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKIDTIFLVKDLPDKAYLRDRF--VNMSSGRS
        KAI+DP++LP+++ KE+ E ME   YGTI++ LSD+VL ++ID +T YEI  K++ ++L +DL ++AYL ++F   NM S  S
Subjt:  KAILDPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKIDTIFLVKDLPDKAYLRDRF--VNMSSGRS

XP_038880370.1 uncharacterized protein LOC120072018 [Benincasa hispida]6.1e-1241.75Show/hide
Query:  RYVMDRSPPIDDLRLQLLASKAIL----------DPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKIDTIFLVKDLPDKAYLR
        RY +++     D  LQ    KA+L          DP K P+++++ +KE +E  AYGTII+N++DS+L+Q++D  TAY + NK++ I+L KDLP+KA+ R
Subjt:  RYVMDRSPPIDDLRLQLLASKAIL----------DPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKIDTIFLVKDLPDKAYLR

Query:  DRF
        +RF
Subjt:  DRF

XP_038885928.1 uncharacterized protein LOC120076236 [Benincasa hispida]4.6e-1255.56Show/hide
Query:  AILDPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKIDTIFLVKDLPDKAYLRDRF
        AI DP K P+ + K EKE +E  AYGTI++N+ DSVL+Q++D  TAY + NK++ I+L KDLP+KA+LR+RF
Subjt:  AILDPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKIDTIFLVKDLPDKAYLRDRF

XP_038887098.1 uncharacterized protein LOC120077280 [Benincasa hispida]2.3e-1150Show/hide
Query:  QLLASKAILDPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKIDTIFLVKDLPDKAYLRDRF
        Q  A  AI DP K P+++++ EKE +E  AYGTII+N++DSVL+Q++D  T Y + NK++ I+L KD P+K +LR+RF
Subjt:  QLLASKAILDPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKIDTIFLVKDLPDKAYLRDRF

XP_038902421.1 uncharacterized protein LOC120089066 [Benincasa hispida]6.1e-1252.78Show/hide
Query:  AILDPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKIDTIFLVKDLPDKAYLRDRF
        AI DP K P+++   EK I+E  AYGTI++N++DSVL+Q++D +TA++ CNK++ I+L KDLP+KA+LR+ F
Subjt:  AILDPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKIDTIFLVKDLPDKAYLRDRF

TrEMBL top hitse value%identityAlignment
A0A5A7TAZ3 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-1042.17Show/hide
Query:  ASKAILDPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKIDTIFLVKDLPDKAYLRDRFVNMSSGRS
        A K I DP+ LP +MN+ +K+ MEE  Y  +I+N++D+VL+Q+I+  T YEIC K+  ++  KDLP K Y+R++  +    +S
Subjt:  ASKAILDPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKIDTIFLVKDLPDKAYLRDRFVNMSSGRS

A0A5A7U6R2 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-0935.05Show/hide
Query:  ASKAILDPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKIDTIFLVKDLPDKAYLRDRFVNMSSGRSPQHSDEECRHMRKV
        A KA+ DP++LP ++ K E+E +EE+AY T+IMN++D+VL+Q+I+  TA+    K+ +++  KDLP+K +++++  +    ++ ++ DE     +K+
Subjt:  ASKAILDPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKIDTIFLVKDLPDKAYLRDRFVNMSSGRSPQHSDEECRHMRKV

A0A5A7V269 Putative gag protein4.2e-1146.99Show/hide
Query:  KAILDPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKIDTIFLVKDLPDKAYLRDRF--VNMSSGRS
        KAI+DP++LP+++ KE+ E ME   YGTI++ LSD+VL ++ID +T YEI  K++ ++L +DL ++AYL ++F   NM S  S
Subjt:  KAILDPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKIDTIFLVKDLPDKAYLRDRF--VNMSSGRS

A0A5C7HB65 gag_pre-integrs domain-containing protein1.8e-0950Show/hide
Query:  KAILDPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKIDTIFLVKDLPDKAYLRDR
        KAI  PEKLP S+  E+K+ M E+A GTII+NLSD+VL+++ D KTA ++  K+++++L K L +K YL++R
Subjt:  KAILDPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKIDTIFLVKDLPDKAYLRDR

A0A5D3BSZ6 Putative gag protein4.2e-1146.99Show/hide
Query:  KAILDPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKIDTIFLVKDLPDKAYLRDRF--VNMSSGRS
        KAI+DP++LP+++ KE+ E ME   YGTI++ LSD+VL ++ID +T YEI  K++ ++L +DL ++AYL ++F   NM S  S
Subjt:  KAILDPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKIDTIFLVKDLPDKAYLRDRF--VNMSSGRS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTGGGCCGTGGGCCGAGCAAGGGTCGGGACTGAGAAGTGGTCGGGACCGAGAAGGGGTCGGGACCGAGCAGGGGTCGGGATCGAGCCTTCTCGGCCCGGATTTAC
TCTCGGGTTTCTAACCCACTTGGCTTTTGCCAATGGCTTTTATACTCTGAGTTTAGTCGGCATCCTTAGGTCCGTCCCAATTTTGAGTTTCTTGATTGGTCGTAGATATG
TAGATATTCAGAGGGACAAATTCGACCTCCTCAGTCGGTTCTCCCCTTTCAAGGGTGGTCGAATTAAGTCGATGACCAAGGCTCGGCCTTGCCTCTCCCGAGAGGCTCTT
CTTTTCGAAGCACCGATGGCTCTCATCAAGATTCATGTCGCTTTTGGACCAAGTCGACGATCAAGGCTCGGTCCTACCTTGCCTGAATGGTCGGCCGCTGCACTCTACAT
GCCTCGGTCAAAATTTCACCTCAGATCGGGATTTCGCTTCGATGAAGCTCGCTTTACTTTATCATCCCCACAGACGACGACAATTGTTGATGCGTGGATTTTGACCCTCC
GTCTCTTCTTCTCGCACACACCAAACTCGATGATAAGGGAATCGATACCTGCAAAATCAAACCGATTTAAACCGGGGTCGGGACCGAGAAGTGGTCGGGACCGAGAAGTG
GTCGGGACCAAGAAGGGGTCGGGACCGAGCCCTCTCGGCCCGGTTAGTTCTCCGCCGGTCGCTTTCCTCGTTCTCTGCCCCGGCGCTAGGTCGTTCTTTGAGCCGGCCCG
TTACGTCATGGACCGATCTCCCCCAATCGATGATCTCAGGCTTCAACTCCTTGCATCAAAAGCCATTCTTGATCCAGAAAAATTACCACAGTCGATGAACAAGGAAGAGA
AAGAAATCATGGAGGAAATTGCATACGGAACAATCATTATGAACCTGAGTGATAGTGTTCTGCAGCAGCTAATTGATCTAAAAACTGCGTATGAGATATGTAATAAGATA
GACACCATTTTTCTTGTAAAAGATCTTCCAGACAAAGCCTATCTAAGAGACAGATTTGTCAACATGAGTTCGGGAAGAAGTCCACAACATTCTGATGAAGAGTGCCGCCA
CATGAGGAAAGTTGATAGGGGCACACCTCCGCCCGAGGCAGTCGAAAGAGAGATTGACCGACCAGTTCTCAGAGATACCGACCGACTAGATGATGGACAGAAGGCTACTA
GTCGACCAGTTCTCGGACAGAAAGACAAACCGGTCGAAGGTCCGGATCAACAAGGTGACAAGATGGCCGCCTTAGAAGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTTGGGCCGTGGGCCGAGCAAGGGTCGGGACTGAGAAGTGGTCGGGACCGAGAAGGGGTCGGGACCGAGCAGGGGTCGGGATCGAGCCTTCTCGGCCCGGATTTAC
TCTCGGGTTTCTAACCCACTTGGCTTTTGCCAATGGCTTTTATACTCTGAGTTTAGTCGGCATCCTTAGGTCCGTCCCAATTTTGAGTTTCTTGATTGGTCGTAGATATG
TAGATATTCAGAGGGACAAATTCGACCTCCTCAGTCGGTTCTCCCCTTTCAAGGGTGGTCGAATTAAGTCGATGACCAAGGCTCGGCCTTGCCTCTCCCGAGAGGCTCTT
CTTTTCGAAGCACCGATGGCTCTCATCAAGATTCATGTCGCTTTTGGACCAAGTCGACGATCAAGGCTCGGTCCTACCTTGCCTGAATGGTCGGCCGCTGCACTCTACAT
GCCTCGGTCAAAATTTCACCTCAGATCGGGATTTCGCTTCGATGAAGCTCGCTTTACTTTATCATCCCCACAGACGACGACAATTGTTGATGCGTGGATTTTGACCCTCC
GTCTCTTCTTCTCGCACACACCAAACTCGATGATAAGGGAATCGATACCTGCAAAATCAAACCGATTTAAACCGGGGTCGGGACCGAGAAGTGGTCGGGACCGAGAAGTG
GTCGGGACCAAGAAGGGGTCGGGACCGAGCCCTCTCGGCCCGGTTAGTTCTCCGCCGGTCGCTTTCCTCGTTCTCTGCCCCGGCGCTAGGTCGTTCTTTGAGCCGGCCCG
TTACGTCATGGACCGATCTCCCCCAATCGATGATCTCAGGCTTCAACTCCTTGCATCAAAAGCCATTCTTGATCCAGAAAAATTACCACAGTCGATGAACAAGGAAGAGA
AAGAAATCATGGAGGAAATTGCATACGGAACAATCATTATGAACCTGAGTGATAGTGTTCTGCAGCAGCTAATTGATCTAAAAACTGCGTATGAGATATGTAATAAGATA
GACACCATTTTTCTTGTAAAAGATCTTCCAGACAAAGCCTATCTAAGAGACAGATTTGTCAACATGAGTTCGGGAAGAAGTCCACAACATTCTGATGAAGAGTGCCGCCA
CATGAGGAAAGTTGATAGGGGCACACCTCCGCCCGAGGCAGTCGAAAGAGAGATTGACCGACCAGTTCTCAGAGATACCGACCGACTAGATGATGGACAGAAGGCTACTA
GTCGACCAGTTCTCGGACAGAAAGACAAACCGGTCGAAGGTCCGGATCAACAAGGTGACAAGATGGCCGCCTTAGAAGAGTAG
Protein sequenceShow/hide protein sequence
MGWAVGRARVGTEKWSGPRRGRDRAGVGIEPSRPGFTLGFLTHLAFANGFYTLSLVGILRSVPILSFLIGRRYVDIQRDKFDLLSRFSPFKGGRIKSMTKARPCLSREAL
LFEAPMALIKIHVAFGPSRRSRLGPTLPEWSAAALYMPRSKFHLRSGFRFDEARFTLSSPQTTTIVDAWILTLRLFFSHTPNSMIRESIPAKSNRFKPGSGPRSGRDREV
VGTKKGSGPSPLGPVSSPPVAFLVLCPGARSFFEPARYVMDRSPPIDDLRLQLLASKAILDPEKLPQSMNKEEKEIMEEIAYGTIIMNLSDSVLQQLIDLKTAYEICNKI
DTIFLVKDLPDKAYLRDRFVNMSSGRSPQHSDEECRHMRKVDRGTPPPEAVEREIDRPVLRDTDRLDDGQKATSRPVLGQKDKPVEGPDQQGDKMAALEE