; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g02420 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g02420
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr7:1950811..1955032
RNA-Seq ExpressionMoc07g02420
SyntenyMoc07g02420
Gene Ontology termsGO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154608.1 uncharacterized protein LOC111021831 [Momordica charantia]2.4e-3154.17Show/hide
Query:  DSPEVSVTMPQSASKAATNPYYLHHIDNTRLIL--------------------------MSFIDGSIERPTDDLLLVWIHNYHVVIAWILNSVSKDISAS
        DS   + +   S   +  NPYYLHH DNT L+L                          + FIDGSI RP  +LL  WIHN HVVIAWILNSVSK+IS+S
Subjt:  DSPEVSVTMPQSASKAATNPYYLHHIDNTRLIL--------------------------MSFIDGSIERPTDDLLLVWIHNYHVVIAWILNSVSKDISAS

Query:  ILFSKSTRNIWLDLKERFQKSNGPRIFQLKHDLATVTQEQQSLS
        ILFS+S R+IW+DLKERF+KSNGPRIFQLK DLA + Q QQS+S
Subjt:  ILFSKSTRNIWLDLKERFQKSNGPRIFQLKHDLATVTQEQQSLS

XP_022156861.1 uncharacterized protein LOC111023702 [Momordica charantia]1.6e-2752.45Show/hide
Query:  SPEVSVTMPQSASKAATNPYYLHHIDNTRLIL--------------------------MSFIDGSIERPTDDLLLVWIHNYHVVIAWILNSVSKDISASI
        SP  S T+   AS    NPYYLHH DNT L+                           + FIDG I RP+ DLL  WI N H+VIAWILNSVSK+ISASI
Subjt:  SPEVSVTMPQSASKAATNPYYLHHIDNTRLIL--------------------------MSFIDGSIERPTDDLLLVWIHNYHVVIAWILNSVSKDISASI

Query:  LFSKSTRNIWLDLKERFQKSNGPRIFQLKHDLATVTQEQQSLS
        LFS+S R+IW+DL ERF+KSN P I+QLK  LAT+ Q+QQS+S
Subjt:  LFSKSTRNIWLDLKERFQKSNGPRIFQLKHDLATVTQEQQSLS

XP_031736904.1 uncharacterized protein LOC105434586 isoform X1 [Cucumis sativus]3.1e-2651.54Show/hide
Query:  NPYYLHHIDNTRLILMS--------------------------FIDGSIERPTDDLLLVWIHNYHVVIAWILNSVSKDISASILFSKSTRNIWLDLKERF
        NPY+LHH DNT L+L++                          F+DG+I RPT DLL VWI N ++VI+WILNSVSK ISA+ILFS   R IW++LKERF
Subjt:  NPYYLHHIDNTRLILMS--------------------------FIDGSIERPTDDLLLVWIHNYHVVIAWILNSVSKDISASILFSKSTRNIWLDLKERF

Query:  QKSNGPRIFQLKHDLATVTQEQQSLSTGKS
        QK N PRIFQLK  LAT++Q Q S+ T  +
Subjt:  QKSNGPRIFQLKHDLATVTQEQQSLSTGKS

XP_031736905.1 uncharacterized protein LOC105434586 isoform X2 [Cucumis sativus]3.1e-2651.54Show/hide
Query:  NPYYLHHIDNTRLILMS--------------------------FIDGSIERPTDDLLLVWIHNYHVVIAWILNSVSKDISASILFSKSTRNIWLDLKERF
        NPY+LHH DNT L+L++                          F+DG+I RPT DLL VWI N ++VI+WILNSVSK ISA+ILFS   R IW++LKERF
Subjt:  NPYYLHHIDNTRLILMS--------------------------FIDGSIERPTDDLLLVWIHNYHVVIAWILNSVSKDISASILFSKSTRNIWLDLKERF

Query:  QKSNGPRIFQLKHDLATVTQEQQSLSTGKS
        QK N PRIFQLK  LAT++Q Q S+ T  +
Subjt:  QKSNGPRIFQLKHDLATVTQEQQSLSTGKS

XP_031736906.1 uncharacterized protein LOC105434586 isoform X3 [Cucumis sativus]3.1e-2651.54Show/hide
Query:  NPYYLHHIDNTRLILMS--------------------------FIDGSIERPTDDLLLVWIHNYHVVIAWILNSVSKDISASILFSKSTRNIWLDLKERF
        NPY+LHH DNT L+L++                          F+DG+I RPT DLL VWI N ++VI+WILNSVSK ISA+ILFS   R IW++LKERF
Subjt:  NPYYLHHIDNTRLILMS--------------------------FIDGSIERPTDDLLLVWIHNYHVVIAWILNSVSKDISASILFSKSTRNIWLDLKERF

Query:  QKSNGPRIFQLKHDLATVTQEQQSLSTGKS
        QK N PRIFQLK  LAT++Q Q S+ T  +
Subjt:  QKSNGPRIFQLKHDLATVTQEQQSLSTGKS

TrEMBL top hitse value%identityAlignment
A0A6J1CR17 uncharacterized protein LOC1110134413.1e-2451.59Show/hide
Query:  TAGNDISVDPFLDLVLPNAVDFQTTDSLTHSATTDRTNIHTDMPTVHVDMTNANTDIPVVITPT----DVPIESCAPTVAPIGSSATSSSDGPSSMPVVL
        ++GND+SV+PF DLVLPN +DFQ                  DMP  H+DMTNA+ DIP V+  T     VPIE CAP   P       S+DG SS PVV 
Subjt:  TAGNDISVDPFLDLVLPNAVDFQTTDSLTHSATTDRTNIHTDMPTVHVDMTNANTDIPVVITPT----DVPIESCAPTVAPIGSSATSSSDGPSSMPVVL

Query:  EPMPNTGPSVG-----------PLDIVPIAPRRSTRPSKMLSYLLDFHCSLLGQYSP
        EPMPNT PSV            PLDIV   PRRSTRPSKM SYL DFHCSLL    P
Subjt:  EPMPNTGPSVG-----------PLDIVPIAPRRSTRPSKMLSYLLDFHCSLLGQYSP

A0A6J1DIP8 uncharacterized protein LOC1110203995.7e-2651.85Show/hide
Query:  TNPYYLHHIDNTRLILMS--------------------------FIDGSIERPTDDLLLVWIHNYHVVIAWILNSVSKDISASILFSKSTRNIWLDLKER
        TNPY+LHH DNT L+L+S                          F+DGSI RPT DLL  WI   +VVI+WILNS+SK+ISASILFS S R IWLDLKER
Subjt:  TNPYYLHHIDNTRLILMS--------------------------FIDGSIERPTDDLLLVWIHNYHVVIAWILNSVSKDISASILFSKSTRNIWLDLKER

Query:  FQKSNGPRIFQLKHDLATVTQEQQSLSTGKSVVDT
        F+K N PRIFQL+ DL+ + Q+Q S+S   +++ T
Subjt:  FQKSNGPRIFQLKHDLATVTQEQQSLSTGKSVVDT

A0A6J1DKR8 uncharacterized protein LOC1110218311.2e-3154.17Show/hide
Query:  DSPEVSVTMPQSASKAATNPYYLHHIDNTRLIL--------------------------MSFIDGSIERPTDDLLLVWIHNYHVVIAWILNSVSKDISAS
        DS   + +   S   +  NPYYLHH DNT L+L                          + FIDGSI RP  +LL  WIHN HVVIAWILNSVSK+IS+S
Subjt:  DSPEVSVTMPQSASKAATNPYYLHHIDNTRLIL--------------------------MSFIDGSIERPTDDLLLVWIHNYHVVIAWILNSVSKDISAS

Query:  ILFSKSTRNIWLDLKERFQKSNGPRIFQLKHDLATVTQEQQSLS
        ILFS+S R+IW+DLKERF+KSNGPRIFQLK DLA + Q QQS+S
Subjt:  ILFSKSTRNIWLDLKERFQKSNGPRIFQLKHDLATVTQEQQSLS

A0A6J1DLQ9 uncharacterized protein LOC1110221172.0e-2355.45Show/hide
Query:  SKAATNPYYLHHIDNTRLIL-----MSFIDGSIERPTDDLLLVWIHNYHVVIAWILNSVSKDISASILFSKSTRNIWLDLKERFQKSNGPRIFQLKHDLA
        SK  TN  Y+    +  + L     + FI+GS+ +P  DLL VWI N HVVIAW LNSVSK ISAS++F+ ST  IWLDLK+RFQ  NGP+IFQL+ DLA
Subjt:  SKAATNPYYLHHIDNTRLIL-----MSFIDGSIERPTDDLLLVWIHNYHVVIAWILNSVSKDISASILFSKSTRNIWLDLKERFQKSNGPRIFQLKHDLA

Query:  TVTQEQQSLS
        T+TQ+Q S++
Subjt:  TVTQEQQSLS

A0A6J1DW89 uncharacterized protein LOC1110237027.9e-2852.45Show/hide
Query:  SPEVSVTMPQSASKAATNPYYLHHIDNTRLIL--------------------------MSFIDGSIERPTDDLLLVWIHNYHVVIAWILNSVSKDISASI
        SP  S T+   AS    NPYYLHH DNT L+                           + FIDG I RP+ DLL  WI N H+VIAWILNSVSK+ISASI
Subjt:  SPEVSVTMPQSASKAATNPYYLHHIDNTRLIL--------------------------MSFIDGSIERPTDDLLLVWIHNYHVVIAWILNSVSKDISASI

Query:  LFSKSTRNIWLDLKERFQKSNGPRIFQLKHDLATVTQEQQSLS
        LFS+S R+IW+DL ERF+KSN P I+QLK  LAT+ Q+QQS+S
Subjt:  LFSKSTRNIWLDLKERFQKSNGPRIFQLKHDLATVTQEQQSLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.5e-0528.57Show/hide
Query:  FIDGSIERPT--DDLLLVWIHNYHVVIAWILNSVSKDISASILFSKSTRNIWLDLKERFQKSNGPRIFQLKHDLATVTQEQQSL
        FIDG++ +P     L   W     +V+ W++NS++  +  S++++++   +W DL+  F      +I+QL+  LAT+ Q   S+
Subjt:  FIDGSIERPT--DDLLLVWIHNYHVVIAWILNSVSKDISASILFSKSTRNIWLDLKERFQKSNGPRIFQLKHDLATVTQEQQSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTTCGACGTTCCTATCGATTCACCGGAAGTTTCTGTCACTATGCCTCAATCTGCATCTAAGGCTGCTACAAATCCATATTATCTCCATCATATAGATAAT
ACTAGATTGATTCTCATGAGTTTTATTGACGGATCAATTGAGCGTCCCACCGACGATCTTTTGCTTGTGTGGATTCACAATTACCATGTGGTTATTGCATGGATC
TTGAATTCCGTTTCCAAGGATATTTCTGCAAGCATCCTGTTTTCTAAATCTACCAGGAATATTTGGCTCGATCTTAAGGAGAGATTCCAGAAGAGCAATGGTCCA
CGCATCTTTCAACTCAAACATGATCTGGCTACTGTAACTCAGGAGCAACAATCGCTTTCGACTGGAAAATCTGTTGTTGACACAGATACTATTGCATCCTACACA
GCAGGTAATGATATATCAGTTGACCCTTTTCTAGACTTGGTTTTGCCAAATGCAGTTGATTTTCAAACTACTGATAGTCTTACTCATTCTGCTACTACTGATAGG
ACTAATATTCATACTGATATGCCTACTGTTCATGTTGATATGACTAATGCCAATACTGATATACCTGTTGTCATTACCCCTACTGATGTACCCATTGAGTCGTGT
GCTCCTACTGTTGCACCTATTGGTTCTTCTGCCACTTCTAGTTCTGATGGGCCTTCATCCATGCCGGTTGTGCTTGAACCTATGCCTAACACAGGACCTTCGGTT
GGACCTTTGGATATTGTCCCTATTGCCCCTCGACGTTCCACTAGGCCTTCCAAAATGCTTTCATATTTACTGGATTTTCATTGCAGCCTTTTGGGTCAATATTCA
CCACCTCTACTTCTACAAGGCATCCTTTACGACAGTATTTATCCTATTCACAACTATCGTCTGCTCATCGACATTATGTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCTTCGACGTTCCTATCGATTCACCGGAAGTTTCTGTCACTATGCCTCAATCTGCATCTAAGGCTGCTACAAATCCATATTATCTCCATCATATAGATAAT
ACTAGATTGATTCTCATGAGTTTTATTGACGGATCAATTGAGCGTCCCACCGACGATCTTTTGCTTGTGTGGATTCACAATTACCATGTGGTTATTGCATGGATC
TTGAATTCCGTTTCCAAGGATATTTCTGCAAGCATCCTGTTTTCTAAATCTACCAGGAATATTTGGCTCGATCTTAAGGAGAGATTCCAGAAGAGCAATGGTCCA
CGCATCTTTCAACTCAAACATGATCTGGCTACTGTAACTCAGGAGCAACAATCGCTTTCGACTGGAAAATCTGTTGTTGACACAGATACTATTGCATCCTACACA
GCAGGTAATGATATATCAGTTGACCCTTTTCTAGACTTGGTTTTGCCAAATGCAGTTGATTTTCAAACTACTGATAGTCTTACTCATTCTGCTACTACTGATAGG
ACTAATATTCATACTGATATGCCTACTGTTCATGTTGATATGACTAATGCCAATACTGATATACCTGTTGTCATTACCCCTACTGATGTACCCATTGAGTCGTGT
GCTCCTACTGTTGCACCTATTGGTTCTTCTGCCACTTCTAGTTCTGATGGGCCTTCATCCATGCCGGTTGTGCTTGAACCTATGCCTAACACAGGACCTTCGGTT
GGACCTTTGGATATTGTCCCTATTGCCCCTCGACGTTCCACTAGGCCTTCCAAAATGCTTTCATATTTACTGGATTTTCATTGCAGCCTTTTGGGTCAATATTCA
CCACCTCTACTTCTACAAGGCATCCTTTACGACAGTATTTATCCTATTCACAACTATCGTCTGCTCATCGACATTATGTTCTAA
Protein sequenceShow/hide protein sequence
MAFDVPIDSPEVSVTMPQSASKAATNPYYLHHIDNTRLILMSFIDGSIERPTDDLLLVWIHNYHVVIAWILNSVSKDISASILFSKSTRNIWLDLKERFQKSNGP
RIFQLKHDLATVTQEQQSLSTGKSVVDTDTIASYTAGNDISVDPFLDLVLPNAVDFQTTDSLTHSATTDRTNIHTDMPTVHVDMTNANTDIPVVITPTDVPIESC
APTVAPIGSSATSSSDGPSSMPVVLEPMPNTGPSVGPLDIVPIAPRRSTRPSKMLSYLLDFHCSLLGQYSPPLLLQGILYDSIYPIHNYRLLIDIMF