; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g00410 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g00410
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr4:280870..293717
RNA-Seq ExpressionMoc04g00410
SyntenyMoc04g00410
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW48032.1 hypothetical protein CK203_089187 [Vitis vinifera]1.2e-0545.16Show/hide
Query:  TSDHFPILMETGSFKWVPISSKFKNMWLLHKSSLKDLEGWWKEINPWGCLGLPLSRILNISK
        TSDH+PI++ET  FKW P   +FKNMWL H S  +    WW+E    G  G    R L   K
Subjt:  TSDHFPILMETGSFKWVPISSKFKNMWLLHKSSLKDLEGWWKEINPWGCLGLPLSRILNISK

RVW70180.1 hypothetical protein CK203_057138 [Vitis vinifera]2.1e-0536.71Show/hide
Query:  GGLISEGKILDITLSMPTSDHFPILMETGSFKWVPISSKFKNMWLLHKSSLKDLEGWWKEINPWGCLGLPLSRILNISK
        G L  +G + ++ L   TSDH+PI+++T  FKW P   +F+NMWL H S  ++   WW+     G  G    R L   K
Subjt:  GGLISEGKILDITLSMPTSDHFPILMETGSFKWVPISSKFKNMWLLHKSSLKDLEGWWKEINPWGCLGLPLSRILNISK

RVW90389.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.6e-0544.44Show/hide
Query:  TSDHFPILMETGSFKWVPISSKFKNMWLLHKSSLKDLEGWWKEINPWGCLGLPLSRILNISKR
        TSDH+PI++ET  FKW P   +F+NMWL H S  ++   WW+E    G  G    R L   KR
Subjt:  TSDHFPILMETGSFKWVPISSKFKNMWLLHKSSLKDLEGWWKEINPWGCLGLPLSRILNISKR

RVW93496.1 hypothetical protein CK203_035058 [Vitis vinifera]1.6e-0540.74Show/hide
Query:  LKGGLISEGKILDITLSMPTSDHFPILMETGSFKWVPISSKFKNMWLLHKSSLKDLEGWWKEINPWGCLGLPLSRILNISK
        L GG +    IL+  L   TSDH PI +ET S KW P   +F+NMWLLH    +    WW+E    G  G    R L   K
Subjt:  LKGGLISEGKILDITLSMPTSDHFPILMETGSFKWVPISSKFKNMWLLHKSSLKDLEGWWKEINPWGCLGLPLSRILNISK

XP_022147852.1 uncharacterized protein LOC111016689 [Momordica charantia]6.4e-3134.93Show/hide
Query:  EERKCWQKCKVRWLIEGDVRTPSSSIVCST--QEERGNRDPWFNRGTLWVTLTLSLGHFPGLWARIHIKITFQVLEQVEIPDSFSFVLCICNFFNSNFLG
        EERKCWQKCKVRWLIEGD  T     V ++  + +R NRDPWFNRG LWVTLTLSLG       +I +K T  +L       +   ++         FL 
Subjt:  EERKCWQKCKVRWLIEGDVRTPSSSIVCST--QEERGNRDPWFNRGTLWVTLTLSLGHFPGLWARIHIKITFQVLEQVEIPDSFSFVLCICNFFNSNFLG

Query:  LQRGIQSPTLAREGIDPWYIRDENYLHWLFLESWIRARSRYAREWTDGHFPSLRCDQNLNLDHLSSTLTSGVLQLVFFCLVAFVICHFLCLRVSHDHLSS
          RG  S +L             NY   L    W R   +Y  +W  G FP L         HLS+                                  
Subjt:  LQRGIQSPTLAREGIDPWYIRDENYLHWLFLESWIRARSRYAREWTDGHFPSLRCDQNLNLDHLSSTLTSGVLQLVFFCLVAFVICHFLCLRVSHDHLSS

Query:  TPTSGDLQLIFLCLVVRSKSKSRSLIKYKWRSPIHFPVLLSLSMGEDTYQVLQQVERSNSTSFVLWSLSLSMGEVKVQ---IMITYQELEQVEISNL-FS
           SGD             S S S + +           LSLSM ED YQ+LQQVE            SLS  E+K Q   I ITYQ L QV+++ L + 
Subjt:  TPTSGDLQLIFLCLVVRSKSKSRSLIKYKWRSPIHFPVLLSLSMGEDTYQVLQQVERSNSTSFVLWSLSLSMGEVKVQ---IMITYQELEQVEISNL-FS

Query:  FVI--WPPSLSTSQNPNQDHLSSTSGD-LQFIFLCPVVNFLSMGNDTYLDHLSSTPTSGDLQLIFLCLVVTFLVYGSLSLPMGEDTYQDHLSSHFPCSRV
        F +  +P S   S++  + +LSST+ + L+ IFLC  V  +   +  +   LS +     +Q I  C  +      S SL M ED YQDHLSSH  C +V
Subjt:  FVI--WPPSLSTSQNPNQDHLSSTSGD-LQFIFLCPVVNFLSMGNDTYLDHLSSTPTSGDLQLIFLCLVVTFLVYGSLSLPMGEDTYQDHLSSHFPCSRV

Query:  RSKSKIKTNLSS-TTSEG
        RSK K + +LSS TTSEG
Subjt:  RSKSKIKTNLSS-TTSEG

TrEMBL top hitse value%identityAlignment
A0A438EK20 Uncharacterized protein5.9e-0645.16Show/hide
Query:  TSDHFPILMETGSFKWVPISSKFKNMWLLHKSSLKDLEGWWKEINPWGCLGLPLSRILNISK
        TSDH+PI++ET  FKW P   +FKNMWL H S  +    WW+E    G  G    R L   K
Subjt:  TSDHFPILMETGSFKWVPISSKFKNMWLLHKSSLKDLEGWWKEINPWGCLGLPLSRILNISK

A0A438EU27 Transposon TX1 uncharacterized 149 kDa protein1.0e-0538.27Show/hide
Query:  LKGGLISEGKILDITLSMPTSDHFPILMETGSFKWVPISSKFKNMWLLHKSSLKDLEGWWKEINPWGCLGLPLSRILNISK
        L G L  +G  L   L   TSDH+PI+++T  FKW P + +F+NMWL H S  ++   WW+     G  G    R L   K
Subjt:  LKGGLISEGKILDITLSMPTSDHFPILMETGSFKWVPISSKFKNMWLLHKSSLKDLEGWWKEINPWGCLGLPLSRILNISK

A0A438I0Z7 LINE-1 retrotransposable element ORF2 protein7.6e-0644.44Show/hide
Query:  TSDHFPILMETGSFKWVPISSKFKNMWLLHKSSLKDLEGWWKEINPWGCLGLPLSRILNISKR
        TSDH+PI++ET  FKW P   +F+NMWL H S  ++   WW+E    G  G    R L   KR
Subjt:  TSDHFPILMETGSFKWVPISSKFKNMWLLHKSSLKDLEGWWKEINPWGCLGLPLSRILNISKR

A0A438I9V4 Uncharacterized protein7.6e-0640.74Show/hide
Query:  LKGGLISEGKILDITLSMPTSDHFPILMETGSFKWVPISSKFKNMWLLHKSSLKDLEGWWKEINPWGCLGLPLSRILNISK
        L GG +    IL+  L   TSDH PI +ET S KW P   +F+NMWLLH    +    WW+E    G  G    R L   K
Subjt:  LKGGLISEGKILDITLSMPTSDHFPILMETGSFKWVPISSKFKNMWLLHKSSLKDLEGWWKEINPWGCLGLPLSRILNISK

A0A6J1D3H5 uncharacterized protein LOC1110166893.1e-3134.93Show/hide
Query:  EERKCWQKCKVRWLIEGDVRTPSSSIVCST--QEERGNRDPWFNRGTLWVTLTLSLGHFPGLWARIHIKITFQVLEQVEIPDSFSFVLCICNFFNSNFLG
        EERKCWQKCKVRWLIEGD  T     V ++  + +R NRDPWFNRG LWVTLTLSLG       +I +K T  +L       +   ++         FL 
Subjt:  EERKCWQKCKVRWLIEGDVRTPSSSIVCST--QEERGNRDPWFNRGTLWVTLTLSLGHFPGLWARIHIKITFQVLEQVEIPDSFSFVLCICNFFNSNFLG

Query:  LQRGIQSPTLAREGIDPWYIRDENYLHWLFLESWIRARSRYAREWTDGHFPSLRCDQNLNLDHLSSTLTSGVLQLVFFCLVAFVICHFLCLRVSHDHLSS
          RG  S +L             NY   L    W R   +Y  +W  G FP L         HLS+                                  
Subjt:  LQRGIQSPTLAREGIDPWYIRDENYLHWLFLESWIRARSRYAREWTDGHFPSLRCDQNLNLDHLSSTLTSGVLQLVFFCLVAFVICHFLCLRVSHDHLSS

Query:  TPTSGDLQLIFLCLVVRSKSKSRSLIKYKWRSPIHFPVLLSLSMGEDTYQVLQQVERSNSTSFVLWSLSLSMGEVKVQ---IMITYQELEQVEISNL-FS
           SGD             S S S + +           LSLSM ED YQ+LQQVE            SLS  E+K Q   I ITYQ L QV+++ L + 
Subjt:  TPTSGDLQLIFLCLVVRSKSKSRSLIKYKWRSPIHFPVLLSLSMGEDTYQVLQQVERSNSTSFVLWSLSLSMGEVKVQ---IMITYQELEQVEISNL-FS

Query:  FVI--WPPSLSTSQNPNQDHLSSTSGD-LQFIFLCPVVNFLSMGNDTYLDHLSSTPTSGDLQLIFLCLVVTFLVYGSLSLPMGEDTYQDHLSSHFPCSRV
        F +  +P S   S++  + +LSST+ + L+ IFLC  V  +   +  +   LS +     +Q I  C  +      S SL M ED YQDHLSSH  C +V
Subjt:  FVI--WPPSLSTSQNPNQDHLSSTSGD-LQFIFLCPVVNFLSMGNDTYLDHLSSTPTSGDLQLIFLCLVVTFLVYGSLSLPMGEDTYQDHLSSHFPCSRV

Query:  RSKSKIKTNLSS-TTSEG
        RSK K + +LSS TTSEG
Subjt:  RSKSKIKTNLSS-TTSEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCTATATGAAGAAAGGAAGTGTTGGCAGAAATGTAAAGTTAGGTGGCTTATAGAAGGGGATGTGAGAACACCAAGCTCTTCCATAGTGTGCTCAACTCAAGAAGA
AAGAGGCAATAGGGATCCATGGTTTAACAGGGGAACTCTATGGGTCACTTTGACCTTGTCGCTAGGTCACTTTCCTGGTTTATGGGCGAGGATACATATCAAGATCACTT
TTCAAGTACTAGAACAAGTGGAGATCCCCGACTCATTTTCATTTGTTTTGTGTATTTGTAATTTTTTCAATTCCAATTTCTTGGGTTTGCAAAGAGGAATACAAAGTCCC
ACATTGGCTAGGGAAGGGATTGATCCATGGTATATAAGAGATGAGAATTATCTCCATTGGTTGTTCTTGGAGTCTTGGATAAGAGCACGCTCTCGTTATGCCCGAGAGTG
GACAGATGGTCACTTTCCTAGTCTACGGTGTGATCAAAATCTGAATCTAGATCACTTATCAAGTACTCTAACAAGTGGAGTTCTCCAACTGGTTTTCTTTTGTCTCGTGG
CATTTGTCATTTGTCACTTTCTTTGTCTAAGAGTTAGTCATGATCACTTATCAAGTACTCCAACAAGTGGAGATCTCCAACTCATTTTTCTCTGTCTTGTGGTGAGGTCA
AAATCCAAATCTAGATCACTTATCAAGTACAAGTGGAGATCTCCGATTCATTTTCCTGTGTTACTTTCCTTGTCTATGGGGGAGGATACATATCAAGTACTTCAACAAGT
GGAGAGGTCCAACTCGACTTCCTTTGTCTTGTGGTCACTTTCCTTGTCTATGGGTGAGGTCAAAGTCCAAATCATGATCACTTATCAAGAACTAGAGCAAGTGGAGATCT
CCAACTTATTTTCTTTCGTCATATGGCCACCTTCCTTGTCTACGAGTCAAAATCCAAATCAAGATCACTTATCATCTACAAGTGGAGATCTCCAATTCATTTTCCTTTGT
CCTGTGGTCAATTTTCTTTCCATGGGTAACGATACATATCTAGATCACTTATCAAGTACTCCAACAAGTGGAGATCTCCAACTCATTTTTCTCTGTCTTGTGGTCACTTT
CCTTGTCTATGGGTCACTTTCCTTACCTATGGGTGAGGATACATATCAAGATCACTTATCAAGTCACTTTCCTTGTTCACGGGTGAGATCAAAATCCAAAATCAAGACCA
ACTTATCAAGTACTACAAGTGAAGGCCTCCGACTCATTTTCCTTTGTCTTGCGGTCACTTCCCTTTTTATGGGTGAGGATACATATCAAGATCTTATCAAAATTCTTACA
ATAAGTGGAGATCTCAGACTCTTTTTCCTTTGTCTTGTGGTCACCTTTCTTGTCTACAGGTCACTTTCCTTATCTATGAGTAAGGCCAAAATTCAAATCAAGGTCACCTA
TCAAGTACTCCAGCAAGTAGAGATCCGCAACTCTTTTTGCCAGTACAAGTCATTTGTTCCGTCACTTTCCTTATCTATGGTTAGCATATCTTTGATCTTTCCACCTCCTC
TTCGTAGGTTACTTTTCTTGTCTATGGGTAAGGTCAATCCAAATCAAGATCACTTATCAAGTCACGTTCCTTGTCTATGGGTGAGACCAAAATCCAAATCAAGATCACTT
ATCAAGTCATTTTCTTTGTCTATGGGTGAGACCAAAATCGAAATCAAGATCGCTTATCAAGTACTACTACATGTGGAGATCTCCAACACATTTTCCTTTGTCTCATGGTC
ACTTTCCTTTTCTATGGGTGAGGATACATATCAAGATAACTTATCAAAAGTCCTACAACAAGTGAAGGTCACTTACCTTGTTTATGGGTTACTTTCCTTGTCAATAGAGA
GGCCAAAATCCAAATCAAGATTACTTATCAAGTACTACAGTCTTCAACTCATTTTCCTTTATCCTGTGGTCACTTTCCTTGTCTATAGGTCACTTTTCTTGTTTATTGGT
GAGGGTACATATCAAGATCACTTATCAAGTACTACAGCAAGTGGAGATCTCAAACTCAATTTCCTTTGTCTTGTGGTGAGGTCAAAATCCAAATCAAGATCACTTATAAG
CATATGGAGATTCCATATCATTTTCCTTTGTCCTGTGGTTACTTTCCATGTCCATGCTAGCTGGTTAAAAGGTGGATTGATTTCAGAAGGCAAGATCTTGGATATAACAC
TATCAATGCCTACCTCTGATCATTTCCCCATTCTTATGGAAACGGGAAGCTTCAAGTGGGTTCCTATATCTTCTAAGTTCAAAAACATGTGGCTCCTTCACAAGTCTTCC
CTGAAAGATCTCGAAGGCTGGTGGAAAGAAATCAACCCTTGGGGTTGCCTGGGTTTGCCCTTATCCAGGATCTTAAATATATCAAAGCGAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCCTATATGAAGAAAGGAAGTGTTGGCAGAAATGTAAAGTTAGGTGGCTTATAGAAGGGGATGTGAGAACACCAAGCTCTTCCATAGTGTGCTCAACTCAAGAAGA
AAGAGGCAATAGGGATCCATGGTTTAACAGGGGAACTCTATGGGTCACTTTGACCTTGTCGCTAGGTCACTTTCCTGGTTTATGGGCGAGGATACATATCAAGATCACTT
TTCAAGTACTAGAACAAGTGGAGATCCCCGACTCATTTTCATTTGTTTTGTGTATTTGTAATTTTTTCAATTCCAATTTCTTGGGTTTGCAAAGAGGAATACAAAGTCCC
ACATTGGCTAGGGAAGGGATTGATCCATGGTATATAAGAGATGAGAATTATCTCCATTGGTTGTTCTTGGAGTCTTGGATAAGAGCACGCTCTCGTTATGCCCGAGAGTG
GACAGATGGTCACTTTCCTAGTCTACGGTGTGATCAAAATCTGAATCTAGATCACTTATCAAGTACTCTAACAAGTGGAGTTCTCCAACTGGTTTTCTTTTGTCTCGTGG
CATTTGTCATTTGTCACTTTCTTTGTCTAAGAGTTAGTCATGATCACTTATCAAGTACTCCAACAAGTGGAGATCTCCAACTCATTTTTCTCTGTCTTGTGGTGAGGTCA
AAATCCAAATCTAGATCACTTATCAAGTACAAGTGGAGATCTCCGATTCATTTTCCTGTGTTACTTTCCTTGTCTATGGGGGAGGATACATATCAAGTACTTCAACAAGT
GGAGAGGTCCAACTCGACTTCCTTTGTCTTGTGGTCACTTTCCTTGTCTATGGGTGAGGTCAAAGTCCAAATCATGATCACTTATCAAGAACTAGAGCAAGTGGAGATCT
CCAACTTATTTTCTTTCGTCATATGGCCACCTTCCTTGTCTACGAGTCAAAATCCAAATCAAGATCACTTATCATCTACAAGTGGAGATCTCCAATTCATTTTCCTTTGT
CCTGTGGTCAATTTTCTTTCCATGGGTAACGATACATATCTAGATCACTTATCAAGTACTCCAACAAGTGGAGATCTCCAACTCATTTTTCTCTGTCTTGTGGTCACTTT
CCTTGTCTATGGGTCACTTTCCTTACCTATGGGTGAGGATACATATCAAGATCACTTATCAAGTCACTTTCCTTGTTCACGGGTGAGATCAAAATCCAAAATCAAGACCA
ACTTATCAAGTACTACAAGTGAAGGCCTCCGACTCATTTTCCTTTGTCTTGCGGTCACTTCCCTTTTTATGGGTGAGGATACATATCAAGATCTTATCAAAATTCTTACA
ATAAGTGGAGATCTCAGACTCTTTTTCCTTTGTCTTGTGGTCACCTTTCTTGTCTACAGGTCACTTTCCTTATCTATGAGTAAGGCCAAAATTCAAATCAAGGTCACCTA
TCAAGTACTCCAGCAAGTAGAGATCCGCAACTCTTTTTGCCAGTACAAGTCATTTGTTCCGTCACTTTCCTTATCTATGGTTAGCATATCTTTGATCTTTCCACCTCCTC
TTCGTAGGTTACTTTTCTTGTCTATGGGTAAGGTCAATCCAAATCAAGATCACTTATCAAGTCACGTTCCTTGTCTATGGGTGAGACCAAAATCCAAATCAAGATCACTT
ATCAAGTCATTTTCTTTGTCTATGGGTGAGACCAAAATCGAAATCAAGATCGCTTATCAAGTACTACTACATGTGGAGATCTCCAACACATTTTCCTTTGTCTCATGGTC
ACTTTCCTTTTCTATGGGTGAGGATACATATCAAGATAACTTATCAAAAGTCCTACAACAAGTGAAGGTCACTTACCTTGTTTATGGGTTACTTTCCTTGTCAATAGAGA
GGCCAAAATCCAAATCAAGATTACTTATCAAGTACTACAGTCTTCAACTCATTTTCCTTTATCCTGTGGTCACTTTCCTTGTCTATAGGTCACTTTTCTTGTTTATTGGT
GAGGGTACATATCAAGATCACTTATCAAGTACTACAGCAAGTGGAGATCTCAAACTCAATTTCCTTTGTCTTGTGGTGAGGTCAAAATCCAAATCAAGATCACTTATAAG
CATATGGAGATTCCATATCATTTTCCTTTGTCCTGTGGTTACTTTCCATGTCCATGCTAGCTGGTTAAAAGGTGGATTGATTTCAGAAGGCAAGATCTTGGATATAACAC
TATCAATGCCTACCTCTGATCATTTCCCCATTCTTATGGAAACGGGAAGCTTCAAGTGGGTTCCTATATCTTCTAAGTTCAAAAACATGTGGCTCCTTCACAAGTCTTCC
CTGAAAGATCTCGAAGGCTGGTGGAAAGAAATCAACCCTTGGGGTTGCCTGGGTTTGCCCTTATCCAGGATCTTAAATATATCAAAGCGAAGTTGA
Protein sequenceShow/hide protein sequence
MVLYEERKCWQKCKVRWLIEGDVRTPSSSIVCSTQEERGNRDPWFNRGTLWVTLTLSLGHFPGLWARIHIKITFQVLEQVEIPDSFSFVLCICNFFNSNFLGLQRGIQSP
TLAREGIDPWYIRDENYLHWLFLESWIRARSRYAREWTDGHFPSLRCDQNLNLDHLSSTLTSGVLQLVFFCLVAFVICHFLCLRVSHDHLSSTPTSGDLQLIFLCLVVRS
KSKSRSLIKYKWRSPIHFPVLLSLSMGEDTYQVLQQVERSNSTSFVLWSLSLSMGEVKVQIMITYQELEQVEISNLFSFVIWPPSLSTSQNPNQDHLSSTSGDLQFIFLC
PVVNFLSMGNDTYLDHLSSTPTSGDLQLIFLCLVVTFLVYGSLSLPMGEDTYQDHLSSHFPCSRVRSKSKIKTNLSSTTSEGLRLIFLCLAVTSLFMGEDTYQDLIKILT
ISGDLRLFFLCLVVTFLVYRSLSLSMSKAKIQIKVTYQVLQQVEIRNSFCQYKSFVPSLSLSMVSISLIFPPPLRRLLFLSMGKVNPNQDHLSSHVPCLWVRPKSKSRSL
IKSFSLSMGETKIEIKIAYQVLLHVEISNTFSFVSWSLSFSMGEDTYQDNLSKVLQQVKVTYLVYGLLSLSIERPKSKSRLLIKYYSLQLIFLYPVVTFLVYRSLFLFIG
EGTYQDHLSSTTASGDLKLNFLCLVVRSKSKSRSLISIWRFHIIFLCPVVTFHVHASWLKGGLISEGKILDITLSMPTSDHFPILMETGSFKWVPISSKFKNMWLLHKSS
LKDLEGWWKEINPWGCLGLPLSRILNISKRS