; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g21960 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g21960
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr7:16105198..16109732
RNA-Seq ExpressionMoc07g21960
SyntenyMoc07g21960
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045262.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]4.0e-2544.59Show/hide
Query:  LHHIAPLLNATSLFTDIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFE
        L +  PLL+ATS    I+ DN  +K TP  F +IA HRS RF+A L+L  Q+FT+FS D +H+SKVSL +FH A+LD  + +S+ IHLLD  N++ LRF+
Subjt:  LHHIAPLLNATSLFTDIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFE

Query:  NDSSNDEPEKSHELDLSPPGEDEEQLG--IIDYGRFFGIDYQEFNRIIAEFSIYEDD
          SS  +P   HEL LSPP  ++ Q+G   +D  ++F +  +   RII +  I+++D
Subjt:  NDSSNDEPEKSHELDLSPPGEDEEQLG--IIDYGRFFGIDYQEFNRIIAEFSIYEDD

XP_008458682.1 PREDICTED: uncharacterized protein LOC103498010 [Cucumis melo]2.0e-2447.1Show/hide
Query:  LHHIAPLLNATSLFTDIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFE
        L    PL++ATSL   +A D   VK TP    +I  +RS +FVA L+L R+ FT+FS D N +SKVSL  FH A+LD  + SS+ IHLLD+ N++ LRFE
Subjt:  LHHIAPLLNATSLFTDIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFE

Query:  NDSSNDEPEKSHELDLSPPGEDEEQLGIIDYGRFFGIDYQEFNRIIAEFSIYEDD
           S+D P   HEL LSPP    E LG ++YG FF +  +E  RII E  ++  D
Subjt:  NDSSNDEPEKSHELDLSPPGEDEEQLGIIDYGRFFGIDYQEFNRIIAEFSIYEDD

XP_008464344.1 PREDICTED: uncharacterized protein LOC103502250 [Cucumis melo]4.0e-2544.59Show/hide
Query:  LHHIAPLLNATSLFTDIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFE
        L +  PLL+ATS    I+ DN  +K TP  F +IA HRS RF+A L+L  Q+FT+FS D +H+SKVSL +FH A+LD  + +S+ IHLLD  N++ LRF+
Subjt:  LHHIAPLLNATSLFTDIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFE

Query:  NDSSNDEPEKSHELDLSPPGEDEEQLG--IIDYGRFFGIDYQEFNRIIAEFSIYEDD
          SS  +P   HEL LSPP  ++ Q+G   +D  ++F +  +   RII +  I+++D
Subjt:  NDSSNDEPEKSHELDLSPPGEDEEQLG--IIDYGRFFGIDYQEFNRIIAEFSIYEDD

XP_022156149.1 uncharacterized protein LOC111023105 [Momordica charantia]5.4e-3860.9Show/hide
Query:  HIAPLLNATSLFTDIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFEND
        +I  LL+ATS  T+IAD+ VTVKLTP  FSMIA HRSSRF  VLKLPR FFT++S  L HTS++S+H FHTALLDASTS S+ IH+ DS+NR  LRFEN 
Subjt:  HIAPLLNATSLFTDIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFEND

Query:  SSNDEPEKSHELDLS---PPGEDEEQLGIIDYGRFFGIDYQEFNRIIAEFSIYEDD
         S+ + EK HEL  S      +DE+  G +DYGRFFGI YQ+F +IIA FSI++D+
Subjt:  SSNDEPEKSHELDLS---PPGEDEEQLGIIDYGRFFGIDYQEFNRIIAEFSIYEDD

XP_031743987.1 uncharacterized protein LOC116404759 [Cucumis sativus]1.3e-2646.5Show/hide
Query:  LHHIAPLLNATSLFTDIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFE
        L +  PLL+ATS F  I+ D+  VK TP  F +I+ HRS RF+A L+L  Q+FTSFS D +H+SKVSL +FH A+LD  + +S+ IHLLD  N++ LRF+
Subjt:  LHHIAPLLNATSLFTDIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFE

Query:  NDSSNDEPEKSHELDLSPPGEDEEQLG--IIDYGRFFGIDYQEFNRIIAEFSIYEDD
          SS  +P   HEL LSPP  ++ Q+G   +D G++F +  +   RII E  I+++D
Subjt:  NDSSNDEPEKSHELDLSPPGEDEEQLG--IIDYGRFFGIDYQEFNRIIAEFSIYEDD

TrEMBL top hitse value%identityAlignment
A0A0A0K902 Uncharacterized protein2.7e-2746.25Show/hide
Query:  LHHIAPLLNATSLFTDIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFE
        L +  PLL+ATS F  I+ D+  VK TP  F +I+ HRS RF+A L+L  Q+FTSFS D +H+SKVSL +FH A+LD  + +S+ IHLLD  N++ LRF+
Subjt:  LHHIAPLLNATSLFTDIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFE

Query:  NDSSNDEPEKSHELDLSPPGEDEEQLG--IIDYGRFFGIDYQEFNRIIAEFSIYEDDIRN
          SS  +P   HEL LSPP  ++ Q+G   +D G++F +  +   RII E  I+++D  N
Subjt:  NDSSNDEPEKSHELDLSPPGEDEEQLG--IIDYGRFFGIDYQEFNRIIAEFSIYEDDIRN

A0A1S3C8J1 uncharacterized protein LOC1034980109.7e-2547.1Show/hide
Query:  LHHIAPLLNATSLFTDIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFE
        L    PL++ATSL   +A D   VK TP    +I  +RS +FVA L+L R+ FT+FS D N +SKVSL  FH A+LD  + SS+ IHLLD+ N++ LRFE
Subjt:  LHHIAPLLNATSLFTDIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFE

Query:  NDSSNDEPEKSHELDLSPPGEDEEQLGIIDYGRFFGIDYQEFNRIIAEFSIYEDD
           S+D P   HEL LSPP    E LG ++YG FF +  +E  RII E  ++  D
Subjt:  NDSSNDEPEKSHELDLSPPGEDEEQLGIIDYGRFFGIDYQEFNRIIAEFSIYEDD

A0A1S3CL88 uncharacterized protein LOC1035022502.0e-2544.59Show/hide
Query:  LHHIAPLLNATSLFTDIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFE
        L +  PLL+ATS    I+ DN  +K TP  F +IA HRS RF+A L+L  Q+FT+FS D +H+SKVSL +FH A+LD  + +S+ IHLLD  N++ LRF+
Subjt:  LHHIAPLLNATSLFTDIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFE

Query:  NDSSNDEPEKSHELDLSPPGEDEEQLG--IIDYGRFFGIDYQEFNRIIAEFSIYEDD
          SS  +P   HEL LSPP  ++ Q+G   +D  ++F +  +   RII +  I+++D
Subjt:  NDSSNDEPEKSHELDLSPPGEDEEQLG--IIDYGRFFGIDYQEFNRIIAEFSIYEDD

A0A5D3DZ07 LINE-1 retrotransposable element ORF2 protein2.0e-2544.59Show/hide
Query:  LHHIAPLLNATSLFTDIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFE
        L +  PLL+ATS    I+ DN  +K TP  F +IA HRS RF+A L+L  Q+FT+FS D +H+SKVSL +FH A+LD  + +S+ IHLLD  N++ LRF+
Subjt:  LHHIAPLLNATSLFTDIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFE

Query:  NDSSNDEPEKSHELDLSPPGEDEEQLG--IIDYGRFFGIDYQEFNRIIAEFSIYEDD
          SS  +P   HEL LSPP  ++ Q+G   +D  ++F +  +   RII +  I+++D
Subjt:  NDSSNDEPEKSHELDLSPPGEDEEQLG--IIDYGRFFGIDYQEFNRIIAEFSIYEDD

A0A6J1DSH6 uncharacterized protein LOC1110231052.6e-3860.9Show/hide
Query:  HIAPLLNATSLFTDIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFEND
        +I  LL+ATS  T+IAD+ VTVKLTP  FSMIA HRSSRF  VLKLPR FFT++S  L HTS++S+H FHTALLDASTS S+ IH+ DS+NR  LRFEN 
Subjt:  HIAPLLNATSLFTDIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFEND

Query:  SSNDEPEKSHELDLS---PPGEDEEQLGIIDYGRFFGIDYQEFNRIIAEFSIYEDD
         S+ + EK HEL  S      +DE+  G +DYGRFFGI YQ+F +IIA FSI++D+
Subjt:  SSNDEPEKSHELDLS---PPGEDEEQLGIIDYGRFFGIDYQEFNRIIAEFSIYEDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGTGCTAAGGCTTGACTCCATTGCTCCTCTTTTGAATGCAACCTCCATTCTCTCATTAGTTGCAGAAGCAGCCGATTTCAAATGCTCGCCGTCCATCATCTCTCT
AATTGTCTCACAACCACCGTCGATGTCGGGGTCGGAGTCGGAGTCGGCGTCCGCCCCCGCCCCCGCCCCTCGCTTTCGCTTCACCGTAGCCCTTCAAATGATGCCCCAAT
TCTTCACCCTATTCTCCTGCGACGGCCAGTTTTGCCATTACAGAATTCTCCTCCACCATTTCTACACAACCTTGTTGGATATGGAACAATACCGTTTCTCTTCTTTAACC
CTCTGTCTTCCCCAACTCCTCCATCGCCTCGTCCTTAAGTTTCAGAATTCTCAGCGTGCGCTGGAGATTCGAGAATTGGCATTGTCTGATCCAGAGGAGGACGACATAGG
AGAAATTGATTACACAACCTTTGTCTCAATTGATTTGATACAGTTCAGACATGTTATAGCTGAGCTAAATACTCCGGAGGATGAGACAGCTCTCGTTATTCTAACGTATT
CACAAGCCAAGTTCATTGGTGCAACTACAGAGATTATTCTTCCCAAAGAGACCACTCGAAAAGGGCTTCACCACATTGCTCCTCTTCTCAACGCCACCTCCCTCTTCACC
GATATCGCCGACGACAACGTCACCGTGAAACTCACCCCGGGATCGTTCTCGATGATTGCCCCGCACCGTTCCTCCCGCTTCGTCGCCGTGCTGAAATTGCCGCGCCAATT
CTTCACCTCCTTCTCGGATGACCTCAATCACACTTCAAAGGTTTCCCTCCACACTTTCCACACTGCTCTCTTGGATGCCTCGACTTCCTCCTCAATCGCCATCCATCTTC
TTGATTCCATGAATCGCTTGGCCCTTAGATTCGAGAATGATTCTTCAAATGATGAGCCAGAGAAAAGCCATGAATTGGATCTGTCACCTCCGGGAGAGGATGAAGAGCAA
TTAGGCATAATTGACTATGGAAGATTTTTTGGGATTGATTATCAAGAATTCAATCGCATTATAGCGGAATTTTCTATCTACGAAGATGACATACGTAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGTGCTAAGGCTTGACTCCATTGCTCCTCTTTTGAATGCAACCTCCATTCTCTCATTAGTTGCAGAAGCAGCCGATTTCAAATGCTCGCCGTCCATCATCTCTCT
AATTGTCTCACAACCACCGTCGATGTCGGGGTCGGAGTCGGAGTCGGCGTCCGCCCCCGCCCCCGCCCCTCGCTTTCGCTTCACCGTAGCCCTTCAAATGATGCCCCAAT
TCTTCACCCTATTCTCCTGCGACGGCCAGTTTTGCCATTACAGAATTCTCCTCCACCATTTCTACACAACCTTGTTGGATATGGAACAATACCGTTTCTCTTCTTTAACC
CTCTGTCTTCCCCAACTCCTCCATCGCCTCGTCCTTAAGTTTCAGAATTCTCAGCGTGCGCTGGAGATTCGAGAATTGGCATTGTCTGATCCAGAGGAGGACGACATAGG
AGAAATTGATTACACAACCTTTGTCTCAATTGATTTGATACAGTTCAGACATGTTATAGCTGAGCTAAATACTCCGGAGGATGAGACAGCTCTCGTTATTCTAACGTATT
CACAAGCCAAGTTCATTGGTGCAACTACAGAGATTATTCTTCCCAAAGAGACCACTCGAAAAGGGCTTCACCACATTGCTCCTCTTCTCAACGCCACCTCCCTCTTCACC
GATATCGCCGACGACAACGTCACCGTGAAACTCACCCCGGGATCGTTCTCGATGATTGCCCCGCACCGTTCCTCCCGCTTCGTCGCCGTGCTGAAATTGCCGCGCCAATT
CTTCACCTCCTTCTCGGATGACCTCAATCACACTTCAAAGGTTTCCCTCCACACTTTCCACACTGCTCTCTTGGATGCCTCGACTTCCTCCTCAATCGCCATCCATCTTC
TTGATTCCATGAATCGCTTGGCCCTTAGATTCGAGAATGATTCTTCAAATGATGAGCCAGAGAAAAGCCATGAATTGGATCTGTCACCTCCGGGAGAGGATGAAGAGCAA
TTAGGCATAATTGACTATGGAAGATTTTTTGGGATTGATTATCAAGAATTCAATCGCATTATAGCGGAATTTTCTATCTACGAAGATGACATACGTAATTAA
Protein sequenceShow/hide protein sequence
MLVLRLDSIAPLLNATSILSLVAEAADFKCSPSIISLIVSQPPSMSGSESESASAPAPAPRFRFTVALQMMPQFFTLFSCDGQFCHYRILLHHFYTTLLDMEQYRFSSLT
LCLPQLLHRLVLKFQNSQRALEIRELALSDPEEDDIGEIDYTTFVSIDLIQFRHVIAELNTPEDETALVILTYSQAKFIGATTEIILPKETTRKGLHHIAPLLNATSLFT
DIADDNVTVKLTPGSFSMIAPHRSSRFVAVLKLPRQFFTSFSDDLNHTSKVSLHTFHTALLDASTSSSIAIHLLDSMNRLALRFENDSSNDEPEKSHELDLSPPGEDEEQ
LGIIDYGRFFGIDYQEFNRIIAEFSIYEDDIRN