; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g21910 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g21910
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr7:16075354..16081827
RNA-Seq ExpressionMoc07g21910
SyntenyMoc07g21910
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN45635.1 hypothetical protein Csa_005498 [Cucumis sativus]9.8e-1234.68Show/hide
Query:  IFKLERIDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLPRV
        +F LE +D F++A  +L+      + + S +MFSL V HH+L  N+A Q+MP FF  Y F ++  +S   I+  F  +   +    +SLSF +   L R+
Subjt:  IFKLERIDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLPRV

Query:  EL-ISEGSCLTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEE
         L  S        + +  +  A ++    ID   FVSID Q F+ V T   R  +VRVT +HS VRF+ E +E
Subjt:  EL-ISEGSCLTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEE

XP_022156120.1 uncharacterized protein LOC111023084 [Momordica charantia]9.2e-95100Show/hide
Query:  MLIFKLERIDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLP
        MLIFKLERIDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLP
Subjt:  MLIFKLERIDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLP

Query:  RVELISEGSCLTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQ
        RVELISEGSCLTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQ
Subjt:  RVELISEGSCLTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQ

XP_022959393.1 uncharacterized protein LOC111460379 [Cucurbita moschata]3.7e-1133.33Show/hide
Query:  MFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLPRVELISEG--SCLTQRVI--ELPLSPAEEKAST
        MF+L V   + R   ALQI+P FF  +N  +Q  +S F IE F+  + + +  G SS+ F L   + ++ L  E   S L  +V+  EL  S +E +   
Subjt:  MFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLPRVELISEG--SCLTQRVI--ELPLSPAEEKAST

Query:  EIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQLERTDEFLEAAAVLAGISTRCHFTLSKGIFSLHIFHHSLRCKIALQI
        ++DY  FVSID QDFK V    + AP V V+L+ S ++F    +EI LT    R +       + AG + R   TL   +F   + H S R  + + +
Subjt:  EIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQLERTDEFLEAAAVLAGISTRCHFTLSKGIFSLHIFHHSLRCKIALQI

XP_023006013.1 uncharacterized protein LOC111498890 [Cucurbita maxima]2.4e-1033.33Show/hide
Query:  ALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLPRVELISEG--SCLTQRVI--ELPLSPAEEKASTEIDYSVFVSIDLQDF
        ALQI+P +F  +N  +Q  +S F IE F+  + + +  G SS+ F L   + ++ L  E   S L  +V+  EL  SP+E +   ++DY  FVSID QDF
Subjt:  ALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLPRVELISEG--SCLTQRVI--ELPLSPAEEKASTEIDYSVFVSIDLQDF

Query:  KPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQLERTDEFLEAAAVLAGISTRCHFTLSKGIFSLHIFHHSLRCKIALQI
        K V    + AP V V+L+ S ++F    +EI LT    R +       + AG + R   TL   +F   + H S R  + + +
Subjt:  KPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQLERTDEFLEAAAVLAGISTRCHFTLSKGIFSLHIFHHSLRCKIALQI

XP_031744101.1 uncharacterized protein LOC116404781 [Cucumis sativus]1.2e-1235.43Show/hide
Query:  MLIFKLERIDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLP
        ML+F LE +D F++A  +L+      + + S +MFSL V HH+L  N+A Q+MP FF  Y F ++  +S   I+  F  +   +    +SLSF +   L 
Subjt:  MLIFKLERIDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLP

Query:  RVEL-ISEGSCLTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEE
        R+ L  S        + +  +  A ++    ID   FVSID Q F+ V T   R  +VRVT +HS VRF+ E +E
Subjt:  RVEL-ISEGSCLTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEE

TrEMBL top hitse value%identityAlignment
A0A0A0K7E0 Uncharacterized protein4.7e-1234.68Show/hide
Query:  IFKLERIDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLPRV
        +F LE +D F++A  +L+      + + S +MFSL V HH+L  N+A Q+MP FF  Y F ++  +S   I+  F  +   +    +SLSF +   L R+
Subjt:  IFKLERIDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLPRV

Query:  EL-ISEGSCLTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEE
         L  S        + +  +  A ++    ID   FVSID Q F+ V T   R  +VRVT +HS VRF+ E +E
Subjt:  EL-ISEGSCLTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEE

A0A6J1DTY7 uncharacterized protein LOC1110230844.5e-95100Show/hide
Query:  MLIFKLERIDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLP
        MLIFKLERIDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLP
Subjt:  MLIFKLERIDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLP

Query:  RVELISEGSCLTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQ
        RVELISEGSCLTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQ
Subjt:  RVELISEGSCLTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQ

A0A6J1H5T5 uncharacterized protein LOC1114603791.8e-1133.33Show/hide
Query:  MFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLPRVELISEG--SCLTQRVI--ELPLSPAEEKAST
        MF+L V   + R   ALQI+P FF  +N  +Q  +S F IE F+  + + +  G SS+ F L   + ++ L  E   S L  +V+  EL  S +E +   
Subjt:  MFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLPRVELISEG--SCLTQRVI--ELPLSPAEEKAST

Query:  EIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQLERTDEFLEAAAVLAGISTRCHFTLSKGIFSLHIFHHSLRCKIALQI
        ++DY  FVSID QDFK V    + AP V V+L+ S ++F    +EI LT    R +       + AG + R   TL   +F   + H S R  + + +
Subjt:  EIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQLERTDEFLEAAAVLAGISTRCHFTLSKGIFSLHIFHHSLRCKIALQI

A0A6J1H641 uncharacterized protein LOC1114603641.5e-1027.59Show/hide
Query:  IDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLPRVELI---
        ID   EAT +L  ++   +FK    M    +   S  C ++ Q+MP FFT Y   DQ+  ++F ++  FR L N ++ G  + +F+LD +   V L    
Subjt:  IDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLPRVELI---

Query:  -SEGSCLTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQLERTDEFLEAAAVLAGISTRCHFT
           G+ +T     +P+ P +E+   ++DYS F SID + FK +  ++  A  V  TL+ S ++F     E  LT Q        E   ++ GI       
Subjt:  -SEGSCLTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQLERTDEFLEAAAVLAGISTRCHFT

Query:  LSKGIFSLHIFHHSLRCKIALQIMPYFFTTYN
         S  +  +  F+     ++      +FF TY+
Subjt:  LSKGIFSLHIFHHSLRCKIALQIMPYFFTTYN

A0A6J1L3R9 uncharacterized protein LOC1114988901.2e-1033.33Show/hide
Query:  ALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLPRVELISEG--SCLTQRVI--ELPLSPAEEKASTEIDYSVFVSIDLQDF
        ALQI+P +F  +N  +Q  +S F IE F+  + + +  G SS+ F L   + ++ L  E   S L  +V+  EL  SP+E +   ++DY  FVSID QDF
Subjt:  ALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLPRVELISEG--SCLTQRVI--ELPLSPAEEKASTEIDYSVFVSIDLQDF

Query:  KPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQLERTDEFLEAAAVLAGISTRCHFTLSKGIFSLHIFHHSLRCKIALQI
        K V    + AP V V+L+ S ++F    +EI LT    R +       + AG + R   TL   +F   + H S R  + + +
Subjt:  KPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQLERTDEFLEAAAVLAGISTRCHFTLSKGIFSLHIFHHSLRCKIALQI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGATCTTCAAGCTCGAGCGGATAGACGGGTTCGTGGAGGCGACGGCAGTGCTGGCCGGAATATCGACTCGATGCCATTTCAAGTTGTCGGCGGCGATGTTCTCCCT
GTACGTGTACCACCATTCCCTCCGGTGCAACATAGCCCTCCAAATAATGCCTCCCTTCTTCACCACATACAATTTCTTCGACCAGAACCCCAACTCCTCATTCTACATCG
AAAACTTCTTCCGCGCTCTCCTCAACTTTCAGCGCAGCGGATGCTCTTCGCTGAGCTTCGCCCTCGACCACCACCTCCCCCGCGTGGAGCTGATCTCAGAAGGATCTTGC
CTCACGCAGCGCGTGATTGAATTGCCCCTCTCTCCTGCAGAGGAGAAGGCCTCCACAGAAATCGACTACTCAGTCTTTGTGTCCATTGATTTGCAAGACTTCAAGCCCGT
GGCAACCATGTTTGATCGCGCTCCTTATGTTCGCGTTACTTTGTCGCATTCGGGCGTGAGGTTTGCTTATGAAGACGAGGAGATTACTCTCACCGCACAGCTTGAGCGCA
CGGACGAATTTCTGGAGGCCGCCGCCGTACTGGCTGGAATCTCAACGCGATGCCATTTCACATTGTCCAAAGGAATCTTCTCCTTACACATCTTCCACCACTCCCTCCGC
TGCAAAATAGCCCTCCAAATAATGCCATATTTCTTCACCACATACAATTTCTTCAACCAAAATCCCCATCCCTCCTTTTTCATCGGCGCCTTCTTCACCGCTCTCCTCAA
CTTTCAACGCATCGGTTGCTCTTCCCTGACCTTCCACCTTGCCCAGATCCTCCCTCTCGCCGAGCTCATCTCCGACGAATCTTGTCAGGCTACAAATGAAATTGATTTGA
GAATCTTTGTCTCCATGGAGCTCCATCACTTCAAGCCCATTGCAACAACCTTTGATCAAGCTCCTTATGTTCGTGTTGCTGTAACCGATTCAAGAGTCAGGTTCTCTTAT
GAAGATTATATCATTACTCTTACCGTAGAGGGAAATGAATGTATAATTGGAGGTGTTGAGGGAGGAGATGAAGTGCAGTTAATACTCACACCAATTCCCATGACAGCCTT
CCATGGGATGAGTCGGGATCAAGTCAGGATGCGTCGGGATCGAAGTGGGGATGAAAACCAGGGGAAGTGGTCTCTGCAGGCCACAAGGGCCGAGGAGGCAGCACTGCGGC
GCTGTCCTGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGATCTTCAAGCTCGAGCGGATAGACGGGTTCGTGGAGGCGACGGCAGTGCTGGCCGGAATATCGACTCGATGCCATTTCAAGTTGTCGGCGGCGATGTTCTCCCT
GTACGTGTACCACCATTCCCTCCGGTGCAACATAGCCCTCCAAATAATGCCTCCCTTCTTCACCACATACAATTTCTTCGACCAGAACCCCAACTCCTCATTCTACATCG
AAAACTTCTTCCGCGCTCTCCTCAACTTTCAGCGCAGCGGATGCTCTTCGCTGAGCTTCGCCCTCGACCACCACCTCCCCCGCGTGGAGCTGATCTCAGAAGGATCTTGC
CTCACGCAGCGCGTGATTGAATTGCCCCTCTCTCCTGCAGAGGAGAAGGCCTCCACAGAAATCGACTACTCAGTCTTTGTGTCCATTGATTTGCAAGACTTCAAGCCCGT
GGCAACCATGTTTGATCGCGCTCCTTATGTTCGCGTTACTTTGTCGCATTCGGGCGTGAGGTTTGCTTATGAAGACGAGGAGATTACTCTCACCGCACAGCTTGAGCGCA
CGGACGAATTTCTGGAGGCCGCCGCCGTACTGGCTGGAATCTCAACGCGATGCCATTTCACATTGTCCAAAGGAATCTTCTCCTTACACATCTTCCACCACTCCCTCCGC
TGCAAAATAGCCCTCCAAATAATGCCATATTTCTTCACCACATACAATTTCTTCAACCAAAATCCCCATCCCTCCTTTTTCATCGGCGCCTTCTTCACCGCTCTCCTCAA
CTTTCAACGCATCGGTTGCTCTTCCCTGACCTTCCACCTTGCCCAGATCCTCCCTCTCGCCGAGCTCATCTCCGACGAATCTTGTCAGGCTACAAATGAAATTGATTTGA
GAATCTTTGTCTCCATGGAGCTCCATCACTTCAAGCCCATTGCAACAACCTTTGATCAAGCTCCTTATGTTCGTGTTGCTGTAACCGATTCAAGAGTCAGGTTCTCTTAT
GAAGATTATATCATTACTCTTACCGTAGAGGGAAATGAATGTATAATTGGAGGTGTTGAGGGAGGAGATGAAGTGCAGTTAATACTCACACCAATTCCCATGACAGCCTT
CCATGGGATGAGTCGGGATCAAGTCAGGATGCGTCGGGATCGAAGTGGGGATGAAAACCAGGGGAAGTGGTCTCTGCAGGCCACAAGGGCCGAGGAGGCAGCACTGCGGC
GCTGTCCTGATTAG
Protein sequenceShow/hide protein sequence
MLIFKLERIDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLPRVELISEGSC
LTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQLERTDEFLEAAAVLAGISTRCHFTLSKGIFSLHIFHHSLR
CKIALQIMPYFFTTYNFFNQNPHPSFFIGAFFTALLNFQRIGCSSLTFHLAQILPLAELISDESCQATNEIDLRIFVSMELHHFKPIATTFDQAPYVRVAVTDSRVRFSY
EDYIITLTVEGNECIIGGVEGGDEVQLILTPIPMTAFHGMSRDQVRMRRDRSGDENQGKWSLQATRAEEAALRRCPD