; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g10940 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g10940
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr4:8163424..8164197
RNA-Seq ExpressionMoc04g10940
SyntenyMoc04g10940
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036997.1 gypsy/ty3 element polyprotein [Cucumis melo var. makuwa]1.6e-4653.11Show/hide
Query:  CFKCDEKYSVGHRCKKRELRVFVVHNDEHAEEDTEMMIPEDELKEEEKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFISTKLADE
        C++C+E +S GHRCK RELR+ VV +D    ED EM+    E +  E    VE ++ NS+VGLT PGT K KG ++ +E++++ DCGATHNFIS KL + 
Subjt:  CFKCDEKYSVGHRCKKRELRVFVVHNDEHAEEDTEMMIPEDELKEEEKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFISTKLADE

Query:  LQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVDWPALTMTF
        L++   +T +YGVIMGS   ++G GICKG+ + LP ++I+EDFLPLELG++D+VLGMQWL++ G + VDW ALTMTF
Subjt:  LQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVDWPALTMTF

KAA0047644.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-4653.09Show/hide
Query:  RGRRRACFKCDEKYSVGHRCKKR---ELRVFVVHND----EHAEEDTEMMIPEDELKEEEKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCG
        R  +  CF+C+EKYSV HRCK +   EL++FVV  +    E  EE+T      + L+EEEK K    ++ NS+VGL  PGTMK KGKIQEREVI+L DCG
Subjt:  RGRRRACFKCDEKYSVGHRCKKR---ELRVFVVHND----EHAEEDTEMMIPEDELKEEEKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCG

Query:  ATHNFISTKLADELQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVDWPALTMTFEKDG
        ATHNFIS KL + LQ+   +T  YGVI+GS T ++G GIC+ V + L    + E+FLPLELG +DVVLGMQWL  +G   VDW  LT+TF  +G
Subjt:  ATHNFISTKLADELQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVDWPALTMTFEKDG

KAA0049174.1 gypsy/ty3 element polyprotein [Cucumis melo var. makuwa]1.6e-4653.11Show/hide
Query:  CFKCDEKYSVGHRCKKRELRVFVVHNDEHAEEDTEMMIPEDELKEEEKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFISTKLADE
        C++C+E +S GHRCK RELR+ VV +D    ED EM+    E +  E    VE ++ NS+VGLT PGT K KG ++ +E++++ DCGATHNFIS KL + 
Subjt:  CFKCDEKYSVGHRCKKRELRVFVVHNDEHAEEDTEMMIPEDELKEEEKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFISTKLADE

Query:  LQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVDWPALTMTF
        L++   +T +YGVIMGS   ++G GICKG+ + LP ++I+EDFLPLELG++D+VLGMQWL++ G + VDW ALTMTF
Subjt:  LQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVDWPALTMTF

TYK17386.1 gypsy/ty3 element polyprotein [Cucumis melo var. makuwa]1.6e-4653.11Show/hide
Query:  CFKCDEKYSVGHRCKKRELRVFVVHNDEHAEEDTEMMIPEDELKEEEKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFISTKLADE
        C++C+E +S GHRCK RELR+ VV +D    ED EM+    E +  E    VE ++ NS+VGLT PGT K KG ++ +E++++ DCGATHNFIS KL + 
Subjt:  CFKCDEKYSVGHRCKKRELRVFVVHNDEHAEEDTEMMIPEDELKEEEKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFISTKLADE

Query:  LQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVDWPALTMTF
        L++   +T +YGVIMGS   ++G GICKG+ + LP ++I+EDFLPLELG++D+VLGMQWL++ G + VDW ALTMTF
Subjt:  LQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVDWPALTMTF

XP_022154744.1 uncharacterized protein LOC111021922 [Momordica charantia]6.5e-5356.52Show/hide
Query:  RRGRRRACFKCDEKYSVGHRCKKRELRVFVVHNDEHAEEDTEMMIPEDELKEEEKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFI
        RR  +  CF+ +EKYS+GHRCK +EL+VFVVH+DE  E D E +I   E +E    + V  +A N++VG +TPGTMK +G I+++EV++L DCGATHNFI
Subjt:  RRGRRRACFKCDEKYSVGHRCKKRELRVFVVHNDEHAEEDTEMMIPEDELKEEEKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFI

Query:  STKLADELQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVDWPALTMTF
        S KL D   +   +T +YGVIMG+   +RG GICKG++L LPE+TI E+FLPLELG+LDVVLGMQWL   G ++VDW ALTM+F
Subjt:  STKLADELQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVDWPALTMTF

TrEMBL top hitse value%identityAlignment
A0A5A7T606 Gypsy/ty3 element polyprotein7.5e-4753.11Show/hide
Query:  CFKCDEKYSVGHRCKKRELRVFVVHNDEHAEEDTEMMIPEDELKEEEKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFISTKLADE
        C++C+E +S GHRCK RELR+ VV +D    ED EM+    E +  E    VE ++ NS+VGLT PGT K KG ++ +E++++ DCGATHNFIS KL + 
Subjt:  CFKCDEKYSVGHRCKKRELRVFVVHNDEHAEEDTEMMIPEDELKEEEKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFISTKLADE

Query:  LQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVDWPALTMTF
        L++   +T +YGVIMGS   ++G GICKG+ + LP ++I+EDFLPLELG++D+VLGMQWL++ G + VDW ALTMTF
Subjt:  LQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVDWPALTMTF

A0A5A7TXF0 Ty3/gypsy retrotransposon protein5.8e-4753.09Show/hide
Query:  RGRRRACFKCDEKYSVGHRCKKR---ELRVFVVHND----EHAEEDTEMMIPEDELKEEEKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCG
        R  +  CF+C+EKYSV HRCK +   EL++FVV  +    E  EE+T      + L+EEEK K    ++ NS+VGL  PGTMK KGKIQEREVI+L DCG
Subjt:  RGRRRACFKCDEKYSVGHRCKKR---ELRVFVVHND----EHAEEDTEMMIPEDELKEEEKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCG

Query:  ATHNFISTKLADELQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVDWPALTMTFEKDG
        ATHNFIS KL + LQ+   +T  YGVI+GS T ++G GIC+ V + L    + E+FLPLELG +DVVLGMQWL  +G   VDW  LT+TF  +G
Subjt:  ATHNFISTKLADELQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVDWPALTMTFEKDG

A0A5D3BD16 Ty3/gypsy retrotransposon protein7.5e-4753.11Show/hide
Query:  CFKCDEKYSVGHRCKKRELRVFVVHNDEHAEEDTEMMIPEDELKEEEKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFISTKLADE
        C++C+E +S GHRCK RELR+ VV +D    ED EM+    E +  E    VE ++ NS+VGLT PGT K KG ++ +E++++ DCGATHNFIS KL + 
Subjt:  CFKCDEKYSVGHRCKKRELRVFVVHNDEHAEEDTEMMIPEDELKEEEKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFISTKLADE

Query:  LQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVDWPALTMTF
        L++   +T +YGVIMGS   ++G GICKG+ + LP ++I+EDFLPLELG++D+VLGMQWL++ G + VDW ALTMTF
Subjt:  LQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVDWPALTMTF

A0A5D3D378 Gypsy/ty3 element polyprotein7.5e-4753.11Show/hide
Query:  CFKCDEKYSVGHRCKKRELRVFVVHNDEHAEEDTEMMIPEDELKEEEKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFISTKLADE
        C++C+E +S GHRCK RELR+ VV +D    ED EM+    E +  E    VE ++ NS+VGLT PGT K KG ++ +E++++ DCGATHNFIS KL + 
Subjt:  CFKCDEKYSVGHRCKKRELRVFVVHNDEHAEEDTEMMIPEDELKEEEKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFISTKLADE

Query:  LQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVDWPALTMTF
        L++   +T +YGVIMGS   ++G GICKG+ + LP ++I+EDFLPLELG++D+VLGMQWL++ G + VDW ALTMTF
Subjt:  LQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVDWPALTMTF

A0A6J1DN22 Reverse transcriptase3.2e-5356.52Show/hide
Query:  RRGRRRACFKCDEKYSVGHRCKKRELRVFVVHNDEHAEEDTEMMIPEDELKEEEKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFI
        RR  +  CF+ +EKYS+GHRCK +EL+VFVVH+DE  E D E +I   E +E    + V  +A N++VG +TPGTMK +G I+++EV++L DCGATHNFI
Subjt:  RRGRRRACFKCDEKYSVGHRCKKRELRVFVVHNDEHAEEDTEMMIPEDELKEEEKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFI

Query:  STKLADELQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVDWPALTMTF
        S KL D   +   +T +YGVIMG+   +RG GICKG++L LPE+TI E+FLPLELG+LDVVLGMQWL   G ++VDW ALTM+F
Subjt:  STKLADELQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVDWPALTMTF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G53705.1 aminoacyl-tRNA ligases;nucleotide binding;ATP binding5.4e-0538.24Show/hide
Query:  MGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELG--DLDVVLGMQWLRRVGRIQVDWPALTMTFEKD
        MG    I   G C G+ L + E  I+ED+L L+L   D DV+LG +WL ++G   ++    T TF  D
Subjt:  MGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELG--DLDVVLGMQWLRRVGRIQVDWPALTMTFEKD

AT1G53705.2 aminoacyl-tRNA ligases;nucleotide binding;ATP binding5.4e-0538.24Show/hide
Query:  MGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELG--DLDVVLGMQWLRRVGRIQVDWPALTMTFEKD
        MG    I   G C G+ L + E  I+ED+L L+L   D DV+LG +WL ++G   ++    T TF  D
Subjt:  MGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELG--DLDVVLGMQWLRRVGRIQVDWPALTMTFEKD

AT3G29750.1 Eukaryotic aspartyl protease family protein2.8e-1737.5Show/hide
Query:  IVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFISTKLADELQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELG--DLDVVLGM
        ++ LT    M+F G I + +V+V  D GAT NFI  +LA  L++  + T    V++G R  I+  G C G+ L + E+ I E+FL L+L   D+DV+LG 
Subjt:  IVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFISTKLADELQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELG--DLDVVLGM

Query:  QWLRRVGRIQVDWPALTMTF
        +WL ++G   V+W     +F
Subjt:  QWLRRVGRIQVDWPALTMTF

AT3G30770.1 Eukaryotic aspartyl protease family protein1.2e-1235.71Show/hide
Query:  EKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFISTKLADELQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPL
        E  K +  V   S    T    M+F G I   +V+V+ D GAT+NFIS +LA  L++  + T    V++G R  I+  G C G+ L + E+ I E+FL L
Subjt:  EKGKAVEYVAFNSIVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFISTKLADELQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPL

Query:  EL--GDLDVVLGMQWLRRVGRIQVDW
        +L   D+DV+LG    + + R  + W
Subjt:  EL--GDLDVVLGMQWLRRVGRIQVDW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAACAAGCTGCCCAAGAAGGTGAAGAGACGTCCGTTGGAAAGGGAACATGAATTAACTCAGGACAAGGACACGCGCCAGCTGGCAGACCAGAGCTGCAGCAAAAA
CGACTCATTGAGGCAGAATATCAGAAGAGGAAGGAGAAGAGCCTGTTTCAAGTGCGACGAGAAGTACTCTGTGGGGCATAGGTGTAAGAAACGCGAGTTACGTGTGTTTG
TTGTCCACAACGATGAACATGCTGAAGAAGACACGGAGATGATGATTCCCGAGGACGAGCTTAAAGAAGAGGAAAAGGGCAAGGCAGTGGAATATGTAGCTTTCAACTCC
ATTGTGGGACTCACAACCCCAGGGACAATGAAGTTCAAAGGCAAGATACAAGAAAGGGAAGTCATAGTTCTCTTCGATTGCGGTGCGACGCACAACTTTATCTCTACAAA
GTTGGCGGATGAATTACAGATGGTCAGGACCAAAACACCCAGTTATGGGGTTATCATGGGGTCGAGGACAACGATCAGGGGAGGAGGAATCTGTAAGGGGGTAGTCCTCG
CTTTGCCTGAAATGACTATAATGGAAGACTTTCTACCGCTGGAACTAGGGGATCTCGACGTAGTGCTGGGGATGCAGTGGCTGAGGAGAGTGGGGAGAATACAAGTCGAC
TGGCCGGCATTAACCATGACTTTCGAGAAGGATGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGAACAAGCTGCCCAAGAAGGTGAAGAGACGTCCGTTGGAAAGGGAACATGAATTAACTCAGGACAAGGACACGCGCCAGCTGGCAGACCAGAGCTGCAGCAAAAA
CGACTCATTGAGGCAGAATATCAGAAGAGGAAGGAGAAGAGCCTGTTTCAAGTGCGACGAGAAGTACTCTGTGGGGCATAGGTGTAAGAAACGCGAGTTACGTGTGTTTG
TTGTCCACAACGATGAACATGCTGAAGAAGACACGGAGATGATGATTCCCGAGGACGAGCTTAAAGAAGAGGAAAAGGGCAAGGCAGTGGAATATGTAGCTTTCAACTCC
ATTGTGGGACTCACAACCCCAGGGACAATGAAGTTCAAAGGCAAGATACAAGAAAGGGAAGTCATAGTTCTCTTCGATTGCGGTGCGACGCACAACTTTATCTCTACAAA
GTTGGCGGATGAATTACAGATGGTCAGGACCAAAACACCCAGTTATGGGGTTATCATGGGGTCGAGGACAACGATCAGGGGAGGAGGAATCTGTAAGGGGGTAGTCCTCG
CTTTGCCTGAAATGACTATAATGGAAGACTTTCTACCGCTGGAACTAGGGGATCTCGACGTAGTGCTGGGGATGCAGTGGCTGAGGAGAGTGGGGAGAATACAAGTCGAC
TGGCCGGCATTAACCATGACTTTCGAGAAGGATGGATAG
Protein sequenceShow/hide protein sequence
MSNKLPKKVKRRPLEREHELTQDKDTRQLADQSCSKNDSLRQNIRRGRRRACFKCDEKYSVGHRCKKRELRVFVVHNDEHAEEDTEMMIPEDELKEEEKGKAVEYVAFNS
IVGLTTPGTMKFKGKIQEREVIVLFDCGATHNFISTKLADELQMVRTKTPSYGVIMGSRTTIRGGGICKGVVLALPEMTIMEDFLPLELGDLDVVLGMQWLRRVGRIQVD
WPALTMTFEKDG