; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g14310 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g14310
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:9638864..9639634
RNA-Seq ExpressionMoc03g14310
SyntenyMoc03g14310
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141796.1 uncharacterized protein LOC111012081 [Momordica charantia]4.6e-10881.64Show/hide
Query:  MEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQ
        +EAPIPPKFK PT+KPYDGSKDPKDYVEVFEGLMDFQA +DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QF SRHYD+KTATHLATIRQ
Subjt:  MEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQ

Query:  KEGETLREYVTRFQEERPKVVHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDK
        KEGETLREYVTRFQEE+ KV HCSDDSAMCYFLTGLADE LT+KL EEAP+TF EVLQK KKVIDG ELLRTKTGRPE++I + +  ++  KTD KS DK
Subjt:  KEGETLREYVTRFQEERPKVVHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDK

Query:  GSSSSASRTEYRRSESGPSWSRPYERYTPTTIPISEILTNIEESGMEKLLKLPKKL
        GS SS  R EYRR+E+GP+ SRPYER+TPTTIPI EILT IEESGMEKLLK P+KL
Subjt:  GSSSSASRTEYRRSESGPSWSRPYERYTPTTIPISEILTNIEESGMEKLLKLPKKL

XP_022149377.1 uncharacterized protein LOC111017807 [Momordica charantia]8.4e-11091.95Show/hide
Query:  KDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQKEGETLREYVTRFQEERPKV
        +DPKDYVEVFEGLMDFQAATDAIKCRAFQIALTG ARLWYRRLPARSISTYSQLRKEFISQF SRHYDRKTATHLATIRQKE ETLREYVTRFQEE+ KV
Subjt:  KDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQKEGETLREYVTRFQEERPKV

Query:  VHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDKGSSSSASRTEYRRSESGPSW
        VHCSDDSAMCYFLTGLADETLT+KL EEAPATF EVLQKAKKVIDG+ELLRTKTGRPEKQIDQKKL QEKRKTDSKS DKGSSSSASR E+RR ESGPS 
Subjt:  VHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDKGSSSSASRTEYRRSESGPSW

Query:  SRPYERYTPTTIPISEILTNIEESGMEKLLKLPKKL
        SRPYERYTPTTI ISEILTNIEESGMEKLLK P+KL
Subjt:  SRPYERYTPTTIPISEILTNIEESGMEKLLKLPKKL

XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]9.2e-10983.92Show/hide
Query:  MEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQ
        +EA IP KFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCR FQIALTGSARLWYRRLPARSISTYSQLRKEFI QF SRHYDRKTATHL TIRQ
Subjt:  MEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQ

Query:  KEGETLREYVTRFQEERPKVVHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDK
        KEGETLREYVTRFQEE+ KV HCSD SAMCYFLT LADETLT+KLEEEAPATFVEVLQKAKK+IDG+ELLRTKT RPEK+IDQ + +++K KTDSK+ DK
Subjt:  KEGETLREYVTRFQEERPKVVHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDK

Query:  GSSSSASRTEYRRSESGPSWSRPYERYTPTTIPISEILTNIEESGMEKLLKLPKK
        G SS +SR  YRRS++    SRPYERYTPTTIPISEILTNIE++GMEKLLK P+K
Subjt:  GSSSSASRTEYRRSESGPSWSRPYERYTPTTIPISEILTNIEESGMEKLLKLPKK

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]9.9e-11184.38Show/hide
Query:  MEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQ
        +EA IPPKFKTPTMKPYDGSKDPKDYVEVFE LMDFQAATDAIKC AFQIALTGSARLWYRRLPAR ISTYSQLRKEFISQF SRHYDRKT THLATIRQ
Subjt:  MEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQ

Query:  KEGETLREYVTRFQEERPKVVHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDK
        KEGETLREYVTRF EE+ KV HCSDDSAMCYFLTGLADETLT+KL EEAPATF EVLQK KKVIDG+ELLRTKTGRPEK IDQ +  ++K K DSKS DK
Subjt:  KEGETLREYVTRFQEERPKVVHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDK

Query:  GSSSSASRTEYRRSESGPSWSRPYERYTPTTIPISEILTNIEESGMEKLLKLPKKL
        G SSS+SR +YRRS S  + SRPYE YTPTTIPI EILTNIEE+GMEKLLK P+KL
Subjt:  GSSSSASRTEYRRSESGPSWSRPYERYTPTTIPISEILTNIEESGMEKLLKLPKKL

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]1.6e-10881.64Show/hide
Query:  MEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQ
        +EAPIPPKFK PT+KPYDG+KDPKDYVEVFEGLMDFQAA+DAIKCRAFQIALTGSARLWYRRLP RSISTYSQLR+EF++QF SRHYD+KTATHLATIRQ
Subjt:  MEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQ

Query:  KEGETLREYVTRFQEERPKVVHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDK
        KEGETLREYVTRFQEE+ KV HCSDDSAMCYFLTGLADE LT+KL EEAPATF EVLQKAKKVIDG+ELLRTKTGRPE++I + +  ++  + D KS DK
Subjt:  KEGETLREYVTRFQEERPKVVHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDK

Query:  GSSSSASRTEYRRSESGPSWSRPYERYTPTTIPISEILTNIEESGMEKLLKLPKKL
        GS SS  R EYRR+E+GP+ SRPYER+TPTTIPI EILTNIEESGMEKLLK P+KL
Subjt:  GSSSSASRTEYRRSESGPSWSRPYERYTPTTIPISEILTNIEESGMEKLLKLPKKL

TrEMBL top hitse value%identityAlignment
A0A6J1CKB3 uncharacterized protein LOC1110120812.2e-10881.64Show/hide
Query:  MEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQ
        +EAPIPPKFK PT+KPYDGSKDPKDYVEVFEGLMDFQA +DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QF SRHYD+KTATHLATIRQ
Subjt:  MEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQ

Query:  KEGETLREYVTRFQEERPKVVHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDK
        KEGETLREYVTRFQEE+ KV HCSDDSAMCYFLTGLADE LT+KL EEAP+TF EVLQK KKVIDG ELLRTKTGRPE++I + +  ++  KTD KS DK
Subjt:  KEGETLREYVTRFQEERPKVVHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDK

Query:  GSSSSASRTEYRRSESGPSWSRPYERYTPTTIPISEILTNIEESGMEKLLKLPKKL
        GS SS  R EYRR+E+GP+ SRPYER+TPTTIPI EILT IEESGMEKLLK P+KL
Subjt:  GSSSSASRTEYRRSESGPSWSRPYERYTPTTIPISEILTNIEESGMEKLLKLPKKL

A0A6J1D7S8 uncharacterized protein LOC1110178074.0e-11091.95Show/hide
Query:  KDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQKEGETLREYVTRFQEERPKV
        +DPKDYVEVFEGLMDFQAATDAIKCRAFQIALTG ARLWYRRLPARSISTYSQLRKEFISQF SRHYDRKTATHLATIRQKE ETLREYVTRFQEE+ KV
Subjt:  KDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQKEGETLREYVTRFQEERPKV

Query:  VHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDKGSSSSASRTEYRRSESGPSW
        VHCSDDSAMCYFLTGLADETLT+KL EEAPATF EVLQKAKKVIDG+ELLRTKTGRPEKQIDQKKL QEKRKTDSKS DKGSSSSASR E+RR ESGPS 
Subjt:  VHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDKGSSSSASRTEYRRSESGPSW

Query:  SRPYERYTPTTIPISEILTNIEESGMEKLLKLPKKL
        SRPYERYTPTTI ISEILTNIEESGMEKLLK P+KL
Subjt:  SRPYERYTPTTIPISEILTNIEESGMEKLLKLPKKL

A0A6J1DDW5 uncharacterized protein LOC1110196344.5e-10983.92Show/hide
Query:  MEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQ
        +EA IP KFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCR FQIALTGSARLWYRRLPARSISTYSQLRKEFI QF SRHYDRKTATHL TIRQ
Subjt:  MEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQ

Query:  KEGETLREYVTRFQEERPKVVHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDK
        KEGETLREYVTRFQEE+ KV HCSD SAMCYFLT LADETLT+KLEEEAPATFVEVLQKAKK+IDG+ELLRTKT RPEK+IDQ + +++K KTDSK+ DK
Subjt:  KEGETLREYVTRFQEERPKVVHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDK

Query:  GSSSSASRTEYRRSESGPSWSRPYERYTPTTIPISEILTNIEESGMEKLLKLPKK
        G SS +SR  YRRS++    SRPYERYTPTTIPISEILTNIE++GMEKLLK P+K
Subjt:  GSSSSASRTEYRRSESGPSWSRPYERYTPTTIPISEILTNIEESGMEKLLKLPKK

A0A6J1DHB3 uncharacterized protein LOC1110204794.8e-11184.38Show/hide
Query:  MEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQ
        +EA IPPKFKTPTMKPYDGSKDPKDYVEVFE LMDFQAATDAIKC AFQIALTGSARLWYRRLPAR ISTYSQLRKEFISQF SRHYDRKT THLATIRQ
Subjt:  MEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQ

Query:  KEGETLREYVTRFQEERPKVVHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDK
        KEGETLREYVTRF EE+ KV HCSDDSAMCYFLTGLADETLT+KL EEAPATF EVLQK KKVIDG+ELLRTKTGRPEK IDQ +  ++K K DSKS DK
Subjt:  KEGETLREYVTRFQEERPKVVHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDK

Query:  GSSSSASRTEYRRSESGPSWSRPYERYTPTTIPISEILTNIEESGMEKLLKLPKKL
        G SSS+SR +YRRS S  + SRPYE YTPTTIPI EILTNIEE+GMEKLLK P+KL
Subjt:  GSSSSASRTEYRRSESGPSWSRPYERYTPTTIPISEILTNIEESGMEKLLKLPKKL

A0A6J1DS95 uncharacterized protein LOC1110234217.6e-10981.64Show/hide
Query:  MEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQ
        +EAPIPPKFK PT+KPYDG+KDPKDYVEVFEGLMDFQAA+DAIKCRAFQIALTGSARLWYRRLP RSISTYSQLR+EF++QF SRHYD+KTATHLATIRQ
Subjt:  MEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQ

Query:  KEGETLREYVTRFQEERPKVVHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDK
        KEGETLREYVTRFQEE+ KV HCSDDSAMCYFLTGLADE LT+KL EEAPATF EVLQKAKKVIDG+ELLRTKTGRPE++I + +  ++  + D KS DK
Subjt:  KEGETLREYVTRFQEERPKVVHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDK

Query:  GSSSSASRTEYRRSESGPSWSRPYERYTPTTIPISEILTNIEESGMEKLLKLPKKL
        GS SS  R EYRR+E+GP+ SRPYER+TPTTIPI EILTNIEESGMEKLLK P+KL
Subjt:  GSSSSASRTEYRRSESGPSWSRPYERYTPTTIPISEILTNIEESGMEKLLKLPKKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCCCACCATGAAGCCCTATGATGGGTCTAAGGACCCCAAAGACTATGTTGAGGTCTTCGAGGGCCTCATGGAT
TTTCAAGCGGCAACGGATGCAATAAAATGCCGCGCCTTCCAAATCGCGCTCACAGGCAGCGCGCGCTTGTGGTACCGAAGACTGCCGGCCAGGTCGATCTCGACC
TACTCCCAGCTGAGGAAGGAGTTCATCAGTCAGTTCTTCTCTCGGCATTATGATAGAAAGACAGCGACTCACCTGGCCACCATCAGGCAGAAGGAAGGAGAGACG
CTGAGAGAATATGTCACAAGGTTTCAGGAGGAGCGGCCGAAGGTTGTGCACTGCTCCGACGATTCGGCCATGTGCTACTTCCTCACCGGCCTGGCTGATGAGACC
CTCACTCTGAAGCTCGAAGAGGAGGCTCCGGCAACCTTCGTCGAAGTCTTGCAGAAGGCAAAAAAGGTCATTGATGGGAAAGAGCTCCTCCGAACCAAGACTGGC
CGACCTGAAAAGCAGATCGATCAGAAGAAATTGACCCAAGAGAAGAGGAAGACTGATTCCAAGTCCGGAGACAAGGGATCGTCTTCTTCCGCCAGCAGAACAGAA
TACCGTAGGTCGGAGAGCGGCCCCAGCTGGAGCCGACCTTATGAACGGTACACACCAACCACCATCCCCATCTCTGAGATACTCACGAACATCGAGGAGAGCGGG
ATGGAAAAGCTCCTGAAGCTACCTAAGAAGCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCCCACCATGAAGCCCTATGATGGGTCTAAGGACCCCAAAGACTATGTTGAGGTCTTCGAGGGCCTCATGGAT
TTTCAAGCGGCAACGGATGCAATAAAATGCCGCGCCTTCCAAATCGCGCTCACAGGCAGCGCGCGCTTGTGGTACCGAAGACTGCCGGCCAGGTCGATCTCGACC
TACTCCCAGCTGAGGAAGGAGTTCATCAGTCAGTTCTTCTCTCGGCATTATGATAGAAAGACAGCGACTCACCTGGCCACCATCAGGCAGAAGGAAGGAGAGACG
CTGAGAGAATATGTCACAAGGTTTCAGGAGGAGCGGCCGAAGGTTGTGCACTGCTCCGACGATTCGGCCATGTGCTACTTCCTCACCGGCCTGGCTGATGAGACC
CTCACTCTGAAGCTCGAAGAGGAGGCTCCGGCAACCTTCGTCGAAGTCTTGCAGAAGGCAAAAAAGGTCATTGATGGGAAAGAGCTCCTCCGAACCAAGACTGGC
CGACCTGAAAAGCAGATCGATCAGAAGAAATTGACCCAAGAGAAGAGGAAGACTGATTCCAAGTCCGGAGACAAGGGATCGTCTTCTTCCGCCAGCAGAACAGAA
TACCGTAGGTCGGAGAGCGGCCCCAGCTGGAGCCGACCTTATGAACGGTACACACCAACCACCATCCCCATCTCTGAGATACTCACGAACATCGAGGAGAGCGGG
ATGGAAAAGCTCCTGAAGCTACCTAAGAAGCTCTGA
Protein sequenceShow/hide protein sequence
MEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTATHLATIRQKEGET
LREYVTRFQEERPKVVHCSDDSAMCYFLTGLADETLTLKLEEEAPATFVEVLQKAKKVIDGKELLRTKTGRPEKQIDQKKLTQEKRKTDSKSGDKGSSSSASRTE
YRRSESGPSWSRPYERYTPTTIPISEILTNIEESGMEKLLKLPKKL