; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g18780 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g18780
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr9:14635727..14638796
RNA-Seq ExpressionMoc09g18780
SyntenyMoc09g18780
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147761.1 uncharacterized protein LOC111016619 [Momordica charantia]1.3e-5070.55Show/hide
Query:  KVFSLTLTGSARVWFQQLKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLKDYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSF
        +VFS TLTGS R+WFQQLKRKSISSF ELARAF+ QF GG NRS+ VAYLLTI++K TESLKDYVAR N+EKLQVE LTDAV LL F+S VKDE LVFSF
Subjt:  KVFSLTLTGSARVWFQQLKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLKDYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSF

Query:  GKRTPSTFSEALSRAQKYMSARELIHSRYDPEEECAYYNAKRERHGEKRHRSRMKASSSKRSK
        GKRT  TFSEA SRAQ YMS  ELIHS+ DP+ + A YN KR+R GEKRH  R + S S + +
Subjt:  GKRTPSTFSEALSRAQKYMSARELIHSRYDPEEECAYYNAKRERHGEKRHRSRMKASSSKRSK

XP_022150035.1 uncharacterized protein LOC111018307 [Momordica charantia]5.3e-4467.32Show/hide
Query:  KVFSLTLTGSARVWFQQLKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLKDYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSF
        +VFS TL+GSARVWF+QLKR SISSF  LA+AF+ QF+GGR+RS+ VAYLLTI+++ TESL DYVAR NEEKLQVE LT+AV+L+AF+S ++DEHL FSF
Subjt:  KVFSLTLTGSARVWFQQLKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLKDYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSF

Query:  GKRTPSTFSEALSRAQKYMSARELIHSRYDPEEECAYYNAKRERHGEKRHRSR
        GK+TPSTFSEALSRAQKYMSA E   S+ +PE + +  N  RER G+K   SR
Subjt:  GKRTPSTFSEALSRAQKYMSARELIHSRYDPEEECAYYNAKRERHGEKRHRSR

XP_022158344.1 uncharacterized protein LOC111024851 [Momordica charantia]9.0e-5274.84Show/hide
Query:  KVFSLTLTGSARVWFQQLKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLKDYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSF
        +VFS TLTGSAR WF+QLKR SISSF ELA AF+ QFVGGR +SK V YLLTI++K TESLK+YVAR NEEKLQVE LTDAVAL+AFVS VKDE LVFSF
Subjt:  KVFSLTLTGSARVWFQQLKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLKDYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSF

Query:  GKRTPSTFSEALSRAQKYMSARELIHSRYDPEEECAYYNAKRERHGEKRHRSRMKASSS
        GKRTP TF EALSRAQKYMSA ELIH   D E E A Y+ KRER GEKRHRS  + S S
Subjt:  GKRTPSTFSEALSRAQKYMSARELIHSRYDPEEECAYYNAKRERHGEKRHRSRMKASSS

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]5.8e-6758.94Show/hide
Query:  KVFSLTLTGSARVWFQQLKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLKDYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSF
        +VFS TL GSAR+WF+QLKR SISSF  LARAF+ QFVGGR RS+ VAYLLTI+++TTESL+DYVAR NEEKLQVE LTDAV+LLAF+S V+DEHL FSF
Subjt:  KVFSLTLTGSARVWFQQLKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLKDYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSF

Query:  GKRTPSTFSEALSRAQKYMSARELIHSRYDPEEECAYYNAKRERHG--------EKRHRS------------------------------------RMKA
        GKRTP+TFSEALSRAQ+YMSA E  +S+ +P+ +    + KRER G        EKR RS                                    RMKA
Subjt:  GKRTPSTFSEALSRAQKYMSARELIHSRYDPEEECAYYNAKRERHG--------EKRHRS------------------------------------RMKA

Query:  SSSKRSKGKYCLFHRDHGHTTQTCFDLKKEIEDLMRRGYLKEYVEESKETPDTDRNDMSPTQK
        SS+KRSKG+YCLFHRDHGH TQ CFDLK+E+E L+RRGYLKEYVEE K T + + +D SP ++
Subjt:  SSSKRSKGKYCLFHRDHGHTTQTCFDLKKEIEDLMRRGYLKEYVEESKETPDTDRNDMSPTQK

XP_022159368.1 uncharacterized protein LOC111025785 [Momordica charantia]9.7e-5454.07Show/hide
Query:  LKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLKDYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSFGKRTPSTFSEALSRAQK
        +KR SISSF  LARAF+ QFVGGR RS+ VAYLLTI+++TTESL DYVAR N+EKLQ+ESLTD V+LLAF+S V+DEHL FSFGK+TPSTFSE LSRAQ+
Subjt:  LKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLKDYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSFGKRTPSTFSEALSRAQK

Query:  YMSARELIHSRYDPEEECAYYNAKRERHG--------EKRHRS------------------------------------RMKASSSKRSKGKYCLFHRDH
        YMSA E  +S+ +P+ +    + KRER G        EKR RS                                    RMK  S+KRSKG+YCLFHRDH
Subjt:  YMSARELIHSRYDPEEECAYYNAKRERHG--------EKRHRS------------------------------------RMKASSSKRSKGKYCLFHRDH

Query:  GHTTQTCFDLKKEIEDLMRRGYLKEYVEESKETPDTDRNDMSPTQK
         H TQ  FDLK+E+E L+RRGYL+EYVEE K T + + ++ SP ++
Subjt:  GHTTQTCFDLKKEIEDLMRRGYLKEYVEESKETPDTDRNDMSPTQK

TrEMBL top hitse value%identityAlignment
A0A6J1D3B7 uncharacterized protein LOC1110166196.3e-5170.55Show/hide
Query:  KVFSLTLTGSARVWFQQLKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLKDYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSF
        +VFS TLTGS R+WFQQLKRKSISSF ELARAF+ QF GG NRS+ VAYLLTI++K TESLKDYVAR N+EKLQVE LTDAV LL F+S VKDE LVFSF
Subjt:  KVFSLTLTGSARVWFQQLKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLKDYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSF

Query:  GKRTPSTFSEALSRAQKYMSARELIHSRYDPEEECAYYNAKRERHGEKRHRSRMKASSSKRSK
        GKRT  TFSEA SRAQ YMS  ELIHS+ DP+ + A YN KR+R GEKRH  R + S S + +
Subjt:  GKRTPSTFSEALSRAQKYMSARELIHSRYDPEEECAYYNAKRERHGEKRHRSRMKASSSKRSK

A0A6J1D7D2 uncharacterized protein LOC1110183072.6e-4467.32Show/hide
Query:  KVFSLTLTGSARVWFQQLKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLKDYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSF
        +VFS TL+GSARVWF+QLKR SISSF  LA+AF+ QF+GGR+RS+ VAYLLTI+++ TESL DYVAR NEEKLQVE LT+AV+L+AF+S ++DEHL FSF
Subjt:  KVFSLTLTGSARVWFQQLKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLKDYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSF

Query:  GKRTPSTFSEALSRAQKYMSARELIHSRYDPEEECAYYNAKRERHGEKRHRSR
        GK+TPSTFSEALSRAQKYMSA E   S+ +PE + +  N  RER G+K   SR
Subjt:  GKRTPSTFSEALSRAQKYMSARELIHSRYDPEEECAYYNAKRERHGEKRHRSR

A0A6J1DWY0 uncharacterized protein LOC1110252932.8e-6758.94Show/hide
Query:  KVFSLTLTGSARVWFQQLKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLKDYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSF
        +VFS TL GSAR+WF+QLKR SISSF  LARAF+ QFVGGR RS+ VAYLLTI+++TTESL+DYVAR NEEKLQVE LTDAV+LLAF+S V+DEHL FSF
Subjt:  KVFSLTLTGSARVWFQQLKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLKDYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSF

Query:  GKRTPSTFSEALSRAQKYMSARELIHSRYDPEEECAYYNAKRERHG--------EKRHRS------------------------------------RMKA
        GKRTP+TFSEALSRAQ+YMSA E  +S+ +P+ +    + KRER G        EKR RS                                    RMKA
Subjt:  GKRTPSTFSEALSRAQKYMSARELIHSRYDPEEECAYYNAKRERHG--------EKRHRS------------------------------------RMKA

Query:  SSSKRSKGKYCLFHRDHGHTTQTCFDLKKEIEDLMRRGYLKEYVEESKETPDTDRNDMSPTQK
        SS+KRSKG+YCLFHRDHGH TQ CFDLK+E+E L+RRGYLKEYVEE K T + + +D SP ++
Subjt:  SSSKRSKGKYCLFHRDHGHTTQTCFDLKKEIEDLMRRGYLKEYVEESKETPDTDRNDMSPTQK

A0A6J1DYL6 uncharacterized protein LOC1110257854.7e-5454.07Show/hide
Query:  LKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLKDYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSFGKRTPSTFSEALSRAQK
        +KR SISSF  LARAF+ QFVGGR RS+ VAYLLTI+++TTESL DYVAR N+EKLQ+ESLTD V+LLAF+S V+DEHL FSFGK+TPSTFSE LSRAQ+
Subjt:  LKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLKDYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSFGKRTPSTFSEALSRAQK

Query:  YMSARELIHSRYDPEEECAYYNAKRERHG--------EKRHRS------------------------------------RMKASSSKRSKGKYCLFHRDH
        YMSA E  +S+ +P+ +    + KRER G        EKR RS                                    RMK  S+KRSKG+YCLFHRDH
Subjt:  YMSARELIHSRYDPEEECAYYNAKRERHG--------EKRHRS------------------------------------RMKASSSKRSKGKYCLFHRDH

Query:  GHTTQTCFDLKKEIEDLMRRGYLKEYVEESKETPDTDRNDMSPTQK
         H TQ  FDLK+E+E L+RRGYL+EYVEE K T + + ++ SP ++
Subjt:  GHTTQTCFDLKKEIEDLMRRGYLKEYVEESKETPDTDRNDMSPTQK

A0A6J1DZ49 uncharacterized protein LOC1110248514.4e-5274.84Show/hide
Query:  KVFSLTLTGSARVWFQQLKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLKDYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSF
        +VFS TLTGSAR WF+QLKR SISSF ELA AF+ QFVGGR +SK V YLLTI++K TESLK+YVAR NEEKLQVE LTDAVAL+AFVS VKDE LVFSF
Subjt:  KVFSLTLTGSARVWFQQLKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLKDYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSF

Query:  GKRTPSTFSEALSRAQKYMSARELIHSRYDPEEECAYYNAKRERHGEKRHRSRMKASSS
        GKRTP TF EALSRAQKYMSA ELIH   D E E A Y+ KRER GEKRHRS  + S S
Subjt:  GKRTPSTFSEALSRAQKYMSARELIHSRYDPEEECAYYNAKRERHGEKRHRSRMKASSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTGGTCGGTGCCGAGACCTTCTCGACACCAAGCAGTTCTCGGCCCCACGACAGAGAACTATAACGGTCCATCCTACCCTTTCGCAGTTACACTCGTATTAACTCG
TCTTAATTCTCAATCTGACCTGAAATCGACAATGAAAGTGTTCTCTCTCACCTTGACTGGCTCGGCACGCGTCTGGTTTCAACAGTTAAAAAGAAAATCGATCTCAAGCT
TCAGTGAATTGGCAAGAGCATTTATCATCCAATTCGTCGGAGGACGAAATAGAAGCAAGTCTGTAGCATACCTACTGACTATTGAGCGAAAGACAACAGAAAGCCTCAAA
GACTATGTGGCTCGGTTAAATGAAGAAAAACTACAAGTGGAGAGTTTGACAGATGCAGTGGCACTGCTAGCCTTTGTATCTTGTGTGAAAGACGAACACTTGGTATTCTC
TTTTGGGAAGAGAACTCCAAGCACTTTCTCGGAAGCTTTAAGTCGAGCCCAGAAGTACATGAGTGCTAGAGAATTGATTCACTCGAGATATGACCCTGAAGAAGAATGTG
CATATTATAACGCAAAAAGAGAGAGGCATGGAGAAAAGCGACACAGATCGAGGATGAAAGCATCCTCTAGTAAAAGGAGCAAAGGAAAATATTGTTTGTTCCATAGAGAT
CATGGCCACACGACACAGACTTGTTTTGACCTCAAAAAAGAAATAGAGGACCTCATGCGAAGAGGTTACCTCAAAGAATATGTAGAAGAGTCCAAGGAAACTCCAGACAC
AGATCGAAACGACATGTCACCTACCCAGAAGCAGGCCGATAAGTGGGATGTCAGTCAGTTAGTAAAACCAAGGAGAAATCCCAAAGGTGCTGAAAGACACGCATATAGGC
CCAGCAAGAGTAAGGGCTACTGTTATGGGAAGATAAAAGGAGAAGAGACCAAGCGACGGGAGGCCGAAAGGAACCCGAGAAGAAAGCACGAGCCAAGAAAGGGGGCCGGC
ACCCAACGACCCGAGCTAGCCCCCGACCGATCACTGGCTGGCGCCGAGCAGTTCGGGGTCGAAAACTATAACGATCCATCCGACCATTTCGCAGTTACACCCGTATTAAC
TCGCCTTGATTCTCAATCCGACCTAAAACCGACAATGAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCTGGTCGGTGCCGAGACCTTCTCGACACCAAGCAGTTCTCGGCCCCACGACAGAGAACTATAACGGTCCATCCTACCCTTTCGCAGTTACACTCGTATTAACTCG
TCTTAATTCTCAATCTGACCTGAAATCGACAATGAAAGTGTTCTCTCTCACCTTGACTGGCTCGGCACGCGTCTGGTTTCAACAGTTAAAAAGAAAATCGATCTCAAGCT
TCAGTGAATTGGCAAGAGCATTTATCATCCAATTCGTCGGAGGACGAAATAGAAGCAAGTCTGTAGCATACCTACTGACTATTGAGCGAAAGACAACAGAAAGCCTCAAA
GACTATGTGGCTCGGTTAAATGAAGAAAAACTACAAGTGGAGAGTTTGACAGATGCAGTGGCACTGCTAGCCTTTGTATCTTGTGTGAAAGACGAACACTTGGTATTCTC
TTTTGGGAAGAGAACTCCAAGCACTTTCTCGGAAGCTTTAAGTCGAGCCCAGAAGTACATGAGTGCTAGAGAATTGATTCACTCGAGATATGACCCTGAAGAAGAATGTG
CATATTATAACGCAAAAAGAGAGAGGCATGGAGAAAAGCGACACAGATCGAGGATGAAAGCATCCTCTAGTAAAAGGAGCAAAGGAAAATATTGTTTGTTCCATAGAGAT
CATGGCCACACGACACAGACTTGTTTTGACCTCAAAAAAGAAATAGAGGACCTCATGCGAAGAGGTTACCTCAAAGAATATGTAGAAGAGTCCAAGGAAACTCCAGACAC
AGATCGAAACGACATGTCACCTACCCAGAAGCAGGCCGATAAGTGGGATGTCAGTCAGTTAGTAAAACCAAGGAGAAATCCCAAAGGTGCTGAAAGACACGCATATAGGC
CCAGCAAGAGTAAGGGCTACTGTTATGGGAAGATAAAAGGAGAAGAGACCAAGCGACGGGAGGCCGAAAGGAACCCGAGAAGAAAGCACGAGCCAAGAAAGGGGGCCGGC
ACCCAACGACCCGAGCTAGCCCCCGACCGATCACTGGCTGGCGCCGAGCAGTTCGGGGTCGAAAACTATAACGATCCATCCGACCATTTCGCAGTTACACCCGTATTAAC
TCGCCTTGATTCTCAATCCGACCTAAAACCGACAATGAAGTAA
Protein sequenceShow/hide protein sequence
MAWSVPRPSRHQAVLGPTTENYNGPSYPFAVTLVLTRLNSQSDLKSTMKVFSLTLTGSARVWFQQLKRKSISSFSELARAFIIQFVGGRNRSKSVAYLLTIERKTTESLK
DYVARLNEEKLQVESLTDAVALLAFVSCVKDEHLVFSFGKRTPSTFSEALSRAQKYMSARELIHSRYDPEEECAYYNAKRERHGEKRHRSRMKASSSKRSKGKYCLFHRD
HGHTTQTCFDLKKEIEDLMRRGYLKEYVEESKETPDTDRNDMSPTQKQADKWDVSQLVKPRRNPKGAERHAYRPSKSKGYCYGKIKGEETKRREAERNPRRKHEPRKGAG
TQRPELAPDRSLAGAEQFGVENYNDPSDHFAVTPVLTRLDSQSDLKPTMK