; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g20450 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g20450
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr10:15083750..15087995
RNA-Seq ExpressionMoc10g20450
SyntenyMoc10g20450
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149799.1 uncharacterized protein LOC111018145 [Momordica charantia]2.4e-5763.02Show/hide
Query:  ADPPPPPA----DPQVALLPEVLQALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGS---------------------CDDQFKVKGA
        A P  PPA    +PQ+ LL E LQA+INN  GV  VQA+PP+H H PQSEA+FIKDFK YGP TFD  S                     C+DQFKVKGA
Subjt:  ADPPPPPA----DPQVALLPEVLQALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGS---------------------CDDQFKVKGA

Query:  VFMLRDEPLNWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFALELIPIESVKIKRCEK
        V MLR E LN WDS+A  EDHANVP+ W RFKDLL DYY+P+TVKD KEAEFLHL QGTL+VAQYERKF E SCFALELIP E++KIKR  K
Subjt:  VFMLRDEPLNWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFALELIPIESVKIKRCEK

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]2.3e-5542.03Show/hide
Query:  MSPRRSMRLPANVNPTLNGENVADPPPPPADPQVALLPEVLQALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGS-------------
        M PR SMRL A+          ADP P                      GV  VQA PP+H H PQSEA+FIKDFK YGP TFDG S             
Subjt:  MSPRRSMRLPANVNPTLNGENVADPPPPPADPQVALLPEVLQALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGS-------------

Query:  --------CDDQFKVKGAVFMLRDEPLNWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFALELIPI
                C+DQFKVKGAVFMLR E LNWWDS+AA ED+ANVP+ WARFK+LL DYY+P+TVKD KEAEFLHL QGTL+VAQYERKFTE S FALELIP 
Subjt:  --------CDDQFKVKGAVFMLRDEPLNWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFALELIPI

Query:  ESVKIKR--------------------------------------------------CEKKSPSEFCRLDSAVRERTAFCKG------------------
        E++KIKR                                                   ++K PS +  L     +R A  +G                  
Subjt:  ESVKIKR--------------------------------------------------CEKKSPSEFCRLDSAVRERTAFCKG------------------

Query:  -------------------VSHVGLERIGVRAAGLPTVSTQGGNQKARVFALTGKEAANAEAVV
                           +S    +R+G R    P VSTQG NQ+ARVFALT KEAA+AE VV
Subjt:  -------------------VSHVGLERIGVRAAGLPTVSTQGGNQKARVFALTGKEAANAEAVV

XP_022156330.1 uncharacterized protein LOC111023250 [Momordica charantia]1.2e-5358Show/hide
Query:  PPPPPADPQVAL------------LP--EVLQALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGS---------------------CD
        P PPP++    +            LP  E L  +    +    +   PPRHFH PQSEAQFIKDFK YGP TFDGGS                     C+
Subjt:  PPPPPADPQVAL------------LP--EVLQALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGS---------------------CD

Query:  DQFKVKGAVFMLRDEPLNWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFALELIPIESVKIKRCEK
        DQFKVKGAVFMLR + LNWWDS+AA EDHAN+PVTWARFKDLL DYY+P+TVKD KEAEFLH +QGTLTVAQYERKFTE S FA ELIP E++KIKR  K
Subjt:  DQFKVKGAVFMLRDEPLNWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFALELIPIESVKIKRCEK

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]4.4e-6763.89Show/hide
Query:  MSPRRSMRLPANVNPTLNGENVADPPPPPADPQVALLPEVLQ------ALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGS-------
        M PRRSMRL A+V+P   GENVADPPPPP   Q  ++P          ALINNT GV   Q +PPRH H PQSEAQFIKDFK YGP TF GGS       
Subjt:  MSPRRSMRLPANVNPTLNGENVADPPPPPADPQVALLPEVLQ------ALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGS-------

Query:  --------------CDDQFKVKGAVFMLRDEPLNWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFA
                      C+DQFKVKGAVFMLR E LNWWDS+AATEDHANVPV WARFK+LL D+Y+ +TV+D KE EFLHL QGTLTVAQYERKFTE S FA
Subjt:  --------------CDDQFKVKGAVFMLRDEPLNWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFA

Query:  LELIPIESVKIKRCEK
        LELIP E++KIKR  K
Subjt:  LELIPIESVKIKRCEK

XP_022158637.1 uncharacterized protein LOC111025088 [Momordica charantia]2.0e-6469.95Show/hide
Query:  PPADPQVALLPEVLQALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGS---------------------CDDQFKVKGAVFMLRDEPL
        P  +PQVALL E LQALINNT GV   QA PPRHFH PQSEAQFIKDFK YGP TFDGGS                     C+DQFKVKG VFMLR E L
Subjt:  PPADPQVALLPEVLQALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGS---------------------CDDQFKVKGAVFMLRDEPL

Query:  NWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFALELIPIESVKIKRCEK
        NWWDS+A  EDHANVPV WARFKDLL DYY+P+TVKD KEAEFLHL QGTLTVAQYERKFTE S FALE IP E++KIKR  K
Subjt:  NWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFALELIPIESVKIKRCEK

TrEMBL top hitse value%identityAlignment
A0A6J1D841 uncharacterized protein LOC1110181451.2e-5763.02Show/hide
Query:  ADPPPPPA----DPQVALLPEVLQALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGS---------------------CDDQFKVKGA
        A P  PPA    +PQ+ LL E LQA+INN  GV  VQA+PP+H H PQSEA+FIKDFK YGP TFD  S                     C+DQFKVKGA
Subjt:  ADPPPPPA----DPQVALLPEVLQALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGS---------------------CDDQFKVKGA

Query:  VFMLRDEPLNWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFALELIPIESVKIKRCEK
        V MLR E LN WDS+A  EDHANVP+ W RFKDLL DYY+P+TVKD KEAEFLHL QGTL+VAQYERKF E SCFALELIP E++KIKR  K
Subjt:  VFMLRDEPLNWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFALELIPIESVKIKRCEK

A0A6J1DQ01 uncharacterized protein LOC1110232506.0e-5458Show/hide
Query:  PPPPPADPQVAL------------LP--EVLQALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGS---------------------CD
        P PPP++    +            LP  E L  +    +    +   PPRHFH PQSEAQFIKDFK YGP TFDGGS                     C+
Subjt:  PPPPPADPQVAL------------LP--EVLQALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGS---------------------CD

Query:  DQFKVKGAVFMLRDEPLNWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFALELIPIESVKIKRCEK
        DQFKVKGAVFMLR + LNWWDS+AA EDHAN+PVTWARFKDLL DYY+P+TVKD KEAEFLH +QGTLTVAQYERKFTE S FA ELIP E++KIKR  K
Subjt:  DQFKVKGAVFMLRDEPLNWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFALELIPIESVKIKRCEK

A0A6J1DUM2 uncharacterized protein LOC1110232471.1e-5542.03Show/hide
Query:  MSPRRSMRLPANVNPTLNGENVADPPPPPADPQVALLPEVLQALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGS-------------
        M PR SMRL A+          ADP P                      GV  VQA PP+H H PQSEA+FIKDFK YGP TFDG S             
Subjt:  MSPRRSMRLPANVNPTLNGENVADPPPPPADPQVALLPEVLQALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGS-------------

Query:  --------CDDQFKVKGAVFMLRDEPLNWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFALELIPI
                C+DQFKVKGAVFMLR E LNWWDS+AA ED+ANVP+ WARFK+LL DYY+P+TVKD KEAEFLHL QGTL+VAQYERKFTE S FALELIP 
Subjt:  --------CDDQFKVKGAVFMLRDEPLNWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFALELIPI

Query:  ESVKIKR--------------------------------------------------CEKKSPSEFCRLDSAVRERTAFCKG------------------
        E++KIKR                                                   ++K PS +  L     +R A  +G                  
Subjt:  ESVKIKR--------------------------------------------------CEKKSPSEFCRLDSAVRERTAFCKG------------------

Query:  -------------------VSHVGLERIGVRAAGLPTVSTQGGNQKARVFALTGKEAANAEAVV
                           +S    +R+G R    P VSTQG NQ+ARVFALT KEAA+AE VV
Subjt:  -------------------VSHVGLERIGVRAAGLPTVSTQGGNQKARVFALTGKEAANAEAVV

A0A6J1DVA0 uncharacterized protein LOC1110234242.1e-6763.89Show/hide
Query:  MSPRRSMRLPANVNPTLNGENVADPPPPPADPQVALLPEVLQ------ALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGS-------
        M PRRSMRL A+V+P   GENVADPPPPP   Q  ++P          ALINNT GV   Q +PPRH H PQSEAQFIKDFK YGP TF GGS       
Subjt:  MSPRRSMRLPANVNPTLNGENVADPPPPPADPQVALLPEVLQ------ALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGS-------

Query:  --------------CDDQFKVKGAVFMLRDEPLNWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFA
                      C+DQFKVKGAVFMLR E LNWWDS+AATEDHANVPV WARFK+LL D+Y+ +TV+D KE EFLHL QGTLTVAQYERKFTE S FA
Subjt:  --------------CDDQFKVKGAVFMLRDEPLNWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFA

Query:  LELIPIESVKIKRCEK
        LELIP E++KIKR  K
Subjt:  LELIPIESVKIKRCEK

A0A6J1DXQ7 uncharacterized protein LOC1110250889.9e-6569.95Show/hide
Query:  PPADPQVALLPEVLQALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGS---------------------CDDQFKVKGAVFMLRDEPL
        P  +PQVALL E LQALINNT GV   QA PPRHFH PQSEAQFIKDFK YGP TFDGGS                     C+DQFKVKG VFMLR E L
Subjt:  PPADPQVALLPEVLQALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGS---------------------CDDQFKVKGAVFMLRDEPL

Query:  NWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFALELIPIESVKIKRCEK
        NWWDS+A  EDHANVPV WARFKDLL DYY+P+TVKD KEAEFLHL QGTLTVAQYERKFTE S FALE IP E++KIKR  K
Subjt:  NWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFALELIPIESVKIKRCEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGTCGAGGCCTCGGGTATAAATGGTCGGGGGTCGATGCGAGGAGTCTTTCGAAAGGAGAACTATTGGGCCTTGTGTACAAATGGTCAAGGGCCAGTAGAC
GGTGAAGTCATTGGGGCCTCGGATAAGGGCTGCTTACTGAGTACTGTGGTTGTACTCATCCCTCTTTTTCCCCTCCAGTTCGTAGGTATCGAGCTAGCTCATGGG
ATGGTGATGGCGAGAAGGAGGCTTGATCGGAAAGACCTGAAATTCGGGGGCGTTACAGTTAGTATCAGAGCCAAAACGTTCCTGTGGACTGACCTAGTAACTAGG
GTGTATAGGAGTAGTGGTCCTGGTCGACCTCCTTGTCCTTACCAGACAATGTCACCCCGTCGTAGTATGAGGTTGCCTGCAAATGTCAATCCAACCCTCAATGGT
GAGAATGTGGCAGACCCACCGCCCCCTCCGGCTGATCCTCAGGTGGCGTTGCTTCCGGAGGTGTTGCAGGCGCTGATCAATAACACAGTTGGAGTTAGCAGTGTA
CAAGCTAAGCCACCCCGACATTTTCATGCTCCTCAAAGCGAAGCCCAATTCATCAAGGATTTCAAGCATTACGGACCCTCTACCTTTGATGGAGGAAGTTGCGAT
GACCAGTTCAAGGTTAAGGGTGCGGTTTTTATGTTGAGGGATGAGCCCCTGAATTGGTGGGACTCACTAGCAGCGACAGAAGACCATGCTAATGTACCGGTCACG
TGGGCAAGGTTCAAGGATTTGTTGTGTGACTACTATTTCCCGAAGACCGTGAAAGATGCAAAGGAGGCAGAGTTCCTCCATCTCACCCAAGGAACCCTGACGGTA
GCACAATATGAAAGAAAGTTTACAGAATTCTCCTGTTTTGCTCTAGAATTAATTCCCATCGAGTCAGTAAAGATCAAGAGGTGTGAAAAGAAAAGTCCCTCCGAA
TTTTGCCGACTAGACTCAGCAGTGCGGGAGAGAACTGCATTTTGCAAGGGAGTGTCACATGTCGGCCTCGAACGCATAGGGGTTAGGGCAGCAGGCCTCCCAACA
GTTTCGACGCAGGGAGGTAACCAGAAGGCTCGTGTTTTCGCACTTACCGGCAAGGAAGCAGCGAATGCCGAAGCCGTTGTCATAGCCGCTCAAAAACTTAATGGC
GAGATTTACAAACAATGGAAGTCGAATTTAAACACTATTCTCGTGATAGATGATCTTAGGTTCGTCTGGCAAGAGAATTGTCTTCAAGCTCCTGTGCCTAACGCC
ATTGTGGCAGTTCGTAACGTCTATGACAGGTGGATCAAGGCCAATGACAAAGCCAAGGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCCAAGAAGCACGAG
GACACGGTCACTGCTAAGGAGATCATGGACTCGCTGCAGAGCATGTTTGGACAACCGTCCTCACAGGCTCGACATGAAGCTCTTAAGTTCGTTACAACTCCCGCA
TGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCGTCGAGGCCTCGGGTATAAATGGTCGGGGGTCGATGCGAGGAGTCTTTCGAAAGGAGAACTATTGGGCCTTGTGTACAAATGGTCAAGGGCCAGTAGAC
GGTGAAGTCATTGGGGCCTCGGATAAGGGCTGCTTACTGAGTACTGTGGTTGTACTCATCCCTCTTTTTCCCCTCCAGTTCGTAGGTATCGAGCTAGCTCATGGG
ATGGTGATGGCGAGAAGGAGGCTTGATCGGAAAGACCTGAAATTCGGGGGCGTTACAGTTAGTATCAGAGCCAAAACGTTCCTGTGGACTGACCTAGTAACTAGG
GTGTATAGGAGTAGTGGTCCTGGTCGACCTCCTTGTCCTTACCAGACAATGTCACCCCGTCGTAGTATGAGGTTGCCTGCAAATGTCAATCCAACCCTCAATGGT
GAGAATGTGGCAGACCCACCGCCCCCTCCGGCTGATCCTCAGGTGGCGTTGCTTCCGGAGGTGTTGCAGGCGCTGATCAATAACACAGTTGGAGTTAGCAGTGTA
CAAGCTAAGCCACCCCGACATTTTCATGCTCCTCAAAGCGAAGCCCAATTCATCAAGGATTTCAAGCATTACGGACCCTCTACCTTTGATGGAGGAAGTTGCGAT
GACCAGTTCAAGGTTAAGGGTGCGGTTTTTATGTTGAGGGATGAGCCCCTGAATTGGTGGGACTCACTAGCAGCGACAGAAGACCATGCTAATGTACCGGTCACG
TGGGCAAGGTTCAAGGATTTGTTGTGTGACTACTATTTCCCGAAGACCGTGAAAGATGCAAAGGAGGCAGAGTTCCTCCATCTCACCCAAGGAACCCTGACGGTA
GCACAATATGAAAGAAAGTTTACAGAATTCTCCTGTTTTGCTCTAGAATTAATTCCCATCGAGTCAGTAAAGATCAAGAGGTGTGAAAAGAAAAGTCCCTCCGAA
TTTTGCCGACTAGACTCAGCAGTGCGGGAGAGAACTGCATTTTGCAAGGGAGTGTCACATGTCGGCCTCGAACGCATAGGGGTTAGGGCAGCAGGCCTCCCAACA
GTTTCGACGCAGGGAGGTAACCAGAAGGCTCGTGTTTTCGCACTTACCGGCAAGGAAGCAGCGAATGCCGAAGCCGTTGTCATAGCCGCTCAAAAACTTAATGGC
GAGATTTACAAACAATGGAAGTCGAATTTAAACACTATTCTCGTGATAGATGATCTTAGGTTCGTCTGGCAAGAGAATTGTCTTCAAGCTCCTGTGCCTAACGCC
ATTGTGGCAGTTCGTAACGTCTATGACAGGTGGATCAAGGCCAATGACAAAGCCAAGGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCCAAGAAGCACGAG
GACACGGTCACTGCTAAGGAGATCATGGACTCGCTGCAGAGCATGTTTGGACAACCGTCCTCACAGGCTCGACATGAAGCTCTTAAGTTCGTTACAACTCCCGCA
TGA
Protein sequenceShow/hide protein sequence
MSVEASGINGRGSMRGVFRKENYWALCTNGQGPVDGEVIGASDKGCLLSTVVVLIPLFPLQFVGIELAHGMVMARRRLDRKDLKFGGVTVSIRAKTFLWTDLVTR
VYRSSGPGRPPCPYQTMSPRRSMRLPANVNPTLNGENVADPPPPPADPQVALLPEVLQALINNTVGVSSVQAKPPRHFHAPQSEAQFIKDFKHYGPSTFDGGSCD
DQFKVKGAVFMLRDEPLNWWDSLAATEDHANVPVTWARFKDLLCDYYFPKTVKDAKEAEFLHLTQGTLTVAQYERKFTEFSCFALELIPIESVKIKRCEKKSPSE
FCRLDSAVRERTAFCKGVSHVGLERIGVRAAGLPTVSTQGGNQKARVFALTGKEAANAEAVVIAAQKLNGEIYKQWKSNLNTILVIDDLRFVWQENCLQAPVPNA
IVAVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFVTTPA