; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g15840 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g15840
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr2:11923575..11927324
RNA-Seq ExpressionMoc02g15840
SyntenyMoc02g15840
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]1.2e-3447.28Show/hide
Query:  MSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDGLAVRELMLRL--------RPS---------------------
        MS  KQL KS+VDRLVEIEEQLL+LRE+P+ L  +E+ +DE S K   ID VN RIDGL ++++ +R+        RP                      
Subjt:  MSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDGLAVRELMLRL--------RPS---------------------

Query:  ---------KMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE
                 ++ +E+T++FK  ID ++AE+ E+ TRVNLTMRAVGN AP    + FNK+KVPEPKPF G RDAK LENF+FD+E
Subjt:  ---------KMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE

XP_022154605.1 uncharacterized protein LOC111021829 [Momordica charantia]2.2e-3345.65Show/hide
Query:  MSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDGLAVRELMLRL--------RPS---------------------
        MS  KQL KS+VDRLVEIEEQLL+LRE+P++L  +E+ +DE S K   ID VN R+DGL ++++ +R+        RP                      
Subjt:  MSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDGLAVRELMLRL--------RPS---------------------

Query:  ---------KMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE
                 ++ +E+T++FK  ID ++AE+ E+ TRVNLTMRAVGN AP    + FNK+KVPEPKPF G R  K LENF FD+E
Subjt:  ---------KMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]1.5e-3452.17Show/hide
Query:  MSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDGLAVRELMLRLRP------------------------------
        MS  KQL KS++DRLVEIEE+LLFLREIP+NL YVES LDEISTKADGIDVVN RIDGLA+RELMLR+                                
Subjt:  MSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDGLAVRELMLRLRP------------------------------

Query:  --------SKMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE
                 +MVSELTD+FKA +D+M+AEIAELGT                             KPFCGARDAKALENFIFDLE
Subjt:  --------SKMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE

XP_022940258.1 uncharacterized protein LOC111445936 [Cucurbita moschata]2.1e-3143.48Show/hide
Query:  MSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDGLAVRELMLRLRP------------------------------
        MS  KQL KS  DRLV+IE+QLL+  E+ + +  +ES ++EI+TKAD I+ +  R++ ++V+ELM+R+                                
Subjt:  MSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDGLAVRELMLRLRP------------------------------

Query:  --------SKMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE
                 +MVSE++   +A I  +KAE+A+L T++N+TMRAV N  P GG IQ+ K+KVPEPKPFCG RDAKALENFIFD E
Subjt:  --------SKMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE

XP_023525752.1 uncharacterized protein LOC111789266 [Cucurbita pepo subsp. pepo]2.4e-3244.57Show/hide
Query:  MSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDGLAVRELMLRLRP------------------------------
        MS  KQL KS  DRLV+IE+QLL+  E+ + +  +ES ++EI+TKAD I+ +  R++ ++VRELM+R+                                
Subjt:  MSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDGLAVRELMLRLRP------------------------------

Query:  --------SKMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE
                 +MVSE++   +A I  +KAE+A+L T++N+TMRAVGN  P GG IQ+ K+KVPEPKPFCG RDAKALENFIFD E
Subjt:  --------SKMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE

TrEMBL top hitse value%identityAlignment
A0A6J1D906 Reverse transcriptase5.6e-3547.28Show/hide
Query:  MSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDGLAVRELMLRL--------RPS---------------------
        MS  KQL KS+VDRLVEIEEQLL+LRE+P+ L  +E+ +DE S K   ID VN RIDGL ++++ +R+        RP                      
Subjt:  MSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDGLAVRELMLRL--------RPS---------------------

Query:  ---------KMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE
                 ++ +E+T++FK  ID ++AE+ E+ TRVNLTMRAVGN AP    + FNK+KVPEPKPF G RDAK LENF+FD+E
Subjt:  ---------KMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE

A0A6J1DK29 uncharacterized protein LOC1110218291.1e-3345.65Show/hide
Query:  MSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDGLAVRELMLRL--------RPS---------------------
        MS  KQL KS+VDRLVEIEEQLL+LRE+P++L  +E+ +DE S K   ID VN R+DGL ++++ +R+        RP                      
Subjt:  MSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDGLAVRELMLRL--------RPS---------------------

Query:  ---------KMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE
                 ++ +E+T++FK  ID ++AE+ E+ TRVNLTMRAVGN AP    + FNK+KVPEPKPF G R  K LENF FD+E
Subjt:  ---------KMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE

A0A6J1DLQ6 uncharacterized protein LOC1110223207.4e-3552.17Show/hide
Query:  MSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDGLAVRELMLRLRP------------------------------
        MS  KQL KS++DRLVEIEE+LLFLREIP+NL YVES LDEISTKADGIDVVN RIDGLA+RELMLR+                                
Subjt:  MSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDGLAVRELMLRLRP------------------------------

Query:  --------SKMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE
                 +MVSELTD+FKA +D+M+AEIAELGT                             KPFCGARDAKALENFIFDLE
Subjt:  --------SKMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE

A0A6J1FNS3 uncharacterized protein LOC1114459361.0e-3143.48Show/hide
Query:  MSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDGLAVRELMLRLRP------------------------------
        MS  KQL KS  DRLV+IE+QLL+  E+ + +  +ES ++EI+TKAD I+ +  R++ ++V+ELM+R+                                
Subjt:  MSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDGLAVRELMLRLRP------------------------------

Query:  --------SKMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE
                 +MVSE++   +A I  +KAE+A+L T++N+TMRAV N  P GG IQ+ K+KVPEPKPFCG RDAKALENFIFD E
Subjt:  --------SKMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE

A0A6J1IU93 uncharacterized protein LOC1114806311.3e-3144.02Show/hide
Query:  MSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDGLAVRELMLRLRP------------------------------
        MS  KQL KS  DRLV+IE+QLL+  E+ + +  +ES ++EI+ KAD I+ +  R++ ++VRELM+R+                                
Subjt:  MSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDGLAVRELMLRLRP------------------------------

Query:  --------SKMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE
                 +MVSE++   +A I  +KAE+A+L T++NLT RAVGN  P GG IQ+ K+KVPEPKPFCG RDAKALENFIFD E
Subjt:  --------SKMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGAAGCGGCGGTTTCTGAGAGGCAAACCAGGCCGGCAGACGACGGCGGAAGGCAGAGCTCCGGCAACCGAACCATGTATCCGGGGCCGTTCTTTAGTTCTCCGAA
CAACCCTCCGAGCCTAGTTGGCTCATCGAGTTGGAAAACAGTAACCATGTCTGGGATAAAACAGTTGGACAAGTCCTACGTCGACAGACTCGTCGAGATCGAAGAACAAC
TGTTGTTCTTGAGGGAAATTCCTAACAACCTTGGATATGTGGAATCTTGGCTGGATGAGATCTCCACCAAAGCTGACGGAATTGACGTCGTAAATGTCCGCATAGATGGG
CTTGCTGTACGTGAGTTGATGCTTCGGTTGAGACCCTCGAAGATGGTCAGTGAGCTGACCGACAACTTCAAAGCCGTCATTGACGACATGAAGGCAGAGATTGCCGAATT
AGGCACCAGAGTAAATCTCACCATGAGAGCAGTGGGAAATCACGCCCCAACTGGGGGACCTATTCAGTTCAACAAGGTGAAAGTTCCCGAACCCAAGCCCTTTTGTGGGG
CGCGAGATGCTAAAGCCCTTGAGAACTTCATCTTCGACCTTGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTGAAGCGGCGGTTTCTGAGAGGCAAACCAGGCCGGCAGACGACGGCGGAAGGCAGAGCTCCGGCAACCGAACCATGTATCCGGGGCCGTTCTTTAGTTCTCCGAA
CAACCCTCCGAGCCTAGTTGGCTCATCGAGTTGGAAAACAGTAACCATGTCTGGGATAAAACAGTTGGACAAGTCCTACGTCGACAGACTCGTCGAGATCGAAGAACAAC
TGTTGTTCTTGAGGGAAATTCCTAACAACCTTGGATATGTGGAATCTTGGCTGGATGAGATCTCCACCAAAGCTGACGGAATTGACGTCGTAAATGTCCGCATAGATGGG
CTTGCTGTACGTGAGTTGATGCTTCGGTTGAGACCCTCGAAGATGGTCAGTGAGCTGACCGACAACTTCAAAGCCGTCATTGACGACATGAAGGCAGAGATTGCCGAATT
AGGCACCAGAGTAAATCTCACCATGAGAGCAGTGGGAAATCACGCCCCAACTGGGGGACCTATTCAGTTCAACAAGGTGAAAGTTCCCGAACCCAAGCCCTTTTGTGGGG
CGCGAGATGCTAAAGCCCTTGAGAACTTCATCTTCGACCTTGAGTAG
Protein sequenceShow/hide protein sequence
MSEAAVSERQTRPADDGGRQSSGNRTMYPGPFFSSPNNPPSLVGSSSWKTVTMSGIKQLDKSYVDRLVEIEEQLLFLREIPNNLGYVESWLDEISTKADGIDVVNVRIDG
LAVRELMLRLRPSKMVSELTDNFKAVIDDMKAEIAELGTRVNLTMRAVGNHAPTGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLE