; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g19340 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g19340
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr2:14383504..14384430
RNA-Seq ExpressionMoc02g19340
SyntenyMoc02g19340
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB5561215.1 hypothetical protein DKX38_006172 [Salix brachista]1.5e-4050Show/hide
Query:  GSNFSYWKDQIVDYLHSKQFELP-LDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNKVVISNSCGKEKLKFE
        G +F YWK QI DYL+ K+  LP L +KP+ M++ +W  LDR+VLG+IRL LT  V  +V  E TT  LM  LS I+E      +  ISNS GK KL ++
Subjt:  GSNFSYWKDQIVDYLHSKQFELP-LDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNKVVISNSCGKEKLKFE

Query:  DVRDASLGKEIRRKDSGIAPTS---------GHRRRNCKALKKAEGKEAGANVVAEEIHDALVLAVESAHNTWVVDSGASFHTTGQRDILENYVA-NHGK
        D+RD  LG+E+RR+D+G + TS         GH + NC  +KK E  +  AN VAEE H+AL+L+V S  ++WV+DSGASFHTT  ++IL+NYVA ++G 
Subjt:  DVRDASLGKEIRRKDSGIAPTS---------GHRRRNCKALKKAEGKEAGANVVAEEIHDALVLAVESAHNTWVVDSGASFHTTGQRDILENYVA-NHGK

Query:  VYLADG
        VYLADG
Subjt:  VYLADG

RVW62119.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.6e-4041.11Show/hide
Query:  FYGSNFSYWKDQIVDYLHSKQFELP-LDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNKV------------
        F G++F+YW+ QI DYL+ ++  LP L  KP+ M+  +W  LDR+VLG+IRL L+ +V  +V  E TT +LM ALS +YEKPS NNKV            
Subjt:  FYGSNFSYWKDQIVDYLHSKQFELP-LDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNKV------------

Query:  ------------------------------------------------VISNSCGKEKLKFEDVRDASLGKEIRRKDSGIAPTS-----------GHRRR
                                                         +SNS GKEKLK+ D+RD  L +EIRR+D+G    S           GH +R
Subjt:  ------------------------------------------------VISNSCGKEKLKFEDVRDASLGKEIRRKDSGIAPTS-----------GHRRR

Query:  NCKALKKAEGKEAGANVVAEEIHDALVLAVESAHNTWVVDSGASFHTTGQRDILENYVA-NHGKVYLADG
         CK+ KK + ++  AN V EE+HDAL+LAV+S  + WV+DSGASFHTT  R+I++NYVA + GKVYLADG
Subjt:  NCKALKKAEGKEAGANVVAEEIHDALVLAVESAHNTWVVDSGASFHTTGQRDILENYVA-NHGKVYLADG

RVW76343.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]8.4e-4447.6Show/hide
Query:  FYGSNFSYWKDQIVDYLHSKQFELP-LDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNKV------------
        F G++F+YW+ QI DYL+ ++  LP L  KP+ M+  +W  LDR+VLG+IRL L+ +V  +V  E TT +LM A S +YEKPS NNKV            
Subjt:  FYGSNFSYWKDQIVDYLHSKQFELP-LDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNKV------------

Query:  -------VISNSCGKEKLKFEDVRDASLGKEIRRKDSGIAPTS-----------GHRRRNCKALKKAEGKEAGANVVAEEIHDALVLAVESAHNTWVVDS
                +SNS GKEKLK+ D+RD  L +EIRR+D+G    S           GH +R CK+ KK + ++  AN V EE+HDAL+ AV+S  + WV+DS
Subjt:  -------VISNSCGKEKLKFEDVRDASLGKEIRRKDSGIAPTS-----------GHRRRNCKALKKAEGKEAGANVVAEEIHDALVLAVESAHNTWVVDS

Query:  GASFHTTGQRDILENYVA-NHGKVYLADG
        GASFHTT  R+I++NYVA + GKVYLADG
Subjt:  GASFHTTGQRDILENYVA-NHGKVYLADG

RVX04667.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.6e-4042.46Show/hide
Query:  FYGSNFSYWKDQIVDYLHSKQFELP-LDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNK-----VVISNSCG
        F G++F+YW+ QI DYL+ ++  LP L  KP+ M+  +W  LDR+VLG+IRL L+ +V  +V  E TT +LM ALS +YEKPS NNK     + +SNS G
Subjt:  FYGSNFSYWKDQIVDYLHSKQFELP-LDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNK-----VVISNSCG

Query:  KEKLKFEDVRDASLGKEIRRKDSGIAPTS------------------------------------------------GHRRRNCKALKKAEGKEAGANVV
        KEKLK+ D+RD  L +EIR++D+G    S                                                GH +R CK+ KK + ++  AN V
Subjt:  KEKLKFEDVRDASLGKEIRRKDSGIAPTS------------------------------------------------GHRRRNCKALKKAEGKEAGANVV

Query:  AEEIHDALVLAVESAHNTWVVDSGASFHTTGQRDILENY-VANHGKVYLADG
         EE+ DAL+LAV+S  + WV+DSGASFHTT  R+I++NY V + GKVYLADG
Subjt:  AEEIHDALVLAVESAHNTWVVDSGASFHTTGQRDILENY-VANHGKVYLADG

XP_022152845.1 uncharacterized protein LOC111020469 [Momordica charantia]4.4e-4545.27Show/hide
Query:  SNFSYWKDQIVDYLHSKQFELPLDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNKV----------------
        SN ++    ++DYLHSK+ E PL+ KPDDM E +WKKLDRKVLG IRL LT NVQSSVA +TTTM LMNAL+N+YEK SVNNKV                
Subjt:  SNFSYWKDQIVDYLHSKQFELPLDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNKV----------------

Query:  --------------------------------------------VISNSCGKEKLKFEDVRDASLGKEIRRKDSGIAPTS--------------------
                                                     ISNSC KEKLKFEDVRDA+L +EIRRKDSGIAPTS                    
Subjt:  --------------------------------------------VISNSCGKEKLKFEDVRDASLGKEIRRKDSGIAPTS--------------------

Query:  ----------------------GHRRRNCKALKKAEGKEAGANVVAEEIHDALVLAVESAHNTWVVDSGASFHTTGQRDILENYVANHGKVYLADG
                              GH + NCKA KK EG EA AN VAE+IHDALV+AVESAH+TWV+DSG                 NHGKVYLADG
Subjt:  ----------------------GHRRRNCKALKKAEGKEAGANVVAEEIHDALVLAVESAHNTWVVDSGASFHTTGQRDILENYVANHGKVYLADG

TrEMBL top hitse value%identityAlignment
A0A2N9GHB5 Uncharacterized protein3.8e-4240.41Show/hide
Query:  MTGEDKLVI----FYGSNFSYWKDQIVDYLHSKQFELP-LDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNK
        MTGE+  V     F G++F YW+ QI DYL+ K+  LP L +KP+DME+ +W  LDR+VLG+IRL L+  V  +V  E TT ELM AL  +YEKPS NNK
Subjt:  MTGEDKLVI----FYGSNFSYWKDQIVDYLHSKQFELP-LDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNK

Query:  V------------------------------------------------------------VISNSCGKEKLKFEDVRDASLGKEIRRKDSGIAPTS---
        V                                                             +SNS GK KLK+ D+RD  LG+E+RR+D+G   +S   
Subjt:  V------------------------------------------------------------VISNSCGKEKLKFEDVRDASLGKEIRRKDSGIAPTS---

Query:  ------------------GHRRRNCKALKKAEGKEAGANVVAEEIHDALVLAVESAHNTWVVDSGASFHTTGQRDILENYVA-NHGKVYLAD
                          GH R+NC  LKK + +   ANVV EE+HDAL+L+V+S   +WV+DSGASFHTT  R+I++NYVA + GKVYLAD
Subjt:  ------------------GHRRRNCKALKKAEGKEAGANVVAEEIHDALVLAVESAHNTWVVDSGASFHTTGQRDILENYVA-NHGKVYLAD

A0A2N9IKQ5 Uncharacterized protein5.3e-4442.75Show/hide
Query:  MTGEDKLVI----FYGSNFSYWKDQIVDYLHSKQFELP-LDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNK
        MTGE+  V     F G++F YW+ QI DYL+ K+  LP L +KP+DME+ +W  LDR+VLG+IRL L+  V  +V  E TT ELM AL  +YEKPS NNK
Subjt:  MTGEDKLVI----FYGSNFSYWKDQIVDYLHSKQFELP-LDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNK

Query:  V--------------------VISNSCGKEKLKFEDVRDASLGKEIRRKDSGIAPTS-------------------------------------------
        V                     +SNS GK KLK+ D+RD  LG+E+RR+D+G   +S                                           
Subjt:  V--------------------VISNSCGKEKLKFEDVRDASLGKEIRRKDSGIAPTS-------------------------------------------

Query:  --GHRRRNCKALKKAEGKEAGANVVAEEIHDALVLAVESAHNTWVVDSGASFHTTGQRDILENYVA-NHGKVYLAD
          GH R+NC  LKK + +   ANVV EE+HDAL+L+V+S   +WV+DSGASFHTT  R+I++NYVA + GKVYLAD
Subjt:  --GHRRRNCKALKKAEGKEAGANVVAEEIHDALVLAVESAHNTWVVDSGASFHTTGQRDILENYVA-NHGKVYLAD

A0A2N9JB15 Uncharacterized protein1.8e-4443.38Show/hide
Query:  MTGEDKLVI----FYGSNFSYWKDQIVDYLHSKQFELP-LDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNK
        MTGE+  V     F G++F YW+ QI DYL+ K+  LP L +KP+DME+ +W  LDR+VLG+IRL L+  V  +V  E TT ELM AL  +YEKPS NNK
Subjt:  MTGEDKLVI----FYGSNFSYWKDQIVDYLHSKQFELP-LDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNK

Query:  V----------------------------------------VISNSCGKEKLKFEDVRDASLGKEIRRKDSGIAPTS---------------------GH
        V                                         +SNS GK KLK+ D+RD  LG+E+RR+D+G   +S                     GH
Subjt:  V----------------------------------------VISNSCGKEKLKFEDVRDASLGKEIRRKDSGIAPTS---------------------GH

Query:  RRRNCKALKKAEGKEAGANVVAEEIHDALVLAVESAHNTWVVDSGASFHTTGQRDILENYVA-NHGKVYLAD
         R+NC  LKK + +   ANVV EE+HDAL+L+V+S   +WV+DSGASFHTT  R+I++NYVA + GKVYLAD
Subjt:  RRRNCKALKKAEGKEAGANVVAEEIHDALVLAVESAHNTWVVDSGASFHTTGQRDILENYVA-NHGKVYLAD

A0A438GVW5 Retrovirus-related Pol polyprotein from transposon TNT 1-944.1e-4447.6Show/hide
Query:  FYGSNFSYWKDQIVDYLHSKQFELP-LDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNKV------------
        F G++F+YW+ QI DYL+ ++  LP L  KP+ M+  +W  LDR+VLG+IRL L+ +V  +V  E TT +LM A S +YEKPS NNKV            
Subjt:  FYGSNFSYWKDQIVDYLHSKQFELP-LDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNKV------------

Query:  -------VISNSCGKEKLKFEDVRDASLGKEIRRKDSGIAPTS-----------GHRRRNCKALKKAEGKEAGANVVAEEIHDALVLAVESAHNTWVVDS
                +SNS GKEKLK+ D+RD  L +EIRR+D+G    S           GH +R CK+ KK + ++  AN V EE+HDAL+ AV+S  + WV+DS
Subjt:  -------VISNSCGKEKLKFEDVRDASLGKEIRRKDSGIAPTS-----------GHRRRNCKALKKAEGKEAGANVVAEEIHDALVLAVESAHNTWVVDS

Query:  GASFHTTGQRDILENYVA-NHGKVYLADG
        GASFHTT  R+I++NYVA + GKVYLADG
Subjt:  GASFHTTGQRDILENYVA-NHGKVYLADG

A0A6J1DF43 uncharacterized protein LOC1110204692.1e-4545.27Show/hide
Query:  SNFSYWKDQIVDYLHSKQFELPLDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNKV----------------
        SN ++    ++DYLHSK+ E PL+ KPDDM E +WKKLDRKVLG IRL LT NVQSSVA +TTTM LMNAL+N+YEK SVNNKV                
Subjt:  SNFSYWKDQIVDYLHSKQFELPLDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNKV----------------

Query:  --------------------------------------------VISNSCGKEKLKFEDVRDASLGKEIRRKDSGIAPTS--------------------
                                                     ISNSC KEKLKFEDVRDA+L +EIRRKDSGIAPTS                    
Subjt:  --------------------------------------------VISNSCGKEKLKFEDVRDASLGKEIRRKDSGIAPTS--------------------

Query:  ----------------------GHRRRNCKALKKAEGKEAGANVVAEEIHDALVLAVESAHNTWVVDSGASFHTTGQRDILENYVANHGKVYLADG
                              GH + NCKA KK EG EA AN VAE+IHDALV+AVESAH+TWV+DSG                 NHGKVYLADG
Subjt:  ----------------------GHRRRNCKALKKAEGKEAGANVVAEEIHDALVLAVESAHNTWVVDSGASFHTTGQRDILENYVANHGKVYLADG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein2.4e-1239.22Show/hide
Query:  GSNFSYWKDQIVDYLHSKQFELPLDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNKVVISNSCGKEKLKFED
        G+++S+ + +I DYL+ K+   PL KK + M +  W  L R+VL +IRL ++ N+  +VA E +   LM  LS+IY+KPS NN V+      +E +  ED
Subjt:  GSNFSYWKDQIVDYLHSKQFELPLDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNKVVISNSCGKEKLKFED

Query:  VR
         R
Subjt:  VR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGGAGAAGATAAATTGGTTATTTTTTATGGATCTAATTTTTCGTACTGGAAGGATCAGATAGTAGATTATCTACATTCCAAGCAATTTGAATTACCATTAGACAA
GAAGCCGGATGACATGGAAGAAGTCAAATGGAAAAAGTTGGATAGGAAGGTGTTGGGTATGATTCGCCTAATATTAACAAACAATGTGCAGAGCAGCGTAGCTAATGAGA
CTACCACAATGGAGCTAATGAATGCACTGTCCAACATATATGAGAAGCCCTCGGTAAATAATAAGGTAGTTATTTCAAATTCTTGTGGAAAAGAGAAATTGAAATTTGAA
GATGTCAGAGATGCATCTCTTGGAAAGGAGATTCGAAGAAAGGATTCTGGTATTGCGCCTACTTCTGGACATCGGAGGAGGAACTGCAAAGCCCTAAAGAAAGCTGAGGG
TAAAGAAGCTGGTGCAAATGTTGTTGCTGAAGAAATACATGATGCTCTAGTTCTTGCAGTTGAAAGCGCTCATAACACATGGGTGGTGGATTCAGGTGCGTCTTTTCACA
CTACAGGACAACGTGATATTCTTGAAAACTATGTTGCAAATCATGGCAAGGTGTATCTTGCTGATGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGACTGGAGAAGATAAATTGGTTATTTTTTATGGATCTAATTTTTCGTACTGGAAGGATCAGATAGTAGATTATCTACATTCCAAGCAATTTGAATTACCATTAGACAA
GAAGCCGGATGACATGGAAGAAGTCAAATGGAAAAAGTTGGATAGGAAGGTGTTGGGTATGATTCGCCTAATATTAACAAACAATGTGCAGAGCAGCGTAGCTAATGAGA
CTACCACAATGGAGCTAATGAATGCACTGTCCAACATATATGAGAAGCCCTCGGTAAATAATAAGGTAGTTATTTCAAATTCTTGTGGAAAAGAGAAATTGAAATTTGAA
GATGTCAGAGATGCATCTCTTGGAAAGGAGATTCGAAGAAAGGATTCTGGTATTGCGCCTACTTCTGGACATCGGAGGAGGAACTGCAAAGCCCTAAAGAAAGCTGAGGG
TAAAGAAGCTGGTGCAAATGTTGTTGCTGAAGAAATACATGATGCTCTAGTTCTTGCAGTTGAAAGCGCTCATAACACATGGGTGGTGGATTCAGGTGCGTCTTTTCACA
CTACAGGACAACGTGATATTCTTGAAAACTATGTTGCAAATCATGGCAAGGTGTATCTTGCTGATGGATAG
Protein sequenceShow/hide protein sequence
MTGEDKLVIFYGSNFSYWKDQIVDYLHSKQFELPLDKKPDDMEEVKWKKLDRKVLGMIRLILTNNVQSSVANETTTMELMNALSNIYEKPSVNNKVVISNSCGKEKLKFE
DVRDASLGKEIRRKDSGIAPTSGHRRRNCKALKKAEGKEAGANVVAEEIHDALVLAVESAHNTWVVDSGASFHTTGQRDILENYVANHGKVYLADG