; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g30440 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g30440
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr6:22875811..22880245
RNA-Seq ExpressionMoc06g30440
SyntenyMoc06g30440
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAT38747.2 Polyprotein, putative [Solanum demissum]1.0e-3850.6Show/hide
Query:  DDHITDDPSKDESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNAL
        DDH+  DP  D++KKAWL D+ARL+LQI NSI+ +++GLVN CE+VKEL+ YLE+LYSGKGN+SRI+E+ K  Y+ E   +SLT YFME K+ Y E N L
Subjt:  DDHITDDPSKDESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNAL

Query:  LPDSNDPKVRLAQCEQLTVIGFLLGLPARFDSGE----RSQEDDDILVYFIVSPSTEELPSNTSSYVL
        LP S D KV+ AQ EQ+ ++ FL GLP+ F++ +     S E   +   F     TE  P+N  + VL
Subjt:  LPDSNDPKVRLAQCEQLTVIGFLLGLPARFDSGE----RSQEDDDILVYFIVSPSTEELPSNTSSYVL

KAF9661591.1 hypothetical protein SADUNF_Sadunf19G0084700 [Salix dunnii]1.9e-3757.14Show/hide
Query:  DDHITDDPSKD-ESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNA
        DDH+ DDP  D  +KKAWL D+AR+ LQI NSI+ ++I LVN CE+VK+LL YL FLYSGKGN+SRI+++CK  Y+P+  ++SLT YFM+ KR+Y E N+
Subjt:  DDHITDDPSKD-ESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNA

Query:  LLPDSNDPKVRLAQCEQLTVIGFLLGLPARFDS
        LLP S D K + +Q EQ+ V+ FL GLP  FD+
Subjt:  LLPDSNDPKVRLAQCEQLTVIGFLLGLPARFDS

KAF9662291.1 hypothetical protein SADUNF_Sadunf18G0037700 [Salix dunnii]1.9e-3757.14Show/hide
Query:  DDHITDDPSKD-ESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNA
        DDH+ DDP  D  +KKAWL D+AR+ LQI NSI+ ++I LVN CE+VK+LL YL FLYSGKGN+SRI+++CK  Y+P+  ++SLT YFM+ KR+Y E N+
Subjt:  DDHITDDPSKD-ESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNA

Query:  LLPDSNDPKVRLAQCEQLTVIGFLLGLPARFDS
        LLP S D K + +Q EQ+ V+ FL GLP  FD+
Subjt:  LLPDSNDPKVRLAQCEQLTVIGFLLGLPARFDS

KAF9689200.1 hypothetical protein SADUNF_Sadunf01G0066900 [Salix dunnii]1.9e-3757.14Show/hide
Query:  DDHITDDPSKD-ESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNA
        DDH+ DDP  D  +KKAWL D+AR+ LQI NSI+ ++I LVN CE+VK+LL YL FLYSGKGN+SRI+++CK  Y+P+  ++SLT YFM+ KR+Y E N+
Subjt:  DDHITDDPSKD-ESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNA

Query:  LLPDSNDPKVRLAQCEQLTVIGFLLGLPARFDS
        LLP S D K + +Q EQ+ V+ FL GLP  FD+
Subjt:  LLPDSNDPKVRLAQCEQLTVIGFLLGLPARFDS

XP_038882618.1 uncharacterized protein LOC120073824 [Benincasa hispida]3.2e-3755.3Show/hide
Query:  MDDHITDDPSKDESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNA
        MDDHIT++   D +KK W  D++R++LQI NSI+ +I+ LVN CE VK+LL+YL+FLYSGK N++R+F++CK  YQP+ G++SLT+YFME K   AEFNA
Subjt:  MDDHITDDPSKDESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNA

Query:  LLPDSNDPKVRLAQCEQLTVIGFLLGLPARFD
        L+P S DPKV +A+CE+L ++ FL+GL  +++
Subjt:  LLPDSNDPKVRLAQCEQLTVIGFLLGLPARFD

TrEMBL top hitse value%identityAlignment
A0A438DZQ8 Retrovirus-related Pol polyprotein from transposon TNT 1-946.6e-3645.83Show/hide
Query:  DDHITDDPSKDESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNAL
        DDH+T++P  D ++K W+ D+ARL LQ+ NSI  DI+GL++ CE+VKEL+ YL+FLYSGKGNVS+++++    + PE G +SLT YFM+ K++Y E NAL
Subjt:  DDHITDDPSKDESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNAL

Query:  LPDSNDPKVRLAQCEQLTVIGFLLGLPARFDSGE----RSQEDDDILVYFIVSPSTEELPSNTSSYVL
        +P S D +V+ AQ EQ+TV+ FL GLP+ F++ +       + D +   F     TE + S+  + VL
Subjt:  LPDSNDPKVRLAQCEQLTVIGFLLGLPARFDSGE----RSQEDDDILVYFIVSPSTEELPSNTSSYVL

A0A438HPS2 Retrovirus-related Pol polyprotein from transposon TNT 1-948.6e-3652.24Show/hide
Query:  DDHITDDPSKDESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNAL
        DDH+T++P  D ++K W+ D+ARL+LQ+ NSI  DI+GL + CE+VKEL+ YL+FLYSGKGNVSR++++    + PE G +SLT YFM+ K++Y E NAL
Subjt:  DDHITDDPSKDESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNAL

Query:  LPDSNDPKVRLAQCEQLTVIGFLLGLPARFDSGE
        +P S D +V+ AQ EQ+ V+ FL GLP+ F++ +
Subjt:  LPDSNDPKVRLAQCEQLTVIGFLLGLPARFDSGE

A0A5N5KWB0 Uncharacterized protein1.7e-3638.03Show/hide
Query:  DDHITDDPSKD-ESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNA
        DDH+ DDP  D  +KKAWL D+AR+ LQI NSI+ ++IGLVN CE+VK LL YL FLYSGKGN+SRI+++CK  Y PE  D+ LT YFM+ KR+Y E N+
Subjt:  DDHITDDPSKD-ESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNA

Query:  LLPDSNDPKVRLAQCEQLTVIGFLL-----------GLPARFDSGERSQEDDD---------------------------ILVYFIVSP-----------
        LLP S D K + +Q EQ+ V+ FL             L +R  S E  +   D                           ++ Y+   P           
Subjt:  LLPDSNDPKVRLAQCEQLTVIGFLL-----------GLPARFDSGERSQEDDD---------------------------ILVYFIVSP-----------

Query:  ---------------------------------STEE----------LPSNTSSYVLDPSLPTITQVYSRRQPPTDSCPIPTASSSEDPGTSDDLPIALR
                                         S +E          L SN S+  +D SL  ITQVYSRR PP +SCP P A +S DP  S DLPIA+R
Subjt:  ---------------------------------STEE----------LPSNTSSYVLDPSLPTITQVYSRRQPPTDSCPIPTASSSEDPGTSDDLPIALR

Query:  KGSTN
        K S +
Subjt:  KGSTN

A5AWD0 Uncharacterized protein1.1e-3552.24Show/hide
Query:  DDHITDDPSKDESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNAL
        DDH+T++P  D ++K W+ D+ARL LQ+ NSI  DI+GL++ CE+VKEL+ YL+FLYSGKGNVSR++++    + PE G +SLT YFM+ K++Y E NAL
Subjt:  DDHITDDPSKDESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNAL

Query:  LPDSNDPKVRLAQCEQLTVIGFLLGLPARFDSGE
        +P S D +V+ AQ EQ+ V+ FL GLP+ F++ +
Subjt:  LPDSNDPKVRLAQCEQLTVIGFLLGLPARFDSGE

Q6L3Q0 Polyprotein, putative4.9e-3950.6Show/hide
Query:  DDHITDDPSKDESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNAL
        DDH+  DP  D++KKAWL D+ARL+LQI NSI+ +++GLVN CE+VKEL+ YLE+LYSGKGN+SRI+E+ K  Y+ E   +SLT YFME K+ Y E N L
Subjt:  DDHITDDPSKDESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNAL

Query:  LPDSNDPKVRLAQCEQLTVIGFLLGLPARFDSGE----RSQEDDDILVYFIVSPSTEELPSNTSSYVL
        LP S D KV+ AQ EQ+ ++ FL GLP+ F++ +     S E   +   F     TE  P+N  + VL
Subjt:  LPDSNDPKVRLAQCEQLTVIGFLLGLPARFDSGE----RSQEDDDILVYFIVSPSTEELPSNTSSYVL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGATCACATAACTGATGATCCATCTAAGGATGAGAGCAAGAAAGCTTGGTTGTGGGATAATGCTCGACTGGTTCTGCAGATCAATAATTCGATCGAGGGT
GACATTATTGGCTTGGTTAATGAATGTGAGTATGTTAAAGAGTTGCTTAAATATTTAGAATTCCTTTATTCTGGAAAAGGAAATGTTAGTCGAATATTTGAAATC
TGCAAGCGCTGCTACCAACCTGAGTTTGGTGACCAATCTCTTACAAATTACTTTATGGAACACAAACGAATTTATGCAGAGTTTAATGCATTACTCCCAGATAGT
AATGATCCAAAAGTTCGGCTTGCTCAATGCGAACAACTAACAGTTATCGGTTTTCTTCTTGGTCTTCCAGCTAGATTTGATTCAGGGGAGAGGTCACAAGAAGAT
GATGACATTCTTGTCTATTTCATTGTCTCTCCTTCTACTGAAGAGCTTCCTAGCAATACATCTTCCTATGTGCTTGATCCTTCTCTTCCCACCATTACTCAAGTT
TATTCTCGTCGGCAACCTCCTACGGACTCATGCCCTATACCAACAGCTTCTTCGTCCGAGGATCCAGGAACAAGTGATGACCTTCCTATTGCTCTTAGAAAAGGG
TCAACCAATGAAGAAGAGGATTTCCACTTCAACATGATGAGAACTATCAAGATTCATAAAGATGGAGAACAAGTGGAGAAAGGAACTTCAACGGAACATGAAGAC
ACCAAACATGAGAGCTCCAATTTAGGGATTATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACGATCACATAACTGATGATCCATCTAAGGATGAGAGCAAGAAAGCTTGGTTGTGGGATAATGCTCGACTGGTTCTGCAGATCAATAATTCGATCGAGGGT
GACATTATTGGCTTGGTTAATGAATGTGAGTATGTTAAAGAGTTGCTTAAATATTTAGAATTCCTTTATTCTGGAAAAGGAAATGTTAGTCGAATATTTGAAATC
TGCAAGCGCTGCTACCAACCTGAGTTTGGTGACCAATCTCTTACAAATTACTTTATGGAACACAAACGAATTTATGCAGAGTTTAATGCATTACTCCCAGATAGT
AATGATCCAAAAGTTCGGCTTGCTCAATGCGAACAACTAACAGTTATCGGTTTTCTTCTTGGTCTTCCAGCTAGATTTGATTCAGGGGAGAGGTCACAAGAAGAT
GATGACATTCTTGTCTATTTCATTGTCTCTCCTTCTACTGAAGAGCTTCCTAGCAATACATCTTCCTATGTGCTTGATCCTTCTCTTCCCACCATTACTCAAGTT
TATTCTCGTCGGCAACCTCCTACGGACTCATGCCCTATACCAACAGCTTCTTCGTCCGAGGATCCAGGAACAAGTGATGACCTTCCTATTGCTCTTAGAAAAGGG
TCAACCAATGAAGAAGAGGATTTCCACTTCAACATGATGAGAACTATCAAGATTCATAAAGATGGAGAACAAGTGGAGAAAGGAACTTCAACGGAACATGAAGAC
ACCAAACATGAGAGCTCCAATTTAGGGATTATTTAG
Protein sequenceShow/hide protein sequence
MDDHITDDPSKDESKKAWLWDNARLVLQINNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRCYQPEFGDQSLTNYFMEHKRIYAEFNALLPDS
NDPKVRLAQCEQLTVIGFLLGLPARFDSGERSQEDDDILVYFIVSPSTEELPSNTSSYVLDPSLPTITQVYSRRQPPTDSCPIPTASSSEDPGTSDDLPIALRKG
STNEEEDFHFNMMRTIKIHKDGEQVEKGTSTEHEDTKHESSNLGII