; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g06600 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g06600
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:4729055..4732122
RNA-Seq ExpressionMoc03g06600
SyntenyMoc03g06600
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW99369.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.6e-0930.72Show/hide
Query:  GVLKFNGENFSFWKMQVKDLLTCKKIH-KTLERDQQGMVDKDWNEMDKQAVSNIRMSLSM----NVVLEG----------------------MSVKIDEE
        G+ KF+G NF++W+MQ++D    +K+H   L    + M  ++W  +D+Q +  IR++LS     NVV E                       + +  D+E
Subjt:  GVLKFNGENFSFWKMQVKDLLTCKKIH-KTLERDQQGMVDKDWNEMDKQAVSNIRMSLSM----NVVLEG----------------------MSVKIDEE

Query:  VKAMRLLTSLPDSWETMKTAVKFKEDLEKGNTTTNVVTKEERIEEFLACVQRNISVDQWVVDTAAS
        ++A+ +L SLP+SWE M+ A K ++D      + N VT+E      LA    +  +D WV+D+ AS
Subjt:  VKAMRLLTSLPDSWETMKTAVKFKEDLEKGNTTTNVVTKEERIEEFLACVQRNISVDQWVVDTAAS

VFQ58251.1 unnamed protein product [Cuscuta campestris]3.6e-0937.84Show/hide
Query:  KFNGENFSFWKMQVKDLLTCKKIHKTLERDQQGMVDKDWNEMDKQAVSNIRMSLSMNV------------VLEGMS--------VKIDEEVKAMRLLTSL
        KF+G +FS+W+MQ++DLL  K +   L    + M D DW  +D++ +S IR+SL+ NV            ++E +S        +K D+EV+A+ LL+SL
Subjt:  KFNGENFSFWKMQVKDLLTCKKIHKTLERDQQGMVDKDWNEMDKQAVSNIRMSLSMNV------------VLEGMS--------VKIDEEVKAMRLLTSL

Query:  PDSWETMKTAV
        PD+W    TAV
Subjt:  PDSWETMKTAV

VFQ85624.1 unnamed protein product [Cuscuta campestris]2.5e-1040.57Show/hide
Query:  KFNGENFSFWKMQVKDLLTCKKIHKTLERDQQGMVDKDWNEMDKQAVSNIRMSLSMNVVLE---------------GMSVKIDEEVKAMRLLTSLPDSWE
        KF+G +FS+WKMQ++DLL  K +   L    + M D DW  +D++A+S IR+SL+ NV                   + +K D+EV+A+ LL+SLPDSW 
Subjt:  KFNGENFSFWKMQVKDLLTCKKIHKTLERDQQGMVDKDWNEMDKQAVSNIRMSLSMNVVLE---------------GMSVKIDEEVKAMRLLTSLPDSWE

Query:  TMKTAV
           TAV
Subjt:  TMKTAV

VFQ90050.1 unnamed protein product, partial [Cuscuta campestris]9.4e-1039.09Show/hide
Query:  KFNGENFSFWKMQVKDLLTCKKIHKTLERDQQGMVDKDWNEMDKQAVSNIRMSLSMNV-------------------VLEGMSVKIDEEVKAMRLLTSLP
        KF+G +FS+WKMQ++DLL  K +   L    + M D +W  +D++A+S IR+SL+ NV                   +L  + +K D+EV+A+ LL+SLP
Subjt:  KFNGENFSFWKMQVKDLLTCKKIHKTLERDQQGMVDKDWNEMDKQAVSNIRMSLSMNV-------------------VLEGMSVKIDEEVKAMRLLTSLP

Query:  DSWETMKTAV
        DSW    TAV
Subjt:  DSWETMKTAV

VFQ93322.1 unnamed protein product [Cuscuta campestris]1.2e-0938.18Show/hide
Query:  KFNGENFSFWKMQVKDLLTCKKIHKTLERDQQGMVDKDWNEMDKQAVSNIRMSLSMNV-------------------VLEGMSVKIDEEVKAMRLLTSLP
        KF+G +FS+W+MQ++DLL  K +   L    + M D DW  +D++A+S IR+SL+ NV                   +L  + +K D+EV+A+ LL+SLP
Subjt:  KFNGENFSFWKMQVKDLLTCKKIHKTLERDQQGMVDKDWNEMDKQAVSNIRMSLSMNV-------------------VLEGMSVKIDEEVKAMRLLTSLP

Query:  DSWETMKTAV
        D+W    TAV
Subjt:  DSWETMKTAV

TrEMBL top hitse value%identityAlignment
A0A438IRM6 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-0930.72Show/hide
Query:  GVLKFNGENFSFWKMQVKDLLTCKKIH-KTLERDQQGMVDKDWNEMDKQAVSNIRMSLSM----NVVLEG----------------------MSVKIDEE
        G+ KF+G NF++W+MQ++D    +K+H   L    + M  ++W  +D+Q +  IR++LS     NVV E                       + +  D+E
Subjt:  GVLKFNGENFSFWKMQVKDLLTCKKIH-KTLERDQQGMVDKDWNEMDKQAVSNIRMSLSM----NVVLEG----------------------MSVKIDEE

Query:  VKAMRLLTSLPDSWETMKTAVKFKEDLEKGNTTTNVVTKEERIEEFLACVQRNISVDQWVVDTAAS
        ++A+ +L SLP+SWE M+ A K ++D      + N VT+E      LA    +  +D WV+D+ AS
Subjt:  VKAMRLLTSLPDSWETMKTAVKFKEDLEKGNTTTNVVTKEERIEEFLACVQRNISVDQWVVDTAAS

A0A484JXZ8 CCHC-type domain-containing protein1.7e-0937.84Show/hide
Query:  KFNGENFSFWKMQVKDLLTCKKIHKTLERDQQGMVDKDWNEMDKQAVSNIRMSLSMNV------------VLEGMS--------VKIDEEVKAMRLLTSL
        KF+G +FS+W+MQ++DLL  K +   L    + M D DW  +D++ +S IR+SL+ NV            ++E +S        +K D+EV+A+ LL+SL
Subjt:  KFNGENFSFWKMQVKDLLTCKKIHKTLERDQQGMVDKDWNEMDKQAVSNIRMSLSMNV------------VLEGMS--------VKIDEEVKAMRLLTSL

Query:  PDSWETMKTAV
        PD+W    TAV
Subjt:  PDSWETMKTAV

A0A484MBE7 CCHC-type domain-containing protein1.2e-1040.57Show/hide
Query:  KFNGENFSFWKMQVKDLLTCKKIHKTLERDQQGMVDKDWNEMDKQAVSNIRMSLSMNVVLE---------------GMSVKIDEEVKAMRLLTSLPDSWE
        KF+G +FS+WKMQ++DLL  K +   L    + M D DW  +D++A+S IR+SL+ NV                   + +K D+EV+A+ LL+SLPDSW 
Subjt:  KFNGENFSFWKMQVKDLLTCKKIHKTLERDQQGMVDKDWNEMDKQAVSNIRMSLSMNVVLE---------------GMSVKIDEEVKAMRLLTSLPDSWE

Query:  TMKTAV
           TAV
Subjt:  TMKTAV

A0A484MMH6 Uncharacterized protein (Fragment)4.5e-1039.09Show/hide
Query:  KFNGENFSFWKMQVKDLLTCKKIHKTLERDQQGMVDKDWNEMDKQAVSNIRMSLSMNV-------------------VLEGMSVKIDEEVKAMRLLTSLP
        KF+G +FS+WKMQ++DLL  K +   L    + M D +W  +D++A+S IR+SL+ NV                   +L  + +K D+EV+A+ LL+SLP
Subjt:  KFNGENFSFWKMQVKDLLTCKKIHKTLERDQQGMVDKDWNEMDKQAVSNIRMSLSMNV-------------------VLEGMSVKIDEEVKAMRLLTSLP

Query:  DSWETMKTAV
        DSW    TAV
Subjt:  DSWETMKTAV

A0A484MWS8 CCHC-type domain-containing protein5.9e-1038.18Show/hide
Query:  KFNGENFSFWKMQVKDLLTCKKIHKTLERDQQGMVDKDWNEMDKQAVSNIRMSLSMNV-------------------VLEGMSVKIDEEVKAMRLLTSLP
        KF+G +FS+W+MQ++DLL  K +   L    + M D DW  +D++A+S IR+SL+ NV                   +L  + +K D+EV+A+ LL+SLP
Subjt:  KFNGENFSFWKMQVKDLLTCKKIHKTLERDQQGMVDKDWNEMDKQAVSNIRMSLSMNV-------------------VLEGMSVKIDEEVKAMRLLTSLP

Query:  DSWETMKTAV
        D+W    TAV
Subjt:  DSWETMKTAV

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.5e-0526.55Show/hide
Query:  VLKFNGEN-FSFWKMQVKDLLTCKKIHKTLERDQQ---GMVDKDWNEMDKQAVSNIRMSLSMNVV-----------------------------------
        V KFNG+N FS W+ +++DLL  + +HK L+ D +    M  +DW ++D++A S IR+ LS +VV                                   
Subjt:  VLKFNGEN-FSFWKMQVKDLLTCKKIHKTLERDQQ---GMVDKDWNEMDKQAVSNIRMSLSMNVV-----------------------------------

Query:  ------------------------LEGMSVKIDEEVKAMRLLTSLPDSWETMKTAVKF-KEDLEKGNTTTNVVTKEE
                                L  + VKI+EE KA+ LL SLP S++ + T +   K  +E  + T+ ++  E+
Subjt:  ------------------------LEGMSVKIDEEVKAMRLLTSLPDSWETMKTAVKF-KEDLEKGNTTTNVVTKEE

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein5.7e-0526.8Show/hide
Query:  KFNGENFSFWKMQVKDLLTCKKIHKTLERDQQGMVDKDWNEMDKQAVSNIRMSLSMNVVLEGMSVKIDEEVKAMRLLTSLPDSWETMKTAVKFKEDL
        K +G ++SF +M+++D L  KK+H+ L +  + M   DWN + +Q +  IR+++S N+       K  + +  M++L+ +     T  T +  +E +
Subjt:  KFNGENFSFWKMQVKDLLTCKKIHKTLERDQQGMVDKDWNEMDKQAVSNIRMSLSMNVVLEGMSVKIDEEVKAMRLLTSLPDSWETMKTAVKFKEDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAGGGTCGATGCACAGTTCAAGGCTTTGGGGATAAATGGCAAGGTCGAACGCCAAGCTTCTGTAGATGAGTGTCGGGCCTTGGGGATAGATGGCAAGGCC
CGATATGATTCGAGGAGTCCAACGTCGAGCACCGTAGAGAATTGTGGATGTCTTGGCTATGCGAGGTCAGGGAGCCAGAAGGCTGACACGTCACTAGTTGGGTAC
GGTATCGAGGGTCTTCAATTTCCCAACAAGTGGTATCAGAGCTGTAAGGTTGCAGCATCATGGGTGACAGATATCAAGTACCACCATTATTATGAAGGGGTCCTC
AAGTTCAATGGAGAGAATTTCAGTTTTTGGAAGATGCAAGTAAAGGATCTTCTTACATGCAAAAAGATACACAAGACTTTGGAGAGAGACCAGCAGGGAATGGTA
GACAAGGATTGGAATGAGATGGATAAGCAGGCCGTTTCGAACATCAGAATGTCGTTGTCGATGAACGTTGTATTAGAAGGGATGAGTGTCAAGATTGACGAGGAG
GTAAAAGCTATGAGGCTGTTGACGTCTTTGCCTGACAGTTGGGAGACGATGAAGACCGCGGTGAAGTTCAAAGAAGATCTTGAGAAGGGGAACACTACTACAAAT
GTTGTAACAAAAGAAGAACGGATTGAAGAGTTTCTGGCTTGTGTTCAGAGGAACATATCTGTAGACCAGTGGGTGGTTGACACTGCAGCATCAGCGCACATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAAGGGTCGATGCACAGTTCAAGGCTTTGGGGATAAATGGCAAGGTCGAACGCCAAGCTTCTGTAGATGAGTGTCGGGCCTTGGGGATAGATGGCAAGGCC
CGATATGATTCGAGGAGTCCAACGTCGAGCACCGTAGAGAATTGTGGATGTCTTGGCTATGCGAGGTCAGGGAGCCAGAAGGCTGACACGTCACTAGTTGGGTAC
GGTATCGAGGGTCTTCAATTTCCCAACAAGTGGTATCAGAGCTGTAAGGTTGCAGCATCATGGGTGACAGATATCAAGTACCACCATTATTATGAAGGGGTCCTC
AAGTTCAATGGAGAGAATTTCAGTTTTTGGAAGATGCAAGTAAAGGATCTTCTTACATGCAAAAAGATACACAAGACTTTGGAGAGAGACCAGCAGGGAATGGTA
GACAAGGATTGGAATGAGATGGATAAGCAGGCCGTTTCGAACATCAGAATGTCGTTGTCGATGAACGTTGTATTAGAAGGGATGAGTGTCAAGATTGACGAGGAG
GTAAAAGCTATGAGGCTGTTGACGTCTTTGCCTGACAGTTGGGAGACGATGAAGACCGCGGTGAAGTTCAAAGAAGATCTTGAGAAGGGGAACACTACTACAAAT
GTTGTAACAAAAGAAGAACGGATTGAAGAGTTTCTGGCTTGTGTTCAGAGGAACATATCTGTAGACCAGTGGGTGGTTGACACTGCAGCATCAGCGCACATTTAG
Protein sequenceShow/hide protein sequence
MSRVDAQFKALGINGKVERQASVDECRALGIDGKARYDSRSPTSSTVENCGCLGYARSGSQKADTSLVGYGIEGLQFPNKWYQSCKVAASWVTDIKYHHYYEGVL
KFNGENFSFWKMQVKDLLTCKKIHKTLERDQQGMVDKDWNEMDKQAVSNIRMSLSMNVVLEGMSVKIDEEVKAMRLLTSLPDSWETMKTAVKFKEDLEKGNTTTN
VVTKEERIEEFLACVQRNISVDQWVVDTAASAHI