; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g05120 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g05120
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE2
Genome locationchr9:3945907..3947269
RNA-Seq ExpressionMoc09g05120
SyntenyMoc09g05120
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_021717245.1 uncharacterized protein LOC110685089 [Chenopodium quinoa]1.5e-3955.56Show/hide
Query:  KIDTNSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAKLLWD
        KI+TNSPFYLGP +R GDFIT  RLKL           ++         L+GTIT   P    DDW+ IH MLVSW M+TI PEV SMLS Y++AK LWD
Subjt:  KIDTNSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAKLLWD

Query:  YLNELFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISCDRRKCTC
         L E F ++N P I QLK +ISRCE  K+MS+AVY++KL VL DELDKHE LI C   KC C
Subjt:  YLNELFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISCDRRKCTC

XP_021746636.1 uncharacterized protein LOC110712479 [Chenopodium quinoa]1.8e-3753.09Show/hide
Query:  KIDTNSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAKLLWD
        KI+ NSPFYL   ++LG++IT +RLKL           ++         L+GTI  VVP    DDWV IH MLVSW M TIDPEV S+LSNY++AK LWD
Subjt:  KIDTNSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAKLLWD

Query:  YLNELFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISCDRRKCTC
         LNE F ++N P I QLK +I+RCE +KNMS+A+YF KL VL D+L K+E LIS    +CTC
Subjt:  YLNELFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISCDRRKCTC

XP_021746757.1 uncharacterized protein LOC110712595 [Chenopodium quinoa]4.2e-4256.44Show/hide
Query:  TKIDTNSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAKLLW
        TKI+ NSP+YLGP +R GDFIT  RLKL           ++         L+GTIT   P    DDWV IH MLVSW M+TIDPEV +MLSNY +AK LW
Subjt:  TKIDTNSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAKLLW

Query:  DYLNELFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISCDRRKCTC
        D L E F ++N P I Q+K +I+RCE SK+M +AVYF+KL VL DELDKHE LI C R KCTC
Subjt:  DYLNELFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISCDRRKCTC

XP_021756883.1 uncharacterized protein LOC110721955 [Chenopodium quinoa]6.7e-4054.55Show/hide
Query:  KIDTNSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAKLLWD
        KI++NSP+YLGP +R GDFIT  RLKL           ++         L+GTIT   P    DDWV IH MLVS  M+TIDPEV S LSNY++AK LWD
Subjt:  KIDTNSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAKLLWD

Query:  YLNELFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISCDRRKCTCRLE
         L ELF ++N P I Q+K +ISRC+  K+M +AVYF+KL VL DELDKHE LI+C   KCTC +E
Subjt:  YLNELFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISCDRRKCTCRLE

XP_021762963.1 uncharacterized protein LOC110727691 [Chenopodium quinoa]2.4e-3748.65Show/hide
Query:  KIDTNSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAKLLWD
        KI+TNSPFYLG  +R GDFITPIRLKL           ++         L+GTIT  V     DDWV++  MLVSW M+TIDPEV SMLSNY++AK LWD
Subjt:  KIDTNSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAKLLWD

Query:  YLNELFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISCDRRKCTC---RLENYMKNDVSMIKYFINFC
         L+E F ++  P I QLK  I+RCE  KNMS+A+YF K  VL DE+ K E LI+C   KC C   R     + D  + ++ +  C
Subjt:  YLNELFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISCDRRKCTC---RLENYMKNDVSMIKYFINFC

TrEMBL top hitse value%identityAlignment
A0A3Q7IBW4 Uncharacterized protein1.3e-3644.51Show/hide
Query:  TKIDTNSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAKLLW
        +KI+ ++PFYLG  +R GDFITPIRLKL    +     +++         L+GTI   V     +DW+++H MLVSW M+TIDPEV SMLSNY++AK LW
Subjt:  TKIDTNSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAKLLW

Query:  DYLNELFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISCDRRKCTCRL--ENYMKNDVSMIKYFI
        D L+E F ++N P I+QLK  I++CE +K MS+A+Y+ KL VL D+L   + LI+C+  KC+C +  ++  + +  M++ F+
Subjt:  DYLNELFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISCDRRKCTCRL--ENYMKNDVSMIKYFI

A0A438DPL1 Retrovirus-related Pol polyprotein from transposon RE24.0e-3046.45Show/hide
Query:  KIDTNSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAKLLWD
        K + NSPF+LG  +R GDFITP RL+     +     +L+         L GTIT   P +   DW  ++ MLVSW  +TIDPEV S LS + DAK LW+
Subjt:  KIDTNSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAKLLWD

Query:  YLNELFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISC
        +L + ++++N P I QLK  I++CE SK+MS+  Y+ KL VL +EL KHE LISC
Subjt:  YLNELFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISC

A0A438EGZ1 Retrovirus-related Pol polyprotein from transposon TNT 1-943.0e-3046.45Show/hide
Query:  KIDTNSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAKLLWD
        K + NSPF+LG  +R GDFITP RL+     +     +L+         L GTIT   P +   DW  ++ MLVSW  +TIDPEV S LS + DAK LW+
Subjt:  KIDTNSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAKLLWD

Query:  YLNELFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISC
        +L + ++++N P I QLK  I++CE SK+MS+  Y+ KL VL +EL KHE LISC
Subjt:  YLNELFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISC

A0A438GG88 Retrovirus-related Pol polyprotein from transposon RE23.0e-3046.45Show/hide
Query:  KIDTNSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAKLLWD
        K + NSPF+LG  +R GDFITP RL+     +     +L+         L GTIT   P +   DW  ++ MLVSW  +TIDPEV S LS + DAK LW+
Subjt:  KIDTNSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAKLLWD

Query:  YLNELFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISC
        +L + ++++N P I QLK  I++CE SK+MS+  Y+ KL VL +EL KHE LISC
Subjt:  YLNELFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISC

A0A438ILD3 Uncharacterized protein6.7e-3047.02Show/hide
Query:  NSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAKLLWDYLNE
        NSPF+LG  +R GDFITP RL+     +     +L+         L GTIT   P +   DW  ++ MLVSW  +TIDPEV S LS + DAK LW++L +
Subjt:  NSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAKLLWDYLNE

Query:  LFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISC
         ++++N P I QLK  I++CE SK+MS+  Y+ KL VL +EL KHE LISC
Subjt:  LFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCAAAGATTTGGCGCCGTTGCCGGTCGCCGCTACTTCACCAGTCGCCGGTCGCCGATTTGCACCGGTTTGTCGTTGCACCGATTTGCAACGATTGGTTGTT
CTTGGGGTTGCCGTGTGCAACGATTGGCGCCGGTCGCCGATTTGCACCGATTTGTACCGTTCGACTTGTGTTCGTGGTGTTCGTAGTGGTTGCCGCTCACGCTTC
GTGACCAATTTCGTGATTTCATCGCTGAATTTGGGGGAAATTTGGAGGATTTCATCGCTGCAGATTCAAGGAAGATTTCGTACTGAGGACGACAAAACTTCAGTT
GCAAAAACTCTATTAGGAGTTGCCACGAAGATTGATACAAATTCGCCGTTTTACCTAGGGCCACTAGAACGTCTTGGTGATTTCATTACTCCAATTCGTCTAAAG
TTGATAATTTTGACAAATGGTCTCATGCAATTCGAGTTGTCTTTTCGTCTCGACGAAAATTTGGTTATTCTTAATGGAACTATCACTAAGGTCGTTCCGTCTTGG
ATAAATGATGATTGGGTCATGATTCATTATATGCTAGTTTCATGGTACATGGATACAATCGATCCTGAGGTAGGTTCTATGCTATCTAATTATAATGATGCCAAG
CTGTTATGGGATTATTTGAATGAACTATTCTCCATTATGAACGATCCTCATATTTACCAACTTAAGGGAGATATTAGTCGTTGTGAACATAGCAAAAATATGTCA
ATTGCTGTCTACTTTAATAAATTGTATGTCTTGTTGGATGAGTTAGATAAACATGAACTTCTGATTTCTTGCGACCGTAGAAAGTGTACTTGTCGGTTGGAAAAC
TACATGAAAAACGACGTTTCGATGATAAAGTATTTCATCAATTTTTGCTTGGCTCGTGTTCTAATTATTATGGTCAGTCGAGGACTACTCTTTTATCACAAGACC
CTCTCCCATCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTCAAAGATTTGGCGCCGTTGCCGGTCGCCGCTACTTCACCAGTCGCCGGTCGCCGATTTGCACCGGTTTGTCGTTGCACCGATTTGCAACGATTGGTTGTT
CTTGGGGTTGCCGTGTGCAACGATTGGCGCCGGTCGCCGATTTGCACCGATTTGTACCGTTCGACTTGTGTTCGTGGTGTTCGTAGTGGTTGCCGCTCACGCTTC
GTGACCAATTTCGTGATTTCATCGCTGAATTTGGGGGAAATTTGGAGGATTTCATCGCTGCAGATTCAAGGAAGATTTCGTACTGAGGACGACAAAACTTCAGTT
GCAAAAACTCTATTAGGAGTTGCCACGAAGATTGATACAAATTCGCCGTTTTACCTAGGGCCACTAGAACGTCTTGGTGATTTCATTACTCCAATTCGTCTAAAG
TTGATAATTTTGACAAATGGTCTCATGCAATTCGAGTTGTCTTTTCGTCTCGACGAAAATTTGGTTATTCTTAATGGAACTATCACTAAGGTCGTTCCGTCTTGG
ATAAATGATGATTGGGTCATGATTCATTATATGCTAGTTTCATGGTACATGGATACAATCGATCCTGAGGTAGGTTCTATGCTATCTAATTATAATGATGCCAAG
CTGTTATGGGATTATTTGAATGAACTATTCTCCATTATGAACGATCCTCATATTTACCAACTTAAGGGAGATATTAGTCGTTGTGAACATAGCAAAAATATGTCA
ATTGCTGTCTACTTTAATAAATTGTATGTCTTGTTGGATGAGTTAGATAAACATGAACTTCTGATTTCTTGCGACCGTAGAAAGTGTACTTGTCGGTTGGAAAAC
TACATGAAAAACGACGTTTCGATGATAAAGTATTTCATCAATTTTTGCTTGGCTCGTGTTCTAATTATTATGGTCAGTCGAGGACTACTCTTTTATCACAAGACC
CTCTCCCATCCTTAA
Protein sequenceShow/hide protein sequence
MLKDLAPLPVAATSPVAGRRFAPVCRCTDLQRLVVLGVAVCNDWRRSPICTDLYRSTCVRGVRSGCRSRFVTNFVISSLNLGEIWRISSLQIQGRFRTEDDKTSV
AKTLLGVATKIDTNSPFYLGPLERLGDFITPIRLKLIILTNGLMQFELSFRLDENLVILNGTITKVVPSWINDDWVMIHYMLVSWYMDTIDPEVGSMLSNYNDAK
LLWDYLNELFSIMNDPHIYQLKGDISRCEHSKNMSIAVYFNKLYVLLDELDKHELLISCDRRKCTCRLENYMKNDVSMIKYFINFCLARVLIIMVSRGLLFYHKT
LSHP