; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g22660 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g22660
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr3:16006797..16007845
RNA-Seq ExpressionMoc03g22660
SyntenyMoc03g22660
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0016020 - membrane (cellular component)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036997.1 gypsy/ty3 element polyprotein [Cucumis melo var. makuwa]3.0e-3353.38Show/hide
Query:  EGQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLELETLDVMLGMQWLRHVGRMQVDWP
        +G +++++++++VDCGATHNFI  KL E L +   ET +YGVIMGSG  V+G GICKG+ + LP I++ E+FL LEL  +D++LGMQWL+  G M VDW 
Subjt:  EGQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLELETLDVMLGMQWLRHVGRMQVDWP

Query:  ALTMTFEKNGQKVKIQGDPSLTRMEATFQRLAR
        ALTMTF     KV ++GDPSLTRME + + L +
Subjt:  ALTMTFEKNGQKVKIQGDPSLTRMEATFQRLAR

TYK10423.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.0e-3353.38Show/hide
Query:  EGQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLELETLDVMLGMQWLRHVGRMQVDWP
        +G +++++++++VDCGATHNFI  KL E L +   ET +YGVIMGSG  V+G GICKG+ + LP I++ E+FL LEL  +D++LGMQWL+  G M VDW 
Subjt:  EGQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLELETLDVMLGMQWLRHVGRMQVDWP

Query:  ALTMTFEKNGQKVKIQGDPSLTRMEATFQRLAR
        ALTMTF     KV ++GDPSLTRME + + L +
Subjt:  ALTMTFEKNGQKVKIQGDPSLTRMEATFQRLAR

TYK17386.1 gypsy/ty3 element polyprotein [Cucumis melo var. makuwa]3.0e-3353.38Show/hide
Query:  EGQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLELETLDVMLGMQWLRHVGRMQVDWP
        +G +++++++++VDCGATHNFI  KL E L +   ET +YGVIMGSG  V+G GICKG+ + LP I++ E+FL LEL  +D++LGMQWL+  G M VDW 
Subjt:  EGQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLELETLDVMLGMQWLRHVGRMQVDWP

Query:  ALTMTFEKNGQKVKIQGDPSLTRMEATFQRLAR
        ALTMTF     KV ++GDPSLTRME + + L +
Subjt:  ALTMTFEKNGQKVKIQGDPSLTRMEATFQRLAR

TYK21209.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.0e-3353.38Show/hide
Query:  EGQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLELETLDVMLGMQWLRHVGRMQVDWP
        +G +++++++++VDCGATHNFI  KL E L +   ET +YGVIMGSG  V+G GICKG+ + LP I++ E+FL LEL  +D++LGMQWL+  G M VDW 
Subjt:  EGQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLELETLDVMLGMQWLRHVGRMQVDWP

Query:  ALTMTFEKNGQKVKIQGDPSLTRMEATFQRLAR
        ALTMTF     KV ++GDPSLTRME + + L +
Subjt:  ALTMTFEKNGQKVKIQGDPSLTRMEATFQRLAR

XP_022154744.1 uncharacterized protein LOC111021922 [Momordica charantia]5.0e-4144.25Show/hide
Query:  KPEPLQKRLTEAEYQKRKDKGLVRR----------------EIF-----GGAQVDHE-----TEGQ------------------------------IQDK
        + E  QK+LTE EYQ+RKDKGL  R                ++F      G ++D E     TEG+                              I+DK
Subjt:  KPEPLQKRLTEAEYQKRKDKGLVRR----------------EIF-----GGAQVDHE-----TEGQ------------------------------IQDK

Query:  DVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLELETLDVMLGMQWLRHVGRMQVDWPALTMTFE
        +V++L+DCGATHNFI QKL +  N+   ET +YGVIMG+G +VRG GICKG++L LPE+T++ENFL LEL  LDV+LGMQWL   G M+VDW ALTM+F 
Subjt:  DVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLELETLDVMLGMQWLRHVGRMQVDWPALTMTFE

Query:  KNGQKVKIQGDPSLTRMEATFQRLAR
            ++ ++GDP+L RME T ++LAR
Subjt:  KNGQKVKIQGDPSLTRMEATFQRLAR

TrEMBL top hitse value%identityAlignment
A0A5A7T606 Gypsy/ty3 element polyprotein1.4e-3353.38Show/hide
Query:  EGQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLELETLDVMLGMQWLRHVGRMQVDWP
        +G +++++++++VDCGATHNFI  KL E L +   ET +YGVIMGSG  V+G GICKG+ + LP I++ E+FL LEL  +D++LGMQWL+  G M VDW 
Subjt:  EGQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLELETLDVMLGMQWLRHVGRMQVDWP

Query:  ALTMTFEKNGQKVKIQGDPSLTRMEATFQRLAR
        ALTMTF     KV ++GDPSLTRME + + L +
Subjt:  ALTMTFEKNGQKVKIQGDPSLTRMEATFQRLAR

A0A5D3BD16 Ty3/gypsy retrotransposon protein1.4e-3353.38Show/hide
Query:  EGQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLELETLDVMLGMQWLRHVGRMQVDWP
        +G +++++++++VDCGATHNFI  KL E L +   ET +YGVIMGSG  V+G GICKG+ + LP I++ E+FL LEL  +D++LGMQWL+  G M VDW 
Subjt:  EGQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLELETLDVMLGMQWLRHVGRMQVDWP

Query:  ALTMTFEKNGQKVKIQGDPSLTRMEATFQRLAR
        ALTMTF     KV ++GDPSLTRME + + L +
Subjt:  ALTMTFEKNGQKVKIQGDPSLTRMEATFQRLAR

A0A5D3CEX8 Ty3/gypsy retrotransposon protein1.4e-3353.38Show/hide
Query:  EGQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLELETLDVMLGMQWLRHVGRMQVDWP
        +G +++++++++VDCGATHNFI  KL E L +   ET +YGVIMGSG  V+G GICKG+ + LP I++ E+FL LEL  +D++LGMQWL+  G M VDW 
Subjt:  EGQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLELETLDVMLGMQWLRHVGRMQVDWP

Query:  ALTMTFEKNGQKVKIQGDPSLTRMEATFQRLAR
        ALTMTF     KV ++GDPSLTRME + + L +
Subjt:  ALTMTFEKNGQKVKIQGDPSLTRMEATFQRLAR

A0A5D3DRT3 Ty3/gypsy retrotransposon protein1.4e-3353.38Show/hide
Query:  EGQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLELETLDVMLGMQWLRHVGRMQVDWP
        +G +++++++++VDCGATHNFI  KL E L +   ET +YGVIMGSG  V+G GICKG+ + LP I++ E+FL LEL  +D++LGMQWL+  G M VDW 
Subjt:  EGQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLELETLDVMLGMQWLRHVGRMQVDWP

Query:  ALTMTFEKNGQKVKIQGDPSLTRMEATFQRLAR
        ALTMTF     KV ++GDPSLTRME + + L +
Subjt:  ALTMTFEKNGQKVKIQGDPSLTRMEATFQRLAR

A0A6J1DN22 Reverse transcriptase2.4e-4144.25Show/hide
Query:  KPEPLQKRLTEAEYQKRKDKGLVRR----------------EIF-----GGAQVDHE-----TEGQ------------------------------IQDK
        + E  QK+LTE EYQ+RKDKGL  R                ++F      G ++D E     TEG+                              I+DK
Subjt:  KPEPLQKRLTEAEYQKRKDKGLVRR----------------EIF-----GGAQVDHE-----TEGQ------------------------------IQDK

Query:  DVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLELETLDVMLGMQWLRHVGRMQVDWPALTMTFE
        +V++L+DCGATHNFI QKL +  N+   ET +YGVIMG+G +VRG GICKG++L LPE+T++ENFL LEL  LDV+LGMQWL   G M+VDW ALTM+F 
Subjt:  DVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLELETLDVMLGMQWLRHVGRMQVDWPALTMTFE

Query:  KNGQKVKIQGDPSLTRMEATFQRLAR
            ++ ++GDP+L RME T ++LAR
Subjt:  KNGQKVKIQGDPSLTRMEATFQRLAR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein3.3e-1435.34Show/hide
Query:  GQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLEL--ETLDVMLGMQWLRHVGRMQVDW
        G I D  V+V +D GAT NFI  +LA  L +  + T    V++G    ++  G C G+ L + E+ + ENFL L+L    +DV+LG +WL  +G   V+W
Subjt:  GQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLEL--ETLDVMLGMQWLRHVGRMQVDW

Query:  PALTMTFEKNGQKVKI
             +F  N Q + +
Subjt:  PALTMTFEKNGQKVKI

AT3G30770.1 Eukaryotic aspartyl protease family protein1.1e-0932.76Show/hide
Query:  GQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLEL--ETLDVMLGMQWLRHVGRMQVDW
        G I    V+V++D GAT+NFI  +LA  L +  + T    V++G    ++  G C G+ L + E+ + ENFL L+L    +DV+LG    +++ R  + W
Subjt:  GQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVKENFLSLEL--ETLDVMLGMQWLRHVGRMQVDW

Query:  PALTMTFEKNGQKVKI
             +F  N Q V +
Subjt:  PALTMTFEKNGQKVKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCGAGCAAACCAGAACCACTGCAGAAACGCCTTACGGAAGCAGAGTATCAAAAACGGAAGGATAAGGGCCTGGTGCGACGAGAAATATTCGGTGGAGCACAAGT
GGACCATGAAACTGAAGGTCAAATCCAGGACAAGGACGTGATCGTGTTGGTGGACTGCGGCGCAACACACAACTTCATTTTCCAGAAGCTAGCGGAAGAACTCAACATTG
TGAGAACAGAAACTCCCAGCTATGGGGTAATCATGGGGTCGGGAACGATGGTCAGGGGAGGAGGGATATGTAAGGGAGTAGTTCTCACCCTCCCAGAAATAACGGTCAAG
GAGAACTTCTTGTCGTTGGAGTTGGAGACCCTAGATGTAATGCTGGGCATGCAGTGGCTACGACATGTAGGAAGAATGCAAGTTGACTGGCCGGCGTTGACTATGACTTT
TGAAAAGAACGGACAGAAGGTGAAGATCCAGGGGGACCCCTCTTTGACACGCATGGAGGCCACGTTTCAACGGCTAGCAAGGGCGGGAGGAGAGATATGTAAGGGAGTAG
TTCTCACCCTCCCAGAAATAACGGTCAAGGAGGACTTCCTGTCGTTGGAGTTGGGGACCCTAGACGTAATGCTGGACATGCAGTGGCTACGGCAAGTAGGAAGAATGCAA
GTTGACTGGCCGGCGTTGACTATGACTTTTGAAAAGGACGGGCAGAAGGTGAACATCCAAGGGGACCCCTCTTTGACACGCATGGAGATCACGTTTCAAGGGTTCTTGAT
AGAACTACGAGCTCTATTGACTAAAGAGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTCGAGCAAACCAGAACCACTGCAGAAACGCCTTACGGAAGCAGAGTATCAAAAACGGAAGGATAAGGGCCTGGTGCGACGAGAAATATTCGGTGGAGCACAAGT
GGACCATGAAACTGAAGGTCAAATCCAGGACAAGGACGTGATCGTGTTGGTGGACTGCGGCGCAACACACAACTTCATTTTCCAGAAGCTAGCGGAAGAACTCAACATTG
TGAGAACAGAAACTCCCAGCTATGGGGTAATCATGGGGTCGGGAACGATGGTCAGGGGAGGAGGGATATGTAAGGGAGTAGTTCTCACCCTCCCAGAAATAACGGTCAAG
GAGAACTTCTTGTCGTTGGAGTTGGAGACCCTAGATGTAATGCTGGGCATGCAGTGGCTACGACATGTAGGAAGAATGCAAGTTGACTGGCCGGCGTTGACTATGACTTT
TGAAAAGAACGGACAGAAGGTGAAGATCCAGGGGGACCCCTCTTTGACACGCATGGAGGCCACGTTTCAACGGCTAGCAAGGGCGGGAGGAGAGATATGTAAGGGAGTAG
TTCTCACCCTCCCAGAAATAACGGTCAAGGAGGACTTCCTGTCGTTGGAGTTGGGGACCCTAGACGTAATGCTGGACATGCAGTGGCTACGGCAAGTAGGAAGAATGCAA
GTTGACTGGCCGGCGTTGACTATGACTTTTGAAAAGGACGGGCAGAAGGTGAACATCCAAGGGGACCCCTCTTTGACACGCATGGAGATCACGTTTCAAGGGTTCTTGAT
AGAACTACGAGCTCTATTGACTAAAGAGGAATAA
Protein sequenceShow/hide protein sequence
MVSSKPEPLQKRLTEAEYQKRKDKGLVRREIFGGAQVDHETEGQIQDKDVIVLVDCGATHNFIFQKLAEELNIVRTETPSYGVIMGSGTMVRGGGICKGVVLTLPEITVK
ENFLSLELETLDVMLGMQWLRHVGRMQVDWPALTMTFEKNGQKVKIQGDPSLTRMEATFQRLARAGGEICKGVVLTLPEITVKEDFLSLELGTLDVMLDMQWLRQVGRMQ
VDWPALTMTFEKDGQKVNIQGDPSLTRMEITFQGFLIELRALLTKEE