; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g16010 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g16010
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr8:12299547..12301402
RNA-Seq ExpressionMoc08g16010
SyntenyMoc08g16010
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149836.1 uncharacterized protein LOC111018172 [Momordica charantia]9.3e-1836.4Show/hide
Query:  GGNDRPRGQDDLCSHPSSPN---------GRGLYKALNQKYSTKKVERLTRSDSGKPVLDDNKHDKPPFTQDILTASISSKGKLYSFNKYDGWVDLVDHV
        G  D P G+     HP  P           R L   L  K + KKV R T         +D++ D+PPF QDIL A IS K    +FNKYDG  D VDHV
Subjt:  GGNDRPRGQDDLCSHPSSPN---------GRGLYKALNQKYSTKKVERLTRSDSGKPVLDDNKHDKPPFTQDILTASISSKGKLYSFNKYDGWVDLVDHV

Query:  ETYKSLMDFHAYLDAIKCRALFVTLQGSARK--------RQKSGEKLR------------------DYVRRFLTAQITM-LCNEEFACSTPTSSSRHGKT
        ETY+ +MDFHAY DA+KCRAL +TLQG ARK           S ++LR                  DY++RFL+ QI +  C +  A S   +   H K 
Subjt:  ETYKSLMDFHAYLDAIKCRALFVTLQGSARK--------RQKSGEKLR------------------DYVRRFLTAQITM-LCNEEFACSTPTSSSRHGKT

Query:  HRTELARLEREEVWKLKIKHEESAKPREERAKNFRDRTD
                     W L  K E + K   +RA  F +  D
Subjt:  HRTELARLEREEVWKLKIKHEESAKPREERAKNFRDRTD

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]2.4e-1031.46Show/hide
Query:  HPSSPNG---RGLYKALNQKYSTKKVERLTRSDSGKPVLDDNKHDKPPFTQDILTASISSKGKLYSFNKYDGWVDLVDHVETYKSLMDFHAYLDAIKCRA
        +P +P G   R  +  L  K+  +      R +  +   DD    + PFT DIL ASI  K K  +   YDG  D  D+VE ++ LMDF A  DAIKCRA
Subjt:  HPSSPNG---RGLYKALNQKYSTKKVERLTRSDSGKPVLDDNKHDKPPFTQDILTASISSKGKLYSFNKYDGWVDLVDHVETYKSLMDFHAYLDAIKCRA

Query:  LFVTLQGSAR--KRQKSGEKLRDYV---RRFLTAQITMLCNEEFACSTPTSSSRHGKTHRTELARLEREEVWKLKIKH
          + L GSAR   R+     +  Y    + F++   +   + +      T   + GKT +  + R + E+   LK+ H
Subjt:  LFVTLQGSAR--KRQKSGEKLRDYV---RRFLTAQITMLCNEEFACSTPTSSSRHGKTHRTELARLEREEVWKLKIKH

XP_022158651.1 uncharacterized protein LOC111025110 [Momordica charantia]8.4e-1140.31Show/hide
Query:  VDHVETYKSLMDFHAYLDAIKCRALFVTLQGSARK--------RQKSGEKLRDYVRRFLTAQITMLCNEEFACSTPTSSSRHGKTHRTELARLEREEVWK
        +DHVETY+SLMDFHAY DA+KCRA     Q   +         RQ++GE LR+YVRRFL AQIT+ CN+EFA           + H+ +L        W 
Subjt:  VDHVETYKSLMDFHAYLDAIKCRALFVTLQGSARK--------RQKSGEKLRDYVRRFLTAQITMLCNEEFACSTPTSSSRHGKTHRTELARLEREEVWK

Query:  LKIKHEESAKPREERAKNFRDRTD--ESW
           K + + K   E A+ F    +  +SW
Subjt:  LKIKHEESAKPREERAKNFRDRTD--ESW

XP_022158652.1 uncharacterized protein LOC111025109 [Momordica charantia]2.0e-1244.55Show/hide
Query:  LDDNKHDKPPFTQDILTASISSKGKLYSFNKYDGWVDLVDHVETYKSLMDFHAYLDAIKCRALFVTLQGSAR-------KRQKSGEKLRDYVRRFLTAQI
        L+D    +PPFT D+L A I  K K  +   YDG  D  D+VE ++ LMDF A  DAIKCRA  + L GSAR        RQK  E LR+YV RF   Q+
Subjt:  LDDNKHDKPPFTQDILTASISSKGKLYSFNKYDGWVDLVDHVETYKSLMDFHAYLDAIKCRALFVTLQGSAR-------KRQKSGEKLRDYVRRFLTAQI

Query:  TML-CNEEFA
         +  C+++ A
Subjt:  TML-CNEEFA

XP_022159160.1 uncharacterized protein LOC111025585 [Momordica charantia]2.4e-1036.57Show/hide
Query:  KKVERL-TRSDSGKPVLDDNKHDKPPFTQDILTASISSKGKLYSFNKYDGWVDLVDHVETYKSLMDFHAYLDAIKCRALFVTLQGS--------------
        ++VE L  + +  +   +D +  + PFT D+L   I  K K+ +   YDG  D  D+VE ++SLMDF A  DAIKCRA  + L G+              
Subjt:  KKVERL-TRSDSGKPVLDDNKHDKPPFTQDILTASISSKGKLYSFNKYDGWVDLVDHVETYKSLMDFHAYLDAIKCRALFVTLQGS--------------

Query:  -ARKRQKSGEKLRDYVRRFLTAQITML-CNEEFA
         A  RQK GE LR+YV RF   Q+ +  C+++ A
Subjt:  -ARKRQKSGEKLRDYVRRFLTAQITML-CNEEFA

TrEMBL top hitse value%identityAlignment
A0A6J1D9M1 uncharacterized protein LOC1110181724.5e-1836.4Show/hide
Query:  GGNDRPRGQDDLCSHPSSPN---------GRGLYKALNQKYSTKKVERLTRSDSGKPVLDDNKHDKPPFTQDILTASISSKGKLYSFNKYDGWVDLVDHV
        G  D P G+     HP  P           R L   L  K + KKV R T         +D++ D+PPF QDIL A IS K    +FNKYDG  D VDHV
Subjt:  GGNDRPRGQDDLCSHPSSPN---------GRGLYKALNQKYSTKKVERLTRSDSGKPVLDDNKHDKPPFTQDILTASISSKGKLYSFNKYDGWVDLVDHV

Query:  ETYKSLMDFHAYLDAIKCRALFVTLQGSARK--------RQKSGEKLR------------------DYVRRFLTAQITM-LCNEEFACSTPTSSSRHGKT
        ETY+ +MDFHAY DA+KCRAL +TLQG ARK           S ++LR                  DY++RFL+ QI +  C +  A S   +   H K 
Subjt:  ETYKSLMDFHAYLDAIKCRALFVTLQGSARK--------RQKSGEKLR------------------DYVRRFLTAQITM-LCNEEFACSTPTSSSRHGKT

Query:  HRTELARLEREEVWKLKIKHEESAKPREERAKNFRDRTD
                     W L  K E + K   +RA  F +  D
Subjt:  HRTELARLEREEVWKLKIKHEESAKPREERAKNFRDRTD

A0A6J1DPN4 uncharacterized protein LOC1110230601.2e-1031.46Show/hide
Query:  HPSSPNG---RGLYKALNQKYSTKKVERLTRSDSGKPVLDDNKHDKPPFTQDILTASISSKGKLYSFNKYDGWVDLVDHVETYKSLMDFHAYLDAIKCRA
        +P +P G   R  +  L  K+  +      R +  +   DD    + PFT DIL ASI  K K  +   YDG  D  D+VE ++ LMDF A  DAIKCRA
Subjt:  HPSSPNG---RGLYKALNQKYSTKKVERLTRSDSGKPVLDDNKHDKPPFTQDILTASISSKGKLYSFNKYDGWVDLVDHVETYKSLMDFHAYLDAIKCRA

Query:  LFVTLQGSAR--KRQKSGEKLRDYV---RRFLTAQITMLCNEEFACSTPTSSSRHGKTHRTELARLEREEVWKLKIKH
          + L GSAR   R+     +  Y    + F++   +   + +      T   + GKT +  + R + E+   LK+ H
Subjt:  LFVTLQGSAR--KRQKSGEKLRDYV---RRFLTAQITMLCNEEFACSTPTSSSRHGKTHRTELARLEREEVWKLKIKH

A0A6J1DXR9 uncharacterized protein LOC1110251099.7e-1344.55Show/hide
Query:  LDDNKHDKPPFTQDILTASISSKGKLYSFNKYDGWVDLVDHVETYKSLMDFHAYLDAIKCRALFVTLQGSAR-------KRQKSGEKLRDYVRRFLTAQI
        L+D    +PPFT D+L A I  K K  +   YDG  D  D+VE ++ LMDF A  DAIKCRA  + L GSAR        RQK  E LR+YV RF   Q+
Subjt:  LDDNKHDKPPFTQDILTASISSKGKLYSFNKYDGWVDLVDHVETYKSLMDFHAYLDAIKCRALFVTLQGSAR-------KRQKSGEKLRDYVRRFLTAQI

Query:  TML-CNEEFA
         +  C+++ A
Subjt:  TML-CNEEFA

A0A6J1DXW4 uncharacterized protein LOC1110255851.2e-1036.57Show/hide
Query:  KKVERL-TRSDSGKPVLDDNKHDKPPFTQDILTASISSKGKLYSFNKYDGWVDLVDHVETYKSLMDFHAYLDAIKCRALFVTLQGS--------------
        ++VE L  + +  +   +D +  + PFT D+L   I  K K+ +   YDG  D  D+VE ++SLMDF A  DAIKCRA  + L G+              
Subjt:  KKVERL-TRSDSGKPVLDDNKHDKPPFTQDILTASISSKGKLYSFNKYDGWVDLVDHVETYKSLMDFHAYLDAIKCRALFVTLQGS--------------

Query:  -ARKRQKSGEKLRDYVRRFLTAQITML-CNEEFA
         A  RQK GE LR+YV RF   Q+ +  C+++ A
Subjt:  -ARKRQKSGEKLRDYVRRFLTAQITML-CNEEFA

A0A6J1E1K0 uncharacterized protein LOC1110251104.1e-1140.31Show/hide
Query:  VDHVETYKSLMDFHAYLDAIKCRALFVTLQGSARK--------RQKSGEKLRDYVRRFLTAQITMLCNEEFACSTPTSSSRHGKTHRTELARLEREEVWK
        +DHVETY+SLMDFHAY DA+KCRA     Q   +         RQ++GE LR+YVRRFL AQIT+ CN+EFA           + H+ +L        W 
Subjt:  VDHVETYKSLMDFHAYLDAIKCRALFVTLQGSARK--------RQKSGEKLRDYVRRFLTAQITMLCNEEFACSTPTSSSRHGKTHRTELARLEREEVWK

Query:  LKIKHEESAKPREERAKNFRDRTD--ESW
           K + + K   E A+ F    +  +SW
Subjt:  LKIKHEESAKPREERAKNFRDRTD--ESW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACAAGTAGCTACCCCATTGACCACTAAAAATACACCATCTATCCACCAATACAGGTACCACAGAGGAAGCTCGCCCCAAGTAGTCCATAGACTTGACAGATCGCT
TATAAGCGTTAGTTATGGCAGAAGGTTCTCAGACCTCAAGAAAGAGACAGCCCGAAGGTCAGGGACAATCACTGGGTCCCAAAGACCCAAAGGCACGATCAGGTGCGTCC
AAGTCGCCAATCACTCGGAAGCCCAAGACTCGAGCAGATTCCCAGAGGTCGAAGTCTTGACTCTAGGTCGAGGAGGGAACGACCGACCTCGAGGCCAAGATGACCTTTGC
TCCCACCCCTCCTCACCAAATGGCCGAGGCCTCTACAAGGCCTTGAACCAGAAGTACTCCACCAAGAAGGTCGAGAGGCTGACGAGATCAGACTCAGGAAAGCCAGTGTT
AGATGATAATAAGCATGATAAGCCACCTTTCACCCAGGACATCTTGACTGCATCGATCTCCTCCAAAGGAAAGCTTTATAGCTTCAACAAGTACGATGGCTGGGTCGACT
TGGTGGACCACGTGGAGACCTACAAGTCTCTGATGGATTTCCATGCTTACCTGGATGCAATAAAGTGTCGGGCGTTATTTGTGACGTTGCAAGGTTCCGCTAGGAAACGG
CAGAAGTCAGGTGAAAAACTCAGAGATTATGTGAGAAGGTTCCTCACTGCACAAATCACGATGTTGTGCAACGAGGAGTTCGCTTGCTCAACGCCAACCAGCTCATCCAG
ACATGGGAAGACCCACCGGACGGAGCTAGCGCGCCTCGAGAGAGAAGAGGTTTGGAAGCTGAAAATTAAGCATGAAGAGAGTGCTAAGCCTAGAGAAGAGCGAGCGAAGA
ATTTCCGAGACAGAACAGATGAAAGCTGGAGCGAAGTAGAGTTGGAGCATGGATTCATTACGCAACAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGACAAGTAGCTACCCCATTGACCACTAAAAATACACCATCTATCCACCAATACAGGTACCACAGAGGAAGCTCGCCCCAAGTAGTCCATAGACTTGACAGATCGCT
TATAAGCGTTAGTTATGGCAGAAGGTTCTCAGACCTCAAGAAAGAGACAGCCCGAAGGTCAGGGACAATCACTGGGTCCCAAAGACCCAAAGGCACGATCAGGTGCGTCC
AAGTCGCCAATCACTCGGAAGCCCAAGACTCGAGCAGATTCCCAGAGGTCGAAGTCTTGACTCTAGGTCGAGGAGGGAACGACCGACCTCGAGGCCAAGATGACCTTTGC
TCCCACCCCTCCTCACCAAATGGCCGAGGCCTCTACAAGGCCTTGAACCAGAAGTACTCCACCAAGAAGGTCGAGAGGCTGACGAGATCAGACTCAGGAAAGCCAGTGTT
AGATGATAATAAGCATGATAAGCCACCTTTCACCCAGGACATCTTGACTGCATCGATCTCCTCCAAAGGAAAGCTTTATAGCTTCAACAAGTACGATGGCTGGGTCGACT
TGGTGGACCACGTGGAGACCTACAAGTCTCTGATGGATTTCCATGCTTACCTGGATGCAATAAAGTGTCGGGCGTTATTTGTGACGTTGCAAGGTTCCGCTAGGAAACGG
CAGAAGTCAGGTGAAAAACTCAGAGATTATGTGAGAAGGTTCCTCACTGCACAAATCACGATGTTGTGCAACGAGGAGTTCGCTTGCTCAACGCCAACCAGCTCATCCAG
ACATGGGAAGACCCACCGGACGGAGCTAGCGCGCCTCGAGAGAGAAGAGGTTTGGAAGCTGAAAATTAAGCATGAAGAGAGTGCTAAGCCTAGAGAAGAGCGAGCGAAGA
ATTTCCGAGACAGAACAGATGAAAGCTGGAGCGAAGTAGAGTTGGAGCATGGATTCATTACGCAACAATAG
Protein sequenceShow/hide protein sequence
MGQVATPLTTKNTPSIHQYRYHRGSSPQVVHRLDRSLISVSYGRRFSDLKKETARRSGTITGSQRPKGTIRCVQVANHSEAQDSSRFPEVEVLTLGRGGNDRPRGQDDLC
SHPSSPNGRGLYKALNQKYSTKKVERLTRSDSGKPVLDDNKHDKPPFTQDILTASISSKGKLYSFNKYDGWVDLVDHVETYKSLMDFHAYLDAIKCRALFVTLQGSARKR
QKSGEKLRDYVRRFLTAQITMLCNEEFACSTPTSSSRHGKTHRTELARLEREEVWKLKIKHEESAKPREERAKNFRDRTDESWSEVELEHGFITQQ