; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g16800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g16800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
Genome locationchr2:12617523..12618310
RNA-Seq ExpressionMoc02g16800
SyntenyMoc02g16800
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]1.1e-3943.08Show/hide
Query:  MVVDAYANGALLSKSYTEVVDILERISSNNYHWLDSRAINDKGNYGAASNTEMTYLKDQIANLTNMVKNMSVA-----------------------TTSS
        MV+DA ANGA+LSKSY E  +ILERI+SNNY W  +RA   +   G      +T L  Q+A++TN++KNM++                        T  +
Subjt:  MVVDAYANGALLSKSYTEVVDILERISSNNYHWLDSRAINDKGNYGAASNTEMTYLKDQIANLTNMVKNMSVA-----------------------TTSS

Query:  TNSGSSKVMAI------------SCSYCKGWRQHPNFSWGGNQGGHGNQNAGQYQQKTMQQPGLLMPSQPQGARPAANPSNSMEALMRDYMVKNYALIQS
          S  + V  +            S SY   W+ HPNFSWGG       Q    +     QQP    P QPQG++     ++S+E+LMRDYM KN  +IQS
Subjt:  TNSGSSKVMAI------------SCSYCKGWRQHPNFSWGGNQGGHGNQNAGQYQQKTMQQPGLLMPSQPQGARPAANPSNSMEALMRDYMVKNYALIQS

Query:  QAATLRNMEVQLGQFTIEMKNSPQGTLPSNTKNPRREGKEQCQTLTLRSGKAL
        QAA+LRN+EVQLGQ   ++KN PQGTLPS+T+NPRR+GKE C+ +TLRSGK +
Subjt:  QAATLRNMEVQLGQFTIEMKNSPQGTLPSNTKNPRREGKEQCQTLTLRSGKAL

XP_030498047.1 uncharacterized protein LOC115713707 [Cannabis sativa]4.9e-4042.75Show/hide
Query:  MVVDAYANGALLSKSYTEVVDILERISSNNYHWLDSRAINDKGNYGAASNTEMTYLKDQIANLTNMVKNMSVATTSSTNSGSSKVMAISCSYC-------
        MV+DA ANGA+ SKSY E  +I+ERI+SNNY W  +RA   +   G      +T L  Q+A++TN++KNM++  +    +   +   ISC YC       
Subjt:  MVVDAYANGALLSKSYTEVVDILERISSNNYHWLDSRAINDKGNYGAASNTEMTYLKDQIANLTNMVKNMSVATTSSTNSGSSKVMAISCSYC-------

Query:  -----------------------------KGWRQHPNFSWGGNQGGHGNQNAGQYQQKTMQQPGLLMPSQPQGA-RPAANPSNSMEALMRDYMVKNYALI
                                       W+ HPNFSWGG QG   + +  Q Q K    PG     +PQ + +P  + ++S+E+LMRDYM KN A+I
Subjt:  -----------------------------KGWRQHPNFSWGGNQGGHGNQNAGQYQQKTMQQPGLLMPSQPQGA-RPAANPSNSMEALMRDYMVKNYALI

Query:  QSQAATLRNMEVQLGQFTIEMKNSPQGTLPSNTKNPRREGKEQCQTLTLRSGKAL
        QSQAA+LRN+EVQLGQ    +KN PQGTLPS+T+NPRR+GKE C+ +TLRSGK L
Subjt:  QSQAATLRNMEVQLGQFTIEMKNSPQGTLPSNTKNPRREGKEQCQTLTLRSGKAL

XP_030503898.1 uncharacterized protein LOC115719117 [Cannabis sativa]4.7e-4346.44Show/hide
Query:  MVVDAYANGALLSKSYTEVVDILERISSNNYHWLDSRAINDKGNYGAASNTEMTYLKDQIANLTNMVKNMSV-ATTSSTNSGSSKVMAISC---------
        MV+DA ANGA+LSKSY E  +ILERI+SNNY W  +RA   +   G      +T L  Q+A++TN++KNM++  +         ++ +  C         
Subjt:  MVVDAYANGALLSKSYTEVVDILERISSNNYHWLDSRAINDKGNYGAASNTEMTYLKDQIANLTNMVKNMSV-ATTSSTNSGSSKVMAISC---------

Query:  -------SYCKGWRQHPNFSWGGNQGGHGNQNAGQYQQK----TMQQPGLLMPSQPQGARPAANPSNSMEALMRDYMVKNYALIQSQAATLRNMEVQLGQ
               SY   W+ HPNFSWGG QG   +   GQ +Q       QQP    P QPQG++     ++S+E+LMRDYM KN A+IQSQAA+LRN+EVQLGQ
Subjt:  -------SYCKGWRQHPNFSWGGNQGGHGNQNAGQYQQK----TMQQPGLLMPSQPQGARPAANPSNSMEALMRDYMVKNYALIQSQAATLRNMEVQLGQ

Query:  FTIEMKNSPQGTLPSNTKNPRREGKEQCQTLTLRSGKAL
           ++KN PQGTLPS+T+NPRR+GKE C+ +TLRSGK +
Subjt:  FTIEMKNSPQGTLPSNTKNPRREGKEQCQTLTLRSGKAL

XP_030507648.1 uncharacterized protein LOC115722545 [Cannabis sativa]8.9e-4243.7Show/hide
Query:  MVVDAYANGALLSKSYTEVVDILERISSNNYHWLDSRAINDKGNYGAASNTEMTYLKDQIANLTNMVKNMSVATTSSTNSGSSKVMAISCSYC-------
        MV+DA ANGA+LSKSY E  +ILERI+SNNY W  +RA   +   G      +T L  Q+A++TN++KNM++  +    +   +   ISC YC       
Subjt:  MVVDAYANGALLSKSYTEVVDILERISSNNYHWLDSRAINDKGNYGAASNTEMTYLKDQIANLTNMVKNMSVATTSSTNSGSSKVMAISCSYC-------

Query:  -----------------------------KGWRQHPNFSWGGNQGGHGNQNAGQYQQKTMQQPGLLMPSQPQGARPAANPSNSMEALMRDYMVKNYALIQ
                                       W+ HPNFSWGG   G  +  A Q Q K    PG     QP+  +P  + ++S+E+LMRDYM KN A+IQ
Subjt:  -----------------------------KGWRQHPNFSWGGNQGGHGNQNAGQYQQKTMQQPGLLMPSQPQGARPAANPSNSMEALMRDYMVKNYALIQ

Query:  SQAATLRNMEVQLGQFTIEMKNSPQGTLPSNTKNPRREGKEQCQTLTLRSGKAL
        SQAA+LRN+EVQLGQ   ++KN PQGTLPS+T+NPRR+GKE C+ +TLRSGK L
Subjt:  SQAATLRNMEVQLGQFTIEMKNSPQGTLPSNTKNPRREGKEQCQTLTLRSGKAL

XP_030509259.1 uncharacterized protein LOC115723937 [Cannabis sativa]2.6e-4143.14Show/hide
Query:  MVVDAYANGALLSKSYTEVVDILERISSNNYHWLDSRAINDKGNYGAASNTEMTYLKDQIANLTNMVKNMSVATTSSTNSGSSKVMAISCSYC-------
        MV+DA ANGA+LSKSY E  +ILERI+SNNY W  +RA   +   G      +T L  Q+A++TN++KNM++  +    +   +   ISC YC       
Subjt:  MVVDAYANGALLSKSYTEVVDILERISSNNYHWLDSRAINDKGNYGAASNTEMTYLKDQIANLTNMVKNMSVATTSSTNSGSSKVMAISCSYC-------

Query:  -----------------------------KGWRQHPNFSWGGNQGGHGNQNAGQYQQKTMQQPGLLMPSQPQGA-RPAANPSNSMEALMRDYMVKNYALI
                                       W+ HPNFSWGG   G  +  A Q Q K    PG     +PQ   +P  + ++S+E+LMRDYM KN A+I
Subjt:  -----------------------------KGWRQHPNFSWGGNQGGHGNQNAGQYQQKTMQQPGLLMPSQPQGA-RPAANPSNSMEALMRDYMVKNYALI

Query:  QSQAATLRNMEVQLGQFTIEMKNSPQGTLPSNTKNPRREGKEQCQTLTLRSGKAL
        QSQAA+LRN+EVQLGQ   ++KN PQGTLPS+T+NPRR+GKE C+ +TLRSGK +
Subjt:  QSQAATLRNMEVQLGQFTIEMKNSPQGTLPSNTKNPRREGKEQCQTLTLRSGKAL

TrEMBL top hitse value%identityAlignment
A0A5B6VWJ0 Retroelement pol polyprotein-like6.9e-3239.46Show/hide
Query:  MVVDAYANGALLSKSYTEVVDILERISSNNYHWLDSRAINDKGNYGAASNTEMTYLKDQIANLTNMVKNMSVATTSSTNSGSSK-------VMAISCS--
        MVVDA ANGALLSKSY E  +I+ERI+SNNY W  SRA + +   G      +T L  Q++++++M KN+   TT+ +NS +++       +  + C   
Subjt:  MVVDAYANGALLSKSYTEVVDILERISSNNYHWLDSRAINDKGNYGAASNTEMTYLKDQIANLTNMVKNMSVATTSSTNSGSSK-------VMAISCS--

Query:  --------------------------------YCKGWRQHPNFSWGGNQGGHGNQNAGQYQQKTMQQPGLLMPSQPQGARP--AANPSNSMEALMRDYMV
                                        Y   WR H +FSW        NQ AG     T  +P  L P+ PQ  +    A  SNS+E+L++ YM 
Subjt:  --------------------------------YCKGWRQHPNFSWGGNQGGHGNQNAGQYQQKTMQQPGLLMPSQPQGARP--AANPSNSMEALMRDYMV

Query:  KNYALIQSQAATLRNMEVQLGQFTIEMKNSPQGTLPSNTKNPRREGKEQCQTLTLRSGKAL
        KN ALIQSQAATL+N+E Q+GQ   E++N  QG LPS+T+NPR  GKE C+ LTLRS K +
Subjt:  KNYALIQSQAATLRNMEVQLGQFTIEMKNSPQGTLPSNTKNPRREGKEQCQTLTLRSGKAL

A0A6J1DVZ9 uncharacterized protein LOC1110249708.4e-3052.9Show/hide
Query:  DSRAINDKGNYGAASNTEMTYLKDQIANLTNMVKNMSVATTSSTNSGSSKVMAISCSYCKGWRQHPNFSWGGNQGGHGNQNAGQYQQKTMQQPGLLMPSQ
        DS+A+N++ N+ A  N  M  L DQIANLTNMVKNM+ ATTSS + G  + +           + P            NQN  QYQQK  QQPGLLMP+Q
Subjt:  DSRAINDKGNYGAASNTEMTYLKDQIANLTNMVKNMSVATTSSTNSGSSKVMAISCSYCKGWRQHPNFSWGGNQGGHGNQNAGQYQQKTMQQPGLLMPSQ

Query:  PQGARPAANPSNSMEALMRDYMVKNYALIQSQAATLRNMEVQLGQFTIEMKNSPQ
         QG RPAAN SNSME +MR+YM +N ALIQSQAA  RN+EVQLGQ   ++KN P+
Subjt:  PQGARPAANPSNSMEALMRDYMVKNYALIQSQAATLRNMEVQLGQFTIEMKNSPQ

A0A6J1DW02 uncharacterized protein LOC1110248975.6e-2635.68Show/hide
Query:  MVVDAYANGALLSKSYTEVVDILERISSNNYHWLDSR---AINDKGNYGAASNTEMTYLKDQIANLTNMVKNMSVATTSSTNSGSSKVMAISCSY---CK
        M+++  ANGA   K++ E+VDIL  ++S+N  W   R   A   +   G  +    T ++ ++  +   +K M++   +   +    V +  C++   C+
Subjt:  MVVDAYANGALLSKSYTEVVDILERISSNNYHWLDSR---AINDKGNYGAASNTEMTYLKDQIANLTNMVKNMSVATTSSTNSGSSKVMAISCSY---CK

Query:  -----GWRQHPNFSWGGNQGGHGNQNAGQYQQKTM-----QQPGLLMPSQPQGAR----PAANPSNSMEALMRDYMVKNYALIQSQAATLRNMEVQLGQF
              WR HPNFSWGG QGG    N GQ QQ         Q  +  P Q    R    P  N ++++E +M++YM +  A+IQSQAA++RN   QLG  
Subjt:  -----GWRQHPNFSWGGNQGGHGNQNAGQYQQKTM-----QQPGLLMPSQPQGAR----PAANPSNSMEALMRDYMVKNYALIQSQAATLRNMEVQLGQF

Query:  TIEMKNSPQGTLPSNTKNPRREGKEQCQTLTLRSGKALPHP
          E+KN PQG+ P +T+ PRREGKEQC+ +TLRSG     P
Subjt:  TIEMKNSPQGTLPSNTKNPRREGKEQCQTLTLRSGKALPHP

A0A6J1DWK1 uncharacterized protein LOC1110250532.1e-3642.22Show/hide
Query:  MVVDAYANGALLSKSYTEVVDILERISSNNYHWLDSRAINDKGNYGAASNTEMTYLKDQIANLTNMVKNMSVATTSSTNSGSSKVMAISCSYCKGWRQHP
        +V+DA  NGALL K Y + ++ILERISS+N+ W D RAI  K +     +   T L  +I  LT++               +++  + S +Y    R HP
Subjt:  MVVDAYANGALLSKSYTEVVDILERISSNNYHWLDSRAINDKGNYGAASNTEMTYLKDQIANLTNMVKNMSVATTSSTNSGSSKVMAISCSYCKGWRQHP

Query:  NFSWGGNQGGH--GNQNAGQYQQKTMQQPGLLMPSQPQGARPAANPSNSMEALMRDYMVKNYALIQSQAATLRNMEVQLGQFTIEMKNSPQGTLPSNTKN
        NF W GNQGGH  G  NA  +QQK    PG     Q      +     S+E +M+ YM  N A +QSQAA+LRN+E+Q+GQ  +++K+ P G LPS+T+ 
Subjt:  NFSWGGNQGGH--GNQNAGQYQQKTMQQPGLLMPSQPQGARPAANPSNSMEALMRDYMVKNYALIQSQAATLRNMEVQLGQFTIEMKNSPQGTLPSNTKN

Query:  PRREGKEQCQTLTLRSGKALPHPYP
        P+R+ KEQC  LTLRSGKALP  +P
Subjt:  PRREGKEQCQTLTLRSGKALPHPYP

A0A6J1DXK5 uncharacterized protein LOC1110255004.9e-3040.97Show/hide
Query:  MVVDAYANGALLSKSYTEVVDILERISSNNYHWLDSRAINDKGNYGAASNTEMTYLKDQIANLTNMV-KNMSVATTSSTNSGS---SKVMAISCSYCKGW
        +V+DA ANGALL+K Y E  +ILERISSNN  W D RAI+ KG+ G   +   T L  +I NLT++V ++M+  +T   ++G    S +  ISCS+C G 
Subjt:  MVVDAYANGALLSKSYTEVVDILERISSNNYHWLDSRAINDKGNYGAASNTEMTYLKDQIANLTNMV-KNMSVATTSSTNSGS---SKVMAISCSYCKGW

Query:  RQHPNFSWGGNQGGHGNQNAGQYQQKTMQQPGLLMPSQPQGARPAANPSNSMEALMRDYMVKNYALIQSQAATLRNMEVQLGQFTIEMKNSPQGTLPSNT
         ++ N   G  +  H   NA    Q     P  +  +   G           +  M  YM  N   +QSQA +LRN+E+Q+GQ   ++K+ P+G LPS+ 
Subjt:  RQHPNFSWGGNQGGHGNQNAGQYQQKTMQQPGLLMPSQPQGARPAANPSNSMEALMRDYMVKNYALIQSQAATLRNMEVQLGQFTIEMKNSPQGTLPSNT

Query:  KNPRREGKEQCQTLTLRSGKALPHPYP
        K P+R+GKEQC  LTLRSGK LP  +P
Subjt:  KNPRREGKEQCQTLTLRSGKALPHPYP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGTTGATGCATATGCTAATGGAGCGTTGTTATCTAAATCATATACCGAAGTAGTGGACATATTGGAGAGAATCTCTTCCAATAACTACCACTGGTTAGATTCTAG
AGCTATAAATGACAAAGGAAACTATGGGGCTGCCAGCAATACGGAGATGACTTATCTGAAAGATCAAATAGCAAACCTAACCAACATGGTAAAAAACATGAGTGTTGCTA
CAACATCGTCCACTAACTCAGGATCAAGCAAAGTAATGGCAATCTCATGTTCCTACTGTAAAGGATGGAGGCAACACCCAAATTTTAGCTGGGGAGGAAATCAAGGTGGT
CATGGGAATCAGAATGCCGGGCAGTACCAACAGAAAACAATGCAGCAGCCAGGACTATTGATGCCTAGCCAACCGCAAGGAGCAAGGCCAGCAGCTAACCCCTCAAACTC
TATGGAAGCCCTGATGAGAGATTATATGGTCAAGAATTATGCATTGATACAAAGCCAAGCTGCCACCTTAAGAAACATGGAAGTTCAACTAGGGCAGTTTACTATTGAAA
TGAAAAATAGCCCACAAGGCACCCTGCCAAGTAATACTAAGAATCCTAGGAGGGAAGGAAAAGAGCAGTGTCAAACATTGACCCTTCGTAGTGGAAAAGCATTACCGCAT
CCCTACCCAGCTTTGATGAGAGAGGATAATGTTGTACAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTGTTGATGCATATGCTAATGGAGCGTTGTTATCTAAATCATATACCGAAGTAGTGGACATATTGGAGAGAATCTCTTCCAATAACTACCACTGGTTAGATTCTAG
AGCTATAAATGACAAAGGAAACTATGGGGCTGCCAGCAATACGGAGATGACTTATCTGAAAGATCAAATAGCAAACCTAACCAACATGGTAAAAAACATGAGTGTTGCTA
CAACATCGTCCACTAACTCAGGATCAAGCAAAGTAATGGCAATCTCATGTTCCTACTGTAAAGGATGGAGGCAACACCCAAATTTTAGCTGGGGAGGAAATCAAGGTGGT
CATGGGAATCAGAATGCCGGGCAGTACCAACAGAAAACAATGCAGCAGCCAGGACTATTGATGCCTAGCCAACCGCAAGGAGCAAGGCCAGCAGCTAACCCCTCAAACTC
TATGGAAGCCCTGATGAGAGATTATATGGTCAAGAATTATGCATTGATACAAAGCCAAGCTGCCACCTTAAGAAACATGGAAGTTCAACTAGGGCAGTTTACTATTGAAA
TGAAAAATAGCCCACAAGGCACCCTGCCAAGTAATACTAAGAATCCTAGGAGGGAAGGAAAAGAGCAGTGTCAAACATTGACCCTTCGTAGTGGAAAAGCATTACCGCAT
CCCTACCCAGCTTTGATGAGAGAGGATAATGTTGTACAGTAG
Protein sequenceShow/hide protein sequence
MVVDAYANGALLSKSYTEVVDILERISSNNYHWLDSRAINDKGNYGAASNTEMTYLKDQIANLTNMVKNMSVATTSSTNSGSSKVMAISCSYCKGWRQHPNFSWGGNQGG
HGNQNAGQYQQKTMQQPGLLMPSQPQGARPAANPSNSMEALMRDYMVKNYALIQSQAATLRNMEVQLGQFTIEMKNSPQGTLPSNTKNPRREGKEQCQTLTLRSGKALPH
PYPALMREDNVVQ