; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g31070 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g31070
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr6:23376463..23383816
RNA-Seq ExpressionMoc06g31070
SyntenyMoc06g31070
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU28547.1 hypothetical protein TSUD_268860 [Trifolium subterraneum]1.1e-0528.33Show/hide
Query:  NVATDSTSDGSVRPSLSPIASSHGNVALSEVVGNVCPTATDIVDSAIVGSPVAVEAQLLRRSNRLSRPPSYLREYHCALL-------NGDCFPTVTTKHP
        N+  D T+  +   S SP+ SSH              T T I  S  +  P  +    LR+S R++ PP YL+++HC LL       + D   + T+K+P
Subjt:  NVATDSTSDGSVRPSLSPIASSHGNVALSEVVGNVCPTATDIVDSAIVGSPVAVEAQLLRRSNRLSRPPSYLREYHCALL-------NGDCFPTVTTKHP

Query:  LHNYVSYSRLAPSHSHLALAV-SITDLLTYKKIHKTVKERPAIGMSDKEWTKMDEQAITIIRLCLVMNLMSLVASQKKAM
        L +++SY  L+P+H H  L + S+++  +Y+K            +SD+ W    +  +  +      NL+ L  S KKA+
Subjt:  LHNYVSYSRLAPSHSHLALAV-SITDLLTYKKIHKTVKERPAIGMSDKEWTKMDEQAITIIRLCLVMNLMSLVASQKKAM

KZV50756.1 hypothetical protein F511_19388 [Dorcoceras hygrometricum]5.1e-0634.15Show/hide
Query:  NVCPTATDIVDSAIVGSPVAVEAQLLRRSNRLSRPPSYLREYHCALLNGDCFPTVTTKHPLHNYVSYSRLAPSHSHLALAVS
        N+ P  + + +S  +  P++ +++   RS R+ +PP +L++YHC +L+    P+ +T HPL N+V+YS+L+P H +L   +S
Subjt:  NVCPTATDIVDSAIVGSPVAVEAQLLRRSNRLSRPPSYLREYHCALLNGDCFPTVTTKHPLHNYVSYSRLAPSHSHLALAVS

XP_020255452.1 uncharacterized protein LOC109832512 [Asparagus officinalis]3.9e-0626.52Show/hide
Query:  QQLLQLLQSQMNVSKVSTSDAASNHMTGITACSLNAANL-WILDSGASAHICCSQSLFETVSPISPIH-VNLPDKSRFVVHLSDTSIAADNSGSLVNSDN
        QQL+ LL + M     + S A++  ++G  AC   + ++ W++D+GA+ HI CS+SLF ++ P S I  V LP+      ++S T   A  +G  ++ D 
Subjt:  QQLLQLLQSQMNVSKVSTSDAASNHMTGITACSLNAANL-WILDSGASAHICCSQSLFETVSPISPIH-VNLPDKSRFVVHLSDTSIAADNSGSLVNSDN

Query:  VATDSTSDGSVRPSLSPIASS-------------HGNVAL----------SEVVGNVCPTATDIVDSAIVGSP---VAVEAQLL---RRSNRLSRPPSYL
         +  +   G +   L  + S+             HG + +           ++    C   T   D A+V  P   + V  +++   RRS R+SRPP YL
Subjt:  VATDSTSDGSVRPSLSPIASS-------------HGNVAL----------SEVVGNVCPTATDIVDSAIVGSP---VAVEAQLL---RRSNRLSRPPSYL

Query:  REYHCALLNGDCFPTVTTKHPLHNYVSYSRLAPSH-SHLALAVSITDLLTYKKIHKTVKERPAI
        +++  +++ G      +T HP+ +Y+ YS L+ +H S++A   S    +T+ + +++ + R A+
Subjt:  REYHCALLNGDCFPTVTTKHPLHNYVSYSRLAPSH-SHLALAVSITDLLTYKKIHKTVKERPAI

XP_022143573.1 uncharacterized protein LOC111013441 [Momordica charantia]2.1e-0745.56Show/hide
Query:  NPDVVAQCQQLLQLLQSQMNVSK-VSTSDAASNHMTGITACSLNAANLWILDSGASAHICCSQSLFETVSPISPIHVNLPDKSRFVVHLS
        N   ++  QQL QLLQSQ++  K V+ +D  +++       SL      ILD GASAHIC  + LF+ +  ISP+HVNLP+K RFVV  S
Subjt:  NPDVVAQCQQLLQLLQSQMNVSK-VSTSDAASNHMTGITACSLNAANLWILDSGASAHICCSQSLFETVSPISPIHVNLPDKSRFVVHLS

XP_022143573.1 uncharacterized protein LOC111013441 [Momordica charantia]1.7e-0435.79Show/hide
Query:  PSLSPIA-SSHGNVALSEVVGNVCPTATDIVDSAIVGSPVAVEAQLLRRSNRLSRPPSYLREYHCALLNGDCFPTVTTKHPLHNYVSYSRLAPSH
        P+  P A  S     +SE + N  P+ +  + +   GS +     + RRS R S+ PSYL+++HC+LL        +T+HPL  Y+SYSRL+ +H
Subjt:  PSLSPIA-SSHGNVALSEVVGNVCPTATDIVDSAIVGSPVAVEAQLLRRSNRLSRPPSYLREYHCALLNGDCFPTVTTKHPLHNYVSYSRLAPSH

XP_022143573.1 uncharacterized protein LOC111013441 [Momordica charantia]3.9e-0626.96Show/hide
Query:  WILDSGASAHICCSQSLFETVSPISPIHVNLPDKSRF------VVHLSDTSIAAD-----------NSGSLVNSDNVATDSTSDGSVRPSLSPIASSH--
        WI+DSGA++H+C   +LF     ++ + V+LP+ +R        +HLS + I  D             G L+++  +      D S   +LS  A  H  
Subjt:  WILDSGASAHICCSQSLFETVSPISPIHVNLPDKSRF------VVHLSDTSIAAD-----------NSGSLVNSDNVATDSTSDGSVRPSLSPIASSH--

Query:  GNVA-----------------LSEVVGNVCPTATDIVDSAI----------VGSPVAVEAQLLR---------RSNRLSRPPSYLREYHCALLNGDCFPT
        G++A                 L  +  +  P+   ++DS             G+ ++ EA  L          R  R S+ PSYL +YHCAL+     PT
Subjt:  GNVA-----------------LSEVVGNVCPTATDIVDSAI----------VGSPVAVEAQLLR---------RSNRLSRPPSYLREYHCALLNGDCFPT

Query:  --VTTKHPLHNYVSYSRLAPSHSHLALAVS
           TT +P+ N++SYS+ +PS+    L++S
Subjt:  --VTTKHPLHNYVSYSRLAPSHSHLALAVS

TrEMBL top hitse value%identityAlignment
A0A2Z6NG25 Integrase catalytic domain-containing protein5.6e-0628.33Show/hide
Query:  NVATDSTSDGSVRPSLSPIASSHGNVALSEVVGNVCPTATDIVDSAIVGSPVAVEAQLLRRSNRLSRPPSYLREYHCALL-------NGDCFPTVTTKHP
        N+  D T+  +   S SP+ SSH              T T I  S  +  P  +    LR+S R++ PP YL+++HC LL       + D   + T+K+P
Subjt:  NVATDSTSDGSVRPSLSPIASSHGNVALSEVVGNVCPTATDIVDSAIVGSPVAVEAQLLRRSNRLSRPPSYLREYHCALL-------NGDCFPTVTTKHP

Query:  LHNYVSYSRLAPSHSHLALAV-SITDLLTYKKIHKTVKERPAIGMSDKEWTKMDEQAITIIRLCLVMNLMSLVASQKKAM
        L +++SY  L+P+H H  L + S+++  +Y+K            +SD+ W    +  +  +      NL+ L  S KKA+
Subjt:  LHNYVSYSRLAPSHSHLALAV-SITDLLTYKKIHKTVKERPAIGMSDKEWTKMDEQAITIIRLCLVMNLMSLVASQKKAM

A0A2Z7D0U1 Integrase catalytic domain-containing protein2.5e-0634.15Show/hide
Query:  NVCPTATDIVDSAIVGSPVAVEAQLLRRSNRLSRPPSYLREYHCALLNGDCFPTVTTKHPLHNYVSYSRLAPSHSHLALAVS
        N+ P  + + +S  +  P++ +++   RS R+ +PP +L++YHC +L+    P+ +T HPL N+V+YS+L+P H +L   +S
Subjt:  NVCPTATDIVDSAIVGSPVAVEAQLLRRSNRLSRPPSYLREYHCALLNGDCFPTVTTKHPLHNYVSYSRLAPSHSHLALAVS

A0A6J1CR17 uncharacterized protein LOC1110134411.0e-0745.56Show/hide
Query:  NPDVVAQCQQLLQLLQSQMNVSK-VSTSDAASNHMTGITACSLNAANLWILDSGASAHICCSQSLFETVSPISPIHVNLPDKSRFVVHLS
        N   ++  QQL QLLQSQ++  K V+ +D  +++       SL      ILD GASAHIC  + LF+ +  ISP+HVNLP+K RFVV  S
Subjt:  NPDVVAQCQQLLQLLQSQMNVSK-VSTSDAASNHMTGITACSLNAANLWILDSGASAHICCSQSLFETVSPISPIHVNLPDKSRFVVHLS

A0A6J1CR17 uncharacterized protein LOC1110134418.0e-0535.79Show/hide
Query:  PSLSPIA-SSHGNVALSEVVGNVCPTATDIVDSAIVGSPVAVEAQLLRRSNRLSRPPSYLREYHCALLNGDCFPTVTTKHPLHNYVSYSRLAPSH
        P+  P A  S     +SE + N  P+ +  + +   GS +     + RRS R S+ PSYL+++HC+LL        +T+HPL  Y+SYSRL+ +H
Subjt:  PSLSPIA-SSHGNVALSEVVGNVCPTATDIVDSAIVGSPVAVEAQLLRRSNRLSRPPSYLREYHCALLNGDCFPTVTTKHPLHNYVSYSRLAPSH

A0A6J1CR17 uncharacterized protein LOC1110134415.0e-0727.03Show/hide
Query:  QLLQLLQS-------QMNVSKVSTSDAASNHMTGITACSLNAANLWILDSGASAHICCSQSLFETVSPISPIHVNLPDKSRFVVHLSDTSIAADNSGSLV
        QLL LL +       Q NV+      AA++  T +T   LN A  W++D+GA+ H+  +   + ++  +  ++V LP+    VV                
Subjt:  QLLQLLQS-------QMNVSKVSTSDAASNHMTGITACSLNAANLWILDSGASAHICCSQSLFETVSPISPIHVNLPDKSRFVVHLSDTSIAADNSGSLV

Query:  NSDNVATDSTSDGSVRPSLSPIASSHGNVALSEVVGNVCPTATDIVDSAIVGSPVAVEAQLLRRSNRLSRPPSYLREYHCALLNGDCFPTVTTK----HP
                 T  GS++ + S + +   +V  S++     PT T    S I  SP       LR+S R+++PP YL+++HC L++     T ++     +P
Subjt:  NSDNVATDSTSDGSVRPSLSPIASSHGNVALSEVVGNVCPTATDIVDSAIVGSPVAVEAQLLRRSNRLSRPPSYLREYHCALLNGDCFPTVTTK----HP

Query:  LHNYVSYSRLAPSHSHLALAVS
        L + + Y++L+PSH + AL+V+
Subjt:  LHNYVSYSRLAPSHSHLALAVS

A0A803QGJ5 Uncharacterized protein4.4e-1131.58Show/hide
Query:  QCQQLLQLLQSQMNVSKVSTSDAASNHMTGITACS---LNAANLWILDSGASAHICCSQSLFETVSPISPIHVNLPDKSRFVVHLSDTSIAADNSGSLVN
        Q QQLL LL SQ N S  +     S   +GI   +   L     WI+DSGA+ HIC     F++V  ++   + LP+ +  +VH S     + N   LV 
Subjt:  QCQQLLQLLQSQMNVSKVSTSDAASNHMTGITACS---LNAANLWILDSGASAHICCSQSLFETVSPISPIHVNLPDKSRFVVHLSDTSIAADNSGSLVN

Query:  SD---------NVATDSTSDGSVRPS--LSPIASSHGNVALSEVVGNVCPTATDIVDSAIVGSPVAVEAQLLRRSNRLSRPPSYLREYHCALLNGDCFPT
         D         N+ +D  ++    P   L  IA+               P ATD+ + A    P      L RR+ R+SRPPSYL+++ C  L  D    
Subjt:  SD---------NVATDSTSDGSVRPS--LSPIASSHGNVALSEVVGNVCPTATDIVDSAIVGSPVAVEAQLLRRSNRLSRPPSYLREYHCALLNGDCFPT

Query:  VTTKHPLHNYVSYSRLAPSHSHLALAVS
         +T +P+  YVSYS+L+ ++    LAV+
Subjt:  VTTKHPLHNYVSYSRLAPSHSHLALAVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGAATCCTGATGTTGTTGCTCAATGTCAACAGCTTCTCCAGCTTCTTCAGTCTCAAATGAATGTCTCGAAGGTTTCTACTTCTGATGCAGCTTCGAATCAT
ATGACAGGTATTACTGCTTGTTCTTTAAATGCTGCTAATTTGTGGATTCTGGACTCTGGTGCTTCAGCGCATATTTGCTGTTCTCAGTCATTGTTTGAAACTGTT
TCGCCTATTTCTCCAATCCATGTCAACTTGCCTGATAAGTCTCGGTTTGTTGTGCACTTAAGTGATACCTCTATAGCTGCTGATAATTCGGGTTCTTTAGTTAAT
TCAGACAATGTTGCTACTGACAGTACTTCTGATGGTTCTGTTCGGCCATCTCTCTCTCCAATTGCAAGTTCTCATGGGAATGTTGCTCTTTCTGAGGTTGTTGGT
AATGTTTGTCCAACTGCCACTGATATTGTTGATTCTGCTATTGTTGGCTCTCCGGTTGCTGTTGAGGCTCAGCTACTTCGACGTTCTAATCGTCTATCTCGACCT
CCTTCATACCTCAGAGAATATCATTGTGCTCTCTTGAATGGTGATTGTTTTCCTACTGTCACTACTAAACATCCTTTGCACAATTATGTATCTTACTCTCGGTTG
GCTCCGTCTCATTCTCATTTGGCCTTAGCTGTTTCTATTACAGATCTTCTTACATACAAGAAGATTCACAAGACTGTAAAAGAACGACCGGCCATTGGGATGTCA
GATAAAGAGTGGACAAAGATGGATGAACAAGCGATAACAATCATCAGGCTGTGTTTGGTGATGAATCTGATGAGTCTCGTGGCGAGCCAGAAAAAAGCAATGAGA
CTGATGAAAGCTTTGACTGATAGGTTTGGCGAGCTGATGAAGTCGCGTAGAAGAAGGAGTGCATCGAGGAAAAAGACCACAGTTGATGCTGAGGTCGAGGAGGAA
GTCTCTAAAGTGACAACAGACTTGGGTGGGAATGCCAAGTCATTAGAGGGGAAATCTTTCTTTAGGAGTCATTGGGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCGAATCCTGATGTTGTTGCTCAATGTCAACAGCTTCTCCAGCTTCTTCAGTCTCAAATGAATGTCTCGAAGGTTTCTACTTCTGATGCAGCTTCGAATCAT
ATGACAGGTATTACTGCTTGTTCTTTAAATGCTGCTAATTTGTGGATTCTGGACTCTGGTGCTTCAGCGCATATTTGCTGTTCTCAGTCATTGTTTGAAACTGTT
TCGCCTATTTCTCCAATCCATGTCAACTTGCCTGATAAGTCTCGGTTTGTTGTGCACTTAAGTGATACCTCTATAGCTGCTGATAATTCGGGTTCTTTAGTTAAT
TCAGACAATGTTGCTACTGACAGTACTTCTGATGGTTCTGTTCGGCCATCTCTCTCTCCAATTGCAAGTTCTCATGGGAATGTTGCTCTTTCTGAGGTTGTTGGT
AATGTTTGTCCAACTGCCACTGATATTGTTGATTCTGCTATTGTTGGCTCTCCGGTTGCTGTTGAGGCTCAGCTACTTCGACGTTCTAATCGTCTATCTCGACCT
CCTTCATACCTCAGAGAATATCATTGTGCTCTCTTGAATGGTGATTGTTTTCCTACTGTCACTACTAAACATCCTTTGCACAATTATGTATCTTACTCTCGGTTG
GCTCCGTCTCATTCTCATTTGGCCTTAGCTGTTTCTATTACAGATCTTCTTACATACAAGAAGATTCACAAGACTGTAAAAGAACGACCGGCCATTGGGATGTCA
GATAAAGAGTGGACAAAGATGGATGAACAAGCGATAACAATCATCAGGCTGTGTTTGGTGATGAATCTGATGAGTCTCGTGGCGAGCCAGAAAAAAGCAATGAGA
CTGATGAAAGCTTTGACTGATAGGTTTGGCGAGCTGATGAAGTCGCGTAGAAGAAGGAGTGCATCGAGGAAAAAGACCACAGTTGATGCTGAGGTCGAGGAGGAA
GTCTCTAAAGTGACAACAGACTTGGGTGGGAATGCCAAGTCATTAGAGGGGAAATCTTTCTTTAGGAGTCATTGGGTGTGA
Protein sequenceShow/hide protein sequence
MPNPDVVAQCQQLLQLLQSQMNVSKVSTSDAASNHMTGITACSLNAANLWILDSGASAHICCSQSLFETVSPISPIHVNLPDKSRFVVHLSDTSIAADNSGSLVN
SDNVATDSTSDGSVRPSLSPIASSHGNVALSEVVGNVCPTATDIVDSAIVGSPVAVEAQLLRRSNRLSRPPSYLREYHCALLNGDCFPTVTTKHPLHNYVSYSRL
APSHSHLALAVSITDLLTYKKIHKTVKERPAIGMSDKEWTKMDEQAITIIRLCLVMNLMSLVASQKKAMRLMKALTDRFGELMKSRRRRSASRKKTTVDAEVEEE
VSKVTTDLGGNAKSLEGKSFFRSHWV