; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g28480 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g28480
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr10:21362425..21365627
RNA-Seq ExpressionMoc10g28480
SyntenyMoc10g28480
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7561662.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]1.1e-3263.89Show/hide
Query:  WAVDSGASFHTIGQHDILENYVAGNQGKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEISFSQGNWKVTKGSTVIT
        W +DSGASFHT    +I+ENYV GN GKVYLA+G PLDI+GIGD+NLKM++G VWKI KVRHV  +M+NLIS+GQLD+ G  ++F  G WKV KGS V+ 
Subjt:  WAVDSGASFHTIGQHDILENYVAGNQGKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEISFSQGNWKVTKGSTVIT

Query:  RGRKLGTL
        RG K G+L
Subjt:  RGRKLGTL

KAG7584790.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]3.9e-3364.81Show/hide
Query:  WAVDSGASFHTIGQHDILENYVAGNQGKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEISFSQGNWKVTKGSTVIT
        W +DSGASFHT    +I+ENYVAGN GKVYLA+G PLDI+GIGD+NLKM++G VWKI KVRHV  +M+NLIS+GQLD+ G  ++F  G WKV KGS V+ 
Subjt:  WAVDSGASFHTIGQHDILENYVAGNQGKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEISFSQGNWKVTKGSTVIT

Query:  RGRKLGTL
        RG K G+L
Subjt:  RGRKLGTL

PON60333.1 Zinc finger, CCHC-type [Parasponia andersonii]4.4e-3767.59Show/hide
Query:  WAVDSGASFHTIGQHDILENYVAGNQGKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEISFSQGNWKVTKGSTVIT
        W +DSGASFHT    D+LENY+AGN GKVYLADGEPLDI+G+GD+ LKM+NGSVWKI+KVRHV  +M+NLIS+GQLD EG  ++F+ G+WKV+KG+ V+ 
Subjt:  WAVDSGASFHTIGQHDILENYVAGNQGKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEISFSQGNWKVTKGSTVIT

Query:  RGRKLGTL
        RG K GTL
Subjt:  RGRKLGTL

PON72113.1 hypothetical protein PanWU01x14_069550, partial [Parasponia andersonii]6.0e-3460.68Show/hide
Query:  LEFEGAHNIWAVDSGASFHTIGQHDILENYVAGNQGKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEISFSQGNWK
        L  +   N W +DSGASFHT    D+ ENYV GN GKVY+ADGEPL+IIG+GD+ LKM+NGSVWKI+KVRHV  +++NLIS+GQLD EG  ++F+ G+WK
Subjt:  LEFEGAHNIWAVDSGASFHTIGQHDILENYVAGNQGKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEISFSQGNWK

Query:  VTKGSTVITRGRKLGTL
        V+KG+ V+TR  K  TL
Subjt:  VTKGSTVITRGRKLGTL

XP_022152845.1 uncharacterized protein LOC111020469 [Momordica charantia]3.9e-3349.25Show/hide
Query:  QKGYGIAFASDIALNVDRGRNNNKDHGKHGKSRNNRSKSKNSRLEFEGAHNIWAVDSG-----------ASFHTIGQ-HDIL--------ENYV--AGNQ
        +K  GIA  S   LNVDRGRNNN+ +G  GKS+NNRS+S+NSR E      I  + S            A  +   Q HD L        + +V  +GN 
Subjt:  QKGYGIAFASDIALNVDRGRNNNKDHGKHGKSRNNRSKSKNSRLEFEGAHNIWAVDSG-----------ASFHTIGQ-HDIL--------ENYV--AGNQ

Query:  GKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEISFSQGNWKVTKGSTVITRGRKLGTLNKVDSGKTELDGSIEKTQ
        GKVYLADGEPLDIIGIG+VNLKMANGSVWKIRK                LDNEGCEISF QGNWKVTKG+ VI RG K GTL  V++   ++   ++ + 
Subjt:  GKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEISFSQGNWKVTKGSTVITRGRKLGTLNKVDSGKTELDGSIEKTQ

Query:  Y
        Y
Subjt:  Y

TrEMBL top hitse value%identityAlignment
A0A0D2ZYN2 Uncharacterized protein1.1e-3358.87Show/hide
Query:  SKSKNSRLEFEGAHNIWAVDSGASFHTIGQHDILENYVAGNQGKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEIS
        S +K S    +   + W +DSG SFHT   H+I+ENYVAGN GKVYLADG PLDI+GIGD+NLKM++  VWKI KVRHV  +M+NLIS+GQLD+ G +++
Subjt:  SKSKNSRLEFEGAHNIWAVDSGASFHTIGQHDILENYVAGNQGKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEIS

Query:  FSQGNWKVTKGSTVITRGRKLGTL
        F  G+W+V KGS V+ RG K GTL
Subjt:  FSQGNWKVTKGSTVITRGRKLGTL

A0A0D3CS45 Uncharacterized protein4.4e-3560.48Show/hide
Query:  SKSKNSRLEFEGAHNIWAVDSGASFHTIGQHDILENYVAGNQGKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEIS
        S +K S    + + + W +DSGASFHT   H+I+ENYVAGN GKVYLADG PLDI+GIGD+NLKM++G VWKI KVRHV  +M+NLIS+GQLD+ G +++
Subjt:  SKSKNSRLEFEGAHNIWAVDSGASFHTIGQHDILENYVAGNQGKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEIS

Query:  FSQGNWKVTKGSTVITRGRKLGTL
        F  G W+V KGS V+ RG K GTL
Subjt:  FSQGNWKVTKGSTVITRGRKLGTL

A0A2P5CH01 Zinc finger, CCHC-type2.1e-3767.59Show/hide
Query:  WAVDSGASFHTIGQHDILENYVAGNQGKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEISFSQGNWKVTKGSTVIT
        W +DSGASFHT    D+LENY+AGN GKVYLADGEPLDI+G+GD+ LKM+NGSVWKI+KVRHV  +M+NLIS+GQLD EG  ++F+ G+WKV+KG+ V+ 
Subjt:  WAVDSGASFHTIGQHDILENYVAGNQGKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEISFSQGNWKVTKGSTVIT

Query:  RGRKLGTL
        RG K GTL
Subjt:  RGRKLGTL

A0A2P5DFR2 Uncharacterized protein (Fragment)2.9e-3460.68Show/hide
Query:  LEFEGAHNIWAVDSGASFHTIGQHDILENYVAGNQGKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEISFSQGNWK
        L  +   N W +DSGASFHT    D+ ENYV GN GKVY+ADGEPL+IIG+GD+ LKM+NGSVWKI+KVRHV  +++NLIS+GQLD EG  ++F+ G+WK
Subjt:  LEFEGAHNIWAVDSGASFHTIGQHDILENYVAGNQGKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEISFSQGNWK

Query:  VTKGSTVITRGRKLGTL
        V+KG+ V+TR  K  TL
Subjt:  VTKGSTVITRGRKLGTL

A0A6J1DF43 uncharacterized protein LOC1110204691.9e-3349.25Show/hide
Query:  QKGYGIAFASDIALNVDRGRNNNKDHGKHGKSRNNRSKSKNSRLEFEGAHNIWAVDSG-----------ASFHTIGQ-HDIL--------ENYV--AGNQ
        +K  GIA  S   LNVDRGRNNN+ +G  GKS+NNRS+S+NSR E      I  + S            A  +   Q HD L        + +V  +GN 
Subjt:  QKGYGIAFASDIALNVDRGRNNNKDHGKHGKSRNNRSKSKNSRLEFEGAHNIWAVDSG-----------ASFHTIGQ-HDIL--------ENYV--AGNQ

Query:  GKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEISFSQGNWKVTKGSTVITRGRKLGTLNKVDSGKTELDGSIEKTQ
        GKVYLADGEPLDIIGIG+VNLKMANGSVWKIRK                LDNEGCEISF QGNWKVTKG+ VI RG K GTL  V++   ++   ++ + 
Subjt:  GKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEISFSQGNWKVTKGSTVITRGRKLGTLNKVDSGKTELDGSIEKTQ

Query:  Y
        Y
Subjt:  Y

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-1536.89Show/hide
Query:  LEFEGAHNIWAVDSGASFHTIGQHDILENYVAGNQGKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEISFSQGNWK
        +   G  + W VD+ AS H     D+   YVAG+ G V + +     I GIGD+ +K   G    ++ VRHV ++  NLIS   LD +G E  F+   W+
Subjt:  LEFEGAHNIWAVDSGASFHTIGQHDILENYVAGNQGKVYLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEISFSQGNWK

Query:  VTKGSTVITRGRKLGTLNKVDS
        +TKGS VI +G   GTL + ++
Subjt:  VTKGSTVITRGRKLGTLNKVDS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGGATGGGGCCCCTTGTTCAAGTCCCGGAGTCAGCATTTAAGGGAACACTCATATACTCCCTTAAAGTCGGAGAGGAGTGAATTCCATCTTGTGGAGTTACGTTC
CCAGCTCCCCACTCGGTCTCGTCCCCAAAATGGTAGGAATATTGAGTCGGCAACTTGGGCCACTCTCACTCATACAGATCAAAGGACGAGCCCTCACGGGCAGGAGTCCA
TAACTCACTCAAGATTGAGAATGAGTTGTCTGGTCGTCCTAAGAAACAGCAATCTATTAGTTAACGGTGTTACATCAAAAGATTGGCTATTTCGTGGTCCGTTGGGAGCC
TATGAAGGCAGCTATATCAAATTCTTATGGAAAAGAGAAATTGAAATTCGCAGATATCAGAGGTGCAGCTCTTGGAGAGGATATTCGCAGAAAGGATATGGTATTGCATT
TGCTTCTGATATAGCATTGAATGTGGATAGAGGAAGAAATAATAACAAAGACCACGGAAAACATGGAAAGTCAAGAAACAACAGAAGCAAGTCTAAAAACAGCAGACTAG
AATTTGAGGGCGCTCATAACATATGGGCGGTCGATTCAGGTGCGTCTTTTCATACAATAGGACAACATGACATTCTTGAGAATTATGTTGCAGGAAATCAGGGAAAGGTG
TATCTTGCTGATGGAGAGCCTTTAGACATCATTGGGATTGGTGATGTTAATTTAAAAATGGCGAACGGTTCAGTCTGGAAGATTCGCAAGGTACGTCACGTTCAGAATAT
GATGAAGAATCTGATTTCCATGGGGCAGCTAGATAATGAAGGATGTGAAATATCCTTCAGTCAAGGAAACTGGAAAGTTACAAAGGGTTCCACGGTGATCACTCGAGGAA
GAAAGTTAGGAACTTTAAATAAAGTTGATTCAGGAAAGACAGAGTTAGATGGAAGCATAGAGAAGACTCAGTATATAAGTCCTCCTGAGGTTGAGACTAAAACAACAGAG
ATTGAAGATCATAATACAATTACTCCTGAAGAAACAGTTGTGGAATCTGATGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGGATGGGGCCCCTTGTTCAAGTCCCGGAGTCAGCATTTAAGGGAACACTCATATACTCCCTTAAAGTCGGAGAGGAGTGAATTCCATCTTGTGGAGTTACGTTC
CCAGCTCCCCACTCGGTCTCGTCCCCAAAATGGTAGGAATATTGAGTCGGCAACTTGGGCCACTCTCACTCATACAGATCAAAGGACGAGCCCTCACGGGCAGGAGTCCA
TAACTCACTCAAGATTGAGAATGAGTTGTCTGGTCGTCCTAAGAAACAGCAATCTATTAGTTAACGGTGTTACATCAAAAGATTGGCTATTTCGTGGTCCGTTGGGAGCC
TATGAAGGCAGCTATATCAAATTCTTATGGAAAAGAGAAATTGAAATTCGCAGATATCAGAGGTGCAGCTCTTGGAGAGGATATTCGCAGAAAGGATATGGTATTGCATT
TGCTTCTGATATAGCATTGAATGTGGATAGAGGAAGAAATAATAACAAAGACCACGGAAAACATGGAAAGTCAAGAAACAACAGAAGCAAGTCTAAAAACAGCAGACTAG
AATTTGAGGGCGCTCATAACATATGGGCGGTCGATTCAGGTGCGTCTTTTCATACAATAGGACAACATGACATTCTTGAGAATTATGTTGCAGGAAATCAGGGAAAGGTG
TATCTTGCTGATGGAGAGCCTTTAGACATCATTGGGATTGGTGATGTTAATTTAAAAATGGCGAACGGTTCAGTCTGGAAGATTCGCAAGGTACGTCACGTTCAGAATAT
GATGAAGAATCTGATTTCCATGGGGCAGCTAGATAATGAAGGATGTGAAATATCCTTCAGTCAAGGAAACTGGAAAGTTACAAAGGGTTCCACGGTGATCACTCGAGGAA
GAAAGTTAGGAACTTTAAATAAAGTTGATTCAGGAAAGACAGAGTTAGATGGAAGCATAGAGAAGACTCAGTATATAAGTCCTCCTGAGGTTGAGACTAAAACAACAGAG
ATTGAAGATCATAATACAATTACTCCTGAAGAAACAGTTGTGGAATCTGATGAATAG
Protein sequenceShow/hide protein sequence
MRGWGPLFKSRSQHLREHSYTPLKSERSEFHLVELRSQLPTRSRPQNGRNIESATWATLTHTDQRTSPHGQESITHSRLRMSCLVVLRNSNLLVNGVTSKDWLFRGPLGA
YEGSYIKFLWKREIEIRRYQRCSSWRGYSQKGYGIAFASDIALNVDRGRNNNKDHGKHGKSRNNRSKSKNSRLEFEGAHNIWAVDSGASFHTIGQHDILENYVAGNQGKV
YLADGEPLDIIGIGDVNLKMANGSVWKIRKVRHVQNMMKNLISMGQLDNEGCEISFSQGNWKVTKGSTVITRGRKLGTLNKVDSGKTELDGSIEKTQYISPPEVETKTTE
IEDHNTITPEETVVESDE