; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002766 (gene) of Snake gourd v1 genome

Gene IDTan0002766
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG10:27534138..27534482
RNA-Seq ExpressionTan0002766
SyntenyTan0002766
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143540.1 uncharacterized protein LOC111013417 [Momordica charantia]5.9e-3165.66Show/hide
Query:  MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMR
        M+SSI+ L  +EKL G N   WKN LNTILVVDD +FVLTEECP  P +NA+RNVR+A+DRW+KANDKA+VY+L SM+D+LAKKHE ++TAK+IMD ++
Subjt:  MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMR

XP_022157095.1 uncharacterized protein LOC111023904 [Momordica charantia]4.5e-3166.67Show/hide
Query:  MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMR
        M+SSI+ LL +EKL G N   WKN LNTILVVDD +FVLTEECP  P  NA+RNVR+A+DRW+KANDKA+VY+L SM+D+LAKKHE ++TAK+IMD ++
Subjt:  MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMR

XP_022157632.1 uncharacterized protein LOC111024294 [Momordica charantia]3.4e-3166.67Show/hide
Query:  MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMR
        M+SSI+ LL +EKL G N   WKN LNTILVVDD +FVLTEECP  P  NA+RNVR+A+DRW+KANDKA+VY+L SM+D+LAKKHE ++TAK+IMD ++
Subjt:  MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMR

XP_022157844.1 uncharacterized protein LOC111024457 [Momordica charantia]2.0e-3166.67Show/hide
Query:  MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMR
        M+SSI+ LL +EKL G N   WKN LNTILVVDD +FVLTEECP  P +NA+RNVR+A+DRW+KANDKA+VY+L SM+D+LAKKHE ++TAK+IMD ++
Subjt:  MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMR

XP_038882358.1 uncharacterized protein LOC120073622 [Benincasa hispida]1.3e-3066.67Show/hide
Query:  MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMR
        M+SSII LL +EKL G+N   WK+ LNTILVVDD +FVLTEECP  P SNA+R VR+AYDRW+KAN+KA++Y+L SMSD+LAKKHE + TAK+I+D +R
Subjt:  MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMR

TrEMBL top hitse value%identityAlignment
A0A6J1CP29 uncharacterized protein LOC1110134172.8e-3165.66Show/hide
Query:  MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMR
        M+SSI+ L  +EKL G N   WKN LNTILVVDD +FVLTEECP  P +NA+RNVR+A+DRW+KANDKA+VY+L SM+D+LAKKHE ++TAK+IMD ++
Subjt:  MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMR

A0A6J1DS54 uncharacterized protein LOC1110239042.2e-3166.67Show/hide
Query:  MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMR
        M+SSI+ LL +EKL G N   WKN LNTILVVDD +FVLTEECP  P  NA+RNVR+A+DRW+KANDKA+VY+L SM+D+LAKKHE ++TAK+IMD ++
Subjt:  MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMR

A0A6J1DUZ9 uncharacterized protein LOC1110242941.7e-3166.67Show/hide
Query:  MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMR
        M+SSI+ LL +EKL G N   WKN LNTILVVDD +FVLTEECP  P  NA+RNVR+A+DRW+KANDKA+VY+L SM+D+LAKKHE ++TAK+IMD ++
Subjt:  MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMR

A0A6J1DXQ5 uncharacterized protein LOC1110244579.8e-3266.67Show/hide
Query:  MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMR
        M+SSI+ LL +EKL G N   WKN LNTILVVDD +FVLTEECP  P +NA+RNVR+A+DRW+KANDKA+VY+L SM+D+LAKKHE ++TAK+IMD ++
Subjt:  MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMR

E2GK51 Gag/pol protein (Fragment)4.1e-3064.65Show/hide
Query:  MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMR
        M++SI+ LL +EKL G+N   WK+ LNTILVVDD +FVLTEECP  P  NA+R VR+AYDRW+KANDKA+VY+L SM+D+LAKKH+ + TAK IMD +R
Subjt:  MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAGCTCTATAATTTCCTTATTGGGTGCCGAAAAGTTAACCGGAGAGAATATGATGAGATGGAAAAACAAACTCAACACTATTTTGGTAGTGGATGATCCGAAGTT
TGTGCTAACTGAGGAATGTCCTCATGTGCCAGGCTCGAATGCGTCACGAAATGTTCGTGATGCATATGATCGATGGATCAAGGCCAATGATAAGGCCAAGGTCTACATGC
TGGTAAGTATGTCCGACATATTAGCCAAGAAGCATGAGGACATGATTACCGCCAAGAAAATCATGGATTACATGCGGTGGGTATGTTTGGACAACAATCCACACAAGCCC
GACATAATGCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTAGCTCTATAATTTCCTTATTGGGTGCCGAAAAGTTAACCGGAGAGAATATGATGAGATGGAAAAACAAACTCAACACTATTTTGGTAGTGGATGATCCGAAGTT
TGTGCTAACTGAGGAATGTCCTCATGTGCCAGGCTCGAATGCGTCACGAAATGTTCGTGATGCATATGATCGATGGATCAAGGCCAATGATAAGGCCAAGGTCTACATGC
TGGTAAGTATGTCCGACATATTAGCCAAGAAGCATGAGGACATGATTACCGCCAAGAAAATCATGGATTACATGCGGTGGGTATGTTTGGACAACAATCCACACAAGCCC
GACATAATGCCCTAA
Protein sequenceShow/hide protein sequence
MSSSIISLLGAEKLTGENMMRWKNKLNTILVVDDPKFVLTEECPHVPGSNASRNVRDAYDRWIKANDKAKVYMLVSMSDILAKKHEDMITAKKIMDYMRWVCLDNNPHKP
DIMP