; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013931 (gene) of Snake gourd v1 genome

Gene IDTan0013931
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG11:44817093..44817437
RNA-Seq ExpressionTan0013931
SyntenyTan0013931
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143540.1 uncharacterized protein LOC111013417 [Momordica charantia]1.0e-3065.66Show/hide
Query:  MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVR
        M+SSI+ L  +E L G N  TWKN LNTILVVD+L+FVLTEECPQ P +NA+RNVR+++DRW+KAN+KA+VY+LASM+D+LAK HE ++TAKEIMD ++
Subjt:  MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVR

XP_022157095.1 uncharacterized protein LOC111023904 [Momordica charantia]7.7e-3166.67Show/hide
Query:  MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVR
        M+SSI+ LL +E L G N  TWKN LNTILVVD+L+FVLTEECPQ P  NA+RNVR+++DRW+KAN+KA+VY+LASM+D+LAK HE ++TAKEIMD ++
Subjt:  MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVR

XP_022157632.1 uncharacterized protein LOC111024294 [Momordica charantia]5.9e-3166.67Show/hide
Query:  MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVR
        M+SSI+ LL +E L G N  TWKN LNTILVVD+L+FVLTEECPQ P  NA+RNVR+++DRW+KAN+KA+VY+LASM+D+LAK HE ++TAKEIMD ++
Subjt:  MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVR

XP_022157844.1 uncharacterized protein LOC111024457 [Momordica charantia]3.4e-3166.67Show/hide
Query:  MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVR
        M+SSI+ LL +E L G N  TWKN LNTILVVD+L+FVLTEECPQ P +NA+RNVR+++DRW+KAN+KA+VY+LASM+D+LAK HE ++TAKEIMD ++
Subjt:  MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVR

XP_038882358.1 uncharacterized protein LOC120073622 [Benincasa hispida]2.0e-3167.33Show/hide
Query:  MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVRE
        M+SSII LL +E L G+N   WK+ LNTILVVD+L+FVLTEECPQ P SNA+R VR++YDRW+KANEKA++Y+LASMSD+LAK HE + TAKEI+D +RE
Subjt:  MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVRE

Query:  V
        +
Subjt:  V

TrEMBL top hitse value%identityAlignment
A0A5A7U2U6 Gag/pol protein4.1e-3063.37Show/hide
Query:  MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVRE
        M++SI+ LL ++ L G+N  TWK+ LN ILV+D+L+FVLTEE PQ P SNA++NVR++YDRW+KANEKA+VY+ ASMSD+LAK HE + TAKEIMD +RE
Subjt:  MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVRE

Query:  V
        +
Subjt:  V

A0A6J1CP29 uncharacterized protein LOC1110134174.9e-3165.66Show/hide
Query:  MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVR
        M+SSI+ L  +E L G N  TWKN LNTILVVD+L+FVLTEECPQ P +NA+RNVR+++DRW+KAN+KA+VY+LASM+D+LAK HE ++TAKEIMD ++
Subjt:  MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVR

A0A6J1DS54 uncharacterized protein LOC1110239043.7e-3166.67Show/hide
Query:  MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVR
        M+SSI+ LL +E L G N  TWKN LNTILVVD+L+FVLTEECPQ P  NA+RNVR+++DRW+KAN+KA+VY+LASM+D+LAK HE ++TAKEIMD ++
Subjt:  MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVR

A0A6J1DUZ9 uncharacterized protein LOC1110242942.8e-3166.67Show/hide
Query:  MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVR
        M+SSI+ LL +E L G N  TWKN LNTILVVD+L+FVLTEECPQ P  NA+RNVR+++DRW+KAN+KA+VY+LASM+D+LAK HE ++TAKEIMD ++
Subjt:  MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVR

A0A6J1DXQ5 uncharacterized protein LOC1110244571.7e-3166.67Show/hide
Query:  MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVR
        M+SSI+ LL +E L G N  TWKN LNTILVVD+L+FVLTEECPQ P +NA+RNVR+++DRW+KAN+KA+VY+LASM+D+LAK HE ++TAKEIMD ++
Subjt:  MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAGCTCTATAATTTCCTTATTGGGTGCGGAAATGTTAACCGGAGAGAATATGATGACATGGAAAAACAAACTCAACACTATTTTGGTAGTGGATAATCTGAAGTT
TGTGCTAACTGAGGAGTGTCCTCAGGTGCCAGGCTCGAATGCGTCACGAAATGTTCGTGATTCATATGATCGATGGATCAAGGCCAATGAAAAGGCCAAGGTCTACATGT
TGGCAAGTATGTCTGACATATTAGCCAAGAATCATGAGGGCATGATTACCGCCAAGGAAATCATGGATTACGTGCGCGAGGTATGTTTGGACAACAGTCCACACAAGCCC
GACATAATGCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTAGCTCTATAATTTCCTTATTGGGTGCGGAAATGTTAACCGGAGAGAATATGATGACATGGAAAAACAAACTCAACACTATTTTGGTAGTGGATAATCTGAAGTT
TGTGCTAACTGAGGAGTGTCCTCAGGTGCCAGGCTCGAATGCGTCACGAAATGTTCGTGATTCATATGATCGATGGATCAAGGCCAATGAAAAGGCCAAGGTCTACATGT
TGGCAAGTATGTCTGACATATTAGCCAAGAATCATGAGGGCATGATTACCGCCAAGGAAATCATGGATTACGTGCGCGAGGTATGTTTGGACAACAGTCCACACAAGCCC
GACATAATGCCCTAA
Protein sequenceShow/hide protein sequence
MSSSIISLLGAEMLTGENMMTWKNKLNTILVVDNLKFVLTEECPQVPGSNASRNVRDSYDRWIKANEKAKVYMLASMSDILAKNHEGMITAKEIMDYVREVCLDNSPHKP
DIMP