; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004129 (gene) of Snake gourd v1 genome

Gene IDTan0004129
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG05:38882194..38883106
RNA-Seq ExpressionTan0004129
SyntenyTan0004129
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]4.0e-8052.17Show/hide
Query:  WKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNSR
        WK  LN ILVVDDL+FVLTEECPQAP  NA+R VR+AYDRW+KANDKA+VY+LASM+D+L KKH+ + TAK IMDS++ MFGQ S   RH A+K+I+  R
Subjt:  WKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNSR

Query:  MLEGTSVRDHVLDMMVRFNIAESNGASIDEDRKVK---------------------------------------KVGKGKQADKAAA-------------
        M EGTSVR+HVLDMM+ FNIAE NG  IDE  +V                                         + KGK+ +   A             
Subjt:  MLEGTSVRDHVLDMMVRFNIAESNGASIDEDRKVK---------------------------------------KVGKGKQADKAAA-------------

Query:  -----------QKGK-------KVKDVADKGKCFHCNEDGHWKRNCPKYIAEKKKE----DKYDLLCLEACLVDNDKTTWILDSGATNHVCSSFQGIDSW
                   +KGK       KVK  ADKGKCFHCN+DGHWKRNCPKY+AEKK E     KYDLL +E CLV+ D +TWILDSGATNH+C SFQ   SW
Subjt:  -----------QKGK-------KVKDVADKGKCFHCNEDGHWKRNCPKYIAEKKKE----DKYDLLCLEACLVDNDKTTWILDSGATNHVCSSFQGIDSW

Query:  QQLQQGEITLRVGNGEVVSFES
        ++L++GEITL+VG GEVVS E+
Subjt:  QQLQQGEITLRVGNGEVVSFES

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-7949.24Show/hide
Query:  TWKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNS
        +WK  +N +L++DDL+FVL EECPQ P +NA+R VR+ Y+RW KAN+KA+ Y+LAS+S++L KKHE M+TA+EIMDS+Q MFGQ S Q +H+ALKYI+N+
Subjt:  TWKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNS

Query:  RMLEGTSVRDHVLDMMVRFNIAESNGASIDEDRKV-----------------------------------------------------------------
        RM EG SVR+HVL+MMV FN+AE NGA IDE  +V                                                                 
Subjt:  RMLEGTSVRDHVLDMMVRFNIAESNGASIDEDRKV-----------------------------------------------------------------

Query:  ----------------KKVGKGKQADKAAAQKGKKVKDVADKGKCFHCNEDGHWKRNCPKYIAEKK--KEDKYDLLCLEACLVDNDKTTWILDSGATNHV
                        KK G+G +A+ AAA+  KK K  A KG CFHCN++GHWKRNCPKY+AEKK  K+ KYDLL LE CLV+ND + WI+DSGATNHV
Subjt:  ----------------KKVGKGKQADKAAAQKGKKVKDVADKGKCFHCNEDGHWKRNCPKYIAEKK--KEDKYDLLCLEACLVDNDKTTWILDSGATNHV

Query:  CSSFQGIDSWQQLQQGEITLRVGNGEVVS
        CSSFQGI SW+QL+ GE+T+RVG G VVS
Subjt:  CSSFQGIDSWQQLQQGEITLRVGNGEVVS

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]9.0e-8049.54Show/hide
Query:  TWKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNS
        +WK  +N +L++DDL+FVL EECPQ P +NA+R VR+ Y+RW KAN+KA+ Y+LAS+S++L KKHE M+TA+EIMDS+Q MFGQ S Q +H+ALKYI+N+
Subjt:  TWKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNS

Query:  RMLEGTSVRDHVLDMMVRFNIAESNGASIDEDRKV-----------------------------------------------------------------
        RM EG SVR+HVL+MMV FN+AE NGA IDE  +V                                                                 
Subjt:  RMLEGTSVRDHVLDMMVRFNIAESNGASIDEDRKV-----------------------------------------------------------------

Query:  ----------------KKVGKGKQADKAAAQKGKKVKDVADKGKCFHCNEDGHWKRNCPKYIAEKK--KEDKYDLLCLEACLVDNDKTTWILDSGATNHV
                        KK G+G +A+ AAA+  KK K  A KG CFHCN++GHWKRNCPKY+AEKK  K+ KYDLL LE CLV+ND + WI+DSGATNHV
Subjt:  ----------------KKVGKGKQADKAAAQKGKKVKDVADKGKCFHCNEDGHWKRNCPKYIAEKK--KEDKYDLLCLEACLVDNDKTTWILDSGATNHV

Query:  CSSFQGIDSWQQLQQGEITLRVGNGEVVS
        CSSFQGI SWQQL+ GE+T+RVG G VVS
Subjt:  CSSFQGIDSWQQLQQGEITLRVGNGEVVS

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-7949.24Show/hide
Query:  TWKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNS
        +WK  +N +L++DDL+FVL EECPQ P +NA+R VR+ Y+RW KAN+KA+ Y+LAS+S++L KKHE M+TA+EIMDS+Q MFGQ S Q +H+ALKYI+N+
Subjt:  TWKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNS

Query:  RMLEGTSVRDHVLDMMVRFNIAESNGASIDEDRKV-----------------------------------------------------------------
        RM EG SVR+HVL+MMV FN+AE NGA IDE  +V                                                                 
Subjt:  RMLEGTSVRDHVLDMMVRFNIAESNGASIDEDRKV-----------------------------------------------------------------

Query:  ----------------KKVGKGKQADKAAAQKGKKVKDVADKGKCFHCNEDGHWKRNCPKYIAEKK--KEDKYDLLCLEACLVDNDKTTWILDSGATNHV
                        KK G+G +A+ AAA+  KK K  A KG CFHCN++GHWKRNCPKY+AEKK  K+ KYDLL LE CLV+ND + WI+DSGATNHV
Subjt:  ----------------KKVGKGKQADKAAAQKGKKVKDVADKGKCFHCNEDGHWKRNCPKYIAEKK--KEDKYDLLCLEACLVDNDKTTWILDSGATNHV

Query:  CSSFQGIDSWQQLQQGEITLRVGNGEVVS
        CSSFQGI SW+QL+ GE+T+RVG G VVS
Subjt:  CSSFQGIDSWQQLQQGEITLRVGNGEVVS

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-7949.24Show/hide
Query:  TWKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNS
        +WK  +N +L++DDL+FVL EECPQ P +NA+R VR+ Y+RW KAN+KA+ Y+LAS+S++L KKHE M+TA+EIMDS+Q MFGQ S Q +H+ALKYI+N+
Subjt:  TWKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNS

Query:  RMLEGTSVRDHVLDMMVRFNIAESNGASIDEDRKV-----------------------------------------------------------------
        RM EG SVR+HVL+MMV FN+AE NGA IDE  +V                                                                 
Subjt:  RMLEGTSVRDHVLDMMVRFNIAESNGASIDEDRKV-----------------------------------------------------------------

Query:  ----------------KKVGKGKQADKAAAQKGKKVKDVADKGKCFHCNEDGHWKRNCPKYIAEKK--KEDKYDLLCLEACLVDNDKTTWILDSGATNHV
                        KK G+G +A+ AAA+  KK K  A KG CFHCN++GHWKRNCPKY+AEKK  K+ KYDLL LE CLV+ND + WI+DSGATNHV
Subjt:  ----------------KKVGKGKQADKAAAQKGKKVKDVADKGKCFHCNEDGHWKRNCPKYIAEKK--KEDKYDLLCLEACLVDNDKTTWILDSGATNHV

Query:  CSSFQGIDSWQQLQQGEITLRVGNGEVVS
        CSSFQGI SW+QL+ GE+T+RVG G VVS
Subjt:  CSSFQGIDSWQQLQQGEITLRVGNGEVVS

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.3e-7949.24Show/hide
Query:  TWKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNS
        +WK  +N +L++DDL+FVL EECPQ P +NA+R VR+ Y+RW KAN+KA+ Y+LAS+S++L KKHE M+TA+EIMDS+Q MFGQ S Q +H+ALKYI+N+
Subjt:  TWKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNS

Query:  RMLEGTSVRDHVLDMMVRFNIAESNGASIDEDRKV-----------------------------------------------------------------
        RM EG SVR+HVL+MMV FN+AE NGA IDE  +V                                                                 
Subjt:  RMLEGTSVRDHVLDMMVRFNIAESNGASIDEDRKV-----------------------------------------------------------------

Query:  ----------------KKVGKGKQADKAAAQKGKKVKDVADKGKCFHCNEDGHWKRNCPKYIAEKK--KEDKYDLLCLEACLVDNDKTTWILDSGATNHV
                        KK G+G +A+ AAA+  KK K  A KG CFHCN++GHWKRNCPKY+AEKK  K+ KYDLL LE CLV+ND + WI+DSGATNHV
Subjt:  ----------------KKVGKGKQADKAAAQKGKKVKDVADKGKCFHCNEDGHWKRNCPKYIAEKK--KEDKYDLLCLEACLVDNDKTTWILDSGATNHV

Query:  CSSFQGIDSWQQLQQGEITLRVGNGEVVS
        CSSFQGI SW+QL+ GE+T+RVG G VVS
Subjt:  CSSFQGIDSWQQLQQGEITLRVGNGEVVS

A0A5A7TU93 Gag/pol protein4.3e-8049.54Show/hide
Query:  TWKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNS
        +WK  +N +L++DDL+FVL EECPQ P +NA+R VR+ Y+RW KAN+KA+ Y+LAS+S++L KKHE M+TA+EIMDS+Q MFGQ S Q +H+ALKYI+N+
Subjt:  TWKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNS

Query:  RMLEGTSVRDHVLDMMVRFNIAESNGASIDEDRKV-----------------------------------------------------------------
        RM EG SVR+HVL+MMV FN+AE NGA IDE  +V                                                                 
Subjt:  RMLEGTSVRDHVLDMMVRFNIAESNGASIDEDRKV-----------------------------------------------------------------

Query:  ----------------KKVGKGKQADKAAAQKGKKVKDVADKGKCFHCNEDGHWKRNCPKYIAEKK--KEDKYDLLCLEACLVDNDKTTWILDSGATNHV
                        KK G+G +A+ AAA+  KK K  A KG CFHCN++GHWKRNCPKY+AEKK  K+ KYDLL LE CLV+ND + WI+DSGATNHV
Subjt:  ----------------KKVGKGKQADKAAAQKGKKVKDVADKGKCFHCNEDGHWKRNCPKYIAEKK--KEDKYDLLCLEACLVDNDKTTWILDSGATNHV

Query:  CSSFQGIDSWQQLQQGEITLRVGNGEVVS
        CSSFQGI SWQQL+ GE+T+RVG G VVS
Subjt:  CSSFQGIDSWQQLQQGEITLRVGNGEVVS

A0A5A7TWB9 Gag/pol protein1.3e-7949.24Show/hide
Query:  TWKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNS
        +WK  +N +L++DDL+FVL EECPQ P +NA+R VR+ Y+RW KAN+KA+ Y+LAS+S++L KKHE M+TA+EIMDS+Q MFGQ S Q +H+ALKYI+N+
Subjt:  TWKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNS

Query:  RMLEGTSVRDHVLDMMVRFNIAESNGASIDEDRKV-----------------------------------------------------------------
        RM EG SVR+HVL+MMV FN+AE NGA IDE  +V                                                                 
Subjt:  RMLEGTSVRDHVLDMMVRFNIAESNGASIDEDRKV-----------------------------------------------------------------

Query:  ----------------KKVGKGKQADKAAAQKGKKVKDVADKGKCFHCNEDGHWKRNCPKYIAEKK--KEDKYDLLCLEACLVDNDKTTWILDSGATNHV
                        KK G+G +A+ AAA+  KK K  A KG CFHCN++GHWKRNCPKY+AEKK  K+ KYDLL LE CLV+ND + WI+DSGATNHV
Subjt:  ----------------KKVGKGKQADKAAAQKGKKVKDVADKGKCFHCNEDGHWKRNCPKYIAEKK--KEDKYDLLCLEACLVDNDKTTWILDSGATNHV

Query:  CSSFQGIDSWQQLQQGEITLRVGNGEVVS
        CSSFQGI SW+QL+ GE+T+RVG G VVS
Subjt:  CSSFQGIDSWQQLQQGEITLRVGNGEVVS

A0A5D3CPJ6 Gag/pol protein1.3e-7949.24Show/hide
Query:  TWKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNS
        +WK  +N +L++DDL+FVL EECPQ P +NA+R VR+ Y+RW KAN+KA+ Y+LAS+S++L KKHE M+TA+EIMDS+Q MFGQ S Q +H+ALKYI+N+
Subjt:  TWKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNS

Query:  RMLEGTSVRDHVLDMMVRFNIAESNGASIDEDRKV-----------------------------------------------------------------
        RM EG SVR+HVL+MMV FN+AE NGA IDE  +V                                                                 
Subjt:  RMLEGTSVRDHVLDMMVRFNIAESNGASIDEDRKV-----------------------------------------------------------------

Query:  ----------------KKVGKGKQADKAAAQKGKKVKDVADKGKCFHCNEDGHWKRNCPKYIAEKK--KEDKYDLLCLEACLVDNDKTTWILDSGATNHV
                        KK G+G +A+ AAA+  KK K  A KG CFHCN++GHWKRNCPKY+AEKK  K+ KYDLL LE CLV+ND + WI+DSGATNHV
Subjt:  ----------------KKVGKGKQADKAAAQKGKKVKDVADKGKCFHCNEDGHWKRNCPKYIAEKK--KEDKYDLLCLEACLVDNDKTTWILDSGATNHV

Query:  CSSFQGIDSWQQLQQGEITLRVGNGEVVS
        CSSFQGI SW+QL+ GE+T+RVG G VVS
Subjt:  CSSFQGIDSWQQLQQGEITLRVGNGEVVS

E2GK51 Gag/pol protein (Fragment)2.0e-8052.17Show/hide
Query:  WKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNSR
        WK  LN ILVVDDL+FVLTEECPQAP  NA+R VR+AYDRW+KANDKA+VY+LASM+D+L KKH+ + TAK IMDS++ MFGQ S   RH A+K+I+  R
Subjt:  WKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNSR

Query:  MLEGTSVRDHVLDMMVRFNIAESNGASIDEDRKVK---------------------------------------KVGKGKQADKAAA-------------
        M EGTSVR+HVLDMM+ FNIAE NG  IDE  +V                                         + KGK+ +   A             
Subjt:  MLEGTSVRDHVLDMMVRFNIAESNGASIDEDRKVK---------------------------------------KVGKGKQADKAAA-------------

Query:  -----------QKGK-------KVKDVADKGKCFHCNEDGHWKRNCPKYIAEKKKE----DKYDLLCLEACLVDNDKTTWILDSGATNHVCSSFQGIDSW
                   +KGK       KVK  ADKGKCFHCN+DGHWKRNCPKY+AEKK E     KYDLL +E CLV+ D +TWILDSGATNH+C SFQ   SW
Subjt:  -----------QKGK-------KVKDVADKGKCFHCNEDGHWKRNCPKYIAEKKKE----DKYDLLCLEACLVDNDKTTWILDSGATNHVCSSFQGIDSW

Query:  QQLQQGEITLRVGNGEVVSFES
        ++L++GEITL+VG GEVVS E+
Subjt:  QQLQQGEITLRVGNGEVVSFES

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGACATGGAAAAAAAAACTCAACCCTATTTTGGTAGTGGATGATCTGAAGTTTGTGCTAACTGAGGAGTGTCCTCAGGCGCCAGGCTCGAATGCGTCACGAAATGT
TCGTGATGCATATGATCGATGGATCAAGGCCAATGATAAGGCCAAGGTCTACATGCTGGCAAGTATGTCTGACATATTAACCAAGAAGCATGAGGGCATGATTACCGCAA
AGGAAATCATGGATTCTGTGCAGGGTATGTTTGGACAACAGTCCACACAAGCCCGACATAATGCTTTAAAGTACATATTCAACTCGAGGATGCTAGAGGGTACATCTGTT
CGGGATCATGTTCTGGATATGATGGTACGCTTTAACATCGCAGAGTCGAATGGTGCTTCCATCGATGAGGATCGAAAGGTGAAGAAGGTTGGTAAAGGGAAACAAGCTGA
CAAAGCTGCCGCCCAAAAGGGCAAGAAAGTCAAAGACGTTGCTGACAAAGGAAAGTGTTTCCACTGCAACGAAGACGGGCATTGGAAACGGAACTGTCCGAAGTACATTG
CAGAAAAAAAGAAGGAAGATAAATATGATTTACTTTGCCTAGAAGCTTGTTTAGTGGATAATGATAAAACAACTTGGATACTTGATTCAGGCGCCACTAATCATGTTTGT
TCTTCTTTTCAGGGAATTGATTCCTGGCAGCAGCTACAACAAGGAGAGATAACGCTCCGGGTTGGAAATGGAGAAGTCGTCTCATTTGAAAGCGATGAGGCACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGATGACATGGAAAAAAAAACTCAACCCTATTTTGGTAGTGGATGATCTGAAGTTTGTGCTAACTGAGGAGTGTCCTCAGGCGCCAGGCTCGAATGCGTCACGAAATGT
TCGTGATGCATATGATCGATGGATCAAGGCCAATGATAAGGCCAAGGTCTACATGCTGGCAAGTATGTCTGACATATTAACCAAGAAGCATGAGGGCATGATTACCGCAA
AGGAAATCATGGATTCTGTGCAGGGTATGTTTGGACAACAGTCCACACAAGCCCGACATAATGCTTTAAAGTACATATTCAACTCGAGGATGCTAGAGGGTACATCTGTT
CGGGATCATGTTCTGGATATGATGGTACGCTTTAACATCGCAGAGTCGAATGGTGCTTCCATCGATGAGGATCGAAAGGTGAAGAAGGTTGGTAAAGGGAAACAAGCTGA
CAAAGCTGCCGCCCAAAAGGGCAAGAAAGTCAAAGACGTTGCTGACAAAGGAAAGTGTTTCCACTGCAACGAAGACGGGCATTGGAAACGGAACTGTCCGAAGTACATTG
CAGAAAAAAAGAAGGAAGATAAATATGATTTACTTTGCCTAGAAGCTTGTTTAGTGGATAATGATAAAACAACTTGGATACTTGATTCAGGCGCCACTAATCATGTTTGT
TCTTCTTTTCAGGGAATTGATTCCTGGCAGCAGCTACAACAAGGAGAGATAACGCTCCGGGTTGGAAATGGAGAAGTCGTCTCATTTGAAAGCGATGAGGCACAATGA
Protein sequenceShow/hide protein sequence
MMTWKKKLNPILVVDDLKFVLTEECPQAPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDILTKKHEGMITAKEIMDSVQGMFGQQSTQARHNALKYIFNSRMLEGTSV
RDHVLDMMVRFNIAESNGASIDEDRKVKKVGKGKQADKAAAQKGKKVKDVADKGKCFHCNEDGHWKRNCPKYIAEKKKEDKYDLLCLEACLVDNDKTTWILDSGATNHVC
SSFQGIDSWQQLQQGEITLRVGNGEVVSFESDEAQ