; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005956 (gene) of Snake gourd v1 genome

Gene IDTan0005956
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG06:22331438..22331842
RNA-Seq ExpressionTan0005956
SyntenyTan0005956
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-4064Show/hide
Query:  MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQE
        M+S+ + +L +DKL G+NY +WKN INT+LI DDL+FVL EECPQV  + A+R+VR+ Y++W +ANEKA+ YI+ASLSEVLAKKHE M+TA+EIM SLQE
Subjt:  MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQE

Query:  MFGQQSFQVRHDSLKHVFNARMKEG
        MFGQ S+Q++HD+LK+++NARM EG
Subjt:  MFGQQSFQVRHDSLKHVFNARMKEG

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-4064Show/hide
Query:  MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQE
        M+S+ + +L +DKL G+NY +WKN INT+LI DDL+FVL EECPQV  + A+R+VR+ Y++W +ANEKA+ YI+ASLSEVLAKKHE M+TA+EIM SLQE
Subjt:  MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQE

Query:  MFGQQSFQVRHDSLKHVFNARMKEG
        MFGQ S+Q++HD+LK+++NARM EG
Subjt:  MFGQQSFQVRHDSLKHVFNARMKEG

TYK07761.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-4064Show/hide
Query:  MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQE
        M+S+ + +L +DKL G+NY +WKN INT+LI DDL+FVL EECPQV  + A+R+VR+ Y++W +ANEKA+ YI+ASLSEVLAKKHE M+TA+EIM SLQE
Subjt:  MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQE

Query:  MFGQQSFQVRHDSLKHVFNARMKEG
        MFGQ S+Q++HD+LK+++NARM EG
Subjt:  MFGQQSFQVRHDSLKHVFNARMKEG

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-4064Show/hide
Query:  MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQE
        M+S+ + +L +DKL G+NY +WKN INT+LI DDL+FVL EECPQV  + A+R+VR+ Y++W +ANEKA+ YI+ASLSEVLAKKHE M+TA+EIM SLQE
Subjt:  MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQE

Query:  MFGQQSFQVRHDSLKHVFNARMKEG
        MFGQ S+Q++HD+LK+++NARM EG
Subjt:  MFGQQSFQVRHDSLKHVFNARMKEG

XP_038882358.1 uncharacterized protein LOC120073622 [Benincasa hispida]1.3e-4063.2Show/hide
Query:  MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQE
        M+SSI+ LL S+KL GDNY  WK+N+NTIL+ DDL+FVLTEECPQ   S A+R+VR+AYD+W++ANEKA++YI+AS+S+VLAKKHE + TAKEI+ SL+E
Subjt:  MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQE

Query:  MFGQQSFQVRHDSLKHVFNARMKEG
        +FGQ S+ +RH+++KH++  RMKEG
Subjt:  MFGQQSFQVRHDSLKHVFNARMKEG

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.8e-4064Show/hide
Query:  MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQE
        M+S+ + +L +DKL G+NY +WKN INT+LI DDL+FVL EECPQV  + A+R+VR+ Y++W +ANEKA+ YI+ASLSEVLAKKHE M+TA+EIM SLQE
Subjt:  MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQE

Query:  MFGQQSFQVRHDSLKHVFNARMKEG
        MFGQ S+Q++HD+LK+++NARM EG
Subjt:  MFGQQSFQVRHDSLKHVFNARMKEG

A0A5A7TU93 Gag/pol protein1.8e-4064Show/hide
Query:  MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQE
        M+S+ + +L +DKL G+NY +WKN INT+LI DDL+FVL EECPQV  + A+R+VR+ Y++W +ANEKA+ YI+ASLSEVLAKKHE M+TA+EIM SLQE
Subjt:  MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQE

Query:  MFGQQSFQVRHDSLKHVFNARMKEG
        MFGQ S+Q++HD+LK+++NARM EG
Subjt:  MFGQQSFQVRHDSLKHVFNARMKEG

A0A5A7TWB9 Gag/pol protein1.8e-4064Show/hide
Query:  MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQE
        M+S+ + +L +DKL G+NY +WKN INT+LI DDL+FVL EECPQV  + A+R+VR+ Y++W +ANEKA+ YI+ASLSEVLAKKHE M+TA+EIM SLQE
Subjt:  MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQE

Query:  MFGQQSFQVRHDSLKHVFNARMKEG
        MFGQ S+Q++HD+LK+++NARM EG
Subjt:  MFGQQSFQVRHDSLKHVFNARMKEG

A0A5D3CPJ6 Gag/pol protein1.8e-4064Show/hide
Query:  MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQE
        M+S+ + +L +DKL G+NY +WKN INT+LI DDL+FVL EECPQV  + A+R+VR+ Y++W +ANEKA+ YI+ASLSEVLAKKHE M+TA+EIM SLQE
Subjt:  MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQE

Query:  MFGQQSFQVRHDSLKHVFNARMKEG
        MFGQ S+Q++HD+LK+++NARM EG
Subjt:  MFGQQSFQVRHDSLKHVFNARMKEG

A0A5D3CSZ6 Gag/pol protein1.8e-4064Show/hide
Query:  MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQE
        M+S+ + +L +DKL G+NY +WKN INT+LI DDL+FVL EECPQV  + A+R+VR+ Y++W +ANEKA+ YI+ASLSEVLAKKHE M+TA+EIM SLQE
Subjt:  MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQE

Query:  MFGQQSFQVRHDSLKHVFNARMKEG
        MFGQ S+Q++HD+LK+++NARM EG
Subjt:  MFGQQSFQVRHDSLKHVFNARMKEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAGTTCAATTGTCGCTCTCTTAGTTTCAGATAAACTCGTAGGAGATAATTATCAAACGTGGAAGAATAATATCAACACGATTTTGATAGCTGACGACCTAAAGTT
CGTGCTCACTGAGGAGTGTCCTCAGGTGTCGGGCTCGACCGCATCGCGAAGTGTTCGTGATGCATACGATCAGTGGATTAGGGCCAATGAAAAGGCCAAGGTCTACATAA
TTGCCAGCTTGTCTGAAGTCTTGGCAAAGAAGCATGAGGTGATGATCACCGCTAAAGAGATCATGAAATCTCTGCAGGAAATGTTTGGACAACAATCTTTTCAGGTCCGA
CACGATTCCCTCAAACACGTTTTCAATGCGAGGATGAAGGAGGGACGTCTGTCCGTGAACATGTTTTGGACATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTAGTTCAATTGTCGCTCTCTTAGTTTCAGATAAACTCGTAGGAGATAATTATCAAACGTGGAAGAATAATATCAACACGATTTTGATAGCTGACGACCTAAAGTT
CGTGCTCACTGAGGAGTGTCCTCAGGTGTCGGGCTCGACCGCATCGCGAAGTGTTCGTGATGCATACGATCAGTGGATTAGGGCCAATGAAAAGGCCAAGGTCTACATAA
TTGCCAGCTTGTCTGAAGTCTTGGCAAAGAAGCATGAGGTGATGATCACCGCTAAAGAGATCATGAAATCTCTGCAGGAAATGTTTGGACAACAATCTTTTCAGGTCCGA
CACGATTCCCTCAAACACGTTTTCAATGCGAGGATGAAGGAGGGACGTCTGTCCGTGAACATGTTTTGGACATGA
Protein sequenceShow/hide protein sequence
MSSSIVALLVSDKLVGDNYQTWKNNINTILIADDLKFVLTEECPQVSGSTASRSVRDAYDQWIRANEKAKVYIIASLSEVLAKKHEVMITAKEIMKSLQEMFGQQSFQVR
HDSLKHVFNARMKEGRLSVNMFWT