; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013893 (gene) of Snake gourd v1 genome

Gene IDTan0013893
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionbasic 7S globulin-like
Genome locationLG09:7378939..7379777
RNA-Seq ExpressionTan0013893
SyntenyTan0013893
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597463.1 putative aspartic proteinase GIP2, partial [Cucurbita argyrosperma subsp. sororia]2.3e-3461.04Show/hide
Query:  PRSLTATNGFSLPFAERLHAFRRH----LLNNDLYHFLPNI------------------VGVFSAGEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDV
        P   +A   F+  FA  L    R        N  YHFLPNI                   GVFSAGEK  EYFIGVKSI+INSKTVP N TLLKIDSN +
Subjt:  PRSLTATNGFSLPFAERLHAFRRH----LLNNDLYHFLPNI------------------VGVFSAGEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDV

Query:  GGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKAKSFSCT
        GGTKISTVDPYTVLESSIYN VLKTFTTEL+ V RVAAVAPFGA F AKS S T
Subjt:  GGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKAKSFSCT

KAG7028922.1 Basic 7S globulin, partial [Cucurbita argyrosperma subsp. argyrosperma]1.0e-3461.69Show/hide
Query:  PRSLTATNGFSLPFAERLHAFRRH----LLNNDLYHFLPNI------------------VGVFSAGEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDV
        P   +A   F+  FA  L    R        N  YHFLPNI                   GVFSAGEK  EYFIGVKSIVINSKTVP N TLLKIDSN +
Subjt:  PRSLTATNGFSLPFAERLHAFRRH----LLNNDLYHFLPNI------------------VGVFSAGEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDV

Query:  GGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKAKSFSCT
        GGTKISTVDPYTVLESSIYN VLKTFTTEL+ V RVAAVAPFGA F AKS S T
Subjt:  GGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKAKSFSCT

XP_022936844.1 basic 7S globulin-like [Cucurbita moschata]3.0e-3461.04Show/hide
Query:  PRSLTATNGFSLPFAERLHAFRRH----LLNNDLYHFLPNI------------------VGVFSAGEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDV
        P   +A   F+  FA  L    R        N  YHFLPNI                   GVF+AGEK  EYFIGVKSIVINSKTVP N TLLKIDSN +
Subjt:  PRSLTATNGFSLPFAERLHAFRRH----LLNNDLYHFLPNI------------------VGVFSAGEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDV

Query:  GGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKAKSFSCT
        GGTKISTVDPYTVLESSIYN VLKTFTTEL+ V RVAAVAPFGA F AKS S T
Subjt:  GGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKAKSFSCT

XP_022973769.1 basic 7S globulin-like [Cucurbita maxima]2.2e-3269.11Show/hide
Query:  NDLYHFLPNI------------------VGVFSAGEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQ
        N  YHFLPNI                   GVF+ GEK  EYFIGVKSIVINSKTVP N TLLKIDSN VGGTKISTVDPYTVLESSIYN VLKTFTT L+
Subjt:  NDLYHFLPNI------------------VGVFSAGEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQ

Query:  KVSRVAAVAPFGASFKAKSFSCT
         + RVAAVAPFGA F AKS S T
Subjt:  KVSRVAAVAPFGASFKAKSFSCT

XP_023538677.1 basic 7S globulin-like [Cucurbita pepo subsp. pepo]2.3e-3461.69Show/hide
Query:  PRSLTATNGFSLPFAERLHAFRRH----LLNNDLYHFLPNI------------------VGVFSAGEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDV
        P   +A   F+  FA  L    R        N  YHFLPNI                   GVFSAGEK  EYFIGVKSI+INSKTVP N TLLKIDSN V
Subjt:  PRSLTATNGFSLPFAERLHAFRRH----LLNNDLYHFLPNI------------------VGVFSAGEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDV

Query:  GGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKAKSFSCT
        GGTKISTVDPYTVLESSIYN VLKTFTTEL+ V RVAAVAPFGA F AKS S T
Subjt:  GGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKAKSFSCT

TrEMBL top hitse value%identityAlignment
A0A1S3AXR0 basic 7S globulin-like2.7e-2852.6Show/hide
Query:  PRSLTATNGFSLPFAERLHAFRRH----LLNNDLYHFLPNI------------------VGVFSAGEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDV
        P    A   F+  FA  L    R        N  YHFLPN+                   GV ++GEK  EYFIGVKSIV NSKTVP N TLLKID N  
Subjt:  PRSLTATNGFSLPFAERLHAFRRH----LLNNDLYHFLPNI------------------VGVFSAGEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDV

Query:  GGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKAKSFSCT
        GGTKIST+ PYTVLESSIYN ++KT TTEL+ + RVAAVAPFG  +K+KSF  T
Subjt:  GGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKAKSFSCT

A0A6J1F9F8 basic 7S globulin-like1.5e-3461.04Show/hide
Query:  PRSLTATNGFSLPFAERLHAFRRH----LLNNDLYHFLPNI------------------VGVFSAGEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDV
        P   +A   F+  FA  L    R        N  YHFLPNI                   GVF+AGEK  EYFIGVKSIVINSKTVP N TLLKIDSN +
Subjt:  PRSLTATNGFSLPFAERLHAFRRH----LLNNDLYHFLPNI------------------VGVFSAGEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDV

Query:  GGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKAKSFSCT
        GGTKISTVDPYTVLESSIYN VLKTFTTEL+ V RVAAVAPFGA F AKS S T
Subjt:  GGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKAKSFSCT

A0A6J1GV86 basic 7S globulin-like2.7e-2863.41Show/hide
Query:  NDLYHFLPNI------------------VGVFSAGEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQ
        N  Y+FLPNI                   GV SAGEK  EYFIGVKSIVINSKTVP N TLLKI+S  +GGTKISTV+PYTVLESSIY  VLKTFTTEL+
Subjt:  NDLYHFLPNI------------------VGVFSAGEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQ

Query:  KVSRVAAVAPFGASFKAKSFSCT
        K+ RV AVAPF   F A SF  T
Subjt:  KVSRVAAVAPFGASFKAKSFSCT

A0A6J1IFL5 basic 7S globulin-like1.0e-3269.11Show/hide
Query:  NDLYHFLPNI------------------VGVFSAGEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQ
        N  YHFLPNI                   GVF+ GEK  EYFIGVKSIVINSKTVP N TLLKIDSN VGGTKISTVDPYTVLESSIYN VLKTFTT L+
Subjt:  NDLYHFLPNI------------------VGVFSAGEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQ

Query:  KVSRVAAVAPFGASFKAKSFSCT
         + RVAAVAPFGA F AKS S T
Subjt:  KVSRVAAVAPFGASFKAKSFSCT

A0A6J1IGY1 basic 7S globulin-like1.0e-3269.11Show/hide
Query:  NDLYHFLPNI------------------VGVFSAGEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQ
        N  YHFLPNI                   GVF+ GEK  EYFIGVKSIVINSKTVP N TLLKIDSN VGGTKISTVDPYTVLESSIYN VLKTFTT L+
Subjt:  NDLYHFLPNI------------------VGVFSAGEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQ

Query:  KVSRVAAVAPFGASFKAKSFSCT
         + RVAAVAPFGA F AKS S T
Subjt:  KVSRVAAVAPFGASFKAKSFSCT

SwissProt top hitse value%identityAlignment
F5B8W7 Gamma conglutin 25.6e-0738.37Show/hide
Query:  EYFIGVKSIVINSKTVPP--NITLLKIDSND--VGGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKAKSFS
        EY + V +I +N   V P  N ++L  +  D  +GG  I+T +PYT+L  SIY V  + F   + K ++V AV PFG  F +K  S
Subjt:  EYFIGVKSIVINSKTVPP--NITLLKIDSND--VGGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKAKSFS

I1JNS6 Probable aspartic proteinase GIP11.6e-0945Show/hide
Query:  EYFIGVKSIVINSKTVPPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQ--KVSRVAAVAPFGASFKA
        EYFI + SI IN K +  N ++L +D    GGTKIST +PYTVLE+SIY + ++ F  E     ++   AV PFG  + A
Subjt:  EYFIGVKSIVINSKTVPPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQ--KVSRVAAVAPFGASFKA

P0DO21 Probable aspartic proteinase GIP28.0e-2244.05Show/hide
Query:  MLAIGSGRPSGILQGSDLYAEPR-------SLTATNGFSLPFAERLHAF--RRHLLNNDLYHFLP------NIVGVFSAGEKFVEYFIGVKSIVINSKTV
        M  +G  R S   Q S  ++ PR       S T + G  L F +  + F   R   NND + + P      +    FS+ E   EYFIGVKSI IN K V
Subjt:  MLAIGSGRPSGILQGSDLYAEPR-------SLTATNGFSLPFAERLHAF--RRHLLNNDLYHFLP------NIVGVFSAGEKFVEYFIGVKSIVINSKTV

Query:  PPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKAKSFSCT
        P N TLL ID+  VGGTKISTV+PYT+LE+SIYN V   F  EL  ++RVA+VAPF A F +++ + T
Subjt:  PPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKAKSFSCT

P82952 Gamma conglutin 16.2e-0639.74Show/hide
Query:  EYFIGVKSIVINSKTVPPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKA
        EY+I V+S  IN+  +P       I     GG  IST  PYT L++ I+  + + F  +L+ V  V  VAPFGA F A
Subjt:  EYFIGVKSIVINSKTVPPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKA

Q9FSH9 Gamma conglutin 12.8e-0635.16Show/hide
Query:  EYFIGVKSIVINSKTV---------PPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKAKSFS
        EYFI V +I +N   V         P + +    +S+++GG  I+T +PYTVL  SI+ V  + F   + K ++V AV PFG  +  K  S
Subjt:  EYFIGVKSIVINSKTV---------PPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKAKSFS

Arabidopsis top hitse value%identityAlignment
AT1G03220.1 Eukaryotic aspartyl protease family protein1.1e-1856.84Show/hide
Query:  FSAGEKFVEYFIGVKSIVINSKTVPPNITLLKID-SNDVGGTKISTVDPYTVLESSIYNVVLKTFTTE--LQKVSRVAAVAPFGASFKAKSFSCT
        FS GEK  EYFIGV +I I  KTVP N TLLKI+ S  +GGTKIS+V+PYTVLESSIYN     F  +   + + RVA+V PFGA F  K+   T
Subjt:  FSAGEKFVEYFIGVKSIVINSKTVPPNITLLKID-SNDVGGTKISTVDPYTVLESSIYNVVLKTFTTE--LQKVSRVAAVAPFGASFKAKSFSCT

AT1G03230.1 Eukaryotic aspartyl protease family protein2.7e-1739.69Show/hide
Query:  KLRRLSVSVESSERERMLAIGSGRPSGILQGSDLYAEPRSLTATNGFSLPFAERLHAFRR-HLLNNDLYHFLPNI-----------------VGVFSAGE
        K+  L  S  S+   + LA G+   +G+  G      P    A   F+  FA  L + R      N  Y FLP I                 V  FS GE
Subjt:  KLRRLSVSVESSERERMLAIGSGRPSGILQGSDLYAEPRSLTATNGFSLPFAERLHAFRR-HLLNNDLYHFLPNI-----------------VGVFSAGE

Query:  KFVEYFIGVKSIVINSKTVPPNITLLKID-SNDVGGTKISTVDPYTVLESSIYNVVLKTFTTEL------QKVSRVAAVAPFGASFKAKSFSCT
        K  EYFIGV +I I  KT+P + TLLKI+ S  +GGTKIS+V+PYTVLESSIY    K FT+E       + + RVA+V PFGA F  K+   T
Subjt:  KFVEYFIGVKSIVINSKTVPPNITLLKID-SNDVGGTKISTVDPYTVLESSIYNVVLKTFTTEL------QKVSRVAAVAPFGASFKAKSFSCT

AT5G19100.1 Eukaryotic aspartyl protease family protein2.7e-0936.89Show/hide
Query:  LPFAERLHAFRRHL-LNNDLYHFLP---NIVGVFSA-----GEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLK
        LP  ER  +    L +    Y++LP   ++  +F++       K  EY I VKSI I +KTVP             G TKIST+ PYTV ++S+Y  +L 
Subjt:  LPFAERLHAFRRHL-LNNDLYHFLP---NIVGVFSA-----GEKFVEYFIGVKSIVINSKTVPPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLK

Query:  TFTTELQKVSRVAAVAPFGASF
         FT  + K+++  AV PFGA F
Subjt:  TFTTELQKVSRVAAVAPFGASF

AT5G19110.1 Eukaryotic aspartyl protease family protein1.8e-0536.59Show/hide
Query:  EYFIGVKSIVINSKTVPPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQK--VSRVAAVAPFGASFKAKS
        +Y I VKSI +    +  N  LL       GG K+STV  YTVL++ IYN + ++FT + +   +++V +VAPF   F +++
Subjt:  EYFIGVKSIVINSKTVPPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQK--VSRVAAVAPFGASFKAKS

AT5G48430.1 Eukaryotic aspartyl protease family protein2.8e-0633.33Show/hide
Query:  KFVEYFIGVKSIVINSKTVPPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPF
        K   YF+G+K I +N   +         D N  GG  +ST+ P+T+L S IY V ++ F+     + RV++  PF
Subjt:  KFVEYFIGVKSIVINSKTVPPNITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGAAAGTTTGGAGAATAAATTAAGAAGGCTTTCGGTTTCGGTGGAGTCGAGTGAGAGGGAGAGAATGTTGGCAATTGGCAGCGGGCGGCCTAGTGGGATTCTTCA
GGGTTCGGACTTATACGCCGAACCAAGGAGCTTAACTGCCACGAACGGCTTCTCTTTGCCATTTGCTGAGCGGCTCCACGCATTTCGGCGTCATCTTCTAAACAACGACC
TCTACCATTTCTTACCCAACATCGTCGGCGTCTTCTCTGCCGGCGAAAAATTCGTCGAATATTTCATCGGCGTCAAATCCATCGTCATCAACTCTAAAACCGTCCCACCC
AACATCACCCTCCTCAAAATCGACAGCAACGACGTCGGCGGCACCAAAATCAGCACCGTCGATCCCTACACCGTTTTAGAATCCTCGATCTACAACGTCGTCTTGAAAAC
CTTCACCACCGAGCTCCAGAAAGTTTCGAGAGTGGCGGCGGTGGCGCCGTTCGGGGCTTCTTTCAAAGCGAAAAGCTTTTCGTGTACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAGAAAGTTTGGAGAATAAATTAAGAAGGCTTTCGGTTTCGGTGGAGTCGAGTGAGAGGGAGAGAATGTTGGCAATTGGCAGCGGGCGGCCTAGTGGGATTCTTCA
GGGTTCGGACTTATACGCCGAACCAAGGAGCTTAACTGCCACGAACGGCTTCTCTTTGCCATTTGCTGAGCGGCTCCACGCATTTCGGCGTCATCTTCTAAACAACGACC
TCTACCATTTCTTACCCAACATCGTCGGCGTCTTCTCTGCCGGCGAAAAATTCGTCGAATATTTCATCGGCGTCAAATCCATCGTCATCAACTCTAAAACCGTCCCACCC
AACATCACCCTCCTCAAAATCGACAGCAACGACGTCGGCGGCACCAAAATCAGCACCGTCGATCCCTACACCGTTTTAGAATCCTCGATCTACAACGTCGTCTTGAAAAC
CTTCACCACCGAGCTCCAGAAAGTTTCGAGAGTGGCGGCGGTGGCGCCGTTCGGGGCTTCTTTCAAAGCGAAAAGCTTTTCGTGTACCTAG
Protein sequenceShow/hide protein sequence
MSESLENKLRRLSVSVESSERERMLAIGSGRPSGILQGSDLYAEPRSLTATNGFSLPFAERLHAFRRHLLNNDLYHFLPNIVGVFSAGEKFVEYFIGVKSIVINSKTVPP
NITLLKIDSNDVGGTKISTVDPYTVLESSIYNVVLKTFTTELQKVSRVAAVAPFGASFKAKSFSCT