; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021100 (gene) of Snake gourd v1 genome

Gene IDTan0021100
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCACTA en-spm transposon protein
Genome locationLG05:14431577..14432734
RNA-Seq ExpressionTan0021100
SyntenyTan0021100
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015383029.1 uncharacterized protein LOC112495473 isoform X2 [Citrus sinensis]2.5e-1236.51Show/hide
Query:  SNDEDNIVEMNEAASSNSSQVRGVSCGLGLARIIEATGDRVRVSWSLQQGKPVGSVASLFNSEIGILTRVFVPLKYATKYDIPNEVFTNIIERLLNKSAK
        SN   N+  + E   +   + RG S G+GL R+++A G+R+ +S+  ++ +P+ + AS F +EIG+  R+F P++Y +   IP++    + ERLL ++ K
Subjt:  SNDEDNIVEMNEAASSNSSQVRGVSCGLGLARIIEATGDRVRVSWSLQQGKPVGSVASLFNSEIGILTRVFVPLKYATKYDIPNEVFTNIIERLLNKSAK

Query:  NKASRSKLPFNHCCGTKSFLTYREEK
        N  +R KL +NH  G+ SFL++RE+K
Subjt:  NKASRSKLPFNHCCGTKSFLTYREEK

XP_024951273.1 uncharacterized protein LOC112495473 isoform X3 [Citrus sinensis]8.4e-1336.72Show/hide
Query:  SNDEDNIVEMNEAASSNSSQVRGVSCGLGLARIIEATGDRVRVSWSLQQGKPVGSVASLFNSEIGILTRVFVPLKYATKYDIPNEVFTNIIERLLNKSAK
        SN   N+  + E   +   + RG S G+GL R+++A G+R+ +S+  ++ +P+ + AS F +EIG+  R+F P++Y +   IP++    + ERLL ++ K
Subjt:  SNDEDNIVEMNEAASSNSSQVRGVSCGLGLARIIEATGDRVRVSWSLQQGKPVGSVASLFNSEIGILTRVFVPLKYATKYDIPNEVFTNIIERLLNKSAK

Query:  NKASRSKLPFNHCCGTKSFLTYREEKAK
        N  +R KL +NH  G+ SFL++RE+K K
Subjt:  NKASRSKLPFNHCCGTKSFLTYREEKAK

XP_038895319.1 uncharacterized protein LOC120083572 isoform X1 [Benincasa hispida]5.5e-2032.27Show/hide
Query:  QVRGVSCGLGLARIIEATGDRVRVSWSLQQGKPVGSVASLFNSEIGILTRVFVPLKYATKYDIPNEVFTNIIERLLN-----------------------
        +VRG S G+ L +   AT  R++V+W+  QGKP+G +ASLFN EIG+L R F+PLKY  + DIPNE++  + E+LLN                       
Subjt:  QVRGVSCGLGLARIIEATGDRVRVSWSLQQGKPVGSVASLFNSEIGILTRVFVPLKYATKYDIPNEVFTNIIERLLN-----------------------

Query:  -----------------------------------------------KSAKNKASRSKLPFNHCCGTKSFLTYREEKAK---------------------
                                                       KSA+NK SRSK+ FNHC G+KSFL+ R +K K                     
Subjt:  -----------------------------------------------KSAKNKASRSKLPFNHCCGTKSFLTYREEKAK---------------------

Query:  -MVALAEEQATSSTPMM--------IYEIVASVLGTRPSYVKGMGYGPKPP
          V  A ++A  +  M+          EI+  VLG R SY+ G GYGPKPP
Subjt:  -MVALAEEQATSSTPMM--------IYEIVASVLGTRPSYVKGMGYGPKPP

XP_038895320.1 uncharacterized protein LOC120083572 isoform X2 [Benincasa hispida]3.1e-1534.08Show/hide
Query:  QVRGVSCGLGLARIIEATGDRVRVSWSLQQGKPVGSVASLFNSEIGILTRVFVPLKYATKYDIPNEVFTNIIERLLN-----------------------
        +VRG S G+ L +   AT  R++V+W+  QGKP+G +ASLFN EIG+L R F+PLKY  + DIPNE++  + E+LLN                       
Subjt:  QVRGVSCGLGLARIIEATGDRVRVSWSLQQGKPVGSVASLFNSEIGILTRVFVPLKYATKYDIPNEVFTNIIERLLN-----------------------

Query:  -----------------------------------------------KSAKNKASRSKLPFNHCCGTKSFLTYREEKAK
                                                       KSA+NK SRSK+ FNHC G+KSFL+ R +K K
Subjt:  -----------------------------------------------KSAKNKASRSKLPFNHCCGTKSFLTYREEKAK

XP_038895321.1 uncharacterized protein LOC120083572 isoform X3 [Benincasa hispida]3.1e-1534.08Show/hide
Query:  QVRGVSCGLGLARIIEATGDRVRVSWSLQQGKPVGSVASLFNSEIGILTRVFVPLKYATKYDIPNEVFTNIIERLLN-----------------------
        +VRG S G+ L +   AT  R++V+W+  QGKP+G +ASLFN EIG+L R F+PLKY  + DIPNE++  + E+LLN                       
Subjt:  QVRGVSCGLGLARIIEATGDRVRVSWSLQQGKPVGSVASLFNSEIGILTRVFVPLKYATKYDIPNEVFTNIIERLLN-----------------------

Query:  -----------------------------------------------KSAKNKASRSKLPFNHCCGTKSFLTYREEKAK
                                                       KSA+NK SRSK+ FNHC G+KSFL+ R +K K
Subjt:  -----------------------------------------------KSAKNKASRSKLPFNHCCGTKSFLTYREEKAK

TrEMBL top hitse value%identityAlignment
A0A5P1F7M3 Peroxidase1.7e-1134.93Show/hide
Query:  VRGVSCGLGLARIIEATGDRVRVSWSLQQGKPVGSVASLFNSEIGILTRVFVPLKYATKYDIPNEVFTNIIERLLNKSAKNKASRSKLPFNHCCGTKSFL
        VRG + G G  R++   G  + V      G P G  A+   +EIG   R   P++     +I   +   II R+  KS K K SRSKLP+NH  G++SF 
Subjt:  VRGVSCGLGLARIIEATGDRVRVSWSLQQGKPVGSVASLFNSEIGILTRVFVPLKYATKYDIPNEVFTNIIERLLNKSAKNKASRSKLPFNHCCGTKSFL

Query:  TYREEKAKMV--ALAEEQATSSTPMMIYEIVASVLGTRPSYVKGMG
               ++V   + + Q   ++PM  +EI   VLG RP Y+KG G
Subjt:  TYREEKAKMV--ALAEEQATSSTPMMIYEIVASVLGTRPSYVKGMG

A0A6A1WGC8 Uncharacterized protein9.4e-1028.49Show/hide
Query:  AASSNSSQVRGVSCGLGLARIIEATGDRVRVSWSLQQGKPVGSVASLFNSEIGILTRVFVPLKYATKYDIPNEVFTNIIERLLN------------KSAK
        A++S S + RG + G+ L + + +   ++ V     +   V S A++ +++IG+L R  +P+      D+P EV   I++R+L+            +S +
Subjt:  AASSNSSQVRGVSCGLGLARIIEATGDRVRVSWSLQQGKPVGSVASLFNSEIGILTRVFVPLKYATKYDIPNEVFTNIIERLLN------------KSAK

Query:  NKASRSKLPFNHCCGTKSFLTYREEKAKMVALAEEQATSSTPMMIYEIVASVLGTRPSYVKGMGYGPKPPLS
        NK +RSKL  NH  G++SF   R    +M A   +           E+ + +LG +  YV+G+G   KPP S
Subjt:  NKASRSKLPFNHCCGTKSFLTYREEKAKMVALAEEQATSSTPMMIYEIVASVLGTRPSYVKGMGYGPKPPLS

A0A6J1CQT5 uncharacterized protein LOC1110134761.0e-0850.68Show/hide
Query:  KSAKNKASRSKLPFNHCCGTKSFLTYREEKAKMVALAEEQATSSTPMMIYEIVASVLGTRPSYVKGMGYGPKP
        KSAKNK +RSKL FNH  G K F  +RE+  +M+ L    A+        EI+ +VLG R +YV GMGYGPKP
Subjt:  KSAKNKASRSKLPFNHCCGTKSFLTYREEKAKMVALAEEQATSSTPMMIYEIVASVLGTRPSYVKGMGYGPKP

A0A6J1D6S9 uncharacterized protein LOC1110174611.0e-0850.68Show/hide
Query:  KSAKNKASRSKLPFNHCCGTKSFLTYREEKAKMVALAEEQATSSTPMMIYEIVASVLGTRPSYVKGMGYGPKP
        KSAKNK +RSKL FNH  G K F  +RE+  +M+ L    A+        EI+ +VLG R +YV GMGYGPKP
Subjt:  KSAKNKASRSKLPFNHCCGTKSFLTYREEKAKMVALAEEQATSSTPMMIYEIVASVLGTRPSYVKGMGYGPKP

A0A6V7NJ46 Uncharacterized protein7.2e-1031.21Show/hide
Query:  QGKPVGSVASLFNSEIGILTRVFVPLKYATKYDIPNEVFTNIIERLLNKSAKNKASRSKLPFNHCCGTKSFLTYREE---------------KAKMVALA
        + +PVG  +   + EIGI+ R F P+K +  ++I +     + ER++ +S  N ++R KLP+ H  GT++F+  R +               K +   + 
Subjt:  QGKPVGSVASLFNSEIGILTRVFVPLKYATKYDIPNEVFTNIIERLLNKSAKNKASRSKLPFNHCCGTKSFLTYREE---------------KAKMVALA

Query:  EEQA-----TSSTPMMIYEIVASVLGTRPSYVKGMGYGPKP
          Q+      +  PM   EI A VLGTR  Y+ G G+GPKP
Subjt:  EEQA-----TSSTPMMIYEIVASVLGTRPSYVKGMGYGPKP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTTCAAATGATGAGGACAACATTGTAGAAATGAATGAAGCGGCATCCTCTAATAGTAGTCAGGTTCGTGGTGTTTCATGTGGATTGGGTCTGGCGAGAATTATTGA
GGCCACTGGGGATAGAGTGCGCGTTTCATGGAGTTTACAACAAGGCAAACCGGTTGGAAGTGTTGCTAGCCTCTTTAATAGTGAAATTGGAATTCTGACGAGGGTGTTCG
TCCCACTGAAGTATGCAACTAAGTATGACATTCCAAATGAAGTTTTCACAAACATAATTGAGCGATTGCTGAACAAATCGGCAAAGAACAAGGCTAGTAGAAGCAAGCTC
CCTTTCAATCATTGCTGTGGAACAAAGTCATTTCTCACTTATAGAGAAGAAAAGGCAAAGATGGTAGCACTTGCAGAAGAGCAAGCCACGTCGAGTACACCAATGATGAT
TTACGAAATTGTGGCTAGCGTTCTTGGAACACGACCATCCTATGTTAAAGGAATGGGGTATGGACCAAAACCACCACTATCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTTTCAAATGATGAGGACAACATTGTAGAAATGAATGAAGCGGCATCCTCTAATAGTAGTCAGGTTCGTGGTGTTTCATGTGGATTGGGTCTGGCGAGAATTATTGA
GGCCACTGGGGATAGAGTGCGCGTTTCATGGAGTTTACAACAAGGCAAACCGGTTGGAAGTGTTGCTAGCCTCTTTAATAGTGAAATTGGAATTCTGACGAGGGTGTTCG
TCCCACTGAAGTATGCAACTAAGTATGACATTCCAAATGAAGTTTTCACAAACATAATTGAGCGATTGCTGAACAAATCGGCAAAGAACAAGGCTAGTAGAAGCAAGCTC
CCTTTCAATCATTGCTGTGGAACAAAGTCATTTCTCACTTATAGAGAAGAAAAGGCAAAGATGGTAGCACTTGCAGAAGAGCAAGCCACGTCGAGTACACCAATGATGAT
TTACGAAATTGTGGCTAGCGTTCTTGGAACACGACCATCCTATGTTAAAGGAATGGGGTATGGACCAAAACCACCACTATCTTAG
Protein sequenceShow/hide protein sequence
MLSNDEDNIVEMNEAASSNSSQVRGVSCGLGLARIIEATGDRVRVSWSLQQGKPVGSVASLFNSEIGILTRVFVPLKYATKYDIPNEVFTNIIERLLNKSAKNKASRSKL
PFNHCCGTKSFLTYREEKAKMVALAEEQATSSTPMMIYEIVASVLGTRPSYVKGMGYGPKPPLS