; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010228 (gene) of Snake gourd v1 genome

Gene IDTan0010228
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCACTA en-spm transposon protein
Genome locationLG04:28305322..28309245
RNA-Seq ExpressionTan0010228
SyntenyTan0010228
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RWR95813.1 structure-specific endonuclease subunit slx1 [Cinnamomum micranthum f. kanehirae]7.6e-0632.89Show/hide
Query:  IFFTRLEIDIRTTDPGYSSTTENWNK-------------STKNKASRSKLSFNHCCGTKSFLTYREEKAKMVAL-AEEQATLSIPMTDEEIVASVLGTRP
        +++  L  D R   P  +   E+W +             ST NKA R+K   +HC G++SFL +R +K KM+ + AE     S  MT+E+I   VLGT+P
Subjt:  IFFTRLEIDIRTTDPGYSSTTENWNK-------------STKNKASRSKLSFNHCCGTKSFLTYREEKAKMVAL-AEEQATLSIPMTDEEIVASVLGTRP

Query:  SYVKGMGYGPKPPPAQKT-SYSEDYTQRQENEKLEQRMEQRIEQRIEERMEI
         Y+ G+G+G  P  +  +  Y E    R+  E  E + E+   Q  E R E+
Subjt:  SYVKGMGYGPKPPPAQKT-SYSEDYTQRQENEKLEQRMEQRIEQRIEERMEI

XP_020260850.1 uncharacterized protein LOC109837148 [Asparagus officinalis]4.0e-0733.56Show/hide
Query:  KSTKNKASRSKLSFNHCCGTKSFLTYREEKAKMVALAEE--QATLSIPMTDEEIVASVLGTRPSYVKGMGYGPKPPPAQ--KTSYSEDYTQRQENEKLEQ
        +S KN  +RSKL+++H  G+KSF++++E+  +M+ +  +  Q   +  MTDEEI A VL  +  Y+KG G+GP+PPP +  ++S SE   + +E ++  Q
Subjt:  KSTKNKASRSKLSFNHCCGTKSFLTYREEKAKMVALAEE--QATLSIPMTDEEIVASVLGTRPSYVKGMGYGPKPPPAQ--KTSYSEDYTQRQENEKLEQ

Query:  RMEQRI---EQRIEERMEI--QHSEYERRFEQMGELLTRFTGTNPS
          +Q +   +Q+I+ + E+  +  E  ++FE   E +  F+  +PS
Subjt:  RMEQRI---EQRIEERMEI--QHSEYERRFEQMGELLTRFTGTNPS

XP_022143616.1 uncharacterized protein LOC111013476 [Momordica charantia]9.3e-1237.41Show/hide
Query:  KSTKNKASRSKLSFNHCCGTKSFLTYREEKAKMVALAEEQATLSIPMTDEEIVASVLGTRPSYVKGMGYGPKPP--PAQKTSYSEDYTQRQEN--EKLEQ
        KS KNK +RSKL FNH  G K F  +RE+  +M+ L    A+     T+EEI+ +VLG R +YV GMGYGPKP       + YS++Y +  E   +K E+
Subjt:  KSTKNKASRSKLSFNHCCGTKSFLTYREEKAKMVALAEEQATLSIPMTDEEIVASVLGTRPSYVKGMGYGPKPP--PAQKTSYSEDYTQRQEN--EKLEQ

Query:  RMEQRIEQRIEERMEIQHSEYERRFEQMGELLTRFTGTN
         M +  +++I++ +E Q  E+ R+  +M ++     G++
Subjt:  RMEQRIEQRIEERMEIQHSEYERRFEQMGELLTRFTGTN

XP_022148911.1 uncharacterized protein LOC111017461 [Momordica charantia]4.2e-1235.98Show/hide
Query:  KSTKNKASRSKLSFNHCCGTKSFLTYREEKAKMVALAEEQATLSIPMTDEEIVASVLGTRPSYVKGMGYGPKPP--PAQKTSYSEDYTQRQEN--EKLEQ
        KS KNK +RSKL FNH  G K F  +RE+  +M+ L    A+     T+EEI+ +VLG R +YV GMGYGPKP       + YS++Y +  E   +K E+
Subjt:  KSTKNKASRSKLSFNHCCGTKSFLTYREEKAKMVALAEEQATLSIPMTDEEIVASVLGTRPSYVKGMGYGPKPP--PAQKTSYSEDYTQRQEN--EKLEQ

Query:  RMEQRIEQRIEERMEIQHSEYERRFEQMGELLTRFTGTNP--------SFNKVWF----LNIYV
         M +  +++I++ +E Q  E+ R+  +M ++     G++         SF   W     LN+YV
Subjt:  RMEQRIEQRIEERMEIQHSEYERRFEQMGELLTRFTGTNP--------SFNKVWF----LNIYV

XP_022153681.1 uncharacterized protein LOC111021138 [Momordica charantia]2.4e-0747.95Show/hide
Query:  KSTKNKASRSKLSFNHCCGTKSFLTYREEKAKMVALAEEQATLSIPMTDEEIVASVLGTRPSYVKGMGYGPKP
        KS KNK + SKL FNH    K F  +RE+  +M+ L    A+     T+EEI+ +VLG R +Y+ GMGYGPKP
Subjt:  KSTKNKASRSKLSFNHCCGTKSFLTYREEKAKMVALAEEQATLSIPMTDEEIVASVLGTRPSYVKGMGYGPKP

TrEMBL top hitse value%identityAlignment
A0A3S4PWR1 Structure-specific endonuclease subunit slx13.7e-0632.89Show/hide
Query:  IFFTRLEIDIRTTDPGYSSTTENWNK-------------STKNKASRSKLSFNHCCGTKSFLTYREEKAKMVAL-AEEQATLSIPMTDEEIVASVLGTRP
        +++  L  D R   P  +   E+W +             ST NKA R+K   +HC G++SFL +R +K KM+ + AE     S  MT+E+I   VLGT+P
Subjt:  IFFTRLEIDIRTTDPGYSSTTENWNK-------------STKNKASRSKLSFNHCCGTKSFLTYREEKAKMVAL-AEEQATLSIPMTDEEIVASVLGTRP

Query:  SYVKGMGYGPKPPPAQKT-SYSEDYTQRQENEKLEQRMEQRIEQRIEERMEI
         Y+ G+G+G  P  +  +  Y E    R+  E  E + E+   Q  E R E+
Subjt:  SYVKGMGYGPKPPPAQKT-SYSEDYTQRQENEKLEQRMEQRIEQRIEERMEI

A0A5A7U7V3 CACTA en-spm transposon protein1.4e-0533.57Show/hide
Query:  KSTKNKASRSKLSFNHCCGTKSFLTYREEKAK------MVALAEEQATL--SIPMTDEEIVASVLGTRPSYVKGMGYGPKPPPAQKTSYSEDYTQRQENE
        +S  NKA+R K S+NH  G+KSFL  + E A+       +   + Q T   S P+ ++EI   VLG R  Y +G+G+GPKP   + TS S   T   ++ 
Subjt:  KSTKNKASRSKLSFNHCCGTKSFLTYREEKAK------MVALAEEQATL--SIPMTDEEIVASVLGTRPSYVKGMGYGPKPPPAQKTSYSEDYTQRQENE

Query:  KLEQRMEQRIEQRIEERMEIQ---HSEYERRFEQMGELLTRFT
        + E  ++ ++ + + ER+E+Q   H     + EQ+ +L+  FT
Subjt:  KLEQRMEQRIEQRIEERMEIQ---HSEYERRFEQMGELLTRFT

A0A5A7UQ38 CACTA en-spm transposon protein2.4e-0530.77Show/hide
Query:  KSTKNKASRSKLSFNHCCGTKSFLTYREEKAK---------------------MVALAEEQATLSIPMTDEEIVASVLGTRPSYVKGMGYGPKPPPAQKT
        +S  NKA+R K  +NH  G+KSFL  + E A+                      V+ A E A  S P++++EI   VLG RP Y+KG+G+GPKP   +  
Subjt:  KSTKNKASRSKLSFNHCCGTKSFLTYREEKAK---------------------MVALAEEQATLSIPMTDEEIVASVLGTRPSYVKGMGYGPKPPPAQKT

Query:  SYSEDYTQRQENEKLEQRMEQRIEQRIEERMEIQ---HSEYERRFEQMGELLTRFT
        S S   T   ++ + E  ++ ++ + + ER+E+Q   H     + E M +++   T
Subjt:  SYSEDYTQRQENEKLEQRMEQRIEQRIEERMEIQ---HSEYERRFEQMGELLTRFT

A0A6J1CQT5 uncharacterized protein LOC1110134764.5e-1237.41Show/hide
Query:  KSTKNKASRSKLSFNHCCGTKSFLTYREEKAKMVALAEEQATLSIPMTDEEIVASVLGTRPSYVKGMGYGPKPP--PAQKTSYSEDYTQRQEN--EKLEQ
        KS KNK +RSKL FNH  G K F  +RE+  +M+ L    A+     T+EEI+ +VLG R +YV GMGYGPKP       + YS++Y +  E   +K E+
Subjt:  KSTKNKASRSKLSFNHCCGTKSFLTYREEKAKMVALAEEQATLSIPMTDEEIVASVLGTRPSYVKGMGYGPKPP--PAQKTSYSEDYTQRQEN--EKLEQ

Query:  RMEQRIEQRIEERMEIQHSEYERRFEQMGELLTRFTGTN
         M +  +++I++ +E Q  E+ R+  +M ++     G++
Subjt:  RMEQRIEQRIEERMEIQHSEYERRFEQMGELLTRFTGTN

A0A6J1D6S9 uncharacterized protein LOC1110174612.0e-1235.98Show/hide
Query:  KSTKNKASRSKLSFNHCCGTKSFLTYREEKAKMVALAEEQATLSIPMTDEEIVASVLGTRPSYVKGMGYGPKPP--PAQKTSYSEDYTQRQEN--EKLEQ
        KS KNK +RSKL FNH  G K F  +RE+  +M+ L    A+     T+EEI+ +VLG R +YV GMGYGPKP       + YS++Y +  E   +K E+
Subjt:  KSTKNKASRSKLSFNHCCGTKSFLTYREEKAKMVALAEEQATLSIPMTDEEIVASVLGTRPSYVKGMGYGPKPP--PAQKTSYSEDYTQRQEN--EKLEQ

Query:  RMEQRIEQRIEERMEIQHSEYERRFEQMGELLTRFTGTNP--------SFNKVWF----LNIYV
         M +  +++I++ +E Q  E+ R+  +M ++     G++         SF   W     LN+YV
Subjt:  RMEQRIEQRIEERMEIQHSEYERRFEQMGELLTRFTGTNP--------SFNKVWF----LNIYV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGAAGCACAATTTTCAGAATGGTTTAAAAATAAAGTAAATGAGGAAGATGAGGAAGATGATGAAGACGAGGAAGACGAGGAAGACGAGGAAGATAAGAAAGATCA
GGAAGATGATGAAGACGAGGAAGACGAGGACTACGAGGAAGAAAAGAAAGATGAGGAAGATGATTTTGATGAAAAAATGCTTTCAAATGATGAGGACGACATTGTAGAAG
TCAATGAAGCGACATCCTCTAATAGTAGCCAGGTCCGTGGTGTTTCACGTGGACTCGGTCTAGCGAGGATTATTGAGGCCATTGGTGATAGAGTGCGGGTTCATGGAGTG
TACAACAAGGCAAACAAAATTATATTCTTTACGAGATTGGAAATCGATATAAGGACTACAGATCCAGGTTATTCCAGTACTACAGAAAATTGGAACAAATCGACAAAGAA
CAAGGCTAGTAGAAGCAAGCTCTCTTTCAATCATTGCTGTGGAACAAAGTCATTTCTCACTTATAGAGAAGAAAAGGCAAAAATGGTAGCACTTGCAGAAGAGCAAGCCA
CGTTGAGTATACCAATGACTGACGAAGAAATTGTGGCTAGCGTTCTTGGAACACGACCATCATATGTTAAAGGAATGGGGTATGGACCAAAACCACCACCAGCCCAGAAG
ACATCATACTCAGAAGATTACACTCAACGCCAAGAAAATGAGAAATTGGAGCAGCGAATGGAACAAAGAATAGAACAACGAATTGAAGAACGAATGGAGATACAACACAG
TGAGTATGAACGTAGATTTGAACAGATGGGTGAGCTTTTGACAAGGTTTACTGGAACAAATCCGTCATTTAATAAGGTATGGTTTCTAAATATTTATGTTATTGAAATTT
TGATAGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGCATGAAGCACAATTTTCAGAATGGTTTAAAAATAAAGTAAATGAGGAAGATGAGGAAGATGATGAAGACGAGGAAGACGAGGAAGACGAGGAAGATAAGAAAGATCA
GGAAGATGATGAAGACGAGGAAGACGAGGACTACGAGGAAGAAAAGAAAGATGAGGAAGATGATTTTGATGAAAAAATGCTTTCAAATGATGAGGACGACATTGTAGAAG
TCAATGAAGCGACATCCTCTAATAGTAGCCAGGTCCGTGGTGTTTCACGTGGACTCGGTCTAGCGAGGATTATTGAGGCCATTGGTGATAGAGTGCGGGTTCATGGAGTG
TACAACAAGGCAAACAAAATTATATTCTTTACGAGATTGGAAATCGATATAAGGACTACAGATCCAGGTTATTCCAGTACTACAGAAAATTGGAACAAATCGACAAAGAA
CAAGGCTAGTAGAAGCAAGCTCTCTTTCAATCATTGCTGTGGAACAAAGTCATTTCTCACTTATAGAGAAGAAAAGGCAAAAATGGTAGCACTTGCAGAAGAGCAAGCCA
CGTTGAGTATACCAATGACTGACGAAGAAATTGTGGCTAGCGTTCTTGGAACACGACCATCATATGTTAAAGGAATGGGGTATGGACCAAAACCACCACCAGCCCAGAAG
ACATCATACTCAGAAGATTACACTCAACGCCAAGAAAATGAGAAATTGGAGCAGCGAATGGAACAAAGAATAGAACAACGAATTGAAGAACGAATGGAGATACAACACAG
TGAGTATGAACGTAGATTTGAACAGATGGGTGAGCTTTTGACAAGGTTTACTGGAACAAATCCGTCATTTAATAAGGTATGGTTTCTAAATATTTATGTTATTGAAATTT
TGATAGACTAG
Protein sequenceShow/hide protein sequence
MHEAQFSEWFKNKVNEEDEEDDEDEEDEEDEEDKKDQEDDEDEEDEDYEEEKKDEEDDFDEKMLSNDEDDIVEVNEATSSNSSQVRGVSRGLGLARIIEAIGDRVRVHGV
YNKANKIIFFTRLEIDIRTTDPGYSSTTENWNKSTKNKASRSKLSFNHCCGTKSFLTYREEKAKMVALAEEQATLSIPMTDEEIVASVLGTRPSYVKGMGYGPKPPPAQK
TSYSEDYTQRQENEKLEQRMEQRIEQRIEERMEIQHSEYERRFEQMGELLTRFTGTNPSFNKVWFLNIYVIEILID