; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011123 (gene) of Snake gourd v1 genome

Gene IDTan0011123
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransposase
Genome locationLG09:22032794..22037595
RNA-Seq ExpressionTan0011123
SyntenyTan0011123
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041462.1 putative serine/threonine-protein kinase nek2 [Cucumis melo var. makuwa]1.6e-2158.76Show/hide
Query:  AGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQ-VTSLQKDRRSKNKYNHRLARKGYANFVEELGKSCDLAVENI
        AGISFRQFK+ L TKYI+P +D P+ L+ PP  Y+ IEQ HW+EFV   LSE FQ    LQ+DRRSKNKYNHR++RKGYAN  EE+  S ++ V+ +
Subjt:  AGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQ-VTSLQKDRRSKNKYNHRLARKGYANFVEELGKSCDLAVENI

KAA0054128.1 pol protein [Cucumis melo var. makuwa]4.5e-2437.34Show/hide
Query:  AGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQV-TSLQKDRRSKNKYNHRLARKGYANFVEEL---GKSC-------
        AG +FRQFK  LR KYI+PF++ P+ LK PP  Y++I+Q  W+EFV   L   F+V   LQ++RR KNKYNHRL+RKGYAN  EEL    K+C       
Subjt:  AGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQV-TSLQKDRRSKNKYNHRLARKGYANFVEEL---GKSC-------

Query:  -DLAVENINNIVASGMV--YERLSPHEVVYGVPLRSNDVRVLITLVSDFNAALPIPIAG----SIGTVSNAIGSHAH------GQRILYLCQSQPIILEQ
         D+  + +    +SG V            +    RS      I  +S+ N  L + +       I T S    +H        G R      SQ   L  
Subjt:  -DLAVENINNIVASGMV--YERLSPHEVVYGVPLRSNDVRVLITLVSDFNAALPIPIAG----SIGTVSNAIGSHAH------GQRILYLCQSQPIILEQ

Query:  YAFLQPSLISYASGPEEQRCRFLCNRLRETK-NKLLICPCN
        Y F+ PSLIS     +E R R LC+RL  +K N+L++ P N
Subjt:  YAFLQPSLISYASGPEEQRCRFLCNRLRETK-NKLLICPCN

TYK02903.1 transposase [Cucumis melo var. makuwa]1.8e-2532.34Show/hide
Query:  AGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQ-VTSLQKDRRSKNKYNHRLARKGYANFVE----------ELGKSC
        AG +FRQFK  L  KYI+PF++ PE LK PP  Y++I+Q HW+EFV   L   F+    LQ++RR KN YNHRL+RKGYAN  E          ELG++ 
Subjt:  AGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQ-VTSLQKDRRSKNKYNHRLARKGYANFVE----------ELGKSC

Query:  ---------------DLAVENINNI--VASGMVYERLSPHEVV------------------YGVPLRSN---DVRVLITLVSD--------------FNA
                       +   E +N I  ++     +  SP++V+                  +  P++     DV +L  L  D                 
Subjt:  ---------------DLAVENINNI--VASGMVYERLSPHEVV------------------YGVPLRSN---DVRVLITLVSD--------------FNA

Query:  ALPIPIAGSIGTVSNAIGSHAHGQRILYLCQSQPIILEQYAFLQPSLISYASGPEEQRCRFLCNRLRETK-NKLLICPCNSGHHWLLVVISLKTSTIWSV
         LP  + G IG  +  +         +YL       L  Y F+ PSLIS     +E R R LCNRL  +K N+L++ P N G HW L+ I++   T++ +
Subjt:  ALPIPIAGSIGTVSNAIGSHAHGQRILYLCQSQPIILEQYAFLQPSLISYASGPEEQRCRFLCNRLRETK-NKLLICPCNSGHHWLLVVISLKTSTIWSV

Query:  DSI
        DS+
Subjt:  DSI

TYK04702.1 transposase [Cucumis melo var. makuwa]1.6e-2152.17Show/hide
Query:  AGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQ-VTSLQKDRRSKNKYNHRLARKGYANFVEELGK-SCDLAVENINN
        AGISFRQFK+ L TKYIIP +D P+ L+ PP  Y+ IEQ HW+EFV   LSE FQ    LQ+DRRSKNKYNHR++RKGYAN  EE+ + S +    ++N 
Subjt:  AGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQ-VTSLQKDRRSKNKYNHRLARKGYANFVEELGK-SCDLAVENINN

Query:  IVASGMVYERLSPHE
          ++ ++ + L   E
Subjt:  IVASGMVYERLSPHE

XP_022148697.1 uncharacterized protein LOC111017298 [Momordica charantia]7.3e-2264.04Show/hide
Query:  AGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQ-VTSLQKDRRSKNKYNHRLARKGYANFVEELGKS
        AG SFRQFKS L T +IIPF+D P  L++PP TY+HIE  HW +FV   LSEEF+ + +LQ  RR+KNKYNHRL+RKGYAN +EEL KS
Subjt:  AGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQ-VTSLQKDRRSKNKYNHRLARKGYANFVEELGKS

TrEMBL top hitse value%identityAlignment
A0A5A7TDG0 Putative serine/threonine-protein kinase nek27.8e-2258.76Show/hide
Query:  AGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQ-VTSLQKDRRSKNKYNHRLARKGYANFVEELGKSCDLAVENI
        AGISFRQFK+ L TKYI+P +D P+ L+ PP  Y+ IEQ HW+EFV   LSE FQ    LQ+DRRSKNKYNHR++RKGYAN  EE+  S ++ V+ +
Subjt:  AGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQ-VTSLQKDRRSKNKYNHRLARKGYANFVEELGKSCDLAVENI

A0A5A7UGF9 Pol protein2.2e-2437.34Show/hide
Query:  AGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQV-TSLQKDRRSKNKYNHRLARKGYANFVEEL---GKSC-------
        AG +FRQFK  LR KYI+PF++ P+ LK PP  Y++I+Q  W+EFV   L   F+V   LQ++RR KNKYNHRL+RKGYAN  EEL    K+C       
Subjt:  AGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQV-TSLQKDRRSKNKYNHRLARKGYANFVEEL---GKSC-------

Query:  -DLAVENINNIVASGMV--YERLSPHEVVYGVPLRSNDVRVLITLVSDFNAALPIPIAG----SIGTVSNAIGSHAH------GQRILYLCQSQPIILEQ
         D+  + +    +SG V            +    RS      I  +S+ N  L + +       I T S    +H        G R      SQ   L  
Subjt:  -DLAVENINNIVASGMV--YERLSPHEVVYGVPLRSNDVRVLITLVSDFNAALPIPIAG----SIGTVSNAIGSHAH------GQRILYLCQSQPIILEQ

Query:  YAFLQPSLISYASGPEEQRCRFLCNRLRETK-NKLLICPCN
        Y F+ PSLIS     +E R R LC+RL  +K N+L++ P N
Subjt:  YAFLQPSLISYASGPEEQRCRFLCNRLRETK-NKLLICPCN

A0A5A7UWH7 Transposase7.8e-2261.05Show/hide
Query:  AGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQ-VTSLQKDRRSKNKYNHRLARKGYANFVEELGKSCDLAVE
        AGISFRQFK+ L TKYIIP +D P+ L+ PP  Y+ IEQ HW+EFV   LSE FQ    LQ+DRRSKNKYNHR++RKGYAN  EE+  S ++ V+
Subjt:  AGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQ-VTSLQKDRRSKNKYNHRLARKGYANFVEELGKSCDLAVE

A0A5D3BTB1 Transposase8.9e-2632.34Show/hide
Query:  AGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQ-VTSLQKDRRSKNKYNHRLARKGYANFVE----------ELGKSC
        AG +FRQFK  L  KYI+PF++ PE LK PP  Y++I+Q HW+EFV   L   F+    LQ++RR KN YNHRL+RKGYAN  E          ELG++ 
Subjt:  AGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQ-VTSLQKDRRSKNKYNHRLARKGYANFVE----------ELGKSC

Query:  ---------------DLAVENINNI--VASGMVYERLSPHEVV------------------YGVPLRSN---DVRVLITLVSD--------------FNA
                       +   E +N I  ++     +  SP++V+                  +  P++     DV +L  L  D                 
Subjt:  ---------------DLAVENINNI--VASGMVYERLSPHEVV------------------YGVPLRSN---DVRVLITLVSD--------------FNA

Query:  ALPIPIAGSIGTVSNAIGSHAHGQRILYLCQSQPIILEQYAFLQPSLISYASGPEEQRCRFLCNRLRETK-NKLLICPCNSGHHWLLVVISLKTSTIWSV
         LP  + G IG  +  +         +YL       L  Y F+ PSLIS     +E R R LCNRL  +K N+L++ P N G HW L+ I++   T++ +
Subjt:  ALPIPIAGSIGTVSNAIGSHAHGQRILYLCQSQPIILEQYAFLQPSLISYASGPEEQRCRFLCNRLRETK-NKLLICPCNSGHHWLLVVISLKTSTIWSV

Query:  DSI
        DS+
Subjt:  DSI

A0A6J1D4R5 uncharacterized protein LOC1110172983.5e-2264.04Show/hide
Query:  AGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQ-VTSLQKDRRSKNKYNHRLARKGYANFVEELGKS
        AG SFRQFKS L T +IIPF+D P  L++PP TY+HIE  HW +FV   LSEEF+ + +LQ  RR+KNKYNHRL+RKGYAN +EEL KS
Subjt:  AGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQ-VTSLQKDRRSKNKYNHRLARKGYANFVEELGKS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGCTGGAATTTCGTTCCGGCAATTTAAGAGCATTTTGAGGACTAAATATATTATTCCATTTCAAGATATGCCTGAATGTTTGAAGTCTCCACCGCCGACATATAA
CCACATCGAGCAAGTTCATTGGGATGAATTCGTTGGTAAAGTTCTATCTGAAGAGTTTCAAGTAACCAGTTTGCAAAAGGACAGACGTTCCAAAAATAAATACAATCATA
GGCTTGCTCGTAAAGGATACGCAAATTTTGTGGAAGAGTTGGGAAAATCTTGTGATTTAGCTGTTGAAAATATAAACAATATTGTGGCATCAGGAATGGTTTATGAGAGG
TTAAGTCCACATGAAGTTGTGTACGGTGTGCCTCTCAGATCGAATGATGTAAGAGTACTTATCACTCTTGTATCTGATTTCAATGCTGCATTGCCTATACCGATAGCAGG
GAGTATTGGGACAGTTTCAAATGCGATTGGTTCTCACGCCCATGGCCAAAGAATCTTGTACTTGTGCCAATCTCAACCTATCATATTAGAACAATATGCATTCCTTCAAC
CATCTTTGATTTCATATGCCTCTGGACCTGAAGAGCAACGTTGTCGATTCCTATGTAATAGGTTACGAGAGACCAAGAATAAACTATTGATCTGTCCTTGTAATTCAGGA
CATCATTGGTTGTTGGTGGTTATATCATTGAAAACATCTACAATTTGGTCGGTTGATTCCATAGGACATGGCATTCGAGATTACGTGAAAAATATAGTTAATACGAACCA
CCTCCTTCTTCCTTCTTTCACGATAACCTCCTGCCGCAAAAGCGGTTCATCACGTTGCCGCCGACCACGCCCCTCGCGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGCTGGAATTTCGTTCCGGCAATTTAAGAGCATTTTGAGGACTAAATATATTATTCCATTTCAAGATATGCCTGAATGTTTGAAGTCTCCACCGCCGACATATAA
CCACATCGAGCAAGTTCATTGGGATGAATTCGTTGGTAAAGTTCTATCTGAAGAGTTTCAAGTAACCAGTTTGCAAAAGGACAGACGTTCCAAAAATAAATACAATCATA
GGCTTGCTCGTAAAGGATACGCAAATTTTGTGGAAGAGTTGGGAAAATCTTGTGATTTAGCTGTTGAAAATATAAACAATATTGTGGCATCAGGAATGGTTTATGAGAGG
TTAAGTCCACATGAAGTTGTGTACGGTGTGCCTCTCAGATCGAATGATGTAAGAGTACTTATCACTCTTGTATCTGATTTCAATGCTGCATTGCCTATACCGATAGCAGG
GAGTATTGGGACAGTTTCAAATGCGATTGGTTCTCACGCCCATGGCCAAAGAATCTTGTACTTGTGCCAATCTCAACCTATCATATTAGAACAATATGCATTCCTTCAAC
CATCTTTGATTTCATATGCCTCTGGACCTGAAGAGCAACGTTGTCGATTCCTATGTAATAGGTTACGAGAGACCAAGAATAAACTATTGATCTGTCCTTGTAATTCAGGA
CATCATTGGTTGTTGGTGGTTATATCATTGAAAACATCTACAATTTGGTCGGTTGATTCCATAGGACATGGCATTCGAGATTACGTGAAAAATATAGTTAATACGAACCA
CCTCCTTCTTCCTTCTTTCACGATAACCTCCTGCCGCAAAAGCGGTTCATCACGTTGCCGCCGACCACGCCCCTCGCGGTAG
Protein sequenceShow/hide protein sequence
MKAGISFRQFKSILRTKYIIPFQDMPECLKSPPPTYNHIEQVHWDEFVGKVLSEEFQVTSLQKDRRSKNKYNHRLARKGYANFVEELGKSCDLAVENINNIVASGMVYER
LSPHEVVYGVPLRSNDVRVLITLVSDFNAALPIPIAGSIGTVSNAIGSHAHGQRILYLCQSQPIILEQYAFLQPSLISYASGPEEQRCRFLCNRLRETKNKLLICPCNSG
HHWLLVVISLKTSTIWSVDSIGHGIRDYVKNIVNTNHLLLPSFTITSCRKSGSSRCRRPRPSR