; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004384 (gene) of Snake gourd v1 genome

Gene IDTan0004384
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransposase
Genome locationLG01:95753268..95754089
RNA-Seq ExpressionTan0004384
SyntenyTan0004384
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR029480 - Transposase-associated domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046884.1 transposase [Cucumis melo var. makuwa]3.4e-12578.1Show/hide
Query:  MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGE-ELPNSSVHGDFSKLDTHMYEENDIG
        MD+SWM KSRL KD+ELG ENFI FGFSNT   SIRCPCLKCGNCEK S   IRDHLYVN IDESYKIWFWHGE +LP SS++ + SK DTHMYE ND+G
Subjt:  MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGE-ELPNSSVHGDFSKLDTHMYEENDIG

Query:  SIKEMVDVAHEEYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIY
         I EM++VAHEEYSKDPN FEKLLNDAEKSL EGCKKFTKLSTLVKLYNLKVRYGW+DISFSELLKTLK ILP  NE+  S+YE KKTLGALGM YEKI+
Subjt:  SIKEMVDVAHEEYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIY

Query:  ACPNDCCLYRKDYADAIECPECGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAKNLIWHVDER
        ACPNDCCLYRK++A+A ECPECG SRWK+  N NEGKK+IP KV+ YFPPIPRFKR+FRSI+ AKNLIWH +ER
Subjt:  ACPNDCCLYRKDYADAIECPECGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAKNLIWHVDER

TYK03264.1 transposase [Cucumis melo var. makuwa]3.4e-12578.1Show/hide
Query:  MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGE-ELPNSSVHGDFSKLDTHMYEENDIG
        MD+SWM KSRL KD+ELG ENFI FGFSNT   SIRCPCLKCGNCEK S   IRDHLYVN IDESYKIWFWHGE +LP SS++ + SK DTHMYE ND+G
Subjt:  MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGE-ELPNSSVHGDFSKLDTHMYEENDIG

Query:  SIKEMVDVAHEEYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIY
         I EM++VAHEEYSKDPN FEKLLNDAEKSL EGCKKFTKLSTLVKLYNLKVRYGW+DISFSELLKTLK ILP  NE+  S+YE KKTLGALGM YEKI+
Subjt:  SIKEMVDVAHEEYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIY

Query:  ACPNDCCLYRKDYADAIECPECGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAKNLIWHVDER
        ACPNDCCLYRK++A+A ECPECG SRWK+  N NEGKK+IP KV+ YFPPIPRFKR+FRSI+ AKNLIWH +ER
Subjt:  ACPNDCCLYRKDYADAIECPECGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAKNLIWHVDER

XP_031739988.1 uncharacterized protein LOC116403337 [Cucumis sativus]4.1e-12376.64Show/hide
Query:  MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGEELPNSSVHGDFSKLDTHMYEEN-DIG
        MDKSWM KSRL+KD+ELG ENFIKFGFSNT+S+ IRCPCLKCGNCEKHS   +RDHLYVN IDESYKIWFWHGEELPNSS + + SK   H  E++ D+G
Subjt:  MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGEELPNSSVHGDFSKLDTHMYEEN-DIG

Query:  SIKEMVDVAHEEYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIY
        S+KEM++VAHEEYSKDP GFEKLL DAEK L EGCKK+TKLSTLVKLYNLKVRYGW+D SFSELL+TLK ILP TNEL  S+YE KKTLGALGMEYEKI+
Subjt:  SIKEMVDVAHEEYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIY

Query:  ACPNDCCLYRKDYADAIECPECGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAKNLIWHVDER
        ACPN+CCLYRK++A+AIECPECG SRWK+ K+TNE +K+IP+KVI YFP IPRFKR+FRSI+CA+NL WH  ER
Subjt:  ACPNDCCLYRKDYADAIECPECGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAKNLIWHVDER

XP_031742381.1 uncharacterized protein LOC116404332 [Cucumis sativus]1.4e-12376.56Show/hide
Query:  MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGEELPNSSVHGDFSKLDTHMYEENDIGS
        MDKSWM KSRL+KD+ELG ENFIKFGFSNT S+ IRCPCLKCGNCEKHS   +RDHLYVN IDESYKIWFWHGEELPNSS + + SK D H  E+ D+GS
Subjt:  MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGEELPNSSVHGDFSKLDTHMYEENDIGS

Query:  IKEMVDVAHEEYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIYA
        +KEM++VAHEEYSKDP GFEKLL DAEK L EGCKK+TKLSTLVKLYNLKVRYGW+D SFSELL+TLK I+P TNEL  S+YE KKTLGALGMEYEKI+A
Subjt:  IKEMVDVAHEEYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIYA

Query:  CPNDCCLYRKDYADAIECPECGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAKNLIWHVDER
        CPN+CCLYRK++A+AIEC ECG SRWK+ K+TNE +K+IP+KVI YFP IPRFKR+FRSI+CA+NL WH  ER
Subjt:  CPNDCCLYRKDYADAIECPECGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAKNLIWHVDER

XP_031745762.1 uncharacterized protein LOC116406207 [Cucumis sativus]2.3e-12677.66Show/hide
Query:  MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGEELPNSSVHGDFSKLDTHMYEENDIGS
        MDKSWMSKSRL+K+FELG +NFI+FGFSNTN+TSIRCPCLKCGNC+KH +NDIRDHLY N IDESYKIWFWHGEELPNSS HG+ SK    MYEEND+G+
Subjt:  MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGEELPNSSVHGDFSKLDTHMYEENDIGS

Query:  IKEMVDVAHEEYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIYA
        IKEMV++AHE+YSKDP+GFEKLLND+EK L EGCKKFTKLSTLVKLYNLKV++GW++ISFSELLK LK ILP+ NEL  SMYE KK LGALGMEY+KI+A
Subjt:  IKEMVDVAHEEYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIYA

Query:  CPNDCCLYRKDYADAIECPECGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAKNLIWHVDER
        CPNDCCLYRK+YA+AI CP+CG SRWK+ K+ NE KK+IPAK++ YFPPIPRF+RMFRS++CAKNL WH +ER
Subjt:  CPNDCCLYRKDYADAIECPECGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAKNLIWHVDER

TrEMBL top hitse value%identityAlignment
A0A5A7TUX7 Transposase1.6e-12578.1Show/hide
Query:  MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGE-ELPNSSVHGDFSKLDTHMYEENDIG
        MD+SWM KSRL KD+ELG ENFI FGFSNT   SIRCPCLKCGNCEK S   IRDHLYVN IDESYKIWFWHGE +LP SS++ + SK DTHMYE ND+G
Subjt:  MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGE-ELPNSSVHGDFSKLDTHMYEENDIG

Query:  SIKEMVDVAHEEYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIY
         I EM++VAHEEYSKDPN FEKLLNDAEKSL EGCKKFTKLSTLVKLYNLKVRYGW+DISFSELLKTLK ILP  NE+  S+YE KKTLGALGM YEKI+
Subjt:  SIKEMVDVAHEEYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIY

Query:  ACPNDCCLYRKDYADAIECPECGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAKNLIWHVDER
        ACPNDCCLYRK++A+A ECPECG SRWK+  N NEGKK+IP KV+ YFPPIPRFKR+FRSI+ AKNLIWH +ER
Subjt:  ACPNDCCLYRKDYADAIECPECGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAKNLIWHVDER

A0A5A7U2S8 Transposase1.1e-12174.36Show/hide
Query:  MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGEELPNSSVHGDFSKLDTHMYEENDIGS
        MDK WM KSRL+K++ELG E+FI FGFSNT+++ IRCPCLKCGNCEKHS  D+RDHLYVN IDESYKIWFWHG+     S +G+ SK DTH  EEND+GS
Subjt:  MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGEELPNSSVHGDFSKLDTHMYEENDIGS

Query:  IKEMVDVAHEEYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIYA
        +KE+++VAHEEYSKDPNGFEKLL DAEK L EGCKK+TKLSTLVKLYNLK RYGW DISFSELLKTLK ILP TNEL  S+YE KKTLGALGMEYE+I+A
Subjt:  IKEMVDVAHEEYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIYA

Query:  CPNDCCLYRKDYADAIECPECGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAKNLIWHVDER
        CPN+CCLYRK++A+A ECPECG SRWK+ K+ NE +K+ P+KVI YFPPIPRFKR+FRSI+CA+NL WH  ER
Subjt:  CPNDCCLYRKDYADAIECPECGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAKNLIWHVDER

A0A5A7UL63 Transposase2.4e-10871.97Show/hide
Query:  MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGEELPNSSVHGDFSKLDTHMYEENDIGS
        MD+SWM KSRL+KD+ELG ENFI FGFSNT   SIRCP LKCGNCEK S   IRDHLYVN IDESYK+WFWHGE+LP S ++ + SK DT+MYEEND+GS
Subjt:  MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGEELPNSSVHGDFSKLDTHMYEENDIGS

Query:  IKEMVDVAHEEYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIYA
        I EM++VAHEEYSKDPN F KLLNDAEK L EGCKKFTKLSTLVKLYNL+VRYGW+DISFSELLKTLK ILP +NE+  SMYE KKTLGAL M YEKI+A
Subjt:  IKEMVDVAHEEYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIYA

Query:  CPNDCCLYRKDYADAIECPECGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAK
        CPN+CCLYRK++A+A ECP+CG SRWK+  N N GKK+I        PPIP FKR+FR+    K
Subjt:  CPNDCCLYRKDYADAIECPECGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAK

A0A5D3BVS7 Transposase1.6e-12578.1Show/hide
Query:  MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGE-ELPNSSVHGDFSKLDTHMYEENDIG
        MD+SWM KSRL KD+ELG ENFI FGFSNT   SIRCPCLKCGNCEK S   IRDHLYVN IDESYKIWFWHGE +LP SS++ + SK DTHMYE ND+G
Subjt:  MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGE-ELPNSSVHGDFSKLDTHMYEENDIG

Query:  SIKEMVDVAHEEYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIY
         I EM++VAHEEYSKDPN FEKLLNDAEKSL EGCKKFTKLSTLVKLYNLKVRYGW+DISFSELLKTLK ILP  NE+  S+YE KKTLGALGM YEKI+
Subjt:  SIKEMVDVAHEEYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIY

Query:  ACPNDCCLYRKDYADAIECPECGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAKNLIWHVDER
        ACPNDCCLYRK++A+A ECPECG SRWK+  N NEGKK+IP KV+ YFPPIPRFKR+FRSI+ AKNLIWH +ER
Subjt:  ACPNDCCLYRKDYADAIECPECGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAKNLIWHVDER

A0A5D3CRI9 Transposase2.9e-12276.56Show/hide
Query:  MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGEELPNSSVHGDFSKLDTHMYEENDIGS
        MD SWM KSRL+KD+ LG ENFI FGFSNT   SIRCPCLK GNCEK S   IRDHLYVN IDESYKIWFWHGE+LP SS++ + SK DTHMYEEND+GS
Subjt:  MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGEELPNSSVHGDFSKLDTHMYEENDIGS

Query:  IKEMVDVAHEEYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIYA
        I EM++VAHEEYSKDPN FEKLLNDA+K L EGCK FTKLSTLVKLYNLKVRYGW DISFSELLKTLK I P +NE+  SMYE KKTLGALGM YEKI+A
Subjt:  IKEMVDVAHEEYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIYA

Query:  CPNDCCLYRKDYADAIECPECGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAKNLIWHVDER
        CPNDCCLYRK++A+A ECPECG SRWK+  N N GKK+IP KV+ YFPPI RFKR+FRSID AKNLIW  +ER
Subjt:  CPNDCCLYRKDYADAIECPECGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAKNLIWHVDER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAAATCATGGATGAGCAAGAGTAGACTAACCAAAGATTTTGAGTTGGGTTTTGAAAACTTCATCAAATTTGGATTTTCTAATACAAATAGTACATCTATTCGTTG
TCCTTGTTTAAAATGTGGGAATTGTGAAAAACATAGTAGCAATGATATTAGAGATCATTTATATGTCAATGACATTGATGAAAGTTATAAAATTTGGTTTTGGCATGGTG
AAGAACTTCCCAATTCATCTGTACATGGAGATTTTTCTAAGTTGGATACCCATATGTATGAAGAGAACGATATTGGAAGTATAAAAGAAATGGTTGACGTTGCTCATGAG
GAATATTCTAAAGACCCAAATGGATTTGAGAAGTTGCTTAATGATGCTGAAAAATCCTTATGTGAAGGATGTAAAAAATTCACCAAGTTGTCTACGTTGGTAAAATTGTA
TAATTTGAAAGTTAGATATGGATGGAATGATATTAGTTTTTCAGAACTACTCAAAACTTTAAAGATGATCTTGCCTGCTACTAATGAGCTCGCACCATCAATGTATGAAA
CAAAGAAAACATTAGGTGCATTAGGAATGGAATATGAAAAAATTTATGCATGTCCTAATGATTGTTGTTTGTATAGAAAGGATTATGCTGATGCAATTGAATGTCCAGAA
TGTGGTGCATCAAGGTGGAAACATGATAAGAATACAAATGAAGGAAAAAAAGAAATTCCTGCAAAAGTAATAAGGTATTTTCCACCTATTCCAAGATTCAAAAGGATGTT
TAGAAGTATTGATTGTGCTAAAAACTTGATCTGGCACGTGGATGAGAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACAAATCATGGATGAGCAAGAGTAGACTAACCAAAGATTTTGAGTTGGGTTTTGAAAACTTCATCAAATTTGGATTTTCTAATACAAATAGTACATCTATTCGTTG
TCCTTGTTTAAAATGTGGGAATTGTGAAAAACATAGTAGCAATGATATTAGAGATCATTTATATGTCAATGACATTGATGAAAGTTATAAAATTTGGTTTTGGCATGGTG
AAGAACTTCCCAATTCATCTGTACATGGAGATTTTTCTAAGTTGGATACCCATATGTATGAAGAGAACGATATTGGAAGTATAAAAGAAATGGTTGACGTTGCTCATGAG
GAATATTCTAAAGACCCAAATGGATTTGAGAAGTTGCTTAATGATGCTGAAAAATCCTTATGTGAAGGATGTAAAAAATTCACCAAGTTGTCTACGTTGGTAAAATTGTA
TAATTTGAAAGTTAGATATGGATGGAATGATATTAGTTTTTCAGAACTACTCAAAACTTTAAAGATGATCTTGCCTGCTACTAATGAGCTCGCACCATCAATGTATGAAA
CAAAGAAAACATTAGGTGCATTAGGAATGGAATATGAAAAAATTTATGCATGTCCTAATGATTGTTGTTTGTATAGAAAGGATTATGCTGATGCAATTGAATGTCCAGAA
TGTGGTGCATCAAGGTGGAAACATGATAAGAATACAAATGAAGGAAAAAAAGAAATTCCTGCAAAAGTAATAAGGTATTTTCCACCTATTCCAAGATTCAAAAGGATGTT
TAGAAGTATTGATTGTGCTAAAAACTTGATCTGGCACGTGGATGAGAGGTAA
Protein sequenceShow/hide protein sequence
MDKSWMSKSRLTKDFELGFENFIKFGFSNTNSTSIRCPCLKCGNCEKHSSNDIRDHLYVNDIDESYKIWFWHGEELPNSSVHGDFSKLDTHMYEENDIGSIKEMVDVAHE
EYSKDPNGFEKLLNDAEKSLCEGCKKFTKLSTLVKLYNLKVRYGWNDISFSELLKTLKMILPATNELAPSMYETKKTLGALGMEYEKIYACPNDCCLYRKDYADAIECPE
CGASRWKHDKNTNEGKKEIPAKVIRYFPPIPRFKRMFRSIDCAKNLIWHVDER