; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC09G166540 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC09G166540
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionTitin-like isoform X2
Genome locationCmU531Chr09:5820947..5826413
RNA-Seq ExpressionCmUC09G166540
SyntenyCmUC09G166540
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451360.1 PREDICTED: uncharacterized protein LOC103492673 isoform X1 [Cucumis melo]6.8e-9978.63Show/hide
Query:  MRKK-TQAAKTGANAELSTEAAHESTMTTSERPKGTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDH
        MRKK +QAAK+ ANA L TEAA ES+MT SERP+ TN S KK+ SPL SQSTSKKKVNRFSIRRS RIQNS+ R+ KIQ+V+EEITLSESDEE+E+PT+H
Subjt:  MRKK-TQAAKTGANAELSTEAAHESTMTTSERPKGTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDH

Query:  EKSLPPPDQENDWPGLMMKERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTEAVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAY
        EKSLPP  QE D P LMMKERKFEGK+DYI+KL E HG  LDSIKTEVIKRSF  E +PTPEMNYK+MYIASQKKIEELAEENRVLTQKLENALDR++AY
Subjt:  EKSLPPPDQENDWPGLMMKERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTEAVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAY

Query:  KNGNHDAFEMLEKLKDVIVISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQN
        KNGNHDAFEMLEKLKDVIVI  GLRVSES QATS  ELDK   +D G VPPPSKRKKL KQN
Subjt:  KNGNHDAFEMLEKLKDVIVISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQN

XP_011659306.1 uncharacterized protein LOC105436161 isoform X1 [Cucumis sativus]4.2e-10179.23Show/hide
Query:  RKKTQAAKTGANAELSTEAAHESTMTTSERPKGTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDHEK
        +K++QAAK+ ANA L TEAA ESTMT SERP+ TN SL K+ SPLTSQSTSK+KVNRFSIRRS RIQNS+PRN KIQ+V+EEITLSESDEEDELPT+HEK
Subjt:  RKKTQAAKTGANAELSTEAAHESTMTTSERPKGTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDHEK

Query:  SLPPPDQENDWPGLMMKERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTEAVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAYKN
        SLPP  QE D P LM+KERK EGK+DYIV L E HG+ LDSIKTEVIKRSF  E +PTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDR++AYKN
Subjt:  SLPPPDQENDWPGLMMKERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTEAVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAYKN

Query:  GNHDAFEMLEKLKDVIVISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQN
        GNHDAFEMLEKLKDVIVI  GLRVS+S QATS  EL+K TS+D G VPPPSKRKK  KQN
Subjt:  GNHDAFEMLEKLKDVIVISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQN

XP_031744191.1 uncharacterized protein LOC105436161 isoform X2 [Cucumis sativus]2.8e-9780.74Show/hide
Query:  TEAAHESTMTTSERPKGTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDHEKSLPPPDQENDWPGLMM
        TEAA ESTMT SERP+ TN SL K+ SPLTSQSTSK+KVNRFSIRRS RIQNS+PRN KIQ+V+EEITLSESDEEDELPT+HEKSLPP  QE D P LM+
Subjt:  TEAAHESTMTTSERPKGTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDHEKSLPPPDQENDWPGLMM

Query:  KERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTEAVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAYKNGNHDAFEMLEKLKDVI
        KERK EGK+DYIV L E HG+ LDSIKTEVIKRSF  E +PTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDR++AYKNGNHDAFEMLEKLKDVI
Subjt:  KERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTEAVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAYKNGNHDAFEMLEKLKDVI

Query:  VISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQN
        VI  GLRVS+S QATS  EL+K TS+D G VPPPSKRKK  KQN
Subjt:  VISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQN

XP_031744192.1 uncharacterized protein LOC105436161 isoform X3 [Cucumis sativus]2.8e-9780.74Show/hide
Query:  TEAAHESTMTTSERPKGTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDHEKSLPPPDQENDWPGLMM
        TEAA ESTMT SERP+ TN SL K+ SPLTSQSTSK+KVNRFSIRRS RIQNS+PRN KIQ+V+EEITLSESDEEDELPT+HEKSLPP  QE D P LM+
Subjt:  TEAAHESTMTTSERPKGTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDHEKSLPPPDQENDWPGLMM

Query:  KERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTEAVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAYKNGNHDAFEMLEKLKDVI
        KERK EGK+DYIV L E HG+ LDSIKTEVIKRSF  E +PTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDR++AYKNGNHDAFEMLEKLKDVI
Subjt:  KERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTEAVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAYKNGNHDAFEMLEKLKDVI

Query:  VISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQN
        VI  GLRVS+S QATS  EL+K TS+D G VPPPSKRKK  KQN
Subjt:  VISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQN

XP_038899521.1 uncharacterized protein LOC120086802 [Benincasa hispida]1.5e-10983.91Show/hide
Query:  MRKKTQAAKTGANAELSTEAAHESTMTTSERPKGTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDHE
        MRKK +A KTG +AELSTEAAHESTMTTSE  +GTNTSLKK+ SPLTSQSTSK++ NRFSIRRSGRI+NS P++ KIQSV+EEITLSESD EDELPTDHE
Subjt:  MRKKTQAAKTGANAELSTEAAHESTMTTSERPKGTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDHE

Query:  KSLPPPDQENDWPGLMMKERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTEAVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAYK
        KSLPPPDQENDW  LM KERKFEGKIDYIVKLLE HGY LDSIKTEVIKRS S E VPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALD+++AYK
Subjt:  KSLPPPDQENDWPGLMMKERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTEAVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAYK

Query:  NGNHDAFEMLEKLKDVIVISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQN
        NGNHDAFEMLEKLKDVIVISN L+VSESTQA S  ELDK TS+DAGDVPPP K+KKL  QN
Subjt:  NGNHDAFEMLEKLKDVIVISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQN

TrEMBL top hitse value%identityAlignment
A0A0A0K4Z7 Uncharacterized protein2.1e-10179.23Show/hide
Query:  RKKTQAAKTGANAELSTEAAHESTMTTSERPKGTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDHEK
        +K++QAAK+ ANA L TEAA ESTMT SERP+ TN SL K+ SPLTSQSTSK+KVNRFSIRRS RIQNS+PRN KIQ+V+EEITLSESDEEDELPT+HEK
Subjt:  RKKTQAAKTGANAELSTEAAHESTMTTSERPKGTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDHEK

Query:  SLPPPDQENDWPGLMMKERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTEAVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAYKN
        SLPP  QE D P LM+KERK EGK+DYIV L E HG+ LDSIKTEVIKRSF  E +PTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDR++AYKN
Subjt:  SLPPPDQENDWPGLMMKERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTEAVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAYKN

Query:  GNHDAFEMLEKLKDVIVISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQN
        GNHDAFEMLEKLKDVIVI  GLRVS+S QATS  EL+K TS+D G VPPPSKRKK  KQN
Subjt:  GNHDAFEMLEKLKDVIVISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQN

A0A1S3BS36 uncharacterized protein LOC103492673 isoform X13.3e-9978.63Show/hide
Query:  MRKK-TQAAKTGANAELSTEAAHESTMTTSERPKGTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDH
        MRKK +QAAK+ ANA L TEAA ES+MT SERP+ TN S KK+ SPL SQSTSKKKVNRFSIRRS RIQNS+ R+ KIQ+V+EEITLSESDEE+E+PT+H
Subjt:  MRKK-TQAAKTGANAELSTEAAHESTMTTSERPKGTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDH

Query:  EKSLPPPDQENDWPGLMMKERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTEAVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAY
        EKSLPP  QE D P LMMKERKFEGK+DYI+KL E HG  LDSIKTEVIKRSF  E +PTPEMNYK+MYIASQKKIEELAEENRVLTQKLENALDR++AY
Subjt:  EKSLPPPDQENDWPGLMMKERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTEAVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAY

Query:  KNGNHDAFEMLEKLKDVIVISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQN
        KNGNHDAFEMLEKLKDVIVI  GLRVSES QATS  ELDK   +D G VPPPSKRKKL KQN
Subjt:  KNGNHDAFEMLEKLKDVIVISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQN

A0A6J1GMZ1 uncharacterized protein LOC111455873 isoform X25.8e-9676.63Show/hide
Query:  MRKKTQAAKTGANAELSTEAAHESTMTTSERPK-GTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDH
        MR K+QAAK+G NAELSTEA HESTMT SER + GTN SLKKV SP TS+S+SKKKVN  SIRRS RIQNS+P N KIQ+V+EEI LSESD+EDELPTDH
Subjt:  MRKKTQAAKTGANAELSTEAAHESTMTTSERPK-GTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDH

Query:  EKSLPPPDQENDWPGLMMKERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTEAVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAY
        EKS P P +EN W  LMM  R+FE KIDYI+KL E HG+ LDSIKTEVIKRSF TE VPTP+MNYKSMYIASQKKIEELAEENRVLT+KLENA + F+AY
Subjt:  EKSLPPPDQENDWPGLMMKERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTEAVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAY

Query:  KNGNHDAFEMLEKLKDVIVISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQ
        KNGN DAFEMLEKLKDV++ISNGLR SESTQATS  ELDK+  +DA +VPP SKRKKLTKQ
Subjt:  KNGNHDAFEMLEKLKDVIVISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQ

A0A6J1JW00 uncharacterized protein LOC111488838 isoform X28.4e-9576.25Show/hide
Query:  MRKKTQAAKTGANAELSTEAAHESTMTTSERPK-GTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDH
        MR K+QAAK+G NAELSTEA HESTMT SE+ + GTN SLKKV SP TS+S+SKKKVN  SIRRS RIQNS+P N KIQ+VVEEITLSESD+EDELPTDH
Subjt:  MRKKTQAAKTGANAELSTEAAHESTMTTSERPK-GTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDH

Query:  EKSLPPPDQENDWPGLMMKERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTEAVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAY
        EKS P P ++N WP LMM  R FE KID I+KL E HG+ LDSIKTEVIKRSF  E VPTP+MNYKSMYIASQKKIEELAEENRVLT+KLENA +RF+AY
Subjt:  EKSLPPPDQENDWPGLMMKERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTEAVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAY

Query:  KNGNHDAFEMLEKLKDVIVISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQ
        KNGN DAF MLEKLKDV++I NGLR SESTQATS  ELDK+T ++AG+VPP SKRKKLTKQ
Subjt:  KNGNHDAFEMLEKLKDVIVISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQ

A0A6J1KXG8 uncharacterized protein LOC1114980234.0e-9777.86Show/hide
Query:  MRKKTQAAKTGANAELSTEAAHESTMTTSERPKGTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDHE
        MRKK+ AAK+G NAELSTEAAHES MTTSERP+ TN SLKKV SPL S+S+SKKK+N FS+RRS RIQNS P+N KIQSV+EEITLSESD +DELPTDHE
Subjt:  MRKKTQAAKTGANAELSTEAAHESTMTTSERPKGTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDHE

Query:  KSLPPPDQENDWPGLMMKERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTE-AVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAY
        KS PPPD+EN WP LMM+ RKFEGKIDYIVKLLE HG+ LDSIKTEVIKRS STE  VPTP+MNYKS+YIASQKKIEELAEEN+VLT KLE AL  F+AY
Subjt:  KSLPPPDQENDWPGLMMKERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTE-AVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAY

Query:  KNGNHDAFEMLEKLKDVIVISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQN
        KNGN DAFEMLEKLKDVI+ISN L+VSESTQATS  EL+KITS+   DV P SK+KK +KQN
Subjt:  KNGNHDAFEMLEKLKDVIVISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAAGAAAACTCAGGCTGCAAAAACTGGGGCAAATGCAGAATTGAGCACAGAAGCAGCACACGAAAGCACAATGACTACCTCTGAGAGACCAAAAGGAACC
AATACATCCTTGAAGAAAGTAACATCACCACTGACCAGCCAAAGTACTTCTAAAAAGAAAGTGAATAGGTTCTCTATCCGTCGATCTGGACGTATCCAAAATTCT
TCACCTCGAAACCGCAAAATACAAAGCGTCGTTGAAGAGATTACTCTTAGCGAAAGTGATGAAGAAGATGAATTGCCTACTGACCATGAGAAAAGTTTGCCACCT
CCTGACCAAGAGAATGATTGGCCAGGGTTAATGATGAAGGAAAGGAAATTTGAAGGGAAAATTGACTATATTGTAAAACTACTTGAAGAACATGGCTATAACTTA
GACTCAATAAAGACTGAGGTTATCAAAAGATCCTTCTCGACAGAAGCGGTACCTACACCTGAAATGAATTACAAGAGCATGTACATAGCATCCCAGAAGAAGATT
GAAGAACTAGCTGAAGAAAATCGAGTTCTTACTCAAAAATTGGAGAACGCTCTTGACCGATTCAAAGCATACAAGAATGGGAATCATGATGCTTTTGAAATGTTG
GAGAAGTTGAAGGATGTCATCGTGATCTCGAACGGTTTGAGAGTTTCTGAATCAACTCAAGCTACGTCTCATGGAGAACTAGACAAGATTACGTCTGTCGACGCT
GGAGATGTTCCTCCTCCAAGCAAGAGAAAGAAGCTCACTAAACAAAATTGA
mRNA sequenceShow/hide mRNA sequence
GTGAACTCCAAATGCTCAGACGTTCTGAAATTTGAAGATTCCATTTTCAGAAAGCGATTCTTAATGACGCTCTTTTTGATTCTCTTCTGTAATAAATTGGATGAT
CATCAAAATGGGTTTTGTGAATCCTCCATAATAAATTAGCTGTCCTTCTCCTTCTCCCCAATCTCTCAAATGGGGTTCTTCTCATTCGATTGAAACGTAAAGTTT
GGTGGGAAATTTCTATGCTTTGATCTCTTTTTTTTTTTTTCTCAAAAGAGTTTTCTCTTTACGGCTATAAAATGAGGAAGAAAACTCAGGCTGCAAAAACTGGGG
CAAATGCAGAATTGAGCACAGAAGCAGCACACGAAAGCACAATGACTACCTCTGAGAGACCAAAAGGAACCAATACATCCTTGAAGAAAGTAACATCACCACTGA
CCAGCCAAAGTACTTCTAAAAAGAAAGTGAATAGGTTCTCTATCCGTCGATCTGGACGTATCCAAAATTCTTCACCTCGAAACCGCAAAATACAAAGCGTCGTTG
AAGAGATTACTCTTAGCGAAAGTGATGAAGAAGATGAATTGCCTACTGACCATGAGAAAAGTTTGCCACCTCCTGACCAAGAGAATGATTGGCCAGGGTTAATGA
TGAAGGAAAGGAAATTTGAAGGGAAAATTGACTATATTGTAAAACTACTTGAAGAACATGGCTATAACTTAGACTCAATAAAGACTGAGGTTATCAAAAGATCCT
TCTCGACAGAAGCGGTACCTACACCTGAAATGAATTACAAGAGCATGTACATAGCATCCCAGAAGAAGATTGAAGAACTAGCTGAAGAAAATCGAGTTCTTACTC
AAAAATTGGAGAACGCTCTTGACCGATTCAAAGCATACAAGAATGGGAATCATGATGCTTTTGAAATGTTGGAGAAGTTGAAGGATGTCATCGTGATCTCGAACG
GTTTGAGAGTTTCTGAATCAACTCAAGCTACGTCTCATGGAGAACTAGACAAGATTACGTCTGTCGACGCTGGAGATGTTCCTCCTCCAAGCAAGAGAAAGAAGC
TCACTAAACAAAATTGAACTATAATATGGGTTCCCTCCTCATGGTTATAACTCTTTCCTTGCTTGCTCTTTAGCTAAACTTGTTTTGTCAACACAAGCCTTATAT
AGCCCTCTCATTAGCCATGAAAATGAATGAAGGAGGTTAGATTTT
Protein sequenceShow/hide protein sequence
MRKKTQAAKTGANAELSTEAAHESTMTTSERPKGTNTSLKKVTSPLTSQSTSKKKVNRFSIRRSGRIQNSSPRNRKIQSVVEEITLSESDEEDELPTDHEKSLPP
PDQENDWPGLMMKERKFEGKIDYIVKLLEEHGYNLDSIKTEVIKRSFSTEAVPTPEMNYKSMYIASQKKIEELAEENRVLTQKLENALDRFKAYKNGNHDAFEML
EKLKDVIVISNGLRVSESTQATSHGELDKITSVDAGDVPPPSKRKKLTKQN