; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G005060 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G005060
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionTITAN-like protein isoform X1
Genome locationchr02:4304664..4308925
RNA-Seq ExpressionLsi02G005060
SyntenyLsi02G005060
Gene Ontology termsGO:0009793 - embryo development ending in seed dormancy (biological process)
GO:0009960 - endosperm development (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsIPR028015 - CCDC84-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011652149.1 TITAN-like protein isoform X3 [Cucumis sativus]1.1e-22689.77Show/hide
Query:  MRNMNKKE-KKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLA
        M NM KKE KKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLS+FLSRFEIKLSDVRFFL TP  LSPE+ASHNRFWCIFCDVQVDENDSSFACSNAIKHLA
Subjt:  MRNMNKKE-KKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLA

Query:  SADHLKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLGATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSS
        SADHLKNLKHF WKYGGDVERLD+YRILDADVAKWEKKCKVQSV+ASSSLG  NDI NQV+Y NFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSS
Subjt:  SADHLKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLGATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSS

Query:  YSGSSGVSNFVSFPHDTTVSLHAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGG
        YSGSS VSN VSFPHDTTVSLH GSCSGAH+WSSK+LT SE NKHYQLD GRTC ANG  SGQGMY MHQNERT N ESHPEGFQTLTRISNIVSGDSGG
Subjt:  YSGSSGVSNFVSFPHDTTVSLHAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGG

Query:  NVHSGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKE
        N++SGMLPPWLE PEDSGF V+IRP+VGGGVSSL ES+KS KLNPKRVGAAWAEKRK ELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKE
Subjt:  NVHSGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKE

Query:  KSKFLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTS
        KSK LMVENSPETNVNIQPY+SKRMRRD+ENE+D ANHTS
Subjt:  KSKFLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTS

XP_022924143.1 TITAN-like protein isoform X4 [Cucurbita moschata]3.0e-22488.61Show/hide
Query:  MRNMNKKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLAS
        MRNM KKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTP RL+PEYASHNRFWCIFC+V+VDENDSSFACSNAIKHLAS
Subjt:  MRNMNKKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLAS

Query:  ADHLKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLGATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSSY
        ADHLKNLKHFLWKYGGD+ERL+NYRIL+AD AKWE KCKVQSVAASSSLG  NDI NQV+YG FDNFGNNNIHSVESSSSISVLPL SYTNEYQVSNSSY
Subjt:  ADHLKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLGATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSSY

Query:  SGSSGVSNFVSFPHDTTVSLHAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGGN
        SGSS VSN VS PH+TT SLHAGSCSGAHVWS K+L  ++DNKHY L+SGRTC ANGHFSGQGM  MHQ+ER +NEESHPEGFQTLTRISNIVSGDSGGN
Subjt:  SGSSGVSNFVSFPHDTTVSLHAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGGN

Query:  VHSGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKEK
        V SGMLPPWLEN EDSGFKV+IRPVVGGGVSSLNES+KSKKLNPKRVGAAWAEKRK+E+EMEKRGEIVQSY D+NWLPNFGRVWQSGSRKESRKEFEKEK
Subjt:  VHSGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKEK

Query:  SKFLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTS
        SKFLMVEN  E NVNIQPY+SKRMRRDRE+ DDTANH S
Subjt:  SKFLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTS

XP_023519525.1 TITAN-like protein isoform X3 [Cucurbita pepo subsp. pepo]2.4e-22689.29Show/hide
Query:  MRNMNKKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLAS
        MRNM KKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTP RL+PEYASHNRFWCIFC+V+VDENDSSFACSNAIKHLAS
Subjt:  MRNMNKKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLAS

Query:  ADHLKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLGATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSSY
        ADHLKNLKHFLWKYGGD+ERL+NYRIL+AD AKWEKKCKVQSVAASSSLG  NDI NQV+YG FDNFGNNNIHSVESSSSISVLPL SYTNEYQVSNSSY
Subjt:  ADHLKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLGATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSSY

Query:  SGSSGVSNFVSFPHDTTVSLHAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGGN
        SGSS VSN VS PH+TT SLHAGSCSGAHVWS K+L  +EDNKHY L+SGRTC ANGHFSGQGM  MHQ+ERT+NEESHPEGFQTLTRISNIVSGDSGGN
Subjt:  SGSSGVSNFVSFPHDTTVSLHAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGGN

Query:  VHSGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKEK
        V SGMLPPWLEN EDSGFKV+IRPVVGGGVSSLNES+KSKKLNPKRVGAAWAEKRK+E+EMEKRGEIVQSY D+NWLPNFGRVWQSGSRKESRKEFEKEK
Subjt:  VHSGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKEK

Query:  SKFLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTS
        SKFLMVEN PE NV+IQPY+SKRMRRDRE+ DDTANH S
Subjt:  SKFLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTS

XP_038894287.1 TITAN-like protein isoform X1 [Benincasa hispida]5.1e-23291.57Show/hide
Query:  MRNMNKKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLAS
        MRNM KKEKKSAYEYC VCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFL TP  LSPEY+SHNRFWCIFCDVQV+ENDSSFACSNAIKHLAS
Subjt:  MRNMNKKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLAS

Query:  ADHLKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLGATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSSY
        ADHLKNLKHF WK GGDV+RLD+YR+L+ADVAKWEKKCKVQS++ASSS G TNDI NQV+YGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSS+
Subjt:  ADHLKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLGATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSSY

Query:  SGSSGVSNFVSFPHDTTVSLHAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGGN
        SGSS VSN VSFPHDT VSLHAGSCSGAHVWSSK+LTFSEDNKHY LDSGRTC ANGH SGQGMYE HQNERTVNE SHPEGFQTLTRISNIV GDSGGN
Subjt:  SGSSGVSNFVSFPHDTTVSLHAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGGN

Query:  VHSGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKEK
        VHSGMLPPWLENPEDSGFKV+I PVVGGGV SLNES+KSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQ YGDKNWLPNFGRVWQSGSRKESRKEFEKEK
Subjt:  VHSGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKEK

Query:  SKFLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTS
        SK LMVENSPETNVNIQPY+SKRMRRDRENEDDTANHTS
Subjt:  SKFLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTS

XP_038894293.1 TITAN-like protein isoform X2 [Benincasa hispida]3.7e-23091.34Show/hide
Query:  MRNMNKKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLAS
        MRNM KKEKKSAYEYC VCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFL TP  LSPEY+SHNRFWCIFCDVQV+ENDSSFAC NAIKHLAS
Subjt:  MRNMNKKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLAS

Query:  ADHLKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLGATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSSY
        ADHLKNLKHF WK GGDV+RLD+YR+L+ADVAKWEKKCKVQS++ASSS G TNDI NQV+YGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSS+
Subjt:  ADHLKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLGATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSSY

Query:  SGSSGVSNFVSFPHDTTVSLHAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGGN
        SGSS VSN VSFPHDT VSLHAGSCSGAHVWSSK+LTFSEDNKHY LDSGRTC ANGH SGQGMYE HQNERTVNE SHPEGFQTLTRISNIV GDSGGN
Subjt:  SGSSGVSNFVSFPHDTTVSLHAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGGN

Query:  VHSGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKEK
        VHSGMLPPWLENPEDSGFKV+I PVVGGGV SLNES+KSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQ YGDKNWLPNFGRVWQSGSRKESRKEFEKEK
Subjt:  VHSGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKEK

Query:  SKFLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTS
        SK LMVENSPETNVNIQPY+SKRMRRDRENEDDTANHTS
Subjt:  SKFLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTS

TrEMBL top hitse value%identityAlignment
A0A0A0LUE2 Uncharacterized protein5.3e-22789.77Show/hide
Query:  MRNMNKKE-KKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLA
        M NM KKE KKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLS+FLSRFEIKLSDVRFFL TP  LSPE+ASHNRFWCIFCDVQVDENDSSFACSNAIKHLA
Subjt:  MRNMNKKE-KKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLA

Query:  SADHLKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLGATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSS
        SADHLKNLKHF WKYGGDVERLD+YRILDADVAKWEKKCKVQSV+ASSSLG  NDI NQV+Y NFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSS
Subjt:  SADHLKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLGATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSS

Query:  YSGSSGVSNFVSFPHDTTVSLHAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGG
        YSGSS VSN VSFPHDTTVSLH GSCSGAH+WSSK+LT SE NKHYQLD GRTC ANG  SGQGMY MHQNERT N ESHPEGFQTLTRISNIVSGDSGG
Subjt:  YSGSSGVSNFVSFPHDTTVSLHAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGG

Query:  NVHSGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKE
        N++SGMLPPWLE PEDSGF V+IRP+VGGGVSSL ES+KS KLNPKRVGAAWAEKRK ELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKE
Subjt:  NVHSGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKE

Query:  KSKFLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTS
        KSK LMVENSPETNVNIQPY+SKRMRRD+ENE+D ANHTS
Subjt:  KSKFLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTS

A0A1S3AZI0 TITAN-like protein isoform X16.7e-22288.56Show/hide
Query:  MNKKE-KKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLASAD
        M KKE KKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLS+FLSRFEIKLSDVRFFL TP RLSPE+ASHNRFWCIFCDVQVDE DSSFACSNAIKHLASAD
Subjt:  MNKKE-KKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLASAD

Query:  HLKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLGATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSSYSG
        HLKNLKHF WKYGGDVERLD+YRIL+ADVAKWEKKCKVQSV+ASSSLG  NDI NQV+Y NFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSSYSG
Subjt:  HLKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLGATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSSYSG

Query:  SSGVSNFVSFPHDTTVSLHAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGGNVH
        SS VSN VSFPHDTTVSLH GSCS AH+WSSK+LT SE NKHYQLDSGRTC ANG  SGQGMY  HQNE T N+ESHPEGFQTLTRIS+IV+GDSGGNVH
Subjt:  SSGVSNFVSFPHDTTVSLHAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGGNVH

Query:  SGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKEKSK
        SGMLPPWLE PEDSGF V+IRP+V GGV SL ES+KSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSY DKNWLPNFGRVWQSGSRKESRKEFEKEKSK
Subjt:  SGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKEKSK

Query:  FLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTS
         LMVENSPETNVNIQPY+SKRMRRD+EN++D AN+TS
Subjt:  FLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTS

A0A5A7UJL1 TITAN-like protein isoform X16.7e-22288.56Show/hide
Query:  MNKKE-KKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLASAD
        M KKE KKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLS+FLSRFEIKLSDVRFFL TP RLSPE+ASHNRFWCIFCDVQVDE DSSFACSNAIKHLASAD
Subjt:  MNKKE-KKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLASAD

Query:  HLKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLGATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSSYSG
        HLKNLKHF WKYGGDVERLD+YRIL+ADVAKWEKKCKVQSV+ASSSLG  NDI NQV+Y NFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSSYSG
Subjt:  HLKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLGATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSSYSG

Query:  SSGVSNFVSFPHDTTVSLHAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGGNVH
        SS VSN VSFPHDTTVSLH GSCS AH+WSSK+LT SE NKHYQLDSGRTC ANG  SGQGMY  HQNE T N+ESHPEGFQTLTRIS+IV+GDSGGNVH
Subjt:  SSGVSNFVSFPHDTTVSLHAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGGNVH

Query:  SGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKEKSK
        SGMLPPWLE PEDSGF V+IRP+V GGV SL ES+KSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSY DKNWLPNFGRVWQSGSRKESRKEFEKEKSK
Subjt:  SGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKEKSK

Query:  FLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTS
         LMVENSPETNVNIQPY+SKRMRRD+EN++D AN+TS
Subjt:  FLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTS

A0A6J1EBJ7 TITAN-like protein isoform X41.4e-22488.61Show/hide
Query:  MRNMNKKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLAS
        MRNM KKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTP RL+PEYASHNRFWCIFC+V+VDENDSSFACSNAIKHLAS
Subjt:  MRNMNKKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLAS

Query:  ADHLKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLGATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSSY
        ADHLKNLKHFLWKYGGD+ERL+NYRIL+AD AKWE KCKVQSVAASSSLG  NDI NQV+YG FDNFGNNNIHSVESSSSISVLPL SYTNEYQVSNSSY
Subjt:  ADHLKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLGATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSSY

Query:  SGSSGVSNFVSFPHDTTVSLHAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGGN
        SGSS VSN VS PH+TT SLHAGSCSGAHVWS K+L  ++DNKHY L+SGRTC ANGHFSGQGM  MHQ+ER +NEESHPEGFQTLTRISNIVSGDSGGN
Subjt:  SGSSGVSNFVSFPHDTTVSLHAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGGN

Query:  VHSGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKEK
        V SGMLPPWLEN EDSGFKV+IRPVVGGGVSSLNES+KSKKLNPKRVGAAWAEKRK+E+EMEKRGEIVQSY D+NWLPNFGRVWQSGSRKESRKEFEKEK
Subjt:  VHSGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKEK

Query:  SKFLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTS
        SKFLMVEN  E NVNIQPY+SKRMRRDRE+ DDTANH S
Subjt:  SKFLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTS

A0A6J1KEY9 TITAN-like protein8.0e-22386.8Show/hide
Query:  MRNMNKKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLAS
        MR M KKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTP  L+PEYASHNRFWCIFC+V+VDENDSSFACSNAIKHLAS
Subjt:  MRNMNKKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLAS

Query:  ADHLKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLG--------ATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNE
        ADHLKNLKHFLWKYGGD+ERL+NYRIL+AD AKWEKKCKVQS+AA+SSLG          NDI NQV+YG FDNFGNNNIHSVESSSSISVLPLHSYTNE
Subjt:  ADHLKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLG--------ATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNE

Query:  YQVSNSSYSGSSGVSNFVSFPHDTTVSLHAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNI
        YQVSNSSYSGSS VSN VS PH+TT SLHAGSCSGAHVWS K+L  +EDNKHY L+SGRTC ANGHFSGQGM  MHQ+ERT+NEESH EGFQTLTRISNI
Subjt:  YQVSNSSYSGSSGVSNFVSFPHDTTVSLHAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNI

Query:  VSGDSGGNVHSGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKES
        VSGDSGGNV SGMLPPWLEN EDSGFKV+IRPVVGGGVSSLNES+KSKKLNPKRVGAAWAEKRK+E+EM+KRGEIVQSY D+NWLPNFGRVWQSGSRKES
Subjt:  VSGDSGGNVHSGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKES

Query:  RKEFEKEKSKFLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTS
        RKEFEKEKSKFLMVEN PE NVNIQPY+SKRMRRDRE+ DDTANH S
Subjt:  RKEFEKEKSKFLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTS

SwissProt top hitse value%identityAlignment
F4JRR5 TITAN-like protein6.5e-8944.14Show/hide
Query:  MNKKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLASADH
        M K  KKS  E+C VC+ +HDQG RHKYFP HK SLS+ L RF  K++DVRFFL  P  L P+  S NR WC+FCD  + E  SSFACS AI H AS+DH
Subjt:  MNKKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLASADH

Query:  LKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKV---QSVAASSSL----GATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVS
        LKN+K FL K G  ++ +D +RI +ADVAKWEKKC+    +  +   S     G +NDI  ++ +   D       H + S  S  V+PL   TNEYQ+S
Subjt:  LKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKV---QSVAASSSL----GATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVS

Query:  NSSYSGSSGVSNFVSFPHDTTVSL--HAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVS
         S   G     ++++   D+   L   +G+  G H    +S  +S                NG++  Q  Y++ Q+++ ++   +P G   +T IS+  S
Subjt:  NSSYSGSSGVSNFVSFPHDTTVSL--HAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVS

Query:  GDSGGNVHSGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRK
         D+GGNVHSG  PPWL+  +     V++         +     K++KLNP RVGAAWAE+RK+E+EMEK G + +S  D +WLPNFGRVWQSG+RKESRK
Subjt:  GDSGGNVHSGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRK

Query:  EFEKEKSKFLMVEN-SPETN-VNIQPYLSKRMRRD
        EFEKEK K +  E+ S E+  V IQPY+SKR RR+
Subjt:  EFEKEKSKFLMVEN-SPETN-VNIQPYLSKRMRRD

Q4VA36 Centrosomal AT-AC splicing factor4.8e-0733.1Show/hide
Query:  NGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGGNVHSGMLPPW-LENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEK
        NGH +       H   + V E    E  Q LT I +       GN+HSG  PPW ++  E S   + I P     +    E  K KKL P RVGA +   
Subjt:  NGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGGNVHSGMLPPW-LENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEK

Query:  RKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKE
                       S     WLP+FGRVW +G R +SR +F+ E
Subjt:  RKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKE

Q86UT8 Centrosomal AT-AC splicing factor4.0e-0636.27Show/hide
Query:  GNVHSGMLPPW-LENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFE
        GN+HSG  PPW +++ E      EI P     +    E  K KKL P RVGA +                  S     WLP+FGRVW +G R +SR +F+
Subjt:  GNVHSGMLPPW-LENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFE

Query:  KE
         E
Subjt:  KE

Arabidopsis top hitse value%identityAlignment
AT4G24900.1 unknown protein4.6e-9044.14Show/hide
Query:  MNKKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLASADH
        M K  KKS  E+C VC+ +HDQG RHKYFP HK SLS+ L RF  K++DVRFFL  P  L P+  S NR WC+FCD  + E  SSFACS AI H AS+DH
Subjt:  MNKKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLASADH

Query:  LKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKV---QSVAASSSL----GATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVS
        LKN+K FL K G  ++ +D +RI +ADVAKWEKKC+    +  +   S     G +NDI  ++ +   D       H + S  S  V+PL   TNEYQ+S
Subjt:  LKNLKHFLWKYGGDVERLDNYRILDADVAKWEKKCKV---QSVAASSSL----GATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVS

Query:  NSSYSGSSGVSNFVSFPHDTTVSL--HAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVS
         S   G     ++++   D+   L   +G+  G H    +S  +S                NG++  Q  Y++ Q+++ ++   +P G   +T IS+  S
Subjt:  NSSYSGSSGVSNFVSFPHDTTVSL--HAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVS

Query:  GDSGGNVHSGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRK
         D+GGNVHSG  PPWL+  +     V++         +     K++KLNP RVGAAWAE+RK+E+EMEK G + +S  D +WLPNFGRVWQSG+RKESRK
Subjt:  GDSGGNVHSGMLPPWLENPEDSGFKVEIRPVVGGGVSSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRK

Query:  EFEKEKSKFLMVEN-SPETN-VNIQPYLSKRMRRD
        EFEKEK K +  E+ S E+  V IQPY+SKR RR+
Subjt:  EFEKEKSKFLMVEN-SPETN-VNIQPYLSKRMRRD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTAATATGAATAAAAAGGAGAAGAAGAGCGCATACGAATACTGCCTCGTCTGTAAACTAAACCACGACCAAGGGCAGCGCCACAAGTATTTTCCCAACCACAAGAA
ATCTCTTTCTGCTTTTCTATCTCGGTTCGAGATCAAGCTGTCCGACGTTCGCTTTTTTCTCAACACCCCTCTTCGCCTCTCGCCCGAGTATGCTTCTCACAATCGGTTCT
GGTGCATCTTTTGCGATGTTCAAGTCGATGAGAATGATAGTTCCTTCGCATGTAGCAATGCAATTAAACACCTGGCCAGTGCTGATCATCTGAAGAATTTGAAGCATTTC
TTATGGAAGTATGGTGGTGATGTGGAACGTCTGGATAATTACAGAATTTTGGACGCTGATGTAGCTAAGTGGGAGAAGAAGTGCAAAGTACAAAGCGTAGCTGCTTCATC
CAGCCTTGGAGCTACAAATGATATCCAAAATCAAGTCGAATATGGAAATTTTGATAATTTTGGGAATAATAATATCCACTCTGTTGAATCTAGTTCGTCAATTAGTGTTT
TGCCTTTACACAGTTATACAAATGAGTATCAGGTATCCAATTCATCCTATTCAGGATCCTCTGGTGTTTCAAATTTTGTCTCGTTTCCACATGATACCACTGTTTCTTTG
CATGCTGGCTCATGTTCTGGTGCACATGTATGGAGCTCAAAGAGTTTAACATTTAGCGAGGACAACAAGCATTACCAACTGGATAGTGGTAGAACATGCAATGCTAATGG
TCATTTCAGTGGTCAAGGGATGTACGAGATGCATCAGAATGAAAGAACTGTGAACGAAGAAAGCCATCCTGAGGGTTTTCAGACCCTCACTCGGATTTCTAATATTGTTT
CTGGAGATTCTGGAGGAAATGTTCATTCTGGGATGCTGCCTCCTTGGCTTGAAAACCCTGAAGATAGTGGGTTTAAGGTTGAAATAAGACCAGTGGTTGGGGGTGGTGTT
TCTTCTTTGAATGAATCTTCAAAGTCCAAGAAACTGAACCCCAAACGGGTAGGAGCTGCATGGGCAGAAAAAAGAAAGATGGAGCTGGAAATGGAGAAGAGAGGAGAAAT
TGTTCAAAGCTATGGTGACAAGAATTGGCTTCCTAATTTTGGGAGGGTATGGCAATCTGGTAGCCGTAAAGAATCTAGAAAAGAATTTGAGAAGGAGAAATCAAAGTTCC
TGATGGTTGAAAATTCACCTGAAACAAATGTCAATATTCAGCCATACCTTAGCAAACGGATGCGAAGAGATCGGGAGAATGAGGATGATACTGCCAATCACACGAGTACA
TAA
mRNA sequenceShow/hide mRNA sequence
ATAAAAATTAAAATCCTAATCGTGCAACAAGCAAAGCAACACACGGACTGTCTGCATCCCCGGTCGTCGTCTCACAGCCTTAGTTTCCCGCGAAAGCCAAATCAGTACCA
GTTCCTCCCGACAAATTACCGCCCATCCATGGAATCGCTACGAGCTTCCGAAGAGGCCGATTGAAATCCTTCTTTCAGAGTCGCCATTGTGAGACATGAACAACGATCTA
TGCGTAATATGAATAAAAAGGAGAAGAAGAGCGCATACGAATACTGCCTCGTCTGTAAACTAAACCACGACCAAGGGCAGCGCCACAAGTATTTTCCCAACCACAAGAAA
TCTCTTTCTGCTTTTCTATCTCGGTTCGAGATCAAGCTGTCCGACGTTCGCTTTTTTCTCAACACCCCTCTTCGCCTCTCGCCCGAGTATGCTTCTCACAATCGGTTCTG
GTGCATCTTTTGCGATGTTCAAGTCGATGAGAATGATAGTTCCTTCGCATGTAGCAATGCAATTAAACACCTGGCCAGTGCTGATCATCTGAAGAATTTGAAGCATTTCT
TATGGAAGTATGGTGGTGATGTGGAACGTCTGGATAATTACAGAATTTTGGACGCTGATGTAGCTAAGTGGGAGAAGAAGTGCAAAGTACAAAGCGTAGCTGCTTCATCC
AGCCTTGGAGCTACAAATGATATCCAAAATCAAGTCGAATATGGAAATTTTGATAATTTTGGGAATAATAATATCCACTCTGTTGAATCTAGTTCGTCAATTAGTGTTTT
GCCTTTACACAGTTATACAAATGAGTATCAGGTATCCAATTCATCCTATTCAGGATCCTCTGGTGTTTCAAATTTTGTCTCGTTTCCACATGATACCACTGTTTCTTTGC
ATGCTGGCTCATGTTCTGGTGCACATGTATGGAGCTCAAAGAGTTTAACATTTAGCGAGGACAACAAGCATTACCAACTGGATAGTGGTAGAACATGCAATGCTAATGGT
CATTTCAGTGGTCAAGGGATGTACGAGATGCATCAGAATGAAAGAACTGTGAACGAAGAAAGCCATCCTGAGGGTTTTCAGACCCTCACTCGGATTTCTAATATTGTTTC
TGGAGATTCTGGAGGAAATGTTCATTCTGGGATGCTGCCTCCTTGGCTTGAAAACCCTGAAGATAGTGGGTTTAAGGTTGAAATAAGACCAGTGGTTGGGGGTGGTGTTT
CTTCTTTGAATGAATCTTCAAAGTCCAAGAAACTGAACCCCAAACGGGTAGGAGCTGCATGGGCAGAAAAAAGAAAGATGGAGCTGGAAATGGAGAAGAGAGGAGAAATT
GTTCAAAGCTATGGTGACAAGAATTGGCTTCCTAATTTTGGGAGGGTATGGCAATCTGGTAGCCGTAAAGAATCTAGAAAAGAATTTGAGAAGGAGAAATCAAAGTTCCT
GATGGTTGAAAATTCACCTGAAACAAATGTCAATATTCAGCCATACCTTAGCAAACGGATGCGAAGAGATCGGGAGAATGAGGATGATACTGCCAATCACACGAGTACAT
AAGACAATGATTTTCAAAAGCTTCTCATCTAGAGAATTACTGTACATGCACAATCCTTGTCAAAGTTCGGTCTGGAGACAGAATTAGGCTTGGAGTTTGATAGAAAACTT
CAAGGGCAGACTGATATTCAACCTCTTCATTAGTTTATCGGCCATAGTTCAAATCTGTTGTAGGAAACATGACTTTAATCTTATAAAAAAAATTATCGAGTTTCAGTGGT
ACCTGAGAACAGGATTTTTTTTTAACCAACATTGTACAGAGGATGAACTTTTGTGAATTCACTCTCGGCATTCATTTTTTTAATTTAATTGCAAAACAGGGTTTGACCAC
ACTGTTAAATCATTGCTCCATGTGTTTCATTCCCTATTGCTATAGCACCAGCTCCTTATATCTGCTGCTACCAAAGTTGTAATCTGAAACAAGTTTATATTTTGTAAATA
TGTAGCTTGATGACAGCCCTCTGGTTTTCTTCAGCAATGCTGGTTGGCCAAACCGCGGTTGT
Protein sequenceShow/hide protein sequence
MRNMNKKEKKSAYEYCLVCKLNHDQGQRHKYFPNHKKSLSAFLSRFEIKLSDVRFFLNTPLRLSPEYASHNRFWCIFCDVQVDENDSSFACSNAIKHLASADHLKNLKHF
LWKYGGDVERLDNYRILDADVAKWEKKCKVQSVAASSSLGATNDIQNQVEYGNFDNFGNNNIHSVESSSSISVLPLHSYTNEYQVSNSSYSGSSGVSNFVSFPHDTTVSL
HAGSCSGAHVWSSKSLTFSEDNKHYQLDSGRTCNANGHFSGQGMYEMHQNERTVNEESHPEGFQTLTRISNIVSGDSGGNVHSGMLPPWLENPEDSGFKVEIRPVVGGGV
SSLNESSKSKKLNPKRVGAAWAEKRKMELEMEKRGEIVQSYGDKNWLPNFGRVWQSGSRKESRKEFEKEKSKFLMVENSPETNVNIQPYLSKRMRRDRENEDDTANHTST