; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040231 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040231
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF1262)
Genome locationchr13:3055773..3061660
RNA-Seq ExpressionLag0040231
SyntenyLag0040231
Gene Ontology termsGO:0006886 - intracellular protein transport (biological process)
GO:0008152 - metabolic process (biological process)
GO:0008422 - beta-glucosidase activity (molecular function)
GO:0031267 - small GTPase binding (molecular function)
GO:0052861 - glucan endo-1,3-beta-glucanase activity, C-3 substituted reducing group (molecular function)
GO:0052862 - glucan endo-1,4-beta-glucanase activity, C-3 substituted reducing group (molecular function)
InterPro domainsIPR001494 - Importin-beta, N-terminal domain
IPR010683 - Protein of unknown function DUF1262
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6574010.1 hypothetical protein SDJN03_27897, partial [Cucurbita argyrosperma subsp. sororia]6.3e-16576.58Show/hide
Query:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDEEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYYI
        MYVTRPLSLYR SPSSLS APPEGP+SGILVIQDEE AE+RWCCGLFK KE V E+PFPQNKILRLTHAA+AGE EYS+SV AVLIPVLN PLSSNQYYI
Subjt:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDEEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYYI

Query:  INSRGSRAGLACTSSKEEDNTSRGCCYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTEALGLD
        INS G+R GLACTSSKEE+ + R CCYSVPDPPPQLFDPKN YQQFQISDYIYCGG NG+I+KSMAPDGVPP+RLSRKG R Y  Q L NFEPTEALGL+
Subjt:  INSRGSRAGLACTSSKEEDNTSRGCCYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTEALGLD

Query:  PSLRARLPEL-NSEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAGGTPTSRNV-VDGVVWFG
        PSLR RLPE+ +S+PVVVGKWYCPFIFIREG+V  QMS+SPYYEMTL++ W EIFG E ++ +G NGVDVDV+V+REV SVAGG    R    DG VWF 
Subjt:  PSLRARLPEL-NSEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAGGTPTSRNV-VDGVVWFG

Query:  PSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFKKMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQ
         + VGLS A+VER+KWEE R GF WVEE +EKK++V+RREE K +G W+RFGCYVL+ERFVLKRMDGS+VL WEFRHTHQ
Subjt:  PSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFKKMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQ

XP_004135417.1 uncharacterized protein LOC101216291 [Cucumis sativus]5.0e-14669.44Show/hide
Query:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQD-EEAAE---SRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSN
        MYVTRPLSLYR SP S+S+ PPEGP+SGILVIQD EE AE   SRW CGLFK KE V   PFPQNKIL+LTH+A+AGE+EYS+SVYAV+IPVLNQPLSSN
Subjt:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQD-EEAAE---SRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSN

Query:  QYYIINSRGSRAGLACTSSKEEDNTSRGCCYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTEA
        QYYIIN+RG+R GLACTSSK ++ +S  CCY+VPDPPPQLFDPKN YQQFQISDY+YCGG +GFI  S+A DGV P RLSR GWR YI    +  EPT A
Subjt:  QYYIINSRGSRAGLACTSSKEEDNTSRGCCYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTEA

Query:  LGLDPSLRARLPELN-------SEPVVVGKWYCPFIFIREGE--VGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGV---DVDVFVDREVVSVAGGT
         GL+  LRARLP+LN       S+PV VGKWY PFIFIR+G   VG QM++SPYYE+TL Q WVEIFG E       NGV   +VDVFV+REVVS   G 
Subjt:  LGLDPSLRARLPELN-------SEPVVVGKWYCPFIFIREGE--VGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGV---DVDVFVDREVVSVAGGT

Query:  PTSRNVVDGVVWFGPSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVK-VKRREEFKKMG-QWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQ
         +S+NVVDG+VWF P KVGLSL VVERMKWEE R GF+WV+E +EKKV+ VK R + K+MG +W RFGCYVLVERFV+KRMDGSLVL WEFRHTHQ
Subjt:  PTSRNVVDGVVWFGPSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVK-VKRREEFKKMG-QWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQ

XP_022149112.1 uncharacterized protein LOC111017603 [Momordica charantia]7.7e-14771.09Show/hide
Query:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQD-EEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYY
        MYVTRPLSLYR SPS LS  PPEGP+SGILV +D EE AESRW  G+FK K+ V   P PQN+ILRLTHAADAGEYEYSDS+YA+L+PVLNQPLSSNQYY
Subjt:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQD-EEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYY

Query:  IINSRGSRAGLACTSSKEEDNTSRGC--CYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFE-PTEA
        +I+SRG+  GLACTSSK EDNTSR     Y + D P QL DPKNTYQQFQIS+YIYCG  NGFISKS+APDGVPPE L RKGWR Y +   NN   PTEA
Subjt:  IINSRGSRAGLACTSSKEEDNTSRGC--CYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFE-PTEA

Query:  LGLDPSLRARLPEL--NSEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAGGTPTSRNVVDGV
        LGLD +LRARLP+L     PVVVGKWYCPFIF+R+G V  QMS+SPYYEMTLQQ W EIFG       G+  VD DV V+REV+S+AG     RN  DGV
Subjt:  LGLDPSLRARLPEL--NSEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAGGTPTSRNVVDGV

Query:  VWFGPSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFKKMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQ
        +WFG S VGLSLA+VER+KWEE+RAGFE+ +E ++K VKVKRREEFK+ G+W+RFGCYVLVERFVLKRMDGSLVL WEFRHTHQ
Subjt:  VWFGPSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFKKMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQ

XP_022945417.1 uncharacterized protein LOC111449654 [Cucurbita moschata]9.0e-16476.58Show/hide
Query:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDEEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYYI
        MYVTRPLSLYR SPSSLS APPEGP+SGILVIQDEE AE+RWCCGLFK KE V E+PFPQNKILRLTHAA+AGE EYS+SV AVLIPVLN PLSSNQYYI
Subjt:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDEEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYYI

Query:  INSRGSRAGLACTSSKEEDNTSRGCCYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTEALGLD
        INS G+R GLACTSSKEE+ + R CCYSV DPPPQLFDPKN YQQFQISDYIYCGG NG+I+KSMAPDGVPP+RLSRKG R Y  Q L NFEPTEALGL+
Subjt:  INSRGSRAGLACTSSKEEDNTSRGCCYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTEALGLD

Query:  PSLRARLPEL-NSEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAGGTPTSRNV-VDGVVWFG
        PSLR RLPE+ +S+PVVVGKWYCPFIFIREG+V  QMS+SPYYEMTL++ W EIFG E ++ +G NGVDVDV+V+REV SVAGG    R    DG VWF 
Subjt:  PSLRARLPEL-NSEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAGGTPTSRNV-VDGVVWFG

Query:  PSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFKKMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQ
         + VGLS A+VER+KWEE R GF WVEE +EKK++V+RREE K +G W+RFGCYVLVERFVLKRMDGS+VL WEFRHTHQ
Subjt:  PSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFKKMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQ

XP_038892872.1 uncharacterized protein LOC120081783 [Benincasa hispida]1.7e-14969.42Show/hide
Query:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDE----EAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSN
        MYVTRPLSLYR+SPSS+S+ PPEGP+SGILVIQD+    E   S+W CG+FK KE V  +PFPQNKILRLTH+ +AGE+EYS+SVYAVLIPVLN+PLSSN
Subjt:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDE----EAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSN

Query:  QYYIINSRGSRAGLACTSSKEEDNTSRGCCYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNF-EPTE
        QYYIIN+RG+R GLACT+SKE++ +S  CCY+VPDPPPQ+FDPKN YQQFQISDYIYCGG +GF+SKS+APDGVPP RLSR GW+ Y  Q LNNF EPT+
Subjt:  QYYIINSRGSRAGLACTSSKEEDNTSRGCCYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNF-EPTE

Query:  ALGLDPSLRARLPELN-------SEPVVVGKWYCPFIFIREGE--VGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAGGTPT
        ALGL+ SLRA LP+LN       S+ VVVGKWYCPFIFIREG   VG QM++S YYE+TL Q WVEIF  E       N   VD FV+REVVS+AG   T
Subjt:  ALGLDPSLRARLPELN-------SEPVVVGKWYCPFIFIREGE--VGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAGGTPT

Query:  ---SRNVVDGVVWFGPSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVK-VKRREEFK---KMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQ
           SRNV DG+VWF P KVGLSL +VERMKWE++R GF WVEE +EKKV+ VK +EE K   K  +WKRFGCYVLVERFV+KRMDGSLVL WEFRHTHQ
Subjt:  ---SRNVVDGVVWFGPSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVK-VKRREEFK---KMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQ

TrEMBL top hitse value%identityAlignment
A0A0A0KT31 Uncharacterized protein2.4e-14669.44Show/hide
Query:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQD-EEAAE---SRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSN
        MYVTRPLSLYR SP S+S+ PPEGP+SGILVIQD EE AE   SRW CGLFK KE V   PFPQNKIL+LTH+A+AGE+EYS+SVYAV+IPVLNQPLSSN
Subjt:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQD-EEAAE---SRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSN

Query:  QYYIINSRGSRAGLACTSSKEEDNTSRGCCYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTEA
        QYYIIN+RG+R GLACTSSK ++ +S  CCY+VPDPPPQLFDPKN YQQFQISDY+YCGG +GFI  S+A DGV P RLSR GWR YI    +  EPT A
Subjt:  QYYIINSRGSRAGLACTSSKEEDNTSRGCCYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTEA

Query:  LGLDPSLRARLPELN-------SEPVVVGKWYCPFIFIREGE--VGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGV---DVDVFVDREVVSVAGGT
         GL+  LRARLP+LN       S+PV VGKWY PFIFIR+G   VG QM++SPYYE+TL Q WVEIFG E       NGV   +VDVFV+REVVS   G 
Subjt:  LGLDPSLRARLPELN-------SEPVVVGKWYCPFIFIREGE--VGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGV---DVDVFVDREVVSVAGGT

Query:  PTSRNVVDGVVWFGPSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVK-VKRREEFKKMG-QWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQ
         +S+NVVDG+VWF P KVGLSL VVERMKWEE R GF+WV+E +EKKV+ VK R + K+MG +W RFGCYVLVERFV+KRMDGSLVL WEFRHTHQ
Subjt:  PTSRNVVDGVVWFGPSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVK-VKRREEFKKMG-QWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQ

A0A1S3BER1 uncharacterized protein LOC1034892238.4e-12361.24Show/hide
Query:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDEEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYYI
        MYVTRPLS+ R SPS+LS+APPEGP+SGILVI+D EAAES+W  G+ K  E V   PFPQNK + L+H    G     + +YA+LIPVLNQPLSSNQYYI
Subjt:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDEEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYYI

Query:  INSRGSRAGLACTSSKEEDNTSRGCCYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEP-TEALGL
        I S GS  GLA   SKEE N SR     V D PPQ FDP N YQ+F+IS+ +Y G  NGF  KS+A +GV P  ++ K WR Y  +TL  F+P TEALGL
Subjt:  INSRGSRAGLACTSSKEEDNTSRGCCYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEP-TEALGL

Query:  DPSLRARLPEL--------NSEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAGGTPTSRNVV
        D SLRARLP L        +S  VVVGKWYCPFIF+REG+V  Q+ +SPYYEM LQQ WVE+FG    N  G  GV++DV V++EVVSV G      N+ 
Subjt:  DPSLRARLPEL--------NSEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAGGTPTSRNVV

Query:  DGVVWFGPSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFKKMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQ
        DG  WFG S+VGLS+A+VER++WEE+R GFEWV E  EK VKV+RREEF  +G W+RFGCYVLVERF LKRMDGSLVL+WEFRHTHQ
Subjt:  DGVVWFGPSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFKKMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQ

A0A6J1D6Y4 uncharacterized protein LOC1110176033.7e-14771.09Show/hide
Query:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQD-EEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYY
        MYVTRPLSLYR SPS LS  PPEGP+SGILV +D EE AESRW  G+FK K+ V   P PQN+ILRLTHAADAGEYEYSDS+YA+L+PVLNQPLSSNQYY
Subjt:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQD-EEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYY

Query:  IINSRGSRAGLACTSSKEEDNTSRGC--CYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFE-PTEA
        +I+SRG+  GLACTSSK EDNTSR     Y + D P QL DPKNTYQQFQIS+YIYCG  NGFISKS+APDGVPPE L RKGWR Y +   NN   PTEA
Subjt:  IINSRGSRAGLACTSSKEEDNTSRGC--CYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFE-PTEA

Query:  LGLDPSLRARLPEL--NSEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAGGTPTSRNVVDGV
        LGLD +LRARLP+L     PVVVGKWYCPFIF+R+G V  QMS+SPYYEMTLQQ W EIFG       G+  VD DV V+REV+S+AG     RN  DGV
Subjt:  LGLDPSLRARLPEL--NSEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAGGTPTSRNVVDGV

Query:  VWFGPSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFKKMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQ
        +WFG S VGLSLA+VER+KWEE+RAGFE+ +E ++K VKVKRREEFK+ G+W+RFGCYVLVERFVLKRMDGSLVL WEFRHTHQ
Subjt:  VWFGPSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFKKMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQ

A0A6J1D7B1 uncharacterized protein LOC1110175847.5e-13261.79Show/hide
Query:  MYVTRPLSLYRKSPSSLSLAPP----EGPSSGILVIQDEEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSN
        MYVTRPLS+YR   ++ +   P    EGP++G+LVI+D EAAESRW  GL K K  V   PFPQNKI+ L +  ++GE++++D  YA+LIPV+N+PLSSN
Subjt:  MYVTRPLSLYRKSPSSLSLAPP----EGPSSGILVIQDEEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSN

Query:  QYYIINSRGSRAGLACTSSKEEDNTSRGCCYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLN-NFEPTE
        +YY+I SRG   GLACTSSKE+D TS  CC+ +PD PPQLFDP N YQQFQIS+Y+ C G  GF++ S+APDGVPP  L R+GWR Y +   N N E T+
Subjt:  QYYIINSRGSRAGLACTSSKEEDNTSRGCCYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLN-NFEPTE

Query:  ALGLDPSLRARLPELN-------SEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAGGTPTSR
        ALGLD +LRA LP LN       S+PVVVGKWYCPFIF+R+GEVG Q+S+SPYYEMTLQQ W EIFG       G+ GVD DV V++EV+ +AG     R
Subjt:  ALGLDPSLRARLPELN-------SEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAGGTPTSR

Query:  NVVDGVVWFGPSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFKKMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQ
         V DGVVWFG   VGLSLA+VER+KWEE+RAGFE+ +E ++K VKVKRREE+  +G+WKRFGCYVL+ERFVLKRMDGSLVL WEF+HTHQ
Subjt:  NVVDGVVWFGPSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFKKMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQ

A0A6J1G0V4 uncharacterized protein LOC1114496544.4e-16476.58Show/hide
Query:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDEEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYYI
        MYVTRPLSLYR SPSSLS APPEGP+SGILVIQDEE AE+RWCCGLFK KE V E+PFPQNKILRLTHAA+AGE EYS+SV AVLIPVLN PLSSNQYYI
Subjt:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDEEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYYI

Query:  INSRGSRAGLACTSSKEEDNTSRGCCYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTEALGLD
        INS G+R GLACTSSKEE+ + R CCYSV DPPPQLFDPKN YQQFQISDYIYCGG NG+I+KSMAPDGVPP+RLSRKG R Y  Q L NFEPTEALGL+
Subjt:  INSRGSRAGLACTSSKEEDNTSRGCCYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTEALGLD

Query:  PSLRARLPEL-NSEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAGGTPTSRNV-VDGVVWFG
        PSLR RLPE+ +S+PVVVGKWYCPFIFIREG+V  QMS+SPYYEMTL++ W EIFG E ++ +G NGVDVDV+V+REV SVAGG    R    DG VWF 
Subjt:  PSLRARLPEL-NSEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAGGTPTSRNV-VDGVVWFG

Query:  PSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFKKMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQ
         + VGLS A+VER+KWEE R GF WVEE +EKK++V+RREE K +G W+RFGCYVLVERFVLKRMDGS+VL WEFRHTHQ
Subjt:  PSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFKKMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13480.1 Protein of unknown function (DUF1262)4.9e-7543.93Show/hide
Query:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDEEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYYI
        MYVTR LS Y+K PS L L PPEGP+SGI+VIQDEE+  +  CC     + R+  +PFPQN  L   + +  GE++ +     V IPVL+QPLSSN YY+
Subjt:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDEEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYYI

Query:  INSRGSRAGLACTSSKEEDNTSRGCCYS-VPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTEALGL
        +  RG   G A  S+ EE+  S   C+S +PD  PQ  DP + YQQF+I  +     +  + + S+A DGVPP  L RK W      + +     +A G+
Subjt:  INSRGSRAGLACTSSKEEDNTSRGCCYS-VPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTEALGL

Query:  DPSLRARLPELNS--EPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAG-GTPTSRNVVDGVVW
        +  LR    EL S  + + +GKWY PFIF+ EG+V  QM+ S +Y +TLQQ+W E+F  E    +    V VDV V+ E V + G  T    +  DGVVW
Subjt:  DPSLRARLPELNS--EPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAG-GTPTSRNVVDGVVW

Query:  FG-------PSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFK-KMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRH
        F          K+GL   VVERMKWEE+R  F W+ E  E+   +KR E F+     WK + CYVL+E F L RMDGSLVL +EFRH
Subjt:  FG-------PSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFK-KMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRH

AT1G13500.1 Protein of unknown function (DUF1262)7.6e-7643.73Show/hide
Query:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDEEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYYI
        MYVT+ LS Y+++PS L+L P EGP+SG+LVIQDEE+  +  CC +      +N +PFPQN   R+      G   Y D V    IPVL+QP SSN YY+
Subjt:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDEEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYYI

Query:  INSRGSRAGLACTSSKEEDNTSRGCCYS-VPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTEALGL
        I   G   G AC S+KE D  S   C++ +P+  P+  DP +  QQF+I  +        F + S+A DG+PP+ L+RKGW       + +    E  GL
Subjt:  INSRGSRAGLACTSSKEEDNTSRGCCYS-VPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTEALGL

Query:  --------DPSLRARLPELNSEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAG---GTPTSR
                D  LR  LP+L++  VVVGKWY PF+F++EG+   QM  S YY MTLQQ++ E+F  E      +  V VDV V+ EVV + G      T  
Subjt:  --------DPSLRARLPELNSEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAG---GTPTSR

Query:  NVVDGVVWF---GPSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFKKMG-QWKRFGCYVLVERFVLKRMDGSLVLAWEFRH
           DGVVWF   G  K+GL   V+ERMKWEE+R  F W+ +  E++  +KR E F+  G  WK + CYVLVE F LKR DGSLVL +EF+H
Subjt:  NVVDGVVWF---GPSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFKKMG-QWKRFGCYVLVERFVLKRMDGSLVLAWEFRH

AT1G13520.1 Protein of unknown function (DUF1262)7.1e-7442.23Show/hide
Query:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDEEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYYI
        MYVTR LS Y+++ S L+ + PEGP+SG+LVIQDEE+  +  CC        +  +PFPQN  L +T+    G    S     + IPVL+QP  SN+YY+
Subjt:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDEEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYYI

Query:  INSRGSRAGLACTSSKEEDNTSRGCCYS-VPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTEALGL
        I  RG ++G A  S+KEED      C+S VP+  PQ  DP + YQQF++       G   + + S+AP+G+PPE L RK W      + +     +A G+
Subjt:  INSRGSRAGLACTSSKEEDNTSRGCCYS-VPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTEALGL

Query:  DPSLRARLPELNSEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNG-VDVDVFVDREVVSVAGGTPTSRNVV-DGVVWF
        + +LR++LP   +  VVVGKWY PFIF++E +   Q+ SSPYY MTL+Q+W E++     N   + G V VDV V+ +VV + G    SR     G VWF
Subjt:  DPSLRARLPELNSEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNG-VDVDVFVDREVVSVAGGTPTSRNVV-DGVVWF

Query:  G-------PSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFK-KMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRH
                  K+GL   VVERMKWEE+R  F W+   +  +  +KR E F+     WK + C VL+E F LKRMDGSLVL +EF H
Subjt:  G-------PSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFK-KMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRH

AT1G13530.1 Protein of unknown function (DUF1262)2.0e-7644.79Show/hide
Query:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDEEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYYI
        MYVT+ LS Y+++PS L+  P EGP+SG+LVIQDEE+  +  CC        +N +PFPQN  + + +    G+           IPVL+QP SSN YY+
Subjt:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDEEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYYI

Query:  INSRGSRAGLACTSSKEEDNTSRGCCYS-VPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTEALG-
        I  RG  AG AC S+KEED  S   C++ V +  P+L DP + YQQF+I  +        F + S+A DG+PP  L RKGW      + +     +A G 
Subjt:  INSRGSRAGLACTSSKEEDNTSRGCCYS-VPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTEALG-

Query:  LDPSLRARLPELNSEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAG---GTPTSRNVVDGVV
        +D  LR  LP+   + VVVGKWY PF+F++EG+   QM  S YY MTL Q++ E+F  E  + K    V VDV V+ EVV + G   G  T     DGVV
Subjt:  LDPSLRARLPELNSEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAG---GTPTSRNVVDGVV

Query:  WFGPS---KVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFKKMG-QWKRFGCYVLVERFVLKRMDGSLVLAWEFRH
        WFG S   K+G+   V+ERMKWEE+R  F W ++  E K  +KR E+F+  G QWK + CYVLVE F LK+ DGSLVL +EFRH
Subjt:  WFGPS---KVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFKKMG-QWKRFGCYVLVERFVLKRMDGSLVLAWEFRH

AT1G13540.1 Protein of unknown function (DUF1262)3.6e-7043.34Show/hide
Query:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDEEAAESRWCC--GLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQY
        MY+TR  S YR++P+ L+  PPEGP+SGILVIQD+ +      C      +   ++ +P PQN  L +T     G+    D V  V IPVL++PLSSN Y
Subjt:  MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDEEAAESRWCC--GLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQY

Query:  YIINSRGSRAGLACTSSKEEDNTSRGCCYS---VPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTE
        Y I   G  +G A  S+KE+D  S  CC+    VP+  P+  DP + YQQF+I        +  + + S+APDGVPP  L +K W     +  +     +
Subjt:  YIINSRGSRAGLACTSSKEEDNTSRGCCYS---VPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTE

Query:  ALGLDPSLRARLPELNSEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQN-GVDVDVFVDREVVSVAGGTPTSRNVVDGV
        A GL   LR  LP L     VVGKWY PFIF++E EV  Q+ +S YY MTL+Q+W E+F  E  N K +N  V VDV V+ EVV + G         +G 
Subjt:  ALGLDPSLRARLPELNSEPVVVGKWYCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQN-GVDVDVFVDREVVSVAGGTPTSRNVVDGV

Query:  VWF--GPSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFKKMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRH
        VWF  G  K+GL   VVERMKWEE+R G+    +  E+ + VKR E+      WK + CYVL+E FVLKRMD SLVL +EF H
Subjt:  VWF--GPSKVGLSLAVVERMKWEEQRAGFEWVEEWQEKKVKVKRREEFKKMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACGTGACGAGGCCGCTTTCTCTGTACAGAAAATCTCCGTCGTCGTTGTCTCTGGCGCCGCCGGAGGGTCCGAGCTCGGGCATTCTGGTGATCCAAGACGAAGAAGC
TGCGGAGTCGAGATGGTGCTGTGGATTGTTCAAGATGAAGGAGCGTGTGAACGAAATGCCCTTCCCTCAGAACAAGATACTGCGGCTTACGCACGCGGCCGATGCAGGGG
AGTATGAGTATTCCGATTCGGTATACGCAGTGCTTATTCCAGTTTTGAATCAGCCGCTCTCTTCCAATCAATATTATATCATCAATTCCCGTGGCTCTCGAGCAGGGCTA
GCATGCACAAGTTCAAAGGAAGAAGACAACACAAGCCGTGGCTGCTGCTACAGCGTTCCCGATCCACCACCACAGCTCTTCGACCCAAAAAACACATATCAACAATTCCA
AATCAGCGATTACATATACTGTGGAGGCAACAACGGCTTCATCTCCAAGTCCATGGCTCCGGACGGGGTTCCTCCGGAACGCCTGAGCCGCAAGGGCTGGAGACCTTACA
TCCAACAAACTCTCAATAACTTCGAGCCCACCGAAGCGCTCGGCCTCGACCCATCACTCCGGGCCCGCCTCCCAGAGCTCAACTCCGAGCCAGTGGTGGTTGGGAAATGG
TACTGCCCTTTCATTTTCATTCGGGAAGGAGAGGTTGGTCCTCAGATGAGCAGCTCGCCGTACTACGAAATGACGCTTCAGCAGAAATGGGTTGAGATATTTGGACGTGA
GAGGACGAACAAGAAAGGCCAAAATGGCGTGGATGTGGATGTCTTTGTGGACAGGGAAGTGGTTTCAGTTGCCGGTGGGACGCCCACGAGCAGAAATGTTGTTGATGGTG
TGGTGTGGTTTGGGCCGTCGAAGGTTGGGCTGAGCTTGGCGGTTGTGGAGAGAATGAAATGGGAGGAGCAGAGAGCTGGATTTGAATGGGTCGAAGAATGGCAAGAAAAG
AAAGTGAAGGTGAAGAGAAGAGAGGAGTTTAAGAAGATGGGTCAGTGGAAAAGATTTGGGTGTTATGTGTTGGTTGAGAGGTTTGTGCTTAAGAGAATGGATGGAAGTTT
GGTTCTTGCTTGGGAATTTAGGCACACTCATCAACAATATGTGCTTGATGCTTCAAGCAAGCTTTTTGCAGAACCCGCCTGGCAATCAATCATCATCCTACGCGCGCTTT
TTACAGCCCACACTTGGAAGCAAGCGAGTGAACGCTGCGGCAGTCGCAGAGCCAGAGCCGACCAGAGTGAACTCCGAAACGGCGCAACAAACTCCGAAGCCTTCGCTCGG
CGCTCGAAGATGGCGAATGTCATTGATCAGGATCAACAGTGGCTTCTCAATTGCTTGTCGGCCACTCTCGATCCCAACCAGGAGGTTCGCTCATTTGCAGAGGCCTCGCT
GAATCAGGCATCTCTCCAACCAGGTTTTGGAGTTGCATTATCCAAAGTTGCTGCCAATAGGGAGCTTCCTGTCGGGTTGCGCCAGATATCCTTTGTTGTCATTGCTATTT
AG
mRNA sequenceShow/hide mRNA sequence
ATGTACGTGACGAGGCCGCTTTCTCTGTACAGAAAATCTCCGTCGTCGTTGTCTCTGGCGCCGCCGGAGGGTCCGAGCTCGGGCATTCTGGTGATCCAAGACGAAGAAGC
TGCGGAGTCGAGATGGTGCTGTGGATTGTTCAAGATGAAGGAGCGTGTGAACGAAATGCCCTTCCCTCAGAACAAGATACTGCGGCTTACGCACGCGGCCGATGCAGGGG
AGTATGAGTATTCCGATTCGGTATACGCAGTGCTTATTCCAGTTTTGAATCAGCCGCTCTCTTCCAATCAATATTATATCATCAATTCCCGTGGCTCTCGAGCAGGGCTA
GCATGCACAAGTTCAAAGGAAGAAGACAACACAAGCCGTGGCTGCTGCTACAGCGTTCCCGATCCACCACCACAGCTCTTCGACCCAAAAAACACATATCAACAATTCCA
AATCAGCGATTACATATACTGTGGAGGCAACAACGGCTTCATCTCCAAGTCCATGGCTCCGGACGGGGTTCCTCCGGAACGCCTGAGCCGCAAGGGCTGGAGACCTTACA
TCCAACAAACTCTCAATAACTTCGAGCCCACCGAAGCGCTCGGCCTCGACCCATCACTCCGGGCCCGCCTCCCAGAGCTCAACTCCGAGCCAGTGGTGGTTGGGAAATGG
TACTGCCCTTTCATTTTCATTCGGGAAGGAGAGGTTGGTCCTCAGATGAGCAGCTCGCCGTACTACGAAATGACGCTTCAGCAGAAATGGGTTGAGATATTTGGACGTGA
GAGGACGAACAAGAAAGGCCAAAATGGCGTGGATGTGGATGTCTTTGTGGACAGGGAAGTGGTTTCAGTTGCCGGTGGGACGCCCACGAGCAGAAATGTTGTTGATGGTG
TGGTGTGGTTTGGGCCGTCGAAGGTTGGGCTGAGCTTGGCGGTTGTGGAGAGAATGAAATGGGAGGAGCAGAGAGCTGGATTTGAATGGGTCGAAGAATGGCAAGAAAAG
AAAGTGAAGGTGAAGAGAAGAGAGGAGTTTAAGAAGATGGGTCAGTGGAAAAGATTTGGGTGTTATGTGTTGGTTGAGAGGTTTGTGCTTAAGAGAATGGATGGAAGTTT
GGTTCTTGCTTGGGAATTTAGGCACACTCATCAACAATATGTGCTTGATGCTTCAAGCAAGCTTTTTGCAGAACCCGCCTGGCAATCAATCATCATCCTACGCGCGCTTT
TTACAGCCCACACTTGGAAGCAAGCGAGTGAACGCTGCGGCAGTCGCAGAGCCAGAGCCGACCAGAGTGAACTCCGAAACGGCGCAACAAACTCCGAAGCCTTCGCTCGG
CGCTCGAAGATGGCGAATGTCATTGATCAGGATCAACAGTGGCTTCTCAATTGCTTGTCGGCCACTCTCGATCCCAACCAGGAGGTTCGCTCATTTGCAGAGGCCTCGCT
GAATCAGGCATCTCTCCAACCAGGTTTTGGAGTTGCATTATCCAAAGTTGCTGCCAATAGGGAGCTTCCTGTCGGGTTGCGCCAGATATCCTTTGTTGTCATTGCTATTT
AG
Protein sequenceShow/hide protein sequence
MYVTRPLSLYRKSPSSLSLAPPEGPSSGILVIQDEEAAESRWCCGLFKMKERVNEMPFPQNKILRLTHAADAGEYEYSDSVYAVLIPVLNQPLSSNQYYIINSRGSRAGL
ACTSSKEEDNTSRGCCYSVPDPPPQLFDPKNTYQQFQISDYIYCGGNNGFISKSMAPDGVPPERLSRKGWRPYIQQTLNNFEPTEALGLDPSLRARLPELNSEPVVVGKW
YCPFIFIREGEVGPQMSSSPYYEMTLQQKWVEIFGRERTNKKGQNGVDVDVFVDREVVSVAGGTPTSRNVVDGVVWFGPSKVGLSLAVVERMKWEEQRAGFEWVEEWQEK
KVKVKRREEFKKMGQWKRFGCYVLVERFVLKRMDGSLVLAWEFRHTHQQYVLDASSKLFAEPAWQSIIILRALFTAHTWKQASERCGSRRARADQSELRNGATNSEAFAR
RSKMANVIDQDQQWLLNCLSATLDPNQEVRSFAEASLNQASLQPGFGVALSKVAANRELPVGLRQISFVVIAI