; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015942 (gene) of Snake gourd v1 genome

Gene IDTan0015942
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationLG01:25000275..25005429
RNA-Seq ExpressionTan0015942
SyntenyTan0015942
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650029.1 hypothetical protein Csa_011504 [Cucumis sativus]9.1e-12555.17Show/hide
Query:  FYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFK------------TKEGDIIDCININKQPALDHPLLKNHKVQIRPS-INPYNFLKNSSKSNYG-
        F  K ++AS+S LS+EEELE+E  LKLLNK  IKT+K            TKEGDIIDC++INKQPALDHPLLKNHKVQ  PS      F K+SS++N G 
Subjt:  FYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFK------------TKEGDIIDCININKQPALDHPLLKNHKVQIRPS-INPYNFLKNSSKSNYG-

Query:  -SLMSNNN--RESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKK-NGKYYGASGRFAVYNLSVTTEQ
         +L SNNN   E CP GFVPIRRTLKKDLIRL+SLSSN  K++  + N  +   DD+F + V+FP   NVVS S +K   KYYG     +VYN+S++ +Q
Subjt:  -SLMSNNN--RESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKK-NGKYYGASGRFAVYNLSVTTEQ

Query:  SSSSNLWIVGGPYDTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNV
        SSS+N+WIVGGP D+L V++ GW VNP VNGD +TR FVYWT DGG  TGCYNM+CQGFVQ++PS     PL PTSTY G QYDY FT+IQ+        
Subjt:  SSSSNLWIVGGPYDTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNV

Query:  DQSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGGIAKPAPDRKSPPLGNGYKP-KTGNYYEACFIRSISYVVDGYKVES--PPRWDHTPTFVSNR
           GNWW+L+ ++ + LGYWP EL  NL +GADQ+AWGGIA+P+ D  SP LG+G+KP + G+Y E C+IR+I  ++ G    +   P WD+T ++ SN 
Subjt:  DQSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGGIAKPAPDRKSPPLGNGYKP-KTGNYYEACFIRSISYVVDGYKVES--PPRWDHTPTFVSNR

Query:  DCYDLNASETCNNDELQYCFTYGGPGGDDCEAIIF
         CYDLN +  C  D ++YCFT+GGPGG +CEA IF
Subjt:  DCYDLNASETCNNDELQYCFTYGGPGGDDCEAIIF

TYK11502.1 neprosin 2 [Cucumis melo var. makuwa]2.2e-12356.64Show/hide
Query:  VLFVCFYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNS--SKSNYGSLMSNNN
        + FVCF CK N AS+ NLS+EEELE+E QLKLLNK  IKT+KTKEGDIIDC++INKQPALDHPLLKNHKVQ  PS       K    S+SN G L SNNN
Subjt:  VLFVCFYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNS--SKSNYGSLMSNNN

Query:  R-ESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYD-DYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSVTTE-QSSSSNLWI
          E CP+GFVPIRRTLK+DLIRL+SLSSN +K   E+S +P+ D   D+  + VRFP D NVVS S  K   Y+GA  R AVYN+S++ E QSSS+N+W+
Subjt:  R-ESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYD-DYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSVTTE-QSSSSNLWI

Query:  VGGPYDTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNVDQSGNWWL
        VGGP ++LNV++A      AV+GDSL R FVYWT D G  TGCYNM CQGFV ++P I     + P S Y+G QYDY F+++Q I          G+WW+
Subjt:  VGGPYDTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNVDQSGNWWL

Query:  LLADSQVALGYWPNELFPNLRNGADQVAWGGIAKPA-PDRKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTPTFVSNRDCYDLNASET
         + D QV LGYWP+ELFPNL  GA+QVAWGG A+P+    +SPPLG+G+KP  G   EACF+R+I Y+   Y + S P  D+T  +VS+  CYDL ++E 
Subjt:  LLADSQVALGYWPNELFPNLRNGADQVAWGGIAKPA-PDRKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTPTFVSNRDCYDLNASET

Query:  CNNDELQYCFTYGGPGGDDCEA
        C+ D  +YCFT+GGPGG DC A
Subjt:  CNNDELQYCFTYGGPGGDDCEA

XP_022145288.1 uncharacterized protein LOC111014777 [Momordica charantia]7.5e-10349.06Show/hide
Query:  MASKASAFCLVIVLFVCFYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNSSKS
        MASKA  + L+IVL +   CKG+ A  SNLS+EEELELE QLKLLN+  I TF+T+EGDIIDC++INKQPALDHP LK+HK+Q RPS  P+   K+SS S
Subjt:  MASKASAFCLVIVLFVCFYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNSSKS

Query:  NYGSLMSNNNRESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSVTTEQS
           S ++NNNR +CP G+VPIRRT+KKDLIR+RSLSS K+ +  + S +   D          FP + +VVS + KK  KYYGASG  +VYNLSV  +QS
Subjt:  NYGSLMSNNNRESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSVTTEQS

Query:  SSSNLWIVGGPYDTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNVD
        SSSN+WI+GGP    NVILAGWQVNP +NGDSLTR+FVYWT                          D P                              
Subjt:  SSSNLWIVGGPYDTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNVD

Query:  QSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGGIAKPAPDRKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTPTFVSNRDCYD
         +GNWWL + +S   +GYWP ELF +L +G +QVAWGGIAKP+P+  SPPLGNG+KP    Y +AC+ R ++YV +  K + P   ++T  ++SN  CY 
Subjt:  QSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGGIAKPAPDRKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTPTFVSNRDCYD

Query:  LNASETCNNDELQYCFTYGGPGGDDC
        L+  ETC  +   YC T+GGPGG++C
Subjt:  LNASETCNNDELQYCFTYGGPGGDDC

XP_031738648.1 uncharacterized protein LOC105435061 [Cucumis sativus]9.5e-9846.49Show/hide
Query:  NQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNSSKSNYGSLMSNNNRESCPVGFVPIR
        ++AS+S LS+EEELE+E  LKLLNK  IKT+KTKEGDIIDC++INKQPALDHPLLKNHKVQ                                       
Subjt:  NQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNSSKSNYGSLMSNNNRESCPVGFVPIR

Query:  RTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKK-NGKYYGASGRFAVYNLSVTTEQSSSSNLWIVGGPYDTLNVILAG
                                                       VVS S +K   KYYG     +VYN+S++ +QSSS+N+WIVGGP D+L V++ G
Subjt:  RTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKK-NGKYYGASGRFAVYNLSVTTEQSSSSNLWIVGGPYDTLNVILAG

Query:  WQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNVDQSGNWWLLLADSQVALGYWPN
        W VNP VNGD +TR FVYWT DGG  TGCYNM+CQGFVQ++PS     PL PTSTY G QYDY FT+IQ+           GNWW+L+ ++ + LGYWP 
Subjt:  WQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNVDQSGNWWLLLADSQVALGYWPN

Query:  ELFPNLRNGADQVAWGGIAKPAPDRKSPPLGNGYKP-KTGNYYEACFIRSISYVVDGYKVES--PPRWDHTPTFVSNRDCYDLNASETCNNDELQYCFTY
        EL  NL +GADQ+AWGGIA+P+ D  SP LG+G+KP + G+Y E C+IR+I  ++ G    +   P WD+T ++ SN  CYDLN +  C  D ++YCFT+
Subjt:  ELFPNLRNGADQVAWGGIAKPAPDRKSPPLGNGYKP-KTGNYYEACFIRSISYVVDGYKVES--PPRWDHTPTFVSNRDCYDLNASETCNNDELQYCFTY

Query:  GGPGGDDCEAIIF
        GGPGG +CEA IF
Subjt:  GGPGGDDCEAIIF

XP_031738649.1 uncharacterized protein LOC116402744 [Cucumis sativus]7.2e-13058.14Show/hide
Query:  ASAFCLVIVLFVCFYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPS-INPYNFLKNSSKSNYG
        + A CLVIV FVCF CK N AS+ NLS+EE+LE+E QLKLLNK  IKT+KTKEGDIIDC++INKQPALDHPLLKNHKVQ  PS      F ++SS+SN G
Subjt:  ASAFCLVIVLFVCFYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPS-INPYNFLKNSSKSNYG

Query:  SLMSNNNR-ESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWD--YDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSVTTE-Q
         L SNNN  E CPVGFVPIRRTLK+DLIRL+SLSSN +    ++S  PE D    D F + VRFP   NVVS S  K   Y+GA  R AV+N+S++   Q
Subjt:  SLMSNNNR-ESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWD--YDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSVTTE-Q

Query:  SSSSNLWIVGGPYDTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNV
        SSS+N+W++GG  D+LNV++AGWQVNPAVNGD+L R FVYWT D GV TGCYNM CQGFV ++P+I     + P S Y+G QYDY F+++Q I       
Subjt:  SSSSNLWIVGGPYDTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNV

Query:  DQSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGGIAKPA-PDRKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTPTFVSNRDC
           G+WW+ + D+QV LGYWPNELFPNL  GADQVAWGG A+P     +SPPLG+G+KP  G   EA F+R+I Y+   Y + S P  ++T  +VSN  C
Subjt:  DQSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGGIAKPA-PDRKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTPTFVSNRDC

Query:  YDLNASETCNNDELQYCFTYGGPGGDDCEA
        YDL ++E C+ D  +YCFT+GGPGG  CEA
Subjt:  YDLNASETCNNDELQYCFTYGGPGGDDCEA

TrEMBL top hitse value%identityAlignment
A0A059CIH2 Uncharacterized protein6.6e-8943.97Show/hide
Query:  MASKASAFCLVIVLFVCFYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNSSKS
        MA K  AF LV++         +++  +N+SK+++++LE QLKLLNK PIKTF T+EGDIIDCI+I+KQPA+DHPLLKNHK+Q +P +   NF K SS +
Subjt:  MASKASAFCLVIVLFVCFYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNSSKS

Query:  NYGSLMSNNNRESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSVTTEQS
         Y   +   +R+ CP+G VPI+R  K+DLIR RS+      +  E  + P          +V    DH  +          YGASG  +VYN+S   +Q 
Subjt:  NYGSLMSNNNRESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSVTTEQS

Query:  SSSNLWIVGGPYDTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNVD
        SS N+WI  GP D +++I+AGW+V+P +N D LTRLF YWTGD G   GCYN FCQGFVQ+   ITP++PL+P STY GP Y+    V Q I        
Subjt:  SSSNLWIVGGPYDTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNVD

Query:  QSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGGIAKPAPDRKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTPTFVSNRDCYD
         +GNWWL + D  + +GYWP ELF NLRNG+   AWGG+AK   +   PP+GNG+ P      +A + R + ++      ES   + + P  V    CY+
Subjt:  QSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGGIAKPAPDRKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTPTFVSNRDCYD

Query:  LNASETCNNDELQYCFTYGGPGG
        L  +     D   Y FT+GGPGG
Subjt:  LNASETCNNDELQYCFTYGGPGG

A0A5A7UEV4 Uncharacterized protein4.9e-8453.76Show/hide
Query:  DDYFGEKVRFPDDHNVVSFSSKKN-GKYYGASGRFAVYNLSVTTEQSSSSNLWIVGGPYDTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNM
        DD+F + V++PD+ NVVS S KK   KYYG     +VYN+S++  QSSSSN+WIVGGP ++L V++ GW VNP VNGD +TR FVYWT DGG  TGCYNM
Subjt:  DDYFGEKVRFPDDHNVVSFSSKKN-GKYYGASGRFAVYNLSVTTEQSSSSNLWIVGGPYDTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNM

Query:  FCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNVDQSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGGIAKPAPDRKSPPLGN
        +CQGFVQ++PS     PL PTSTY+G QYDY FT+IQ+          +GNWW+L+ ++ + LGYWP EL  NL +GA+Q+AWGGIAKP+ D  SP LG+
Subjt:  FCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNVDQSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGGIAKPAPDRKSPPLGN

Query:  GYKPK-TGNYYEACFIRSISYV----VDGYKVESPPRWDHTPTFVSNRDCYDLNASETCNNDELQYCFTYGGPGGDDCE
        G+KP   G+Y E C+IR+I  +     + YK+   P WD+T ++ SN  CYDLN +  C +D ++YCFT+GGPGG +CE
Subjt:  GYKPK-TGNYYEACFIRSISYV----VDGYKVESPPRWDHTPTFVSNRDCYDLNASETCNNDELQYCFTYGGPGGDDCE

A0A5D3CJM0 Neprosin 21.1e-12356.64Show/hide
Query:  VLFVCFYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNS--SKSNYGSLMSNNN
        + FVCF CK N AS+ NLS+EEELE+E QLKLLNK  IKT+KTKEGDIIDC++INKQPALDHPLLKNHKVQ  PS       K    S+SN G L SNNN
Subjt:  VLFVCFYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNS--SKSNYGSLMSNNN

Query:  R-ESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYD-DYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSVTTE-QSSSSNLWI
          E CP+GFVPIRRTLK+DLIRL+SLSSN +K   E+S +P+ D   D+  + VRFP D NVVS S  K   Y+GA  R AVYN+S++ E QSSS+N+W+
Subjt:  R-ESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYD-DYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSVTTE-QSSSSNLWI

Query:  VGGPYDTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNVDQSGNWWL
        VGGP ++LNV++A      AV+GDSL R FVYWT D G  TGCYNM CQGFV ++P I     + P S Y+G QYDY F+++Q I          G+WW+
Subjt:  VGGPYDTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNVDQSGNWWL

Query:  LLADSQVALGYWPNELFPNLRNGADQVAWGGIAKPA-PDRKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTPTFVSNRDCYDLNASET
         + D QV LGYWP+ELFPNL  GA+QVAWGG A+P+    +SPPLG+G+KP  G   EACF+R+I Y+   Y + S P  D+T  +VS+  CYDL ++E 
Subjt:  LLADSQVALGYWPNELFPNLRNGADQVAWGGIAKPA-PDRKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTPTFVSNRDCYDLNASET

Query:  CNNDELQYCFTYGGPGGDDCEA
        C+ D  +YCFT+GGPGG DC A
Subjt:  CNNDELQYCFTYGGPGGDDCEA

A0A6J1CVJ6 uncharacterized protein LOC1110147773.6e-10349.06Show/hide
Query:  MASKASAFCLVIVLFVCFYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNSSKS
        MASKA  + L+IVL +   CKG+ A  SNLS+EEELELE QLKLLN+  I TF+T+EGDIIDC++INKQPALDHP LK+HK+Q RPS  P+   K+SS S
Subjt:  MASKASAFCLVIVLFVCFYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNSSKS

Query:  NYGSLMSNNNRESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSVTTEQS
           S ++NNNR +CP G+VPIRRT+KKDLIR+RSLSS K+ +  + S +   D          FP + +VVS + KK  KYYGASG  +VYNLSV  +QS
Subjt:  NYGSLMSNNNRESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSVTTEQS

Query:  SSSNLWIVGGPYDTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNVD
        SSSN+WI+GGP    NVILAGWQVNP +NGDSLTR+FVYWT                          D P                              
Subjt:  SSSNLWIVGGPYDTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNVD

Query:  QSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGGIAKPAPDRKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTPTFVSNRDCYD
         +GNWWL + +S   +GYWP ELF +L +G +QVAWGGIAKP+P+  SPPLGNG+KP    Y +AC+ R ++YV +  K + P   ++T  ++SN  CY 
Subjt:  QSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGGIAKPAPDRKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTPTFVSNRDCYD

Query:  LNASETCNNDELQYCFTYGGPGGDDC
        L+  ETC  +   YC T+GGPGG++C
Subjt:  LNASETCNNDELQYCFTYGGPGGDDC

A0A6J1CW60 uncharacterized protein LOC1110147751.1e-8846.46Show/hide
Query:  MASKASAFCLVIVLFVCFYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNSSKS
        MASKA  + L+IVL +   CKG+ A  SNLS EEELE E QLKLLNK  I TF+T+EGDIIDC++INKQPALDHPLLKNHKVQ                 
Subjt:  MASKASAFCLVIVLFVCFYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNSSKS

Query:  NYGSLMSNNNRESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSVTTEQS
                                                                             V S + K+  KYYG +   +VYNLSV  +QS
Subjt:  NYGSLMSNNNRESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSVTTEQS

Query:  SSSNLWIVGGPYDTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNVD
        SSSN+WI+GGP +  NVIL GWQVNP +NGDS TR+FVYWT DGG++TG YNMFC+ F+Q +PS  P+ PL P+STY+G QYDY FTV Q          
Subjt:  SSSNLWIVGGPYDTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNVD

Query:  QSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGGIAKPAPDRKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVE
         +G+WWL ++DSQ  +GYWP ELF +L +GA+QVAWGGIAKP+P+  SPPLGNG+KP  G + +AC+ R+++Y+ +  + E
Subjt:  QSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGGIAKPAPDRKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)1.2e-5833.26Show/hide
Query:  ASKASAFCLVIVLFVCFYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNSSKSN
        A  A  F + + L+  F    + A+ S +SK ++ E++  L  LNK  +K+ ++ +GD+IDC+ I+KQPA DHP LK+HK+Q++P+ +P     +    N
Subjt:  ASKASAFCLVIVLFVCFYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNSSKSN

Query:  YGSLMSNNNRE-----------SCPVGFVPIRRTLKKDLIRLRSLS--SNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRF
          S   +N +E            C  G +P+RRT + D++R  S+     K++         E D  +  G +          + +  +  KYYGA    
Subjt:  YGSLMSNNNRE-----------SCPVGFVPIRRTLKKDLIRLRSLS--SNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRF

Query:  AVYNLSVTTE-QSSSSNLWIVGGPY-DTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLF
         V+   +  + + S S +W++GG +   LN I AGWQV+P + GD+ TRLF YWT D    TGCYN+ C GF+QI+  I     +SP S YR  QYD   
Subjt:  AVYNLSVTTE-QSSSSNLWIVGGPY-DTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLF

Query:  TVIQVINFKVLNVDQSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGG-IAKPAPD--RKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESP
             I+  +    + G+WW+   +  V LGYWP+ LF  L   A  + WGG +     D    S  +G+G  P+ G + +A + R+I  VVDG      
Subjt:  TVIQVINFKVLNVDQSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGG-IAKPAPD--RKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESP

Query:  PRWDHTPTFVSNRDCYDLNASETCNNDELQYCFTYGGPG
        P+     TF    +CYD+   +T +ND+  + F YGGPG
Subjt:  PRWDHTPTFVSNRDCYDLNASETCNNDELQYCFTYGGPG

AT2G44240.1 Protein of Unknown Function (DUF239)2.2e-6034.97Show/hide
Query:  LVIVLFVCFYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINP-----YNFLKNSSKS----
        +V  L +     G +  H       E++++  LK LNK  +K+ K+++GDIIDC+ I  QPA DHPLLKNH +Q++PS  P       + K  +K+    
Subjt:  LVIVLFVCFYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINP-----YNFLKNSSKS----

Query:  -NYGSLMSNNNRESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSVTT-E
          YG          CP   +PIRRT K++++R +SL S  +K+              Y  E    P+ H+  +F   +NGK+YG      V+   V T  
Subjt:  -NYGSLMSNNNRESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSVTT-E

Query:  QSSSSNLWIVGGPYDTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLN
        + S S  WIV G   + N I AGWQV P + G++  RLFVYWT DG   TGCYN+ C GFVQ     T        S Y G Q         V+N  +  
Subjt:  QSSSSNLWIVGGPYDTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLN

Query:  VDQSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGG-IAKPAPDR-KSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTPTFVSNR
          ++GNWWL + D+ V +GYWP  LF +L +GA +V WGG I  P  DR  +  +G+G+  + G   +A ++++I  +VDG      P+  +  ++  NR
Subjt:  VDQSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGG-IAKPAPDR-KSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTPTFVSNR

Query:  DCYDL---NASETCNNDELQYCFTYGGPG
        +CY +   NA  +         F YGGPG
Subjt:  DCYDL---NASETCNNDELQYCFTYGGPG

AT3G13510.1 Protein of Unknown Function (DUF239)6.4e-6034.44Show/hide
Query:  FVCFY--CKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNSSKSNYGSLMSNNNRE
        FVC +     + A+ S  S  ++ E++  L  LNK P+KT ++ +GDIIDCI I+KQPA DHP LK+HK+Q+RPS +P     ++  S        +  +
Subjt:  FVCFY--CKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNSSKSNYGSLMSNNNRE

Query:  ------SCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSV-TTEQSSSSNL
               C  G +P+RRT + D++R  S+     K K  +   P+    D   +      + +  + +  +  KYYGA     V+   +  T + S S +
Subjt:  ------SCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSV-TTEQSSSSNL

Query:  WIVGGPY-DTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNVDQSGN
        W++GG +   LN I AGWQV+P + GD+ TRLF YWT D    TGCYN+ C GF+QI+  I     +SP S YR  QYD        I+  +    + G+
Subjt:  WIVGGPY-DTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNVDQSGN

Query:  WWLLLADSQVALGYWPNELFPNLRNGADQVAWGG---IAKPAPDRKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTPTFVSNRDCYDL
        WW+   +  V LGYWP+ LF  L   A  + WGG    ++         +G+G+ P+ G + +A + R+I  VVDG      P+     TF    +CYD+
Subjt:  WWLLLADSQVALGYWPNELFPNLRNGADQVAWGG---IAKPAPDRKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTPTFVSNRDCYDL

Query:  NASETCNNDELQYCFTYGGPG
           +T +ND+  + F YGGPG
Subjt:  NASETCNNDELQYCFTYGGPG

AT5G56530.1 Protein of Unknown Function (DUF239)1.1e-5934.03Show/hide
Query:  SKASAFCLVIVLFVCFYCKGNQASHSNLS-KEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNSSKSN
        SK   F   +V F CF+   +      LS   +  E+   L  LNK  +K+ ++ +GDIIDC++I+KQPA DHP LK+HK+Q+ PS  P +    S  S 
Subjt:  SKASAFCLVIVLFVCFYCKGNQASHSNLS-KEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNSSKSN

Query:  YGSLMSN------NNRESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSV
              N      +    C  G +P+RRT K+D++R  S+    +K           D D      +      + +++   + GK+YGA     V+   V
Subjt:  YGSLMSN------NNRESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSV

Query:  -TTEQSSSSNLWIVGGPY-DTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVIN
         ++ + S S LWI+GG +   LN I AGWQV+P + GD+ TRLF YWT D    TGCYN+ C GF+QI+  I     +SP S +  PQYD        I+
Subjt:  -TTEQSSSSNLWIVGGPY-DTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVIN

Query:  FKVLNVDQSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGG-IAKPAPD--RKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTP
          +    + G+WW+   D  V LGYWP+ LF  L + A  V WGG +     D    +  +G+G  P  G + +A + R+I  V     ++ P   +   
Subjt:  FKVLNVDQSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGG-IAKPAPD--RKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTP

Query:  TFVSNRDCYDLNASETCNNDELQYCFTYGGPG
        TF    +CYD+   E   ND+  + F YGGPG
Subjt:  TFVSNRDCYDLNASETCNNDELQYCFTYGGPG

AT5G56530.2 Protein of Unknown Function (DUF239)1.1e-5934.03Show/hide
Query:  SKASAFCLVIVLFVCFYCKGNQASHSNLS-KEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNSSKSN
        SK   F   +V F CF+   +      LS   +  E+   L  LNK  +K+ ++ +GDIIDC++I+KQPA DHP LK+HK+Q+ PS  P +    S  S 
Subjt:  SKASAFCLVIVLFVCFYCKGNQASHSNLS-KEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNSSKSN

Query:  YGSLMSN------NNRESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSV
              N      +    C  G +P+RRT K+D++R  S+    +K           D D      +      + +++   + GK+YGA     V+   V
Subjt:  YGSLMSN------NNRESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSV

Query:  -TTEQSSSSNLWIVGGPY-DTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVIN
         ++ + S S LWI+GG +   LN I AGWQV+P + GD+ TRLF YWT D    TGCYN+ C GF+QI+  I     +SP S +  PQYD        I+
Subjt:  -TTEQSSSSNLWIVGGPY-DTLNVILAGWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVIN

Query:  FKVLNVDQSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGG-IAKPAPD--RKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTP
          +    + G+WW+   D  V LGYWP+ LF  L + A  V WGG +     D    +  +G+G  P  G + +A + R+I  V     ++ P   +   
Subjt:  FKVLNVDQSGNWWLLLADSQVALGYWPNELFPNLRNGADQVAWGG-IAKPAPD--RKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTP

Query:  TFVSNRDCYDLNASETCNNDELQYCFTYGGPG
        TF    +CYD+   E   ND+  + F YGGPG
Subjt:  TFVSNRDCYDLNASETCNNDELQYCFTYGGPG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCAAAGCAAGTGCTTTTTGTTTAGTGATAGTGCTTTTTGTTTGCTTCTATTGCAAAGGCAATCAAGCCTCTCATTCAAACCTCTCAAAAGAAGAAGAATTAGA
GTTAGAAGCGCAGCTCAAACTTCTCAACAAGGCACCCATCAAAACATTTAAGACTAAGGAGGGTGATATCATTGATTGTATCAACATCAATAAACAACCCGCTCTTGATC
ATCCTTTACTGAAGAATCACAAAGTTCAGATTCGACCGAGTATAAATCCATATAATTTTCTCAAGAACTCATCTAAATCAAACTATGGCTCACTCATGAGTAACAACAAC
AGAGAAAGTTGTCCGGTTGGATTTGTTCCTATTCGAAGAACATTAAAAAAAGATTTAATTAGATTAAGATCTCTATCATCGAATAAGCAAAAATCAAAATTTGAAGCATC
AAATCAGCCCGAATGGGATTATGATGATTATTTTGGTGAGAAGGTCAGGTTTCCTGATGATCACAATGTGGTTTCTTTTTCTTCGAAAAAAAATGGGAAATATTATGGAG
CTTCTGGACGTTTTGCAGTATACAATTTGAGTGTGACTACTGAACAATCTTCTAGTTCTAACCTATGGATAGTTGGTGGCCCTTATGATACTCTAAATGTAATACTTGCA
GGCTGGCAGGTGAATCCAGCCGTAAATGGTGATAGTCTCACTAGGTTGTTCGTGTATTGGACGGGTGATGGAGGTGTTAACACGGGATGTTACAACATGTTCTGTCAAGG
TTTCGTACAAATACATCCAAGTATTACTCCAGACTTTCCTCTTTCTCCGACCTCTACCTATAGAGGACCACAATATGATTATCTATTTACTGTCATTCAGGTAATAAATT
TCAAGGTTTTAAATGTTGATCAATCAGGGAATTGGTGGCTTCTACTAGCAGACAGTCAAGTGGCATTGGGGTATTGGCCAAATGAGTTGTTTCCAAATCTAAGAAATGGG
GCAGATCAAGTTGCATGGGGAGGAATTGCAAAGCCTGCACCAGATAGAAAGAGCCCTCCATTGGGAAATGGGTACAAGCCAAAAACTGGTAATTATTATGAAGCTTGCTT
CATTAGGAGCATAAGTTACGTTGTGGATGGCTATAAAGTTGAATCACCACCCAGATGGGATCATACACCGACTTTTGTGAGTAATAGAGACTGTTATGATCTTAATGCTA
GTGAGACTTGTAATAATGATGAGTTGCAATATTGCTTCACCTATGGAGGACCAGGTGGAGATGATTGTGAAGCCATCATTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCAAAGCAAGTGCTTTTTGTTTAGTGATAGTGCTTTTTGTTTGCTTCTATTGCAAAGGCAATCAAGCCTCTCATTCAAACCTCTCAAAAGAAGAAGAATTAGA
GTTAGAAGCGCAGCTCAAACTTCTCAACAAGGCACCCATCAAAACATTTAAGACTAAGGAGGGTGATATCATTGATTGTATCAACATCAATAAACAACCCGCTCTTGATC
ATCCTTTACTGAAGAATCACAAAGTTCAGATTCGACCGAGTATAAATCCATATAATTTTCTCAAGAACTCATCTAAATCAAACTATGGCTCACTCATGAGTAACAACAAC
AGAGAAAGTTGTCCGGTTGGATTTGTTCCTATTCGAAGAACATTAAAAAAAGATTTAATTAGATTAAGATCTCTATCATCGAATAAGCAAAAATCAAAATTTGAAGCATC
AAATCAGCCCGAATGGGATTATGATGATTATTTTGGTGAGAAGGTCAGGTTTCCTGATGATCACAATGTGGTTTCTTTTTCTTCGAAAAAAAATGGGAAATATTATGGAG
CTTCTGGACGTTTTGCAGTATACAATTTGAGTGTGACTACTGAACAATCTTCTAGTTCTAACCTATGGATAGTTGGTGGCCCTTATGATACTCTAAATGTAATACTTGCA
GGCTGGCAGGTGAATCCAGCCGTAAATGGTGATAGTCTCACTAGGTTGTTCGTGTATTGGACGGGTGATGGAGGTGTTAACACGGGATGTTACAACATGTTCTGTCAAGG
TTTCGTACAAATACATCCAAGTATTACTCCAGACTTTCCTCTTTCTCCGACCTCTACCTATAGAGGACCACAATATGATTATCTATTTACTGTCATTCAGGTAATAAATT
TCAAGGTTTTAAATGTTGATCAATCAGGGAATTGGTGGCTTCTACTAGCAGACAGTCAAGTGGCATTGGGGTATTGGCCAAATGAGTTGTTTCCAAATCTAAGAAATGGG
GCAGATCAAGTTGCATGGGGAGGAATTGCAAAGCCTGCACCAGATAGAAAGAGCCCTCCATTGGGAAATGGGTACAAGCCAAAAACTGGTAATTATTATGAAGCTTGCTT
CATTAGGAGCATAAGTTACGTTGTGGATGGCTATAAAGTTGAATCACCACCCAGATGGGATCATACACCGACTTTTGTGAGTAATAGAGACTGTTATGATCTTAATGCTA
GTGAGACTTGTAATAATGATGAGTTGCAATATTGCTTCACCTATGGAGGACCAGGTGGAGATGATTGTGAAGCCATCATTTTTTAG
Protein sequenceShow/hide protein sequence
MASKASAFCLVIVLFVCFYCKGNQASHSNLSKEEELELEAQLKLLNKAPIKTFKTKEGDIIDCININKQPALDHPLLKNHKVQIRPSINPYNFLKNSSKSNYGSLMSNNN
RESCPVGFVPIRRTLKKDLIRLRSLSSNKQKSKFEASNQPEWDYDDYFGEKVRFPDDHNVVSFSSKKNGKYYGASGRFAVYNLSVTTEQSSSSNLWIVGGPYDTLNVILA
GWQVNPAVNGDSLTRLFVYWTGDGGVNTGCYNMFCQGFVQIHPSITPDFPLSPTSTYRGPQYDYLFTVIQVINFKVLNVDQSGNWWLLLADSQVALGYWPNELFPNLRNG
ADQVAWGGIAKPAPDRKSPPLGNGYKPKTGNYYEACFIRSISYVVDGYKVESPPRWDHTPTFVSNRDCYDLNASETCNNDELQYCFTYGGPGGDDCEAIIF