; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G012030 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G012030
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationchr07:17519146..17523550
RNA-Seq ExpressionLsi07G012030
SyntenyLsi07G012030
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB99729.1 hypothetical protein L484_023259 [Morus notabilis]1.3e-3240Show/hide
Query:  KPSINLEHKYLKHCPSGSVPIRRTQKEDFMRIKSLSN-------SSTHVVSLSMNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAG
        K  + ++ +  + CP G+VPI+RT KE  +R +SL +       S   VVSL +      YG  G +S  +L +A DQ SS ++W+  GP   L+ I AG
Subjt:  KPSINLEHKYLKHCPSGSVPIRRTQKEDFMRIKSLSN-------SSTHVVSLSMNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAG

Query:  W-QVKHGHLIRFLFYVNPAVNGGDDTRINGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQ
        W   K G+   F+   N  +  G   R    K VAWGGIAK GK G+SP +G+G+ P+GN+R ACY   + YVN++N  +PP      Q +   KCY L+
Subjt:  W-QVKHGHLIRFLFYVNPAVNGGDDTRINGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQ

Query:  DDRTCGGFDQMYYCFTYGGPGGKCG
        +D      D + Y FT+GGPGGKCG
Subjt:  DDRTCGGFDQMYYCFTYGGPGGKCG

OMP03992.1 hypothetical protein COLO4_10038 [Corchorus olitorius]7.8e-3038.6Show/hide
Query:  VVSLSMNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPP---QQLSVILAGWQVKHGHLIRFLFYVNPAVNGGDDTRI------------------
        VV L M E   YYG+ G + AYNLSV   Q SS NLWV  GP     QL+VIL GW             V+P +NG   TR+                  
Subjt:  VVSLSMNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPP---QQLSVILAGWQVKHGHLIRFLFYVNPAVNGGDDTRI------------------

Query:  ------------------NGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQ
                          NGA  VAWGGIA   K G SPP+GSG+ PN ++  +C+ + ++++N  +  + P      QYI  S CY L D + C G+ +
Subjt:  ------------------NGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQ

Query:  MYYCFTYGGPGGKCG
        M YCFT+GGPGGKCG
Subjt:  MYYCFTYGGPGGKCG

XP_022145286.1 uncharacterized protein LOC111014774 [Momordica charantia]6.2e-3542.08Show/hide
Query:  KSLSNSSTHVVSLSMNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGWQVKHGHLIRFLFYVNPAVNG-------GDDTRI-----
        K  ++S  +VVSL +     YYGV G  S YNLSVAQDQ+SS+N+W++GGPP+ L+V      +    L R   Y      G          T I     
Subjt:  KSLSNSSTHVVSLSMNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGWQVKHGHLIRFLFYVNPAVNG-------GDDTRI-----

Query:  -------NGAKQVAWGGIAKKGKNGMSPPLGSGNLP-NGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGP
               +GA+QVAWGGIAK   NGMSPPLG+G+ P NG +  ACY + I Y++  N G+ P    +  ++ +S CYGL D       D MY+CFT+GGP
Subjt:  -------NGAKQVAWGGIAKKGKNGMSPPLGSGNLP-NGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGP

Query:  GG
        GG
Subjt:  GG

XP_022145288.1 uncharacterized protein LOC111014777 [Momordica charantia]9.5e-4440.22Show/hide
Query:  ARKESSSKSKPSINLEHKYLKHCPSGSVPIRRTQKEDFMRIKSLSNS-----------------STHVVSLSMNENQPYYGVTGTMSAYNLSVAQDQASS
        ++  SSS+ K  IN  +   + CP+G VPIRRT K+D +RI+SLS+                  +  VVS++M +   YYG +G++S YNLSVAQDQ+SS
Subjt:  ARKESSSKSKPSINLEHKYLKHCPSGSVPIRRTQKEDFMRIKSLSNS-----------------STHVVSLSMNENQPYYGVTGTMSAYNLSVAQDQASS

Query:  TNLWVIGGPPQQLSVILAGWQVKHGHLIRFLFYVNPAVNGGDDTRI-----------------------------------NGAKQVAWGGIAKKGKNGM
        +N+W+IGGPPQ  +VILAGWQ            VNP +NG   TR+                                   +G +QVAWGGIAK   NGM
Subjt:  TNLWVIGGPPQQLSVILAGWQVKHGHLIRFLFYVNPAVNGGDDTRI-----------------------------------NGAKQVAWGGIAKKGKNGM

Query:  SPPLGSGNLPN-GNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGG
        SPPLG+G+ PN   +  ACY R + YV++ N G  P       Y+ ++ CY L +  TCGG +  YYC T+GGPGG
Subjt:  SPPLGSGNLPN-GNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGG

XP_024038072.1 uncharacterized protein LOC18039972 [Citrus clementina]1.2e-3533.74Show/hide
Query:  SARKESSSKSKPSINLEHKYLKH----CPSGSVPIRRTQKEDFMRIKSLSNSST---------------HVVSLSMNENQPYYGVTGTMSAYNLSVAQDQ
        SA+K S SK+         Y++H    CP G+VPIRRT KED ++  S+   S                H V++ M ++ PY+GV G +  +NL+VA+DQ
Subjt:  SARKESSSKSKPSINLEHKYLKH----CPSGSVPIRRTQKEDFMRIKSLSNSST---------------HVVSLSMNENQPYYGVTGTMSAYNLSVAQDQ

Query:  ASSTNLWVIGGPPQQLSVILAGWQVKHGHLIRFLFYVNPAVNGGDDTRI---------------------------------------------------
         S TN+W+  GPP QL+VILAGW             V+PA+NG   TR+                                                   
Subjt:  ASSTNLWVIGGPPQQLSVILAGWQVKHGHLIRFLFYVNPAVNGGDDTRI---------------------------------------------------

Query:  --------------------------------NGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKC
                                         GA+ VAWGGIA  GKNG+SPP+GSG L N +FR  CYIR I+YV+ QN    P    L+Q++  S C
Subjt:  --------------------------------NGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKC

Query:  YGLQDDRTCGGFDQMYYCFTYGGPGGKCG
        YGL+D + CG   +MYYC  +GG GG+CG
Subjt:  YGLQDDRTCGGFDQMYYCFTYGGPGGKCG

TrEMBL top hitse value%identityAlignment
A0A1R3KA80 Uncharacterized protein3.8e-3038.6Show/hide
Query:  VVSLSMNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPP---QQLSVILAGWQVKHGHLIRFLFYVNPAVNGGDDTRI------------------
        VV L M E   YYG+ G + AYNLSV   Q SS NLWV  GP     QL+VIL GW             V+P +NG   TR+                  
Subjt:  VVSLSMNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPP---QQLSVILAGWQVKHGHLIRFLFYVNPAVNGGDDTRI------------------

Query:  ------------------NGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQ
                          NGA  VAWGGIA   K G SPP+GSG+ PN ++  +C+ + ++++N  +  + P      QYI  S CY L D + C G+ +
Subjt:  ------------------NGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQ

Query:  MYYCFTYGGPGGKCG
        M YCFT+GGPGGKCG
Subjt:  MYYCFTYGGPGGKCG

A0A6J1CVJ6 uncharacterized protein LOC1110147774.6e-4440.22Show/hide
Query:  ARKESSSKSKPSINLEHKYLKHCPSGSVPIRRTQKEDFMRIKSLSNS-----------------STHVVSLSMNENQPYYGVTGTMSAYNLSVAQDQASS
        ++  SSS+ K  IN  +   + CP+G VPIRRT K+D +RI+SLS+                  +  VVS++M +   YYG +G++S YNLSVAQDQ+SS
Subjt:  ARKESSSKSKPSINLEHKYLKHCPSGSVPIRRTQKEDFMRIKSLSNS-----------------STHVVSLSMNENQPYYGVTGTMSAYNLSVAQDQASS

Query:  TNLWVIGGPPQQLSVILAGWQVKHGHLIRFLFYVNPAVNGGDDTRI-----------------------------------NGAKQVAWGGIAKKGKNGM
        +N+W+IGGPPQ  +VILAGWQ            VNP +NG   TR+                                   +G +QVAWGGIAK   NGM
Subjt:  TNLWVIGGPPQQLSVILAGWQVKHGHLIRFLFYVNPAVNGGDDTRI-----------------------------------NGAKQVAWGGIAKKGKNGM

Query:  SPPLGSGNLPN-GNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGG
        SPPLG+G+ PN   +  ACY R + YV++ N G  P       Y+ ++ CY L +  TCGG +  YYC T+GGPGG
Subjt:  SPPLGSGNLPN-GNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGG

A0A6J1CVW9 uncharacterized protein LOC1110147743.0e-3542.08Show/hide
Query:  KSLSNSSTHVVSLSMNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGWQVKHGHLIRFLFYVNPAVNG-------GDDTRI-----
        K  ++S  +VVSL +     YYGV G  S YNLSVAQDQ+SS+N+W++GGPP+ L+V      +    L R   Y      G          T I     
Subjt:  KSLSNSSTHVVSLSMNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGWQVKHGHLIRFLFYVNPAVNG-------GDDTRI-----

Query:  -------NGAKQVAWGGIAKKGKNGMSPPLGSGNLP-NGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGP
               +GA+QVAWGGIAK   NGMSPPLG+G+ P NG +  ACY + I Y++  N G+ P    +  ++ +S CYGL D       D MY+CFT+GGP
Subjt:  -------NGAKQVAWGGIAKKGKNGMSPPLGSGNLP-NGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGP

Query:  GG
        GG
Subjt:  GG

V4SWW2 Uncharacterized protein2.5e-2933.83Show/hide
Query:  LSNSSTHVVSLSMNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGWQVKHGHLIRFLFYVNPAVNGGDDTRI--------------
        L N     V++ M ++ PY+GV G +  +NL+VA+DQ S TN+W+  GPP QL+VILAGW             V+PA+NG   TR+              
Subjt:  LSNSSTHVVSLSMNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGWQVKHGHLIRFLFYVNPAVNGGDDTRI--------------

Query:  ---------------------------------------------------------------------NGAKQVAWGGIAKKGKNGMSPPLGSGNLPNG
                                                                              GA+ VAWGGIA  GKNG+SPP+GSG L N 
Subjt:  ---------------------------------------------------------------------NGAKQVAWGGIAKKGKNGMSPPLGSGNLPNG

Query:  NFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGGKCG
        +FR  CYIR I+YV+ QN    P    L+Q++  S CYGL+D + CG   +MYYC  +GG GG+CG
Subjt:  NFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGGKCG

W9RN49 Neprosin domain-containing protein6.2e-3340Show/hide
Query:  KPSINLEHKYLKHCPSGSVPIRRTQKEDFMRIKSLSN-------SSTHVVSLSMNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAG
        K  + ++ +  + CP G+VPI+RT KE  +R +SL +       S   VVSL +      YG  G +S  +L +A DQ SS ++W+  GP   L+ I AG
Subjt:  KPSINLEHKYLKHCPSGSVPIRRTQKEDFMRIKSLSN-------SSTHVVSLSMNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAG

Query:  W-QVKHGHLIRFLFYVNPAVNGGDDTRINGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQ
        W   K G+   F+   N  +  G   R    K VAWGGIAK GK G+SP +G+G+ P+GN+R ACY   + YVN++N  +PP      Q +   KCY L+
Subjt:  W-QVKHGHLIRFLFYVNPAVNGGDDTRINGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQ

Query:  DDRTCGGFDQMYYCFTYGGPGGKCG
        +D      D + Y FT+GGPGGKCG
Subjt:  DDRTCGGFDQMYYCFTYGGPGGKCG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)5.8e-0732.65Show/hide
Query:  CPSGSVPIRRTQKEDFMRIKSLS-------------NSSTHVVSLSMNENQPYYGVTGTMSAYNLSVA-QDQASSTNLWVIGGP-PQQLSVILAGWQV
        CP G+VPIRRT++ED +R  S+S             +S+ H  ++     + YYG   +++ +   V  Q + S + +W+I G     L+ I AGWQV
Subjt:  CPSGSVPIRRTQKEDFMRIKSLS-------------NSSTHVVSLSMNENQPYYGVTGTMSAYNLSVA-QDQASSTNLWVIGGP-PQQLSVILAGWQV

AT4G23390.1 Protein of Unknown Function (DUF239)4.4e-0730.53Show/hide
Query:  NGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGI-PPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGG
        +GA  V WGG         SP +GSG+ P   F+ A Y+ G++ + D    +  P  + L+ +     CY +Q     G F        +GGPGG
Subjt:  NGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGI-PPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFTYGGPGG

AT5G05030.1 Protein of Unknown Function (DUF239)4.0e-0827.03Show/hide
Query:  SSSKSKPSINLEHKYLKHCPSGSVPIRRTQK----------EDFMRIKSLSNSSTHVVSLSMNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQ
        S  K++     E K +  CP+G+VPI R  K          E  +   ++ +  TH+  +   +  PY GV  ++S ++L++++DQAS  N+++  G   
Subjt:  SSSKSKPSINLEHKYLKHCPSGSVPIRRTQK----------EDFMRIKSLSNSSTHVVSLSMNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQ

Query:  QLSVILAGWQV
        +++ I  GW +
Subjt:  QLSVILAGWQV

AT5G11660.1 Protein of Unknown Function (DUF239)3.1e-1624.12Show/hide
Query:  MKSARKESSSKSKPSINLEHKYLKHCPSGSVPIRRTQKEDFMRIK----------SLSNSSTHVVSLSMNENQPYYGVTGTMSAYNLSVAQDQASSTNLW
        MK +   S  KS+    +E K +  CP+G+VPI R  KE   R +          ++ +  TH   +    + PY+G+   MS ++L++++DQ S  +++
Subjt:  MKSARKESSSKSKPSINLEHKYLKHCPSGSVPIRRTQKEDFMRIK----------SLSNSSTHVVSLSMNENQPYYGVTGTMSAYNLSVAQDQASSTNLW

Query:  VIGGPPQQLSVILAGW-----------------------------------QVKHGHLIRFLF------------------------------YVNPAVN
        V  G  ++++ I  GW                                   QV    L+   F                              Y  P V+
Subjt:  VIGGPPQQLSVILAGW-----------------------------------QVKHGHLIRFLF------------------------------YVNPAVN

Query:  GG------DDTRINGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCF
         G       D   N    V   G  +   +G+SPP+G+G LP+ +   + +++G++ V+        K+ +L++ + D+KCYGL+D +    F +    F
Subjt:  GG------DDTRINGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCF

Query:  TYGGPGG-KCG
        TYGGPGG  CG
Subjt:  TYGGPGG-KCG

AT5G56530.1 Protein of Unknown Function (DUF239)7.6e-0729.14Show/hide
Query:  SSKSKPSIN----LEHKYLKHCPSGSVPIRRTQKEDFMRIKS---------------------LSNSSTHVVSLSMNENQPYYGVTGTMSAYNLSV-AQD
        S K K S+N    L H+    C  G++P+RRT+KED +R  S                     L N S H  +++  E   +YG   T++ +   V + +
Subjt:  SSKSKPSIN----LEHKYLKHCPSGSVPIRRTQKEDFMRIKS---------------------LSNSSTHVVSLSMNENQPYYGVTGTMSAYNLSV-AQD

Query:  QASSTNLWVIGGP-PQQLSVILAGWQVKHGHLIRFLFYVNPAVNGGDDTRI
        + S + LW++GG   Q L+ I AGWQ            V+P + G ++TR+
Subjt:  QASSTNLWVIGGP-PQQLSVILAGWQVKHGHLIRFLFYVNPAVNGGDDTRI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAGTGCAAGAAAGGAGAGTTCATCAAAATCAAAGCCTTCCATAAATCTTGAGCACAAATACTTGAAACATTGTCCAAGTGGATCAGTTCCTATTAGAAGAACTCA
AAAGGAGGATTTTATGAGAATTAAGTCTCTTTCAAATTCATCAACACATGTGGTGTCTCTTTCAATGAACGAAAATCAGCCATACTATGGAGTGACTGGAACAATGTCTG
CTTACAATTTAAGTGTTGCTCAAGACCAAGCTTCATCTACAAACTTATGGGTTATTGGTGGCCCTCCACAACAGCTTAGTGTCATTCTTGCTGGATGGCAGGTTAAACAT
GGTCATTTGATAAGGTTTTTGTTTTATGTAAATCCAGCTGTAAATGGTGGTGACGACACAAGAATCAATGGGGCAAAACAAGTAGCTTGGGGAGGCATTGCAAAGAAAGG
GAAGAATGGAATGAGCCCTCCATTAGGGAGTGGCAATTTACCTAATGGAAACTTTAGGGTTGCATGTTACATTAGAGGAATTAGATATGTGAATGATCAAAACTTGGGAA
TACCTCCAAAGCAAACTGAACTTCAACAATATATTGGGGACTCTAAATGTTATGGTTTGCAAGATGATAGAACTTGTGGGGGTTTTGATCAAATGTATTATTGCTTCACA
TATGGTGGACCAGGTGGGAAATGTGGGGCTGCTGTTAAAACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAGTGCAAGAAAGGAGAGTTCATCAAAATCAAAGCCTTCCATAAATCTTGAGCACAAATACTTGAAACATTGTCCAAGTGGATCAGTTCCTATTAGAAGAACTCA
AAAGGAGGATTTTATGAGAATTAAGTCTCTTTCAAATTCATCAACACATGTGGTGTCTCTTTCAATGAACGAAAATCAGCCATACTATGGAGTGACTGGAACAATGTCTG
CTTACAATTTAAGTGTTGCTCAAGACCAAGCTTCATCTACAAACTTATGGGTTATTGGTGGCCCTCCACAACAGCTTAGTGTCATTCTTGCTGGATGGCAGGTTAAACAT
GGTCATTTGATAAGGTTTTTGTTTTATGTAAATCCAGCTGTAAATGGTGGTGACGACACAAGAATCAATGGGGCAAAACAAGTAGCTTGGGGAGGCATTGCAAAGAAAGG
GAAGAATGGAATGAGCCCTCCATTAGGGAGTGGCAATTTACCTAATGGAAACTTTAGGGTTGCATGTTACATTAGAGGAATTAGATATGTGAATGATCAAAACTTGGGAA
TACCTCCAAAGCAAACTGAACTTCAACAATATATTGGGGACTCTAAATGTTATGGTTTGCAAGATGATAGAACTTGTGGGGGTTTTGATCAAATGTATTATTGCTTCACA
TATGGTGGACCAGGTGGGAAATGTGGGGCTGCTGTTAAAACTTAA
Protein sequenceShow/hide protein sequence
MKSARKESSSKSKPSINLEHKYLKHCPSGSVPIRRTQKEDFMRIKSLSNSSTHVVSLSMNENQPYYGVTGTMSAYNLSVAQDQASSTNLWVIGGPPQQLSVILAGWQVKH
GHLIRFLFYVNPAVNGGDDTRINGAKQVAWGGIAKKGKNGMSPPLGSGNLPNGNFRVACYIRGIRYVNDQNLGIPPKQTELQQYIGDSKCYGLQDDRTCGGFDQMYYCFT
YGGPGGKCGAAVKT