; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002280 (gene) of Snake gourd v1 genome

Gene IDTan0002280
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG01:110022459..110023578
RNA-Seq ExpressionTan0002280
SyntenyTan0002280
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR040411 - Uncharacterized protein At5g23160-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607843.1 hypothetical protein SDJN03_01185, partial [Cucurbita argyrosperma subsp. sororia]5.0e-8970.63Show/hide
Query:  MDSAVATKSKKKKLFPCFRAAASGGPLRTMRCKDAQDEQVVPSMGVEERNGMMLHNVRSLDGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKIRKRK
        MDS  ATKS KKK FPCFR+ AS GP+RT+R  DA DEQV P M VEERNGMMLHNV+  DG  + SG +K+ GGGGALSRALKAVLFGTSLAKKIRKRK
Subjt:  MDSAVATKSKKKKLFPCFRAAASGGPLRTMRCKDAQDEQVVPSMGVEERNGMMLHNVRSLDGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKIRKRK

Query:  EKQKQNSIKENQRHQALSSISSRSRITSDLNYRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLCLLGLV
         KQK+NS +ENQR Q LSSISSRS   SD N+RN ST SSRTSAPFSS+SF SSSP S EI+++SFRF+PTASNRL RQINLRK    WF+LL+ LL LV
Subjt:  EKQKQNSIKENQRHQALSSISSRSRITSDLNYRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLCLLGLV

Query:  LWGKTGAIMCTSVWIFCFCLYRRRIGFRSPDDKTSAAAMSSGEYSEGIVMEGFLNRDRSGAHNSILHIE
        LW K GA +CTS+WI CF  YRR IGFRSPDDK SAAAMSS EY +  ++EGFLNRDRS   +SI HI+
Subjt:  LWGKTGAIMCTSVWIFCFCLYRRRIGFRSPDDKTSAAAMSSGEYSEGIVMEGFLNRDRSGAHNSILHIE

XP_022941268.1 uncharacterized protein LOC111446618 isoform X1 [Cucurbita moschata]2.2e-8970.26Show/hide
Query:  MDSAVATKSKKKKLFPCFRAAASGGPLRTMRCKDAQDEQVVPSMGVEERNGMMLHNVRSLDGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKIRKRK
        MDS  ATKS KKK FPCFR+ AS GP+RT+R  +A DEQV P M VEERNGMMLHNV+  DG  + SG +K+ GGGGALSRALKAVLFGTSLAKKIRK+K
Subjt:  MDSAVATKSKKKKLFPCFRAAASGGPLRTMRCKDAQDEQVVPSMGVEERNGMMLHNVRSLDGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKIRKRK

Query:  EKQKQNSIKENQRHQALSSISSRSRITSDLNYRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLCLLGLV
         KQK+NS +ENQR + LSSISSRS   SD N+RN ST SSRTSAPFSS+SFCSSSP S EI ++SFRF+PTASNRL RQINLRK    WF+LL+ LL LV
Subjt:  EKQKQNSIKENQRHQALSSISSRSRITSDLNYRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLCLLGLV

Query:  LWGKTGAIMCTSVWIFCFCLYRRRIGFRSPDDKTSAAAMSSGEYSEGIVMEGFLNRDRSGAHNSILHIE
        LW K GA +CTS+WI CF  YRR IGFRSPDDK SAAAMSS EY +  ++EGFLNRDRS   NSI HI+
Subjt:  LWGKTGAIMCTSVWIFCFCLYRRRIGFRSPDDKTSAAAMSSGEYSEGIVMEGFLNRDRSGAHNSILHIE

XP_022981581.1 uncharacterized protein LOC111480657 isoform X1 [Cucurbita maxima]2.4e-9171.38Show/hide
Query:  MDSAVATKSKKKKLFPCFRAAASGGPLRTMRCKDAQDEQVVPSMGVEERNGMMLHNVRSLDGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKIRKRK
        MDS  ATKS KKK FPCFR+ AS  P+RT+R  DA DEQV P M VEERNGMMLHNV+  DG  D SG +K+ GGGGALSRALKAVLFGTSLAKKIRKRK
Subjt:  MDSAVATKSKKKKLFPCFRAAASGGPLRTMRCKDAQDEQVVPSMGVEERNGMMLHNVRSLDGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKIRKRK

Query:  EKQKQNSIKENQRHQALSSISSRSRITSDLNYRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLCLLGLV
         KQK+NS +ENQR Q LSSISSRS   SD N+RN ST SSR SAPFSS+SFCSSSP SSEI+++SFRF+PTASNRL RQINLR     WF+LL+CLL LV
Subjt:  EKQKQNSIKENQRHQALSSISSRSRITSDLNYRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLCLLGLV

Query:  LWGKTGAIMCTSVWIFCFCLYRRRIGFRSPDDKTSAAAMSSGEYSEGIVMEGFLNRDRSGAHNSILHIE
        LW K GA +CTS+WI CF  YRR IGFRSPDDK SAAAMSS EY +  ++EGFLNRDRS   NSI HI+
Subjt:  LWGKTGAIMCTSVWIFCFCLYRRRIGFRSPDDKTSAAAMSSGEYSEGIVMEGFLNRDRSGAHNSILHIE

XP_023525890.1 uncharacterized protein LOC111789371 [Cucurbita pepo subsp. pepo]2.0e-9070.63Show/hide
Query:  MDSAVATKSKKKKLFPCFRAAASGGPLRTMRCKDAQDEQVVPSMGVEERNGMMLHNVRSLDGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKIRKRK
        MDS  A KS KKK FPCFR+ AS GP+RT+R  DA DEQV P M VEERNGMMLHNV+  DG  + SG +K+ GGGGALSRALKAVLFGTSLAKKIRKRK
Subjt:  MDSAVATKSKKKKLFPCFRAAASGGPLRTMRCKDAQDEQVVPSMGVEERNGMMLHNVRSLDGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKIRKRK

Query:  EKQKQNSIKENQRHQALSSISSRSRITSDLNYRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLCLLGLV
         KQK+NS +ENQR Q LSSISSRS   SD N+RNYST SSR SAPFSS+SFCSSSP S EI+++SFRF+PTASNRL RQINLRK    WF+LL+ LL L+
Subjt:  EKQKQNSIKENQRHQALSSISSRSRITSDLNYRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLCLLGLV

Query:  LWGKTGAIMCTSVWIFCFCLYRRRIGFRSPDDKTSAAAMSSGEYSEGIVMEGFLNRDRSGAHNSILHIE
        LW K GA +CTS+WI CF  YRR IGFRSPDDK SAAAMSS EY +  ++EGFLNRDRS   NSI HI+
Subjt:  LWGKTGAIMCTSVWIFCFCLYRRRIGFRSPDDKTSAAAMSSGEYSEGIVMEGFLNRDRSGAHNSILHIE

XP_038900051.1 uncharacterized protein LOC120087211 [Benincasa hispida]5.4e-9173.9Show/hide
Query:  MDSAVATKSKKK-KLFPCFRAAASGGPLRTMRCK-DAQDEQVVPSMGVEERNGMMLHNVRSLDGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKIRK
        MDS  A KSKKK KLFPCFRAAASGGP+ T RCK DA DE + P + V++        VRS DG D DSGHRK+K GGGALSRA+KAVLFGTSLAKKIRK
Subjt:  MDSAVATKSKKK-KLFPCFRAAASGGPLRTMRCK-DAQDEQVVPSMGVEERNGMMLHNVRSLDGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKIRK

Query:  RKEKQKQNSIKENQRHQALSSIS-SRSRITSDLNYRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLCLL
        RK K+KQNS  ENQRHQA  SIS +RSRI SDLNY N ST SSRTSAPFSSSSFCSSSP+SSE+SD+SFRFYPTASNRL RQIN  KI SGWFLLL+CLL
Subjt:  RKEKQKQNSIKENQRHQALSSIS-SRSRITSDLNYRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLCLL

Query:  GLVLWGKTGAIMCTSVWIFCFCLYRRRIGFRSPDDKTSAAAMSSGE-YSEGIVMEGFLNRDRSGAHNSILHI
         LVLWGK GAI+CTSVWI   CLYRRRIG +S DDK SA AMSSGE Y   +VMEGFL RD SGA NSIL I
Subjt:  GLVLWGKTGAIMCTSVWIFCFCLYRRRIGFRSPDDKTSAAAMSSGE-YSEGIVMEGFLNRDRSGAHNSILHI

TrEMBL top hitse value%identityAlignment
A0A5A7SZR0 Uncharacterized protein9.9e-7568Show/hide
Query:  MDSAVATKSKKK-KLFPCFRAAASGGPLRTMRCKDAQDEQVVPSMGVEERNGMMLHNVRSLDGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKIRKR
        MDS    KSKKK KLFPCFRAAASG     +R KD   E V P + V+E       NVR L G D DSGHRK+KG  GALSRA KAVLFGTSLAKKIRKR
Subjt:  MDSAVATKSKKK-KLFPCFRAAASGGPLRTMRCKDAQDEQVVPSMGVEERNGMMLHNVRSLDGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKIRKR

Query:  KEKQKQNSIKE-NQRHQALSSISSRSRITSD-LN-YRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLCL
        K K+K+NS  E NQ HQALSSI +RS   SD LN Y N ST SSRTSAPFSSSSFCSSSPASSE+S++SFRFYP  SNRLLRQINLRKI SGWF+LL+CL
Subjt:  KEKQKQNSIKE-NQRHQALSSISSRSRITSD-LN-YRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLCL

Query:  LGLVLWGKTGAIMCTSVWIFCFCLYRRRIGFRSPDDKTSAAAMSSGE-YSEGIVMEGFLNRDR-SGAHNSILHIE
        L L+LWGK GAIMCTSVWI   CLYRRR+G +    K SA AMSSGE Y   I MEGFL R+R S A NSIL I+
Subjt:  LGLVLWGKTGAIMCTSVWIFCFCLYRRRIGFRSPDDKTSAAAMSSGE-YSEGIVMEGFLNRDR-SGAHNSILHIE

A0A6J1CDH5 uncharacterized protein LOC1110105433.2e-8164.96Show/hide
Query:  MDSAVATKSKKKKLFPCFRAAASGGPLRTMRCKDAQDEQVVPSMGVEERNGMMLHNVRSL----DGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKI
        MDS  A   KK KLFPCFR+ AS   +RT RCKD  DE+V P M VEE +G M H+VRS+    D DD+DSG RKRKG GGALSRA+KAVLFGT+LAKK+
Subjt:  MDSAVATKSKKKKLFPCFRAAASGGPLRTMRCKDAQDEQVVPSMGVEERNGMMLHNVRSL----DGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKI

Query:  RKRKEKQKQNSIKENQ-RHQALSSISSRSRITSDLNYRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLC
        RK+K KQKQNS KENQ RH ++SSIS RSRI SD  Y NYS  SSRTS PFSSSSFCSSSP+SS+IS+ SF  YPTA  RL  QINLR+ICSGW + L+C
Subjt:  RKRKEKQKQNSIKENQ-RHQALSSISSRSRITSDLNYRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLC

Query:  LLGLVLWGKTGAIMCTSVWIFCFCLYRRRIGFRSPDDKTSAAAMSSGEYSEGIVMEGFLNRDRSGAHNSILHIE
        +L L+LWGK  AI+CTSVWI CF    RR GF+SP+ K SAAA+ SGE+++ IV+EG L RDRS A NS L I+
Subjt:  LLGLVLWGKTGAIMCTSVWIFCFCLYRRRIGFRSPDDKTSAAAMSSGEYSEGIVMEGFLNRDRSGAHNSILHIE

A0A6J1F8B9 uncharacterized protein LOC1114430633.0e-7163.7Show/hide
Query:  MDSAVATKSKKK-KLFPCFRAAASGGPLRTMRCKDAQDEQVVPSMGVEERNGMMLHNVRSLDGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKIRKR
        M+S V TKSKK+ KLFPCFRAAASG P+       A +EQV P M V +       NV  +D  D+DS   K+KGG GA SRA++AV+FGTSLAKKI KR
Subjt:  MDSAVATKSKKK-KLFPCFRAAASGGPLRTMRCKDAQDEQVVPSMGVEERNGMMLHNVRSLDGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKIRKR

Query:  KEKQKQNSIKENQRHQALSSISSRSRITSDLNYRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLCLLGL
        K K  QNS KE+QRH A S  SSRSR  SDLNYRNYST   R+S PFSS SF SSSP+S+E SD SFR YPTASNRL  QIN RKI SGWF+LL+CLL L
Subjt:  KEKQKQNSIKENQRHQALSSISSRSRITSDLNYRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLCLLGL

Query:  VLWGKTGAIMCTSVWIFCFCLYRRRIGFRSPDDKTSAAAMSSGEYSEGIVMEGFLNRDRSGAHNSILHIE
        VLWGKTGAI+CTSVW+   CLYR R  FRSPDDK S  AMSSGEY++  +ME FL RDR  A NS L I+
Subjt:  VLWGKTGAIMCTSVWIFCFCLYRRRIGFRSPDDKTSAAAMSSGEYSEGIVMEGFLNRDRSGAHNSILHIE

A0A6J1FRN0 uncharacterized protein LOC111446618 isoform X11.1e-8970.26Show/hide
Query:  MDSAVATKSKKKKLFPCFRAAASGGPLRTMRCKDAQDEQVVPSMGVEERNGMMLHNVRSLDGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKIRKRK
        MDS  ATKS KKK FPCFR+ AS GP+RT+R  +A DEQV P M VEERNGMMLHNV+  DG  + SG +K+ GGGGALSRALKAVLFGTSLAKKIRK+K
Subjt:  MDSAVATKSKKKKLFPCFRAAASGGPLRTMRCKDAQDEQVVPSMGVEERNGMMLHNVRSLDGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKIRKRK

Query:  EKQKQNSIKENQRHQALSSISSRSRITSDLNYRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLCLLGLV
         KQK+NS +ENQR + LSSISSRS   SD N+RN ST SSRTSAPFSS+SFCSSSP S EI ++SFRF+PTASNRL RQINLRK    WF+LL+ LL LV
Subjt:  EKQKQNSIKENQRHQALSSISSRSRITSDLNYRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLCLLGLV

Query:  LWGKTGAIMCTSVWIFCFCLYRRRIGFRSPDDKTSAAAMSSGEYSEGIVMEGFLNRDRSGAHNSILHIE
        LW K GA +CTS+WI CF  YRR IGFRSPDDK SAAAMSS EY +  ++EGFLNRDRS   NSI HI+
Subjt:  LWGKTGAIMCTSVWIFCFCLYRRRIGFRSPDDKTSAAAMSSGEYSEGIVMEGFLNRDRSGAHNSILHIE

A0A6J1IWY5 uncharacterized protein LOC111480657 isoform X11.2e-9171.38Show/hide
Query:  MDSAVATKSKKKKLFPCFRAAASGGPLRTMRCKDAQDEQVVPSMGVEERNGMMLHNVRSLDGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKIRKRK
        MDS  ATKS KKK FPCFR+ AS  P+RT+R  DA DEQV P M VEERNGMMLHNV+  DG  D SG +K+ GGGGALSRALKAVLFGTSLAKKIRKRK
Subjt:  MDSAVATKSKKKKLFPCFRAAASGGPLRTMRCKDAQDEQVVPSMGVEERNGMMLHNVRSLDGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKIRKRK

Query:  EKQKQNSIKENQRHQALSSISSRSRITSDLNYRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLCLLGLV
         KQK+NS +ENQR Q LSSISSRS   SD N+RN ST SSR SAPFSS+SFCSSSP SSEI+++SFRF+PTASNRL RQINLR     WF+LL+CLL LV
Subjt:  EKQKQNSIKENQRHQALSSISSRSRITSDLNYRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLCLLGLV

Query:  LWGKTGAIMCTSVWIFCFCLYRRRIGFRSPDDKTSAAAMSSGEYSEGIVMEGFLNRDRSGAHNSILHIE
        LW K GA +CTS+WI CF  YRR IGFRSPDDK SAAAMSS EY +  ++EGFLNRDRS   NSI HI+
Subjt:  LWGKTGAIMCTSVWIFCFCLYRRRIGFRSPDDKTSAAAMSSGEYSEGIVMEGFLNRDRSGAHNSILHIE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTCCGCTGTTGCAACCAAATCGAAGAAGAAGAAGTTGTTTCCTTGTTTCCGAGCGGCGGCCTCCGGTGGCCCTCTCAGGACGATGCGATGTAAGGACGCTCAAGA
CGAGCAGGTTGTTCCGTCCATGGGGGTGGAAGAGAGAAACGGTATGATGCTTCACAATGTGCGGTCGTTGGATGGAGACGATGACGATTCCGGTCACCGGAAAAGGAAGG
GCGGTGGCGGTGCTTTGTCGCGTGCGCTTAAGGCCGTCTTATTCGGAACTTCATTGGCGAAGAAGATCAGAAAAAGGAAAGAAAAACAAAAGCAAAATTCGATCAAGGAG
AATCAGAGGCATCAAGCTCTGTCTTCAATTAGCAGCCGATCAAGAATTACTTCAGATCTAAATTACCGCAACTATTCTACCAGTTCTTCGCGTACTTCTGCGCCATTTTC
ATCCTCCTCGTTCTGCAGTTCATCTCCTGCCTCCTCGGAAATCAGCGACGTATCATTTCGATTTTATCCAACTGCCTCGAATCGATTGCTCAGACAGATTAATCTCAGAA
AAATCTGCAGCGGTTGGTTTCTGCTACTATTATGTCTTCTAGGGTTGGTTTTATGGGGGAAAACCGGAGCTATTATGTGTACTTCGGTTTGGATCTTCTGTTTCTGTTTG
TACCGCCGTAGAATCGGATTCAGGTCGCCGGACGACAAGACCAGTGCGGCGGCGATGAGTTCCGGCGAATACAGTGAGGGAATTGTCATGGAAGGATTTCTGAACAGAGA
CCGTTCGGGTGCTCATAATTCAATCTTACACATTGAATGA
mRNA sequenceShow/hide mRNA sequence
CTCGTCTTCTCCATGAACGCACCGCCAACTGTTCTTCAAGTTCAGGTTCTCCTTTTTCTCATCGAAGATTCTCATTCTCCAGTTCCTTCAGATTTTCTAGCCTCCCAGAC
GCCTTGCTAAATCCTCGCAAACTGCACAAGAAACACCTTCCATTGTTCGAGTTTTATGTCCAATGGACTCCGCTGTTGCAACCAAATCGAAGAAGAAGAAGTTGTTTCCT
TGTTTCCGAGCGGCGGCCTCCGGTGGCCCTCTCAGGACGATGCGATGTAAGGACGCTCAAGACGAGCAGGTTGTTCCGTCCATGGGGGTGGAAGAGAGAAACGGTATGAT
GCTTCACAATGTGCGGTCGTTGGATGGAGACGATGACGATTCCGGTCACCGGAAAAGGAAGGGCGGTGGCGGTGCTTTGTCGCGTGCGCTTAAGGCCGTCTTATTCGGAA
CTTCATTGGCGAAGAAGATCAGAAAAAGGAAAGAAAAACAAAAGCAAAATTCGATCAAGGAGAATCAGAGGCATCAAGCTCTGTCTTCAATTAGCAGCCGATCAAGAATT
ACTTCAGATCTAAATTACCGCAACTATTCTACCAGTTCTTCGCGTACTTCTGCGCCATTTTCATCCTCCTCGTTCTGCAGTTCATCTCCTGCCTCCTCGGAAATCAGCGA
CGTATCATTTCGATTTTATCCAACTGCCTCGAATCGATTGCTCAGACAGATTAATCTCAGAAAAATCTGCAGCGGTTGGTTTCTGCTACTATTATGTCTTCTAGGGTTGG
TTTTATGGGGGAAAACCGGAGCTATTATGTGTACTTCGGTTTGGATCTTCTGTTTCTGTTTGTACCGCCGTAGAATCGGATTCAGGTCGCCGGACGACAAGACCAGTGCG
GCGGCGATGAGTTCCGGCGAATACAGTGAGGGAATTGTCATGGAAGGATTTCTGAACAGAGACCGTTCGGGTGCTCATAATTCAATCTTACACATTGAATGATCTAACTG
CCAAAGG
Protein sequenceShow/hide protein sequence
MDSAVATKSKKKKLFPCFRAAASGGPLRTMRCKDAQDEQVVPSMGVEERNGMMLHNVRSLDGDDDDSGHRKRKGGGGALSRALKAVLFGTSLAKKIRKRKEKQKQNSIKE
NQRHQALSSISSRSRITSDLNYRNYSTSSSRTSAPFSSSSFCSSSPASSEISDVSFRFYPTASNRLLRQINLRKICSGWFLLLLCLLGLVLWGKTGAIMCTSVWIFCFCL
YRRRIGFRSPDDKTSAAAMSSGEYSEGIVMEGFLNRDRSGAHNSILHIE