; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC04G078300 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC04G078300
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
Descriptionvitellogenin-like
Genome locationCmU531Chr04:24794477..24796603
RNA-Seq ExpressionCmUC04G078300
SyntenyCmUC04G078300
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8652542.1 hypothetical protein Csa_013076 [Cucumis sativus]6.9e-8580Show/hide
Query:  MDQCVEYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHPSLPPLPPPTENSKQITLV----------NSSSPDYHLKHRVSES
        MDQ VEYRRRD++LLDS +DFEFNISIEHES CADE+FSNGIILPIKIQSHKQSHPSLPPLPP  E+SK+ITLV          +SSS D+ L+ RVSES
Subjt:  MDQCVEYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHPSLPPLPPPTENSKQITLV----------NSSSPDYHLKHRVSES

Query:  KSFWGFKRSSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGVYGNY
        KSFWGFKRS+S NNFETK +SLCPIPLLSRSNS GS SNSKSKK KDSQKQ SQ+ NSSSMRK SSPS     PPSSLNQYPTILKPQMCKNPGGVYG Y
Subjt:  KSFWGFKRSSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGVYGNY

Query:  HYIGPVLNVPPKFFGFGSLL
        HYIGPVLNVPPKFFGFGSLL
Subjt:  HYIGPVLNVPPKFFGFGSLL

KAG6584354.1 hypothetical protein SDJN03_20286, partial [Cucurbita argyrosperma subsp. sororia]3.9e-7273.39Show/hide
Query:  DQCVEYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHP-------SLPPLPPPTENSKQITLVNSSSPDYHLKHRVSESKSFW
        DQ  E+ RRD+TLLDSNLDFEFNISI+HES  ADELFSNG+I+P KI+SHKQSHP       SLPPL PPTENSKQ  +VNS+S    L+ R SESKSFW
Subjt:  DQCVEYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHP-------SLPPLPPPTENSKQITLVNSSSPDYHLKHRVSESKSFW

Query:  GFKRSSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSS--PSSLSLPPPSSLNQYPTILKPQMCKNPGGVYGNYHY
        GFKRSSS  NFE+KR SLCP+PLLSRSNS GS  NSKSKKCKDSQKQISQ+ +S+S RK SS  PSSL   P SSLNQYPT LKPQMC+NPGG YG YH 
Subjt:  GFKRSSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSS--PSSLSLPPPSSLNQYPTILKPQMCKNPGGVYGNYHY

Query:  IGPVLNVPPKFFGFGSLL
        IGPVLNVPPKFFG GS+L
Subjt:  IGPVLNVPPKFFGFGSLL

XP_008458924.1 PREDICTED: putative protein TPRXL [Cucumis melo]2.6e-8477.68Show/hide
Query:  MDQCVEYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHPSLPPLPPPTENSKQITLVN--------------SSSPDYHLKHR
        MDQ VEYRRRD++LLDS +DFEFNISIEHES CADE+FSNGIILPIKIQSHKQSHPSLPPLPP  ++SK+ITLVN              SSS D+ L+ R
Subjt:  MDQCVEYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHPSLPPLPPPTENSKQITLVN--------------SSSPDYHLKHR

Query:  VSESKSFWGFKRSSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGV
        VSESKSFWGFKRS S NNFETK +SLCPIPLLSRSNS GS SNSKSKK KDSQKQ SQ+ NSSSMRKSS         PSSLNQYPTILKPQMCKNPGGV
Subjt:  VSESKSFWGFKRSSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGV

Query:  YGNYHYIGPVLNVPPKFFGFGSLL
        YGNYHYIGPVLNVPPKFFGFGSLL
Subjt:  YGNYHYIGPVLNVPPKFFGFGSLL

XP_011650562.2 putative protein TPRXL [Cucumis sativus]6.9e-8580Show/hide
Query:  MDQCVEYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHPSLPPLPPPTENSKQITLV----------NSSSPDYHLKHRVSES
        MDQ VEYRRRD++LLDS +DFEFNISIEHES CADE+FSNGIILPIKIQSHKQSHPSLPPLPP  E+SK+ITLV          +SSS D+ L+ RVSES
Subjt:  MDQCVEYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHPSLPPLPPPTENSKQITLV----------NSSSPDYHLKHRVSES

Query:  KSFWGFKRSSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGVYGNY
        KSFWGFKRS+S NNFETK +SLCPIPLLSRSNS GS SNSKSKK KDSQKQ SQ+ NSSSMRK SSPS     PPSSLNQYPTILKPQMCKNPGGVYG Y
Subjt:  KSFWGFKRSSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGVYGNY

Query:  HYIGPVLNVPPKFFGFGSLL
        HYIGPVLNVPPKFFGFGSLL
Subjt:  HYIGPVLNVPPKFFGFGSLL

XP_038895586.1 uncharacterized protein LOC120083787 [Benincasa hispida]3.9e-8881.74Show/hide
Query:  MDQCVEYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHP--SLPPLPPPTENSKQITLVN------SSSPDYHLKHRVSESKS
        +DQ VEYRRRD++LLDSNLDFEFNISIE ES CADELFSNGIILPIKIQSHKQSHP  SLPPLPPPT+NSKQITLVN      SSS D+ L+ RVSESKS
Subjt:  MDQCVEYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHP--SLPPLPPPTENSKQITLVN------SSSPDYHLKHRVSESKS

Query:  FWGFKRSSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGVYGNYHY
        FWGFKRS+S NNFE++RISLCPIPLLSRSNS GS SNSKSKK KDSQKQI Q+ NSSSM+KSSSPS     PPS LNQYPTIL+ QMCKNPGGVYGNYHY
Subjt:  FWGFKRSSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGVYGNYHY

Query:  IGPVLNVPPKFFGFGSLLL
        IGPVLNVPPKFFGFGSLL+
Subjt:  IGPVLNVPPKFFGFGSLLL

TrEMBL top hitse value%identityAlignment
A0A0A0LQT0 Uncharacterized protein2.5e-8580.37Show/hide
Query:  MDQCVEYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHPSLPPLPPPTENSKQITLV---------NSSSPDYHLKHRVSESK
        MDQ VEYRRRD++LLDS +DFEFNISIEHES CADE+FSNGIILPIKIQSHKQSHPSLPPLPP  E+SK+ITLV         +SSS D+ L+ RVSESK
Subjt:  MDQCVEYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHPSLPPLPPPTENSKQITLV---------NSSSPDYHLKHRVSESK

Query:  SFWGFKRSSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGVYGNYH
        SFWGFKRS+S NNFETK +SLCPIPLLSRSNS GS SNSKSKK KDSQKQ SQ+ NSSSMRK SSPS     PPSSLNQYPTILKPQMCKNPGGVYG YH
Subjt:  SFWGFKRSSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGVYGNYH

Query:  YIGPVLNVPPKFFGFGSLL
        YIGPVLNVPPKFFGFGSLL
Subjt:  YIGPVLNVPPKFFGFGSLL

A0A1S3C907 Uncharacterized protein1.3e-8477.68Show/hide
Query:  MDQCVEYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHPSLPPLPPPTENSKQITLVN--------------SSSPDYHLKHR
        MDQ VEYRRRD++LLDS +DFEFNISIEHES CADE+FSNGIILPIKIQSHKQSHPSLPPLPP  ++SK+ITLVN              SSS D+ L+ R
Subjt:  MDQCVEYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHPSLPPLPPPTENSKQITLVN--------------SSSPDYHLKHR

Query:  VSESKSFWGFKRSSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGV
        VSESKSFWGFKRS S NNFETK +SLCPIPLLSRSNS GS SNSKSKK KDSQKQ SQ+ NSSSMRKSS         PSSLNQYPTILKPQMCKNPGGV
Subjt:  VSESKSFWGFKRSSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGV

Query:  YGNYHYIGPVLNVPPKFFGFGSLL
        YGNYHYIGPVLNVPPKFFGFGSLL
Subjt:  YGNYHYIGPVLNVPPKFFGFGSLL

A0A6J1E7V7 uncharacterized protein LOC111431596 isoform X15.5e-7273.71Show/hide
Query:  EYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHP-------SLPPLPPPTENSKQITLVNSSSPDYHLKHRVSESKSFWGFKR
        E+ RRD+TLLDSNLDFEFNISI+HES  ADELFSNG+I+P KI+SHKQSHP       SLPPL PPTENSKQ  +VNS+S    L+ R SESKSFWGFKR
Subjt:  EYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHP-------SLPPLPPPTENSKQITLVNSSSPDYHLKHRVSESKSFWGFKR

Query:  SSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPS-SLSLPPPSSLNQYPTILKPQMCKNPGGVYGNYHYIGPVL
        SSS  NFE+KR SLCP+PLLSRSNS GS  NSKSKKCKDSQKQISQ+ +S+S RK SSPS S S  P SSLNQYPT LKPQMC+NPGG YG YH IGPVL
Subjt:  SSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPS-SLSLPPPSSLNQYPTILKPQMCKNPGGVYGNYHYIGPVL

Query:  NVPPKFFGFGSLL
        NVPP+FFG GS+L
Subjt:  NVPPKFFGFGSLL

A0A6J1E831 uncharacterized protein LOC111431596 isoform X24.4e-6970.75Show/hide
Query:  EYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHP-------SLPPLPPPTENSKQITLVNSSSPDYHLKHRVSESKSFWGFKR
        E+ RRD+TLLDSNLDFEFNISI+HES  ADELFSNG+I+P KI+SHKQSHP       SLPPL PPTENSKQ  +VNS+S    L+ R SESKSFWGFKR
Subjt:  EYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHP-------SLPPLPPPTENSKQITLVNSSSPDYHLKHRVSESKSFWGFKR

Query:  SSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGVYGNYHYIGPVLN
        SSS  NFE+KR SLCP+PLLSRSNS GS  NSKSKKCKDSQKQISQ+ +S+S RK            SSLNQYPT LKPQMC+NPGG YG YH IGPVLN
Subjt:  SSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGVYGNYHYIGPVLN

Query:  VPPKFFGFGSLL
        VPP+FFG GS+L
Subjt:  VPPKFFGFGSLL

A0A6J1GV05 uncharacterized protein LOC1114571728.8e-7070.51Show/hide
Query:  VEYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHP-------SLPPLPPPTENS---KQITLVNSSSPDYHLKHRVSESKSFW
        +++R  DITLLDSNLDF+F+ISIEHES  ADELF NGIILPIK +SHKQSHP       SLPPLPP   ++   K+I +VN SS  + L+ R +ESKSFW
Subjt:  VEYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHP-------SLPPLPPPTENS---KQITLVNSSSPDYHLKHRVSESKSFW

Query:  GFKRSSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGVYGNYHYIG
        G KRSSS  NFE KR SLCP+PLLSRS S GS SN KSKK KDSQKQISQ+  S+SMRKSSSP     PPP SLNQYPT ++PQMCKNPGGVYGNYHYIG
Subjt:  GFKRSSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGVYGNYHYIG

Query:  PVLNVPPKFFGFGSLLL
         VLNVPPKFFGFGSLLL
Subjt:  PVLNVPPKFFGFGSLLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G48780.1 unknown protein3.6e-1536.77Show/hide
Query:  RRDITLLD-SNLDFEFNISIEH---ESCCADELFSNGIILPIKIQS--------HKQSHP---------SLPPLPPPTENSKQITLVNSSSPDYHLKHRV
        RRD TLLD SN DFEF+IS      +S  ADE+F++G+ILP  + +        +K   P          L P P PT++S++ T   +S  +   +   
Subjt:  RRDITLLD-SNLDFEFNISIEH---ESCCADELFSNGIILPIKIQS--------HKQSHP---------SLPPLPPPTENSKQITLVNSSSPDYHLKHRV

Query:  SESKSFWGFKRSSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGVY
        S SKSFW FKRSSS  N + K+  +C  P L+RSNS GS +NSK    +D                +  PSS S    S  N Y    +PQ      G  
Subjt:  SESKSFWGFKRSSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGVY

Query:  GNYHYIGPVLNVPPKFFGFGSLL
        G    + PVLN  P  FG GS+L
Subjt:  GNYHYIGPVLNVPPKFFGFGSLL

AT1G67050.1 unknown protein5.9e-1836.57Show/hide
Query:  LDSNLDFEFNI--------SIEHESCCADELFSNGIILPIKIQSHKQ---SHPSLPPLPPPTENSKQITLVNSSSPDYHL---KHRVSESKSFWGFKRSS
        L+S++DF+F I        S +  S  ADELFSNG ILP +I+   +     P   P+    ++ KQ    N    +  +       + +KSFWGFKRSS
Subjt:  LDSNLDFEFNI--------SIEHESCCADELFSNGIILPIKIQSHKQ---SHPSLPPLPPPTENSKQITLVNSSSPDYHL---KHRVSESKSFWGFKRSS

Query:  SFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGV-YGNYH----YIGP
        S N   T   SLCP+PLL+RSNS GS S+         QKQ S R ++  ++   S S  S    SS        KP + K+ GG  YG++      + P
Subjt:  SFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGV-YGNYH----YIGP

Query:  VLNVPP--KFFGFGSL
        V+NV P    FGFGS+
Subjt:  VLNVPP--KFFGFGSL

AT3G18300.1 unknown protein6.5e-1735.59Show/hide
Query:  RRDITLLD-SNLDFEFNISIEH---ESCCADELFSNGIILPI--------KIQSHKQSHPSLPPLPPPTENSKQITLVNSSSPDYHLKHRVSE-------
        RRD TLLD SN DFEF+IS      +S  ADE+F++G+ILP+             +     LPP+      S  +  +    P++  K+ V E       
Subjt:  RRDITLLD-SNLDFEFNISIEH---ESCCADELFSNGIILPI--------KIQSHKQSHPSLPPLPPPTENSKQITLVNSSSPDYHLKHRVSE-------

Query:  --------------SKSFWGFKRSSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPS-SLNQYPTI
                      SKSFW FKRSSS  N + K+  +C  P L+RSNS GS + SK +  +D  K  SQRH     R   +PSS   PP S   + Y   
Subjt:  --------------SKSFWGFKRSSSFNNFETKRISLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPS-SLNQYPTI

Query:  LKPQMCKNPGGVYGNYHYIGPVLNVPPKFFGFGSLL
         +    KN GG  G++ +I PV+  P   FG GS+L
Subjt:  LKPQMCKNPGGVYGNYHYIGPVLNVPPKFFGFGSLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCAGTGCGTCGAGTATAGGCGGCGGGACATAACATTGTTGGATTCAAATCTAGACTTTGAGTTCAATATCAGCATTGAACATGAATCTTGTTGTGCTGATGAGCT
GTTTAGCAATGGAATCATTCTTCCCATCAAAATTCAATCTCACAAACAATCTCATCCTTCACTTCCTCCTCTTCCTCCCCCCACAGAGAATTCAAAGCAAATCACATTGG
TGAATTCATCTTCTCCAGATTATCACTTAAAACACAGAGTTTCAGAGTCCAAATCTTTTTGGGGATTCAAAAGAAGTAGCAGTTTCAACAACTTTGAAACTAAAAGAATT
TCTCTTTGCCCAATTCCTCTTTTGTCACGTAGCAATTCAATTGGGTCATTTTCAAATTCAAAGTCCAAAAAGTGTAAAGATTCACAAAAGCAAATTTCTCAGAGACATAA
CTCATCATCAATGAGAAAGTCGTCGTCACCGTCATCGTTGTCGTTGCCGCCGCCGTCATCCTTGAATCAATATCCAACAATTCTAAAGCCTCAGATGTGCAAGAACCCAG
GTGGGGTTTATGGAAATTATCATTATATTGGCCCTGTGTTGAATGTTCCTCCAAAGTTCTTTGGTTTTGGTTCACTTCTTCTGGCTTTGCCATTGTGTAATGGTTTTGTT
ATAAGACACAACATAGGAGGCTTCAATTGTAAGATGGGTACAAATGTATTGGATGAGTTTATATTCACTTCCTCGTCTAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACCAGTGCGTCGAGTATAGGCGGCGGGACATAACATTGTTGGATTCAAATCTAGACTTTGAGTTCAATATCAGCATTGAACATGAATCTTGTTGTGCTGATGAGCT
GTTTAGCAATGGAATCATTCTTCCCATCAAAATTCAATCTCACAAACAATCTCATCCTTCACTTCCTCCTCTTCCTCCCCCCACAGAGAATTCAAAGCAAATCACATTGG
TGAATTCATCTTCTCCAGATTATCACTTAAAACACAGAGTTTCAGAGTCCAAATCTTTTTGGGGATTCAAAAGAAGTAGCAGTTTCAACAACTTTGAAACTAAAAGAATT
TCTCTTTGCCCAATTCCTCTTTTGTCACGTAGCAATTCAATTGGGTCATTTTCAAATTCAAAGTCCAAAAAGTGTAAAGATTCACAAAAGCAAATTTCTCAGAGACATAA
CTCATCATCAATGAGAAAGTCGTCGTCACCGTCATCGTTGTCGTTGCCGCCGCCGTCATCCTTGAATCAATATCCAACAATTCTAAAGCCTCAGATGTGCAAGAACCCAG
GTGGGGTTTATGGAAATTATCATTATATTGGCCCTGTGTTGAATGTTCCTCCAAAGTTCTTTGGTTTTGGTTCACTTCTTCTGGCTTTGCCATTGTGTAATGGTTTTGTT
ATAAGACACAACATAGGAGGCTTCAATTGTAAGATGGGTACAAATGTATTGGATGAGTTTATATTCACTTCCTCGTCTAATTGA
Protein sequenceShow/hide protein sequence
MDQCVEYRRRDITLLDSNLDFEFNISIEHESCCADELFSNGIILPIKIQSHKQSHPSLPPLPPPTENSKQITLVNSSSPDYHLKHRVSESKSFWGFKRSSSFNNFETKRI
SLCPIPLLSRSNSIGSFSNSKSKKCKDSQKQISQRHNSSSMRKSSSPSSLSLPPPSSLNQYPTILKPQMCKNPGGVYGNYHYIGPVLNVPPKFFGFGSLLLALPLCNGFV
IRHNIGGFNCKMGTNVLDEFIFTSSSN