; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007729 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007729
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr9:3926961..3930100
RNA-Seq ExpressionLag0007729
SyntenyLag0007729
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EDO30234.1 predicted protein, partial [Nematostella vectensis]3.0e-0628.62Show/hide
Query:  QVRRFSRVS----LQFFPHSSKVLTRFTAVPSPQVRRFSR-------RFAEFLPPSLKVLTSLRFAHALRCVLPPSSKVLTRFALQFLPPSSKVLTRFVQ
        Q+ RF R +     +F   +S+ +TRF    SPQ+ RF R       RFA    P +      RF  +       +S  +TRF       +S  +TRF +
Subjt:  QVRRFSRVS----LQFFPHSSKVLTRFTAVPSPQVRRFSR-------RFAEFLPPSLKVLTSLRFAHALRCVLPPSSKVLTRFALQFLPPSSKVLTRFVQ

Query:  FLPPNSKVLTRFAAFPSPQIRRFSRASLQFLPQVRRFSRAA----LQFLPQSSKVLTRFVAVLS---SKFEGSHTLRCSSF-----PQVRRFSRASLQFL
           P    +TRF    SPQI RF R++    PQ+ RF+R+A     +F   +S  +TRF    S   ++F  S + + + F     PQ+ RF R++    
Subjt:  FLPPNSKVLTRFAAFPSPQIRRFSRASLQFLPQVRRFSRAA----LQFLPQSSKVLTRFVAVLS---SKFEGSHTLRCSSF-----PQVRRFSRASLQFL

Query:  SPSLKFLPPKFEGSHVASLRSCASLQFLPPSLKVLTSLRCDPSSKFEGSHALRSAIPSPSSKVLTRFAAVPSLQVQRFSHRFAAILPPSSKVLTRFALQF
        SP +           +      AS Q            R   S+  + +   RSA P      +TRFA   S Q+ RF+         +S  +TRF    
Subjt:  SPSLKFLPPKFEGSHVASLRSCASLQFLPPSLKVLTSLRCDPSSKFEGSHALRSAIPSPSSKVLTRFAAVPSLQVQRFSHRFAAILPPSSKVLTRFALQF

Query:  LPQVRRFSCASCSSFLQIRRFSRAS
         PQ+ RF     S+  QI RF R++
Subjt:  LPQVRRFSCASCSSFLQIRRFSRAS

XP_009061288.1 hypothetical protein LOTGIDRAFT_126799, partial [Lottia gigantea]4.7e-0730.32Show/hide
Query:  RCSSFPPSSKVLTSLRCD---PSSKFEGSHALRSAIPSPSSKVLMRF-VQFLPPNSKVLTRFVAFP-SPQIRRFSRASLQFLPPSSKVLTRFALQFLPQV
        +  S PPSSK+L + + +   PS+K   ++ +      PSSK+L+ + ++ LPP++K+L  +   P  P  +      ++ LPPSSK+L  + ++ LP  
Subjt:  RCSSFPPSSKVLTSLRCD---PSSKFEGSHALRSAIPSPSSKVLMRF-VQFLPPNSKVLTRFVAFP-SPQIRRFSRASLQFLPPSSKVLTRFALQFLPQV

Query:  RRFSRA-SLQFLPPSSKVLTSLRCD---PSSKFEGSHALRSAIPS-------------PSSKVLMRF-VQFLPPNSKVLTRFVAFPSP
         +      +  LPPSSK+L + + +   PSSK   ++ + S  PS              SSK+L+ + +  LPP+SK+L  +   P P
Subjt:  RRFSRA-SLQFLPPSSKVLTSLRCD---PSSKFEGSHALRSAIPS-------------PSSKVLMRF-VQFLPPNSKVLTRFVAFPSP

TrEMBL top hitse value%identityAlignment
A0A803JTN0 Uncharacterized protein3.6e-0532.64Show/hide
Query:  ASLQFLSP---SLKFLPPKFEGSHVASLRSCASLQFLPPSLKVLTSLRCDPSS-KF--EGSHALRSAIPSPSSKVLTRFA--AVPSLQVQRFSHRFAAIL
        +SLQFL P   SL+FL P    S  +   + +SLQFL P+   L  L+  PSS +F      +L+   P+PSS    + A  ++ SLQ    S  F    
Subjt:  ASLQFLSP---SLKFLPPKFEGSHVASLRSCASLQFLPPSLKVLTSLRCDPSS-KF--EGSHALRSAIPSPSSKVLTRFA--AVPSLQVQRFSHRFAAIL

Query:  PPSSKVL--TRFALQFL---PQVRRFSCASCSSFLQIRRFSRASLHFLPPKFEGSHALRCS-SFLQVRRFSRASLCNSFPKFEGSHALRCSSFPPSSKVL
        P S   L     +LQFL   P   +F   + SS LQ    +  SLHFL P      +L+ S S LQ  R + +SL    P     H L+     P+   L
Subjt:  PPSSKVL--TRFALQFL---PQVRRFSCASCSSFLQIRRFSRASLHFLPPKFEGSHALRCS-SFLQVRRFSRASLCNSFPKFEGSHALRCSSFPPSSKVL

Query:  TSLRCDPSSKFEGSHALRSAIPSPSSKVLMR----FVQFLPPNSKVLTRFVAFPSP-QIRRFSRASLQFLPPSSKVL-----TRFALQFL---PQVRRF-
         SL+  PSS       L+S  P+PSS   +R     ++FL P    L      PS  Q  + + +SLQFL P+   L     T  +LQFL   P    F 
Subjt:  TSLRCDPSSKFEGSHALRSAIPSPSSKVLMR----FVQFLPPNSKVLTRFVAFPSP-QIRRFSRASLQFLPPSSKVL-----TRFALQFL---PQVRRF-

Query:  --SRASLQFLPPSSKVLTSLRCDPSSKFEGSHALRSAIPSPSSKVLMR----FVQFLPPNSKVLTRFVAFPSPKFEGSHALR-CSSFLQVRRFSRASLCN
          + +SLQFL P+   L  L+  PSS       L+   P+PSS   +R     +QFL P    L     F  P     H L+   S L   R + +SL  
Subjt:  --SRASLQFLPPSSKVLTSLRCDPSSKFEGSHALRSAIPSPSSKVLMR----FVQFLPPNSKVLTRFVAFPSPKFEGSHALR-CSSFLQVRRFSRASLCN

Query:  SFPKVRRFSRASCSSFLQIRRFSRASLHFLPP
               F R + SS LQ  + + + L FL P
Subjt:  SFPKVRRFSRASCSSFLQIRRFSRASLHFLPP

A7T1D9 Predicted protein (Fragment)1.5e-0628.62Show/hide
Query:  QVRRFSRVS----LQFFPHSSKVLTRFTAVPSPQVRRFSR-------RFAEFLPPSLKVLTSLRFAHALRCVLPPSSKVLTRFALQFLPPSSKVLTRFVQ
        Q+ RF R +     +F   +S+ +TRF    SPQ+ RF R       RFA    P +      RF  +       +S  +TRF       +S  +TRF +
Subjt:  QVRRFSRVS----LQFFPHSSKVLTRFTAVPSPQVRRFSR-------RFAEFLPPSLKVLTSLRFAHALRCVLPPSSKVLTRFALQFLPPSSKVLTRFVQ

Query:  FLPPNSKVLTRFAAFPSPQIRRFSRASLQFLPQVRRFSRAA----LQFLPQSSKVLTRFVAVLS---SKFEGSHTLRCSSF-----PQVRRFSRASLQFL
           P    +TRF    SPQI RF R++    PQ+ RF+R+A     +F   +S  +TRF    S   ++F  S + + + F     PQ+ RF R++    
Subjt:  FLPPNSKVLTRFAAFPSPQIRRFSRASLQFLPQVRRFSRAA----LQFLPQSSKVLTRFVAVLS---SKFEGSHTLRCSSF-----PQVRRFSRASLQFL

Query:  SPSLKFLPPKFEGSHVASLRSCASLQFLPPSLKVLTSLRCDPSSKFEGSHALRSAIPSPSSKVLTRFAAVPSLQVQRFSHRFAAILPPSSKVLTRFALQF
        SP +           +      AS Q            R   S+  + +   RSA P      +TRFA   S Q+ RF+         +S  +TRF    
Subjt:  SPSLKFLPPKFEGSHVASLRSCASLQFLPPSLKVLTSLRCDPSSKFEGSHALRSAIPSPSSKVLTRFAAVPSLQVQRFSHRFAAILPPSSKVLTRFALQF

Query:  LPQVRRFSCASCSSFLQIRRFSRAS
         PQ+ RF     S+  QI RF R++
Subjt:  LPQVRRFSCASCSSFLQIRRFSRAS

V3ZAI0 Uncharacterized protein (Fragment)2.3e-0730.32Show/hide
Query:  RCSSFPPSSKVLTSLRCD---PSSKFEGSHALRSAIPSPSSKVLMRF-VQFLPPNSKVLTRFVAFP-SPQIRRFSRASLQFLPPSSKVLTRFALQFLPQV
        +  S PPSSK+L + + +   PS+K   ++ +      PSSK+L+ + ++ LPP++K+L  +   P  P  +      ++ LPPSSK+L  + ++ LP  
Subjt:  RCSSFPPSSKVLTSLRCD---PSSKFEGSHALRSAIPSPSSKVLMRF-VQFLPPNSKVLTRFVAFP-SPQIRRFSRASLQFLPPSSKVLTRFALQFLPQV

Query:  RRFSRA-SLQFLPPSSKVLTSLRCD---PSSKFEGSHALRSAIPS-------------PSSKVLMRF-VQFLPPNSKVLTRFVAFPSP
         +      +  LPPSSK+L + + +   PSSK   ++ + S  PS              SSK+L+ + +  LPP+SK+L  +   P P
Subjt:  RRFSRA-SLQFLPPSSKVLTSLRCD---PSSKFEGSHALRSAIPS-------------PSSKVLMRF-VQFLPPNSKVLTRFVAFPSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTTGCTTCCTCCAAGTTCGAAGGTTCCCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGTTTCGCTGCAGTTCCTTCCTCACAGTT
CGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCTCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGTTTCAC
TGCAGTTCTTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCCCTCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGAGTTCCTTCCTCCAAGTTTG
AAGGTTCTCACATCGCTTCGCTTCGCTCACGCGCTTCGCTGCGTTCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGT
TCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTTTCGCTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCC
TTCCTCAAGTTCGAAGGTTCTCACGCGCTGCGCTGCAGTTCCTTCCTCAAAGTTCAAAGGTTCTCACGCGCTTCGTTGCAGTTCTTTCCTCCAAGTTCGAAGGTTCTCAC
ACGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTTCTCCAAGTTTGAAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGT
CGCTTCGCTGCGCTCATGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTC
GCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCTCCAAGTTCAAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGT
TCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCATGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTT
GCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTCG
AAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCTCCAAGTTCAAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCT
GCAATTCCTTCCCCAAGTTCGAAGGTTCTCATGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCCAAATTCGAAG
GTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGT
TCCTTCCTCCAAGTTCAAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTT
CTCATGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTT
CCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCAC
GCGCTTCGTTGCATTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCC
CCAAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCCCCCAAATTCGAAGGTTGTCACGCG
CTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCACTCCAGCGCTACT
TCCTAAAGTCCAAAGACGTCCATTGTCCTCACGCTGCGCTGCTTCCTTCTCCAAGTTCGAGGGTCCTCATGCTACGCTCGGCTACATTGCTGCGCTACTTCCTAAAGTCC
AAAGACGTCAATTGTCCCTGCACTCATGCTGAAAAGGGCATGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCCAACTCAAGGAACATGTTCG
TGCACTCGTGCTGAAAGGCGTGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCAACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCC
AAGGAACATGCCCCAACTCAAGGAACATGTCCGTGCACTCGTACTGGAAGGCGCGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACT
CGTGCTGAAAGGCGTGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGG
AACATGTCCTTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCAACTCGTGCTGAAAGGCGTGGCGGCG
ACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGACCATGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACG
TCCTTGCACTCGTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTTGCTTCCTCCAAGTTCGAAGGTTCCCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGTTTCGCTGCAGTTCCTTCCTCACAGTT
CGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCTCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGTTTCAC
TGCAGTTCTTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCCCTCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGAGTTCCTTCCTCCAAGTTTG
AAGGTTCTCACATCGCTTCGCTTCGCTCACGCGCTTCGCTGCGTTCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGT
TCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTTTCGCTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCC
TTCCTCAAGTTCGAAGGTTCTCACGCGCTGCGCTGCAGTTCCTTCCTCAAAGTTCAAAGGTTCTCACGCGCTTCGTTGCAGTTCTTTCCTCCAAGTTCGAAGGTTCTCAC
ACGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTTCTCCAAGTTTGAAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGT
CGCTTCGCTGCGCTCATGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTC
GCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCTCCAAGTTCAAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGT
TCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCATGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTT
GCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTCG
AAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCTCCAAGTTCAAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCT
GCAATTCCTTCCCCAAGTTCGAAGGTTCTCATGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCCAAATTCGAAG
GTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGT
TCCTTCCTCCAAGTTCAAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTT
CTCATGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTT
CCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCAC
GCGCTTCGTTGCATTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCC
CCAAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCCCCCAAATTCGAAGGTTGTCACGCG
CTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCACTCCAGCGCTACT
TCCTAAAGTCCAAAGACGTCCATTGTCCTCACGCTGCGCTGCTTCCTTCTCCAAGTTCGAGGGTCCTCATGCTACGCTCGGCTACATTGCTGCGCTACTTCCTAAAGTCC
AAAGACGTCAATTGTCCCTGCACTCATGCTGAAAAGGGCATGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCCAACTCAAGGAACATGTTCG
TGCACTCGTGCTGAAAGGCGTGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCAACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCC
AAGGAACATGCCCCAACTCAAGGAACATGTCCGTGCACTCGTACTGGAAGGCGCGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACT
CGTGCTGAAAGGCGTGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGG
AACATGTCCTTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCAACTCGTGCTGAAAGGCGTGGCGGCG
ACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGACCATGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACG
TCCTTGCACTCGTGCTGA
Protein sequenceShow/hide protein sequence
MEVASSKFEGSHALRCSSFPPNSKVLTRFAAVPSSQFEGSHALRCSSFPPSSKVLTSLRCSSFLQVRRFSRVSLQFFPHSSKVLTRFTAVPSPQVRRFSRRFAEFLPPSL
KVLTSLRFAHALRCVLPPSSKVLTRFALQFLPPSSKVLTRFVQFLPPNSKVLTRFAAFPSPQIRRFSRASLQFLPQVRRFSRAALQFLPQSSKVLTRFVAVLSSKFEGSH
TLRCSSFPQVRRFSRASLQFLSPSLKFLPPKFEGSHVASLRSCASLQFLPPSLKVLTSLRCDPSSKFEGSHALRSAIPSPSSKVLTRFAAVPSLQVQRFSHRFAAILPPS
SKVLTRFALQFLPQVRRFSCASCSSFLQIRRFSRASLHFLPPKFEGSHALRCSSFLQVRRFSRASLCNSFPKFEGSHALRCSSFPPSSKVLTSLRCDPSSKFEGSHALRS
AIPSPSSKVLMRFVQFLPPNSKVLTRFVAFPSPQIRRFSRASLQFLPPSSKVLTRFALQFLPQVRRFSRASLQFLPPSSKVLTSLRCDPSSKFEGSHALRSAIPSPSSKV
LMRFVQFLPPNSKVLTRFVAFPSPKFEGSHALRCSSFLQVRRFSRASLCNSFPKVRRFSRASCSSFLQIRRFSRASLHFLPPNSKVLTRFAAVPSSKFEGSHALRSAIPS
PKFEGSHALRAVPSSKFEGSHALRCSSFPPNSKVVTRFAAVPSPQVRRFSRTSLQFLPPSSKVLTRFAALQRYFLKSKDVHCPHAALLPSPSSRVLMLRSATLLRYFLKS
KDVNCPCTHAEKGMAATQVQGTCPNSRNMSQLKEHVRALVLKGVAAAQVQGTCPNSRNTSLQLVLKGVAATQVQGTCPNSRNMSVHSYWKARRRHKSKEHVPTQGTCPCT
RAERRGGGTSPRNMSQLKEHVLAWRRHKSKEHVPTQGTCPCTRAERRGGDTSPRNMSQLKEHVLATRAERRGGDTSPRNMSQLKEHDHALVLKGVAATQVQGTCPNSRNT
SLHSC