; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014480 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014480
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr12:1233867..1237336
RNA-Seq ExpressionLag0014480
SyntenyLag0014480
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016020 - membrane (cellular component)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAE6523816.1 Tir4p [Saccharomyces cerevisiae PE-2]4.4e-1130.49Show/hide
Query:  SSLSQEPKLHDAPSPHELKSEWKVLPPSSKVPTRFAAVPSPKFEGSHALRCSSFPPSSKVLTSLRCSSFLQVRRFSRRFAVVP-SSKFEGSHIASLRSRA
        SS S+      APS  E+ S   V P SS+V +   A  S     S  +  S  P SS+V++S   SS  +V   S    V P SS+   S +AS  S  
Subjt:  SSLSQEPKLHDAPSPHELKSEWKVLPPSSKVPTRFAAVPSPKFEGSHALRCSSFPPSSKVLTSLRCSSFLQVRRFSRRFAVVP-SSKFEGSHIASLRSRA

Query:  SLRSFLQVRRFSRASLCNSFPKFEGSHALRAVPSSNSKVLTRFGEVPSSKSEGFHALRYSSFPQVRRFSRASLQFLPHSSKVLTCFAAVPSPKFEGSHVA
        +  S   V   S   + +S           +V  S+S+V++    V  S SE   +   SS  +V     AS    P SS+V++   A  S     S VA
Subjt:  SLRSFLQVRRFSRASLCNSFPKFEGSHALRAVPSSNSKVLTRFGEVPSSKSEGFHALRYSSFPQVRRFSRASLQFLPHSSKVLTCFAAVPSPKFEGSHVA

Query:  SLQFLPPNLKVLTRFDTVPSSR---SSKFLPPSSKVLAAQFLPPSSKVLMRFVATFLQVRRFSHALLQLLPPSSKVPSRASLAPSPSSKALLSTASSPSS
        S    P + +V++     PSS    SS   P SS+V+++   P SS+V+   VA+                 SS   + +S+APS S     S ASS S 
Subjt:  SLQFLPPNLKVLTRFDTVPSSR---SSKFLPPSSKVLAAQFLPPSSKVLMRFVATFLQVRRFSHALLQLLPPSSKVPSRASLAPSPSSKALLSTASSPSS

Query:  KALLSVATFLQVRRFSHALLQFLPPSSKVPSRASLAPTPSSKALLSTAPSPSSKALLSVATFLQVRRFSHALLQFLPPSSKVLSRVAAVPSSKFKGSLTR
         A  SVA        S  +   + PSS     +S+AP+ S     S APS S     SVA+         A     P SS+V+S  ++V SS  + + + 
Subjt:  KALLSVATFLQVRRFSHALLQFLPPSSKVPSRASLAPTPSSKALLSTAPSPSSKALLSVATFLQVRRFSHALLQFLPPSSKVLSRVAAVPSSKFKGSLTR

Query:  FTRSFSKFEGASLRCYLPPSSKVLSRVLQFLPPSSKVPSRASLAPSPSSKALLSVA-TFLQVRRFSHALLQFLPPNLKVPSR----ASLAPSPSSKALLS
           S S+   +S+    P SS+V+S      P SS+V S +S+APS S     SVA +  +V   S A       +  V S     AS + +PSS  ++S
Subjt:  FTRSFSKFEGASLRCYLPPSSKVLSRVLQFLPPSSKVPSRASLAPSPSSKALLSVA-TFLQVRRFSHALLQFLPPNLKVPSR----ASLAPSPSSKALLS

Query:  TAPSPSSKALLSTATSPSSKALLSTAPSPSSKALLSATPSPSSKALLSAAPSPSSKALLSTAPSPSSKALLSTAPSPSSKVLLSTPI
        ++ +PSS  ++S++ +PSS  ++S++ +PSS  ++S++ +PSS  ++S++ +PSS  ++S++ +PSS  ++S++ +PSS  ++S+ +
Subjt:  TAPSPSSKALLSTATSPSSKALLSTAPSPSSKALLSATPSPSSKALLSAAPSPSSKALLSTAPSPSSKALLSTAPSPSSKVLLSTPI

KAA0050345.1 gag protease polyprotein [Cucumis melo var. makuwa]1.7e-0765Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHS
        ST TR SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HS
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHS

KAA0056218.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.8e-1246.09Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSGSRHDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKSE
        ST TR SAF+RLS+STSKK R ST VFDRLK+TNDQ +R+M  L+ K F E N D K+H+       +  ++++         +EPKLH APSP ELKS 
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSGSRHDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKSE

Query:  WKVLPPSSKVPTRFAAVPSPKFEGSHAL
          +  P  +   +FA+  SP  EG+ +L
Subjt:  WKVLPPSSKVPTRFAAVPSPKFEGSHAL

KAA0065966.1 hypothetical protein E6C27_scaffold62G00430 [Cucumis melo var. makuwa]5.8e-1144.12Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSGSRHDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKSE
        ST TR SAF+RLS+STSKK R STS FDR K+TN+Q +R++ +L+ KLF E N D K+HS       +  ++++        +++PKLH APSP ELKS 
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSGSRHDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKSE

Query:  WKVLPPSSKVPTRFAAVPSPKFEGSHALRCSSFPPS
            PP +     F+   SPK      L+  S PPS
Subjt:  WKVLPPSSKVPTRFAAVPSPKFEGSHALRCSSFPPS

TYK26726.1 hypothetical protein E5676_scaffold124G00100 [Cucumis melo var. makuwa]6.0e-0840.94Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKL-FDEVNSDKKLHSGSRHDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKS
        STS + SAF+RLS+ST KK R STS FD LK+ +D+ +R+M  L+ KL ++E N D K+HS     +   M +  +F      ++EPKLH APSP ELK 
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKL-FDEVNSDKKLHSGSRHDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKS

Query:  EWKVLPPSSKVPTRFAAVPSPKFEGSH
               S+K+ +R   +   + +G H
Subjt:  EWKVLPPSSKVPTRFAAVPSPKFEGSH

TrEMBL top hitse value%identityAlignment
A0A5A7U9R6 Gag protease polyprotein8.4e-0865Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHS
        ST TR SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HS
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHS

A0A5A7UM99 Ty3-gypsy retrotransposon protein8.7e-1346.09Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSGSRHDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKSE
        ST TR SAF+RLS+STSKK R ST VFDRLK+TNDQ +R+M  L+ K F E N D K+H+       +  ++++         +EPKLH APSP ELKS 
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSGSRHDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKSE

Query:  WKVLPPSSKVPTRFAAVPSPKFEGSHAL
          +  P  +   +FA+  SP  EG+ +L
Subjt:  WKVLPPSSKVPTRFAAVPSPKFEGSHAL

A0A5A7VHY3 Uncharacterized protein2.8e-1144.12Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSGSRHDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKSE
        ST TR SAF+RLS+STSKK R STS FDR K+TN+Q +R++ +L+ KLF E N D K+HS       +  ++++        +++PKLH APSP ELKS 
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSGSRHDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKSE

Query:  WKVLPPSSKVPTRFAAVPSPKFEGSHALRCSSFPPS
            PP +     F+   SPK      L+  S PPS
Subjt:  WKVLPPSSKVPTRFAAVPSPKFEGSHALRCSSFPPS

A0A5D3DSN6 Uncharacterized protein2.9e-0840.94Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKL-FDEVNSDKKLHSGSRHDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKS
        STS + SAF+RLS+ST KK R STS FD LK+ +D+ +R+M  L+ KL ++E N D K+HS     +   M +  +F      ++EPKLH APSP ELK 
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKL-FDEVNSDKKLHSGSRHDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKS

Query:  EWKVLPPSSKVPTRFAAVPSPKFEGSH
               S+K+ +R   +   + +G H
Subjt:  EWKVLPPSSKVPTRFAAVPSPKFEGSH

A0A5D3DZF3 Retrotransposon gag protein8.4e-0864.91Show/hide
Query:  TRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHS
        TR SAF+RLS+STSKK R STS FDRLK+ NDQ +R+M +L+ KLF E N D K+HS
Subjt:  TRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCAAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCA
ACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTGGATCTCGTCACGATCTGATCAAGACCATGACAAAGA
TAAGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAACTGCATGATGCTCCTAGCCCACACGAGCTTAAAAGCGAATGGAAGGTGCTTCCTCCAAGTTCGAAG
GTTCCCACGCGCTTCGCTGCAGTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAG
TTCCTTCCTCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGTAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGGTCACGCGCTTCGCTGCGTTCCT
TCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAATTCGAAGGTTCTCACG
CGCTTCGGTGAAGTTCCTTCCTCCAAGTCTGAAGGTTTTCACGCACTTCGCTACAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCA
CAGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAATTCCTTCCTCCAAATTTGAAGGTTCTCACGCGCT
TCGATACAGTTCCTTCCTCCCGAAGTTCGAAGTTCCTTCCTCCGAGTTCGAAGGTTCTCGCTGCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTGCT
ACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCT
TCTCTCCACTGCTTCTTCTCCAAGTTCGAAGGCGCTTCTCTCTGTTGCTACCTTCCTCCAAGTGCGAAGGTTCTCTCATGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGA
AGGTTCCCTCACGCGCTTCGCTCGCTCCTACTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACCTTCCTCCAA
GTTCGAAGGTTCTCTCACGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGTTGCTGCAGTTCCTTCCTCCAAGTTTAAAGGTTCCCTCACGCGCTT
CACTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTC
CCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGTTCCTTCCTCCA
AATTTGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTACTTC
TCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGCTACTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGCTGCTCCTTCTC
CAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCTACTGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTATTTTGAAG
TTGACGGCGTCCGCTTCGCTTCATCTTCAAAAATTGACTGTTGATAACTTCACTTCATATTCAAAAGTTGACGGAAACTACAGTCATCAAAGTGGCTGGTCTAGACAGGT
GGTGAAGTCACTGCAATTGAATCTGATGACGATCGTTGAAGGCGAGTCGGGTCTGGTGACCACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAG
CAGGAGTGCATGAAGGCGAATCTGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGCGCATGAAGGCGAATCTGGTGACTACCCCTGCAGGTTACTC
AGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTACCC
CTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATCGCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCAAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCA
ACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTGGATCTCGTCACGATCTGATCAAGACCATGACAAAGA
TAAGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAACTGCATGATGCTCCTAGCCCACACGAGCTTAAAAGCGAATGGAAGGTGCTTCCTCCAAGTTCGAAG
GTTCCCACGCGCTTCGCTGCAGTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAG
TTCCTTCCTCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGTAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGGTCACGCGCTTCGCTGCGTTCCT
TCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAATTCGAAGGTTCTCACG
CGCTTCGGTGAAGTTCCTTCCTCCAAGTCTGAAGGTTTTCACGCACTTCGCTACAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCA
CAGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAATTCCTTCCTCCAAATTTGAAGGTTCTCACGCGCT
TCGATACAGTTCCTTCCTCCCGAAGTTCGAAGTTCCTTCCTCCGAGTTCGAAGGTTCTCGCTGCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTGCT
ACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCT
TCTCTCCACTGCTTCTTCTCCAAGTTCGAAGGCGCTTCTCTCTGTTGCTACCTTCCTCCAAGTGCGAAGGTTCTCTCATGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGA
AGGTTCCCTCACGCGCTTCGCTCGCTCCTACTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACCTTCCTCCAA
GTTCGAAGGTTCTCTCACGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGTTGCTGCAGTTCCTTCCTCCAAGTTTAAAGGTTCCCTCACGCGCTT
CACTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTC
CCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGTTCCTTCCTCCA
AATTTGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTACTTC
TCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGCTACTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGCTGCTCCTTCTC
CAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCTACTGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTATTTTGAAG
TTGACGGCGTCCGCTTCGCTTCATCTTCAAAAATTGACTGTTGATAACTTCACTTCATATTCAAAAGTTGACGGAAACTACAGTCATCAAAGTGGCTGGTCTAGACAGGT
GGTGAAGTCACTGCAATTGAATCTGATGACGATCGTTGAAGGCGAGTCGGGTCTGGTGACCACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAG
CAGGAGTGCATGAAGGCGAATCTGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGCGCATGAAGGCGAATCTGGTGACTACCCCTGCAGGTTACTC
AGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTACCC
CTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATCGCTGTAG
Protein sequenceShow/hide protein sequence
MSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSGSRHDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKSEWKVLPPSSK
VPTRFAAVPSPKFEGSHALRCSSFPPSSKVLTSLRCSSFLQVRRFSRRFAVVPSSKFEGSHIASLRSRASLRSFLQVRRFSRASLCNSFPKFEGSHALRAVPSSNSKVLT
RFGEVPSSKSEGFHALRYSSFPQVRRFSRASLQFLPHSSKVLTCFAAVPSPKFEGSHVASLQFLPPNLKVLTRFDTVPSSRSSKFLPPSSKVLAAQFLPPSSKVLMRFVA
TFLQVRRFSHALLQLLPPSSKVPSRASLAPSPSSKALLSTASSPSSKALLSVATFLQVRRFSHALLQFLPPSSKVPSRASLAPTPSSKALLSTAPSPSSKALLSVATFLQ
VRRFSHALLQFLPPSSKVLSRVAAVPSSKFKGSLTRFTRSFSKFEGASLRCYLPPSSKVLSRVLQFLPPSSKVPSRASLAPSPSSKALLSVATFLQVRRFSHALLQFLPP
NLKVPSRASLAPSPSSKALLSTAPSPSSKALLSTATSPSSKALLSTAPSPSSKALLSATPSPSSKALLSAAPSPSSKALLSTAPSPSSKALLSTAPSPSSKVLLSTPILK
LTASASLHLQKLTVDNFTSYSKVDGNYSHQSGWSRQVVKSLQLNLMTIVEGESGLVTTPAGYSDHPIKWGLGLAGVHEGESGYSDHPIKWGLGLAGAHEGESGDYPCRLL
RSPNKMGTGSSRSYSDHPIKWGLGLAGVHEGESGDYPCRLLRSPNKMGTGSSRSASL