; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019466 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019466
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr5:42553041..42554390
RNA-Seq ExpressionLag0019466
SyntenyLag0019466
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]2.1e-3138.6Show/hide
Query:  PSYVSGVIVKLGWQLLCQKPEPAVIPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNLPNF--PYVVFNSMEIAPSEEQLHSALVTCAVEGASWK
        P++++ VI + GW+  CQ P   ++PLVREFYAN+ D       V+   V ++   IN  + L      YV F S     ++EQL   L   A+EGA+W+
Subjt:  PSYVSGVIVKLGWQLLCQKPEPAVIPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNLPNF--PYVVFNSMEIAPSEEQLHSALVTCAVEGASWK

Query:  MARNSVRTLLAAYLKPEANVWHTFARSRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEIVACS-KRKVGRLFFPNLITALCLRAQVEVDENEEI
        ++     T +   LK  A +W+ F  +R +P+TH  TV+K+RVLL++SIL  +S+++ ++  KEI ACS  RK G L+FP+LIT L L+A V   ++E I
Subjt:  MARNSVRTLLAAYLKPEANVWHTFARSRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEIVACS-KRKVGRLFFPNLITALCLRAQVEVDENEEI

Query:  LMDKGIIDSASITRL
        + + G I + SI+R+
Subjt:  LMDKGIIDSASITRL

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]4.4e-3740.48Show/hide
Query:  KFVNAAAKKKFEVMLQRDPL-PERGFEAD----FEKLPSYVSGVIVKLGWQLLCQKPEPAVIPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNL
        KF   AA+ ++E  +Q  PL  E+GF  D      +LP +++ VI +  W+  C  PE  ++PLVREFYAN+ D       VRG  V WS   IN  + L
Subjt:  KFVNAAAKKKFEVMLQRDPL-PERGFEAD----FEKLPSYVSGVIVKLGWQLLCQKPEPAVIPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNL

Query:  PNFPYVVFNSMEIAPSEEQLHSALVTCAVEGASWKMARNSVRTLLAAYLKPEANVWHTFARSRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEI
         + P    +      +E  L + L T AV GA W ++     T + + L P A VW  F +S LLPTTH  TVSK+R+LL+ S+L   SI+VG+++  EI
Subjt:  PNFPYVVFNSMEIAPSEEQLHSALVTCAVEGASWKMARNSVRTLLAAYLKPEANVWHTFARSRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEI

Query:  VACSKRKVGRLFFPNLITALCLRAQVEVDENEEILMDKGIIDSASITRLHGE
         AC+ RK G LFFP+LIT LC  A+     NEE L + G ID+ ++ R+  E
Subjt:  VACSKRKVGRLFFPNLITALCLRAQVEVDENEEILMDKGIIDSASITRLHGE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]6.3e-4436.15Show/hide
Query:  KFVNAAAKKKFEVMLQRDPL-PERGFEAD----FEKLPSYVSGVIVKLGWQLLCQKPEPAVIPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNL
        KF   AA  ++E  +Q  PL  E+GF  D      +LP +++ VI +  W+  C  PE  ++PLVREFYAN+ D E     VRG  V WS   IN  + L
Subjt:  KFVNAAAKKKFEVMLQRDPL-PERGFEAD----FEKLPSYVSGVIVKLGWQLLCQKPEPAVIPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNL

Query:  PNFPYVVFNSMEIAPSEEQLHSALVTCAVEGASWKMARNSVRTLLAAYLKPEANVWHTFARSRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEI
         + P    +      +++ L + L T A  GA W ++     T + + L P A VW+ F +SRLLPTTH  TVSK+R+LL+ S+L   SI+VG+++  EI
Subjt:  PNFPYVVFNSMEIAPSEEQLHSALVTCAVEGASWKMARNSVRTLLAAYLKPEANVWHTFARSRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEI

Query:  VACSKRKVGRLFFPNLITALCLRAQVEVDENEEILMDKGIIDSASITRLHGE------------KKGRSSTNMVCG--------VEEILRQQR----RLM
         AC+ RK G LFFP+LIT LC  A+     NEE L + G ID+ ++ R+  E            +   +S+N   G        +E+ L QQ      +M
Subjt:  VACSKRKVGRLFFPNLITALCLRAQVEVDENEEILMDKGIIDSASITRLHGE------------KKGRSSTNMVCG--------VEEILRQQR----RLM

Query:  RQMEHSESQQKTYWQYAHLRDSAMEKTFEFGFEELPQPFPHFP
          ++H+  QQ+ +W Y+  RD+A++K  +  F      FP FP
Subjt:  RQMEHSESQQKTYWQYAHLRDSAMEKTFEFGFEELPQPFPHFP

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]2.0e-3439.08Show/hide
Query:  KFVNAAAKKKFEVMLQRDPLP-ERGFEADFEK---LPSYVSGVIVKLGWQLLCQKPEPAVIPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNLP
        KF + AA+ ++E  +Q  PL  E+ F  D  K    P +++ VI++  WQL C  PE  ++PLVREFY N+ + +     +RG  V  S   IN  ++L 
Subjt:  KFVNAAAKKKFEVMLQRDPLP-ERGFEADFEK---LPSYVSGVIVKLGWQLLCQKPEPAVIPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNLP

Query:  NFPYVVFNSMEIAPSEEQLHSALVTCAVEGASWKMARNSVRTLLAAYLKPEANVWHTFARSRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEIV
        + P    +      ++ +L   L T A+ GA W ++     T L + L P A VW+ F +SRLLPTTH  TVSKE V L++S+L   SI+VG+++ +EI 
Subjt:  NFPYVVFNSMEIAPSEEQLHSALVTCAVEGASWKMARNSVRTLLAAYLKPEANVWHTFARSRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEIV

Query:  ACSKRKVGRLFFPNLITALCLRAQVEVDENEEILMDKG
        AC+ RK G LFFP+LIT++C   +     NEE L + G
Subjt:  ACSKRKVGRLFFPNLITALCLRAQVEVDENEEILMDKG

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]2.0e-3437.73Show/hide
Query:  IPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNLPNFPYVVFNSMEIAPSEEQLHSALVTCAVEGASWKMARNSVRTLLAAYLKPEANVWHTFAR
        +PLVREFYAN+ D E     VRG  V WS   IN  + L + P    +      +E +L + L T A  GA W ++     T + + L P A VW+ F +
Subjt:  IPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNLPNFPYVVFNSMEIAPSEEQLHSALVTCAVEGASWKMARNSVRTLLAAYLKPEANVWHTFAR

Query:  SRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEIVACSKRKVGRLFFPNLITALCLRAQVEVDENEEILMDKGIIDSASITRLHGE---------
        SRLLPTTH   VSK+R+LL+ S+L   SI+VG+++  EI AC+ +K G LFFP+LIT LC  A   V  NEE L + G ID+ ++ R+  E         
Subjt:  SRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEIVACSKRKVGRLFFPNLITALCLRAQVEVDENEEILMDKGIIDSASITRLHGE---------

Query:  ---KKGRSSTNMVCGVEEILRQQRRL---MRQMEHSESQQKTYWQYAHLRDSAMEKTFEFGFEELPQPFPHFP
           +   +S++   G  ++L+Q + L   + Q EH+  QQ+ +W Y+  RD+A++K  +  F      FP FP
Subjt:  ---KKGRSSTNMVCGVEEILRQQRRL---MRQMEHSESQQKTYWQYAHLRDSAMEKTFEFGFEELPQPFPHFP

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.1e-3740.48Show/hide
Query:  KFVNAAAKKKFEVMLQRDPL-PERGFEAD----FEKLPSYVSGVIVKLGWQLLCQKPEPAVIPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNL
        KF   AA+ ++E  +Q  PL  E+GF  D      +LP +++ VI +  W+  C  PE  ++PLVREFYAN+ D       VRG  V WS   IN  + L
Subjt:  KFVNAAAKKKFEVMLQRDPL-PERGFEAD----FEKLPSYVSGVIVKLGWQLLCQKPEPAVIPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNL

Query:  PNFPYVVFNSMEIAPSEEQLHSALVTCAVEGASWKMARNSVRTLLAAYLKPEANVWHTFARSRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEI
         + P    +      +E  L + L T AV GA W ++     T + + L P A VW  F +S LLPTTH  TVSK+R+LL+ S+L   SI+VG+++  EI
Subjt:  PNFPYVVFNSMEIAPSEEQLHSALVTCAVEGASWKMARNSVRTLLAAYLKPEANVWHTFARSRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEI

Query:  VACSKRKVGRLFFPNLITALCLRAQVEVDENEEILMDKGIIDSASITRLHGE
         AC+ RK G LFFP+LIT LC  A+     NEE L + G ID+ ++ R+  E
Subjt:  VACSKRKVGRLFFPNLITALCLRAQVEVDENEEILMDKGIIDSASITRLHGE

A0A2P5BCG4 Uncharacterized protein (Fragment)3.0e-4436.15Show/hide
Query:  KFVNAAAKKKFEVMLQRDPL-PERGFEAD----FEKLPSYVSGVIVKLGWQLLCQKPEPAVIPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNL
        KF   AA  ++E  +Q  PL  E+GF  D      +LP +++ VI +  W+  C  PE  ++PLVREFYAN+ D E     VRG  V WS   IN  + L
Subjt:  KFVNAAAKKKFEVMLQRDPL-PERGFEAD----FEKLPSYVSGVIVKLGWQLLCQKPEPAVIPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNL

Query:  PNFPYVVFNSMEIAPSEEQLHSALVTCAVEGASWKMARNSVRTLLAAYLKPEANVWHTFARSRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEI
         + P    +      +++ L + L T A  GA W ++     T + + L P A VW+ F +SRLLPTTH  TVSK+R+LL+ S+L   SI+VG+++  EI
Subjt:  PNFPYVVFNSMEIAPSEEQLHSALVTCAVEGASWKMARNSVRTLLAAYLKPEANVWHTFARSRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEI

Query:  VACSKRKVGRLFFPNLITALCLRAQVEVDENEEILMDKGIIDSASITRLHGE------------KKGRSSTNMVCG--------VEEILRQQR----RLM
         AC+ RK G LFFP+LIT LC  A+     NEE L + G ID+ ++ R+  E            +   +S+N   G        +E+ L QQ      +M
Subjt:  VACSKRKVGRLFFPNLITALCLRAQVEVDENEEILMDKGIIDSASITRLHGE------------KKGRSSTNMVCG--------VEEILRQQR----RLM

Query:  RQMEHSESQQKTYWQYAHLRDSAMEKTFEFGFEELPQPFPHFP
          ++H+  QQ+ +W Y+  RD+A++K  +  F      FP FP
Subjt:  RQMEHSESQQKTYWQYAHLRDSAMEKTFEFGFEELPQPFPHFP

A0A2P5DAQ2 Uncharacterized protein9.8e-3539.08Show/hide
Query:  KFVNAAAKKKFEVMLQRDPLP-ERGFEADFEK---LPSYVSGVIVKLGWQLLCQKPEPAVIPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNLP
        KF + AA+ ++E  +Q  PL  E+ F  D  K    P +++ VI++  WQL C  PE  ++PLVREFY N+ + +     +RG  V  S   IN  ++L 
Subjt:  KFVNAAAKKKFEVMLQRDPLP-ERGFEADFEK---LPSYVSGVIVKLGWQLLCQKPEPAVIPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNLP

Query:  NFPYVVFNSMEIAPSEEQLHSALVTCAVEGASWKMARNSVRTLLAAYLKPEANVWHTFARSRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEIV
        + P    +      ++ +L   L T A+ GA W ++     T L + L P A VW+ F +SRLLPTTH  TVSKE V L++S+L   SI+VG+++ +EI 
Subjt:  NFPYVVFNSMEIAPSEEQLHSALVTCAVEGASWKMARNSVRTLLAAYLKPEANVWHTFARSRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEIV

Query:  ACSKRKVGRLFFPNLITALCLRAQVEVDENEEILMDKG
        AC+ RK G LFFP+LIT++C   +     NEE L + G
Subjt:  ACSKRKVGRLFFPNLITALCLRAQVEVDENEEILMDKG

A0A2P5DXM3 Uncharacterized protein9.8e-3537.73Show/hide
Query:  IPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNLPNFPYVVFNSMEIAPSEEQLHSALVTCAVEGASWKMARNSVRTLLAAYLKPEANVWHTFAR
        +PLVREFYAN+ D E     VRG  V WS   IN  + L + P    +      +E +L + L T A  GA W ++     T + + L P A VW+ F +
Subjt:  IPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNLPNFPYVVFNSMEIAPSEEQLHSALVTCAVEGASWKMARNSVRTLLAAYLKPEANVWHTFAR

Query:  SRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEIVACSKRKVGRLFFPNLITALCLRAQVEVDENEEILMDKGIIDSASITRLHGE---------
        SRLLPTTH   VSK+R+LL+ S+L   SI+VG+++  EI AC+ +K G LFFP+LIT LC  A   V  NEE L + G ID+ ++ R+  E         
Subjt:  SRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEIVACSKRKVGRLFFPNLITALCLRAQVEVDENEEILMDKGIIDSASITRLHGE---------

Query:  ---KKGRSSTNMVCGVEEILRQQRRL---MRQMEHSESQQKTYWQYAHLRDSAMEKTFEFGFEELPQPFPHFP
           +   +S++   G  ++L+Q + L   + Q EH+  QQ+ +W Y+  RD+A++K  +  F      FP FP
Subjt:  ---KKGRSSTNMVCGVEEILRQQRRL---MRQMEHSESQQKTYWQYAHLRDSAMEKTFEFGFEELPQPFPHFP

W9QTD9 Uncharacterized protein1.0e-3138.6Show/hide
Query:  PSYVSGVIVKLGWQLLCQKPEPAVIPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNLPNF--PYVVFNSMEIAPSEEQLHSALVTCAVEGASWK
        P++++ VI + GW+  CQ P   ++PLVREFYAN+ D       V+   V ++   IN  + L      YV F S     ++EQL   L   A+EGA+W+
Subjt:  PSYVSGVIVKLGWQLLCQKPEPAVIPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNLPNF--PYVVFNSMEIAPSEEQLHSALVTCAVEGASWK

Query:  MARNSVRTLLAAYLKPEANVWHTFARSRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEIVACS-KRKVGRLFFPNLITALCLRAQVEVDENEEI
        ++     T +   LK  A +W+ F  +R +P+TH  TV+K+RVLL++SIL  +S+++ ++  KEI ACS  RK G L+FP+LIT L L+A V   ++E I
Subjt:  MARNSVRTLLAAYLKPEANVWHTFARSRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEIVACS-KRKVGRLFFPNLITALCLRAQVEVDENEEI

Query:  LMDKGIIDSASITRL
        + + G I + SI+R+
Subjt:  LMDKGIIDSASITRL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCCAAAGTTCTAAAAAAGCTATTCAGGCCCAAAGTTGCTGAATCTTCAGCACCATCAAGCAAACCTCAAGATAAAGAAATAGTTGTTTTAGGTGAAGTTGGGCA
ATCTAAGGCCATAAGAAGGAGTCCTAGAAACAAAGGAAAGGGAAAACAAGTGGTGGAGCAAGAAGAACTTCATGAAGAAGTAATTGTGACTCAAGTTCAAGAGCAAGAAG
TTGAGGAAGAAATTGCACCTCCCCCAAGAAGTCCTCCAACAAGAAAGAAGAGCCTAAAGAAAAAGGGAAAAGAGCCAATGCTGGAATTTGATGAGAGCCCTATGGAAAAG
TTTGTTAATGCTGCCGCTAAGAAAAAATTTGAAGTCATGCTCCAAAGAGATCCTCTTCCTGAAAGAGGATTTGAGGCAGATTTTGAAAAGCTTCCCTCATATGTTTCAGG
AGTGATCGTCAAGCTTGGATGGCAATTACTTTGCCAAAAGCCGGAACCAGCTGTTATTCCCTTGGTGAGGGAATTCTATGCAAATGTCCAAGACAACGAGCTTTTCAGAA
CCAAAGTAAGGGGAAAATGGGTGGACTGGTCGCCATTAGTCATCAACGAATTCTACAACCTTCCTAATTTCCCCTATGTTGTTTTCAACTCAATGGAGATTGCTCCATCT
GAAGAGCAGCTCCATTCAGCTCTTGTCACTTGTGCTGTTGAAGGAGCAAGCTGGAAGATGGCAAGAAACTCAGTTCGAACACTATTGGCTGCATACCTTAAGCCCGAAGC
AAACGTTTGGCACACCTTTGCAAGGAGTAGGTTGCTCCCCACTACTCATGACACCACGGTGTCTAAAGAAAGAGTTCTCTTAATTTTTTCCATCTTAAAAATCCTGAGCA
TTGACGTGGGAAAGCTCTTGGCAAAAGAGATTGTGGCATGTTCAAAAAGGAAGGTAGGAAGACTGTTTTTTCCAAATCTGATTACAGCATTATGTTTGAGGGCTCAAGTC
GAGGTGGATGAGAATGAAGAAATTCTAATGGACAAAGGAATCATAGATTCAGCATCTATAACAAGATTACATGGGGAGAAGAAGGGAAGGTCAAGCACGAATATGGTTTG
TGGGGTTGAGGAGATTTTAAGGCAGCAGAGAAGGTTGATGAGGCAAATGGAACACAGTGAGAGTCAGCAAAAGACTTATTGGCAATATGCTCATCTTAGGGACTCTGCCA
TGGAGAAAACATTCGAATTCGGCTTTGAGGAGCTTCCTCAGCCATTTCCTCATTTCCCAACAGGTTTATTTGACCCGTGGTGCCCTTCCCCATCTCCCAGCGGGAATGAA
AATGATGCTGATGATGATCAAGAGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCCCAAAGTTCTAAAAAAGCTATTCAGGCCCAAAGTTGCTGAATCTTCAGCACCATCAAGCAAACCTCAAGATAAAGAAATAGTTGTTTTAGGTGAAGTTGGGCA
ATCTAAGGCCATAAGAAGGAGTCCTAGAAACAAAGGAAAGGGAAAACAAGTGGTGGAGCAAGAAGAACTTCATGAAGAAGTAATTGTGACTCAAGTTCAAGAGCAAGAAG
TTGAGGAAGAAATTGCACCTCCCCCAAGAAGTCCTCCAACAAGAAAGAAGAGCCTAAAGAAAAAGGGAAAAGAGCCAATGCTGGAATTTGATGAGAGCCCTATGGAAAAG
TTTGTTAATGCTGCCGCTAAGAAAAAATTTGAAGTCATGCTCCAAAGAGATCCTCTTCCTGAAAGAGGATTTGAGGCAGATTTTGAAAAGCTTCCCTCATATGTTTCAGG
AGTGATCGTCAAGCTTGGATGGCAATTACTTTGCCAAAAGCCGGAACCAGCTGTTATTCCCTTGGTGAGGGAATTCTATGCAAATGTCCAAGACAACGAGCTTTTCAGAA
CCAAAGTAAGGGGAAAATGGGTGGACTGGTCGCCATTAGTCATCAACGAATTCTACAACCTTCCTAATTTCCCCTATGTTGTTTTCAACTCAATGGAGATTGCTCCATCT
GAAGAGCAGCTCCATTCAGCTCTTGTCACTTGTGCTGTTGAAGGAGCAAGCTGGAAGATGGCAAGAAACTCAGTTCGAACACTATTGGCTGCATACCTTAAGCCCGAAGC
AAACGTTTGGCACACCTTTGCAAGGAGTAGGTTGCTCCCCACTACTCATGACACCACGGTGTCTAAAGAAAGAGTTCTCTTAATTTTTTCCATCTTAAAAATCCTGAGCA
TTGACGTGGGAAAGCTCTTGGCAAAAGAGATTGTGGCATGTTCAAAAAGGAAGGTAGGAAGACTGTTTTTTCCAAATCTGATTACAGCATTATGTTTGAGGGCTCAAGTC
GAGGTGGATGAGAATGAAGAAATTCTAATGGACAAAGGAATCATAGATTCAGCATCTATAACAAGATTACATGGGGAGAAGAAGGGAAGGTCAAGCACGAATATGGTTTG
TGGGGTTGAGGAGATTTTAAGGCAGCAGAGAAGGTTGATGAGGCAAATGGAACACAGTGAGAGTCAGCAAAAGACTTATTGGCAATATGCTCATCTTAGGGACTCTGCCA
TGGAGAAAACATTCGAATTCGGCTTTGAGGAGCTTCCTCAGCCATTTCCTCATTTCCCAACAGGTTTATTTGACCCGTGGTGCCCTTCCCCATCTCCCAGCGGGAATGAA
AATGATGCTGATGATGATCAAGAGGATTGA
Protein sequenceShow/hide protein sequence
MAPKVLKKLFRPKVAESSAPSSKPQDKEIVVLGEVGQSKAIRRSPRNKGKGKQVVEQEELHEEVIVTQVQEQEVEEEIAPPPRSPPTRKKSLKKKGKEPMLEFDESPMEK
FVNAAAKKKFEVMLQRDPLPERGFEADFEKLPSYVSGVIVKLGWQLLCQKPEPAVIPLVREFYANVQDNELFRTKVRGKWVDWSPLVINEFYNLPNFPYVVFNSMEIAPS
EEQLHSALVTCAVEGASWKMARNSVRTLLAAYLKPEANVWHTFARSRLLPTTHDTTVSKERVLLIFSILKILSIDVGKLLAKEIVACSKRKVGRLFFPNLITALCLRAQV
EVDENEEILMDKGIIDSASITRLHGEKKGRSSTNMVCGVEEILRQQRRLMRQMEHSESQQKTYWQYAHLRDSAMEKTFEFGFEELPQPFPHFPTGLFDPWCPSPSPSGNE
NDADDDQED