; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010551 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010551
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF4283 domain-containing protein
Genome locationchr1:1053795..1059478
RNA-Seq ExpressionLag0010551
SyntenyLag0010551
Gene Ontology termsGO:0043666 - regulation of phosphoprotein phosphatase activity (biological process)
GO:0019903 - protein phosphatase binding (molecular function)
InterPro domainsIPR007587 - SIT4 phosphatase-associated protein family
IPR025558 - Domain of unknown function DUF4283


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0062494.1 uncharacterized protein E6C27_scaffold130G00900 [Cucumis melo var. makuwa]4.3e-1943.31Show/hide
Query:  HSQPKFIKSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVEKNYCGFIPADIHVKIDNKCEFSLKFGDIKALDDRNLNFVS
        HS+   +K Y GW+ IKNLPLD W R +FE IG +L GL+ I+ +TLNL +CS A I+V+KN CGF+ +   +    +    L FGD + L   N    S
Subjt:  HSQPKFIKSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVEKNYCGFIPADIHVKIDNKCEFSLKFGDIKALDDRNLNFVS

Query:  SRRLSSDEFSNSLDVIRIRQAVLDDDL
           +  D+F NS+D ++IR  +LD+DL
Subjt:  SRRLSSDEFSNSLDVIRIRQAVLDDDL

KAA0063414.1 uncharacterized protein E6C27_scaffold508G00510 [Cucumis melo var. makuwa]4.6e-2140.49Show/hide
Query:  PSVNGKWKLFGNLHLKLEFWSSDFHSQPKFIKSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVEKNYCGFIPADIHVKID
        P+  GK   +  +H   E    DF S    +K YGGWI+IKNLPLD W  D ++AIG   GG  SIS KT+NL++CS+A I+V +N CGF+PA + ++  
Subjt:  PSVNGKWKLFGNLHLKLEFWSSDFHSQPKFIKSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVEKNYCGFIPADIHVKID

Query:  NKCEFSLKFGDIKALDDRNLNFVSSRRLSSDEFSNSLDVIRIRQAVLDDDLILGNEEERMNEL
         +    L FGDIK L+      V    L     +N +D++RI Q +LD+    G E   M +L
Subjt:  NKCEFSLKFGDIKALDDRNLNFVSSRRLSSDEFSNSLDVIRIRQAVLDDDLILGNEEERMNEL

TYJ98837.1 putative 3,4-dihydroxy-2-butanone kinase isoform X5 [Cucumis melo var. makuwa]5.5e-2249.56Show/hide
Query:  GKWKLFGNLHLKLEFWSSDFHSQPKFIKSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVEKNYCGFIPADIHVKIDNKC-
        GKW+ +GN HLK+E W +  HS P   K YGGW+ IKNLPLD W R + E IG +  GL  I+ +TLNL + S+A I+V+KN CGF+P+ I +  D KC 
Subjt:  GKWKLFGNLHLKLEFWSSDFHSQPKFIKSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVEKNYCGFIPADIHVKIDNKC-

Query:  EFSLKFGDIKALD
           L FGD + L+
Subjt:  EFSLKFGDIKALD

XP_031738083.1 uncharacterized protein LOC116402658 [Cucumis sativus]2.3e-2043.31Show/hide
Query:  HSQPKFIKSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVEKNYCGFIPADIHVKIDNKCEFSLKFGDIKALDDRNLNFVS
        HS+P  +K YGGW+ +KNLPLDLW R  FEAIG + GG + I+++TLNL +CS+A I+V++N CGF+ +   +    +    L FG+ + L         
Subjt:  HSQPKFIKSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVEKNYCGFIPADIHVKIDNKCEFSLKFGDIKALDDRNLNFVS

Query:  SRRLSS--DEFSNSLDVIRIRQAVLDD
        SR+  S  D + NSLD +RIR+A+ D+
Subjt:  SRRLSS--DEFSNSLDVIRIRQAVLDD

XP_038904899.1 uncharacterized protein LOC120091119 isoform X2 [Benincasa hispida]3.5e-2938.92Show/hide
Query:  DDKALIQVADYSLDPSVN--GKWKLFGNLHLKLEFWSSDFHSQPKFIKSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVE
        D+   + +   SLD  VN  GKW+ FG+ HLK E W++  H +P +++ YGGWI+IKNLPLD W + +FEAIGK  GGL SI+ + LNL+    A I+V+
Subjt:  DDKALIQVADYSLDPSVN--GKWKLFGNLHLKLEFWSSDFHSQPKFIKSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVE

Query:  KNYCGFIPADIHVKIDNKCEFSLKFGDIKALDDRNLNFVSSRRLSSDEFSNSLDVIRIRQAVLDDDLILGNEEERMNELPSCSRS
        +N CGF+PA I V  + +    L FGDI      N        L   +F+N +D+IR+ +    + +       + N L S +R+
Subjt:  KNYCGFIPADIHVKIDNKCEFSLKFGDIKALDDRNLNFVSSRRLSSDEFSNSLDVIRIRQAVLDDDLILGNEEERMNELPSCSRS

TrEMBL top hitse value%identityAlignment
A0A5A7U6A9 Uncharacterized protein5.2e-1841.67Show/hide
Query:  KSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVEKNYCGFIPADIHVKIDNKCEFSLKFGDIKALDDRNLNFVSSRRLSSD
        + YGGW+ IKNLPLD W R + E I  + GGL   +S+TLNL + ++A+I+V+KN CGF+P  I +    +    L FGD + L+    + V S     D
Subjt:  KSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVEKNYCGFIPADIHVKIDNKCEFSLKFGDIKALDDRNLNFVSSRRLSSD

Query:  EFSNSLDVIRIRQAVLDDDL
        +F+NS+D +R+R  +LD+D+
Subjt:  EFSNSLDVIRIRQAVLDDDL

A0A5A7V878 DUF4283 domain-containing protein2.2e-2140.49Show/hide
Query:  PSVNGKWKLFGNLHLKLEFWSSDFHSQPKFIKSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVEKNYCGFIPADIHVKID
        P+  GK   +  +H   E    DF S    +K YGGWI+IKNLPLD W  D ++AIG   GG  SIS KT+NL++CS+A I+V +N CGF+PA + ++  
Subjt:  PSVNGKWKLFGNLHLKLEFWSSDFHSQPKFIKSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVEKNYCGFIPADIHVKID

Query:  NKCEFSLKFGDIKALDDRNLNFVSSRRLSSDEFSNSLDVIRIRQAVLDDDLILGNEEERMNEL
         +    L FGDIK L+      V    L     +N +D++RI Q +LD+    G E   M +L
Subjt:  NKCEFSLKFGDIKALDDRNLNFVSSRRLSSDEFSNSLDVIRIRQAVLDDDLILGNEEERMNEL

A0A5D3BI91 Putative 3,4-dihydroxy-2-butanone kinase isoform X52.6e-2249.56Show/hide
Query:  GKWKLFGNLHLKLEFWSSDFHSQPKFIKSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVEKNYCGFIPADIHVKIDNKC-
        GKW+ +GN HLK+E W +  HS P   K YGGW+ IKNLPLD W R + E IG +  GL  I+ +TLNL + S+A I+V+KN CGF+P+ I +  D KC 
Subjt:  GKWKLFGNLHLKLEFWSSDFHSQPKFIKSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVEKNYCGFIPADIHVKIDNKC-

Query:  EFSLKFGDIKALD
           L FGD + L+
Subjt:  EFSLKFGDIKALD

A0A5D3DKV0 DUF4283 domain-containing protein1.1e-1725.94Show/hide
Query:  NGKWKLFGNLHLKLEFWSSDFHSQPKFIKSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVEKNYCGFIPADIHVKIDNKC
        N  W   G   +K E WSS +H+ PK I SYGGW   + +PL LW+  +F+ +GK  GGL+ ++ +T +  +  KA I+V  NY GF+PA++ +  +   
Subjt:  NGKWKLFGNLHLKLEFWSSDFHSQPKFIKSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVEKNYCGFIPADIHVKIDNKC

Query:  EFSLK---FGDIKALDDRNL----NFVSSRRLSSDEFSNSLDVIRIR--QAVLDDDLILGNEEERMN---ELPSCSRSQGKINEVLGSPKGALLHEEGIN
        +FS++     + K L +RN+     F      + DEF+   +       +A+  D L   ++  + N   + P+      K + V  SP  + L+EE +N
Subjt:  EFSLK---FGDIKALDDRNL----NFVSSRRLSSDEFSNSLDVIRIR--QAVLDDDLILGNEEERMN---ELPSCSRSQGKINEVLGSPKGALLHEEGIN

Query:  NIGCMG-FNDSTQEMDPILSPFNVINDNISTRPKEFQQPLLIDYSPKNINASIGKVG-NQPTTSVSIINSIENDYSPTNINASIGKVGNQPAASVSIINS
        +       N S  E+ P +S     ND +  + K+     L   S  N++ S  KV  N P    +I N    D +P N + S+     +      +  S
Subjt:  NIGCMG-FNDSTQEMDPILSPFNVINDNISTRPKEFQQPLLIDYSPKNINASIGKVG-NQPTTSVSIINSIENDYSPTNINASIGKVGNQPAASVSIINS

Query:  IENEYIQQAALKTYSRKKGSRLVKQFNANVSEISEALTEGELHESQNLLFTPI------HDPPLVLKNCNEDGLEDKERIVSKALKKQYESFPLYYSRRK
           + IQ  A    + KKG  L      ++  +    +  + H S N     I       + P +    NE+     E   +   K ++     YY R+K
Subjt:  IENEYIQQAALKTYSRKKGSRLVKQFNANVSEISEALTEGELHESQNLLFTPI------HDPPLVLKNCNEDGLEDKERIVSKALKKQYESFPLYYSRRK

Query:  NEKTTILDSIPINSNYNPDVIEESYDDSVVSISS--AEAENQFLNDENNEL
         EK    DS          + E     S V+ SS    + N  +N  N+ L
Subjt:  NEKTTILDSIPINSNYNPDVIEESYDDSVVSISS--AEAENQFLNDENNEL

A0A5D3DVS9 Uncharacterized protein2.1e-1943.31Show/hide
Query:  HSQPKFIKSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVEKNYCGFIPADIHVKIDNKCEFSLKFGDIKALDDRNLNFVS
        HS+   +K Y GW+ IKNLPLD W R +FE IG +L GL+ I+ +TLNL +CS A I+V+KN CGF+ +   +    +    L FGD + L   N    S
Subjt:  HSQPKFIKSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVEKNYCGFIPADIHVKIDNKCEFSLKFGDIKALDDRNLNFVS

Query:  SRRLSSDEFSNSLDVIRIRQAVLDDDL
           +  D+F NS+D ++IR  +LD+DL
Subjt:  SRRLSSDEFSNSLDVIRIRQAVLDDDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G07990.1 SIT4 phosphatase-associated family protein5.3e-0746.05Show/hide
Query:  DCKVLEMC---CSASACSEVHSNAAGLLCAISRFSPLGLLAKISSTSFVGSLVRHALEDSHPTFVLINSLSVVRSL
        D  +LEM       S+  EV +NAA  LCAISR +P  L  ++SS  +V  +  HALEDSH    L++SLSV  SL
Subjt:  DCKVLEMC---CSASACSEVHSNAAGLLCAISRFSPLGLLAKISSTSFVGSLVRHALEDSHPTFVLINSLSVVRSL

AT1G30470.1 SIT4 phosphatase-associated family protein3.5e-1151.32Show/hide
Query:  DCKVLEMCC---SASACSEVHSNAAGLLCAISRFSPLGLLAKISSTSFVGSLVRHALEDSHPTFVLINSLSVVRSL
        D  VLEM      +S   EVH+NAA +LC ++R++P GL  K+SS S  G L++H LEDS P  VL+NSLSV  SL
Subjt:  DCKVLEMCC---SASACSEVHSNAAGLLCAISRFSPLGLLAKISSTSFVGSLVRHALEDSHPTFVLINSLSVVRSL

AT1G30470.2 SIT4 phosphatase-associated family protein3.5e-1151.32Show/hide
Query:  DCKVLEMCC---SASACSEVHSNAAGLLCAISRFSPLGLLAKISSTSFVGSLVRHALEDSHPTFVLINSLSVVRSL
        D  VLEM      +S   EVH+NAA +LC ++R++P GL  K+SS S  G L++H LEDS P  VL+NSLSV  SL
Subjt:  DCKVLEMCC---SASACSEVHSNAAGLLCAISRFSPLGLLAKISSTSFVGSLVRHALEDSHPTFVLINSLSVVRSL

AT1G30470.3 SIT4 phosphatase-associated family protein3.5e-1151.32Show/hide
Query:  DCKVLEMCC---SASACSEVHSNAAGLLCAISRFSPLGLLAKISSTSFVGSLVRHALEDSHPTFVLINSLSVVRSL
        D  VLEM      +S   EVH+NAA +LC ++R++P GL  K+SS S  G L++H LEDS P  VL+NSLSV  SL
Subjt:  DCKVLEMCC---SASACSEVHSNAAGLLCAISRFSPLGLLAKISSTSFVGSLVRHALEDSHPTFVLINSLSVVRSL

AT2G28360.1 SIT4 phosphatase-associated family protein1.5e-0644.74Show/hide
Query:  DCKVLEMC---CSASACSEVHSNAAGLLCAISRFSPLGLLAKISSTSFVGSLVRHALEDSHPTFVLINSLSVVRSL
        D  +LEM     + S+  EV +NAA  LCAI+R +P  L  K+SS  FV  +  HA+EDSH    L++SL+V  SL
Subjt:  DCKVLEMC---CSASACSEVHSNAAGLLCAISRFSPLGLLAKISSTSFVGSLVRHALEDSHPTFVLINSLSVVRSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGATAAAGCTTTGATTCAAGTGGCTGATTATAGTTTGGATCCTTCTGTGAATGGTAAGTGGAAACTATTCGGGAACCTTCATTTGAAATTGGAATTTTGGTCCTC
TGATTTTCATTCCCAGCCAAAATTTATAAAAAGTTATGGAGGATGGATTGCAATAAAAAATTTACCTTTGGATTTGTGGCATCGGGACTCCTTTGAAGCTATTGGAAAAA
ACCTTGGTGGGTTGGTTAGTATTTCTTCCAAGACTCTCAATTTATTGGACTGTTCGAAAGCTTTTATTGAAGTAGAAAAAAATTATTGTGGATTTATCCCTGCTGATATT
CATGTTAAGATTGACAATAAATGTGAATTCTCTTTAAAATTTGGGGATATTAAGGCACTAGATGATAGAAATTTGAACTTTGTTTCAAGTAGAAGGTTATCTTCTGATGA
ATTTTCAAATTCCTTAGATGTTATTAGGATCAGGCAAGCTGTTTTGGATGATGATTTGATTCTGGGCAATGAAGAGGAAAGGATGAATGAATTGCCTTCTTGTTCTCGGT
CTCAGGGAAAAATTAATGAGGTGTTGGGTTCTCCAAAGGGTGCTTTGTTGCATGAAGAGGGCATTAATAACATTGGTTGTATGGGCTTTAATGATAGCACTCAAGAAATG
GATCCTATTCTCTCTCCTTTTAATGTTATTAATGATAACATTTCCACCCGCCCTAAGGAATTCCAGCAGCCACTGCTGATTGATTATTCTCCTAAAAATATTAATGCTAG
CATTGGAAAAGTGGGTAATCAGCCAACAACTTCCGTAAGTATTATTAATAGCATTGAAAATGATTATTCTCCTACAAATATTAATGCTAGCATTGGAAAAGTGGGTAATC
AACCAGCAGCTTCTGTAAGTATTATTAATAGCATTGAAAATGAGTATATCCAGCAGGCAGCTTTAAAGACTTACTCTCGGAAAAAGGGGTCTCGATTAGTGAAGCAGTTT
AATGCCAATGTTTCAGAAATTAGTGAAGCATTAACTGAAGGAGAATTGCATGAGTCCCAGAATTTATTATTCACGCCTATTCATGATCCACCTTTGGTTTTGAAGAATTG
TAATGAAGATGGTTTGGAAGATAAGGAACGGATTGTTTCTAAGGCTCTAAAGAAACAATATGAATCTTTTCCTCTTTATTATTCTCGTCGGAAAAATGAAAAGACAACAA
TTTTGGATTCAATTCCTATTAATTCCAATTATAACCCCGACGTGATTGAAGAATCTTATGATGATTCAGTGGTGAGTATTAGTAGTGCCGAGGCTGAAAATCAGTTTTTG
AATGATGAAAACAATGAATTATTGAAGGAATACTCTTTTGCATTGGCTTTAAATCGGATTTTCCAGAACAATGAAGCTGTTTTTGAAGTTCAGATGAATGAGTCTAAGGT
GCCATTTAATTCTAGTCATTGGGAGGATGTTAATGGGGACTTGGTTATCTCAAAGGACACCTCGGTGCATGAAGAAAGAATTAATTGTGATGGTTGTAATGATCCTCCAC
CCAAGATGATTAATGATGATAGTTGTAAGGTGATTAATGAATTTCAGCAGATTTCTAGTGAGAAAGATCAAGTTAATGAGGCGTTGGGGTTTCCAAAAGATGCTTCATTG
CATGATAAGAGTTTTAATCATGTTGATTGTAAAGTGCTGGAAATGTGTTGTTCAGCTTCAGCTTGTTCAGAAGTTCATTCTAATGCAGCAGGATTACTTTGTGCTATTTC
TCGATTTTCTCCTCTTGGTCTTTTGGCTAAAATTTCCAGCACAAGTTTTGTAGGAAGTTTGGTTCGCCATGCCTTAGAAGATTCTCATCCAACGTTTGTTTTAATAAACT
CATTATCAGTGGTTAGATCCTTGGAGGCTATCTACTGGATGATCTTTTGTATACTCCGGCCAAATTGCCCTTGGATATTCTGTTTCAGCAATTATGGAGGCAGTGGATGG
CAGGTTAGAGAGTTTAAGTTGGTTGCTCAAGCTTCTGGATGTTTCTTCAGTTACTACATTTGGAAAGTCACAACCACCTCTTGGAAACACCGTCTGAAGCTCGTCTCAAA
TCTTTGTTATGGAGTCCGTTGTCCAAGATTTATTCAGCATTCTCAATTCAGAATATTTGCCTTTTTTGGGATGCTTTCATTTCTCCATCCTAAGTCTTCATTAGAAGTCT
TTAATCATGAATTTGTGGAAGAAAAACGCACTAAATTTTTCGATCTATCAGTTAAGGTTTCCACCCAGCGGAGTAGTGCTCTAAATGAGGAGATTTTCGAGCTCAGCTCT
CAATCTTTGGTCCTCAATTCATCTCCTCCTTTTGTTGATAATCATCTTTCAAACGGTTTTATTAATTCAACAAATGGGACAATCAACAAGGCACATTTTTTGGAGAATGA
AGTGGAGACAGACAAGGATTCCATGGTGAGTATTAGTAGTGTTGACACGGAGTTCAATTTTCCATTTGTCATTGAGCAAGAGAAGATTTCTTCTGAAGTTGCTATTGAAG
GGGCCCTAAATAATCTTTTCCAAGAACCTGATCCTCCTTCCAAGAAGATTGAGAAAGAAATATGCAAGTTGGAAACTCATGGACACTTTACCAGTAGCTCCCTCATTGGT
GGTCTCTCCCGCACAAACGAAGATGAATCGCCACAGCTATTTGAATTGATTTGTAAAGGCCAGTTAAGGAAGAAGGATACCTTACACATTGGTATCAGAACACGTCAAAT
CCTGGGGAAAACACGCGCAATGGCTCAAAAACAGACAGAGGAATGCCTGGAGAACCAGGAAAGGCAGATGGGAGAGTTTAAGGAGACCATACAGCTGTTCGGGAAACACC
TTGAACAGTTGACGCTCGAGATGAAGGAGAATCAGCGGGCGATTATGGCCCTGATGGCAGGACAGAATGTGGGACGAGAGCAGTCGTCTTCGGAAGTTACTGAGTCCTCG
AGTAAGCAACTTAAGGCCTTAGAGAAGGAACCAGAGAATAAGGAAGAGAAAACAGAGGCCATCGAGGTTAACCGCGTGCAAAGGGAAATACAAGGGGAGACTGTTGCAAG
TTTAAGAAGCTCGAGATGCCGGTTTTTGCAGGGGAGAAGCCGGATTCGTGGCTCTTTCGTGCGGAGCGATATTTCGAGATCCACCAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGATAAAGCTTTGATTCAAGTGGCTGATTATAGTTTGGATCCTTCTGTGAATGGTAAGTGGAAACTATTCGGGAACCTTCATTTGAAATTGGAATTTTGGTCCTC
TGATTTTCATTCCCAGCCAAAATTTATAAAAAGTTATGGAGGATGGATTGCAATAAAAAATTTACCTTTGGATTTGTGGCATCGGGACTCCTTTGAAGCTATTGGAAAAA
ACCTTGGTGGGTTGGTTAGTATTTCTTCCAAGACTCTCAATTTATTGGACTGTTCGAAAGCTTTTATTGAAGTAGAAAAAAATTATTGTGGATTTATCCCTGCTGATATT
CATGTTAAGATTGACAATAAATGTGAATTCTCTTTAAAATTTGGGGATATTAAGGCACTAGATGATAGAAATTTGAACTTTGTTTCAAGTAGAAGGTTATCTTCTGATGA
ATTTTCAAATTCCTTAGATGTTATTAGGATCAGGCAAGCTGTTTTGGATGATGATTTGATTCTGGGCAATGAAGAGGAAAGGATGAATGAATTGCCTTCTTGTTCTCGGT
CTCAGGGAAAAATTAATGAGGTGTTGGGTTCTCCAAAGGGTGCTTTGTTGCATGAAGAGGGCATTAATAACATTGGTTGTATGGGCTTTAATGATAGCACTCAAGAAATG
GATCCTATTCTCTCTCCTTTTAATGTTATTAATGATAACATTTCCACCCGCCCTAAGGAATTCCAGCAGCCACTGCTGATTGATTATTCTCCTAAAAATATTAATGCTAG
CATTGGAAAAGTGGGTAATCAGCCAACAACTTCCGTAAGTATTATTAATAGCATTGAAAATGATTATTCTCCTACAAATATTAATGCTAGCATTGGAAAAGTGGGTAATC
AACCAGCAGCTTCTGTAAGTATTATTAATAGCATTGAAAATGAGTATATCCAGCAGGCAGCTTTAAAGACTTACTCTCGGAAAAAGGGGTCTCGATTAGTGAAGCAGTTT
AATGCCAATGTTTCAGAAATTAGTGAAGCATTAACTGAAGGAGAATTGCATGAGTCCCAGAATTTATTATTCACGCCTATTCATGATCCACCTTTGGTTTTGAAGAATTG
TAATGAAGATGGTTTGGAAGATAAGGAACGGATTGTTTCTAAGGCTCTAAAGAAACAATATGAATCTTTTCCTCTTTATTATTCTCGTCGGAAAAATGAAAAGACAACAA
TTTTGGATTCAATTCCTATTAATTCCAATTATAACCCCGACGTGATTGAAGAATCTTATGATGATTCAGTGGTGAGTATTAGTAGTGCCGAGGCTGAAAATCAGTTTTTG
AATGATGAAAACAATGAATTATTGAAGGAATACTCTTTTGCATTGGCTTTAAATCGGATTTTCCAGAACAATGAAGCTGTTTTTGAAGTTCAGATGAATGAGTCTAAGGT
GCCATTTAATTCTAGTCATTGGGAGGATGTTAATGGGGACTTGGTTATCTCAAAGGACACCTCGGTGCATGAAGAAAGAATTAATTGTGATGGTTGTAATGATCCTCCAC
CCAAGATGATTAATGATGATAGTTGTAAGGTGATTAATGAATTTCAGCAGATTTCTAGTGAGAAAGATCAAGTTAATGAGGCGTTGGGGTTTCCAAAAGATGCTTCATTG
CATGATAAGAGTTTTAATCATGTTGATTGTAAAGTGCTGGAAATGTGTTGTTCAGCTTCAGCTTGTTCAGAAGTTCATTCTAATGCAGCAGGATTACTTTGTGCTATTTC
TCGATTTTCTCCTCTTGGTCTTTTGGCTAAAATTTCCAGCACAAGTTTTGTAGGAAGTTTGGTTCGCCATGCCTTAGAAGATTCTCATCCAACGTTTGTTTTAATAAACT
CATTATCAGTGGTTAGATCCTTGGAGGCTATCTACTGGATGATCTTTTGTATACTCCGGCCAAATTGCCCTTGGATATTCTGTTTCAGCAATTATGGAGGCAGTGGATGG
CAGGTTAGAGAGTTTAAGTTGGTTGCTCAAGCTTCTGGATGTTTCTTCAGTTACTACATTTGGAAAGTCACAACCACCTCTTGGAAACACCGTCTGAAGCTCGTCTCAAA
TCTTTGTTATGGAGTCCGTTGTCCAAGATTTATTCAGCATTCTCAATTCAGAATATTTGCCTTTTTTGGGATGCTTTCATTTCTCCATCCTAAGTCTTCATTAGAAGTCT
TTAATCATGAATTTGTGGAAGAAAAACGCACTAAATTTTTCGATCTATCAGTTAAGGTTTCCACCCAGCGGAGTAGTGCTCTAAATGAGGAGATTTTCGAGCTCAGCTCT
CAATCTTTGGTCCTCAATTCATCTCCTCCTTTTGTTGATAATCATCTTTCAAACGGTTTTATTAATTCAACAAATGGGACAATCAACAAGGCACATTTTTTGGAGAATGA
AGTGGAGACAGACAAGGATTCCATGGTGAGTATTAGTAGTGTTGACACGGAGTTCAATTTTCCATTTGTCATTGAGCAAGAGAAGATTTCTTCTGAAGTTGCTATTGAAG
GGGCCCTAAATAATCTTTTCCAAGAACCTGATCCTCCTTCCAAGAAGATTGAGAAAGAAATATGCAAGTTGGAAACTCATGGACACTTTACCAGTAGCTCCCTCATTGGT
GGTCTCTCCCGCACAAACGAAGATGAATCGCCACAGCTATTTGAATTGATTTGTAAAGGCCAGTTAAGGAAGAAGGATACCTTACACATTGGTATCAGAACACGTCAAAT
CCTGGGGAAAACACGCGCAATGGCTCAAAAACAGACAGAGGAATGCCTGGAGAACCAGGAAAGGCAGATGGGAGAGTTTAAGGAGACCATACAGCTGTTCGGGAAACACC
TTGAACAGTTGACGCTCGAGATGAAGGAGAATCAGCGGGCGATTATGGCCCTGATGGCAGGACAGAATGTGGGACGAGAGCAGTCGTCTTCGGAAGTTACTGAGTCCTCG
AGTAAGCAACTTAAGGCCTTAGAGAAGGAACCAGAGAATAAGGAAGAGAAAACAGAGGCCATCGAGGTTAACCGCGTGCAAAGGGAAATACAAGGGGAGACTGTTGCAAG
TTTAAGAAGCTCGAGATGCCGGTTTTTGCAGGGGAGAAGCCGGATTCGTGGCTCTTTCGTGCGGAGCGATATTTCGAGATCCACCAACTAA
Protein sequenceShow/hide protein sequence
MDDKALIQVADYSLDPSVNGKWKLFGNLHLKLEFWSSDFHSQPKFIKSYGGWIAIKNLPLDLWHRDSFEAIGKNLGGLVSISSKTLNLLDCSKAFIEVEKNYCGFIPADI
HVKIDNKCEFSLKFGDIKALDDRNLNFVSSRRLSSDEFSNSLDVIRIRQAVLDDDLILGNEEERMNELPSCSRSQGKINEVLGSPKGALLHEEGINNIGCMGFNDSTQEM
DPILSPFNVINDNISTRPKEFQQPLLIDYSPKNINASIGKVGNQPTTSVSIINSIENDYSPTNINASIGKVGNQPAASVSIINSIENEYIQQAALKTYSRKKGSRLVKQF
NANVSEISEALTEGELHESQNLLFTPIHDPPLVLKNCNEDGLEDKERIVSKALKKQYESFPLYYSRRKNEKTTILDSIPINSNYNPDVIEESYDDSVVSISSAEAENQFL
NDENNELLKEYSFALALNRIFQNNEAVFEVQMNESKVPFNSSHWEDVNGDLVISKDTSVHEERINCDGCNDPPPKMINDDSCKVINEFQQISSEKDQVNEALGFPKDASL
HDKSFNHVDCKVLEMCCSASACSEVHSNAAGLLCAISRFSPLGLLAKISSTSFVGSLVRHALEDSHPTFVLINSLSVVRSLEAIYWMIFCILRPNCPWIFCFSNYGGSGW
QVREFKLVAQASGCFFSYYIWKVTTTSWKHRLKLVSNLCYGVRCPRFIQHSQFRIFAFFGMLSFLHPKSSLEVFNHEFVEEKRTKFFDLSVKVSTQRSSALNEEIFELSS
QSLVLNSSPPFVDNHLSNGFINSTNGTINKAHFLENEVETDKDSMVSISSVDTEFNFPFVIEQEKISSEVAIEGALNNLFQEPDPPSKKIEKEICKLETHGHFTSSSLIG
GLSRTNEDESPQLFELICKGQLRKKDTLHIGIRTRQILGKTRAMAQKQTEECLENQERQMGEFKETIQLFGKHLEQLTLEMKENQRAIMALMAGQNVGREQSSSEVTESS
SKQLKALEKEPENKEEKTEAIEVNRVQREIQGETVASLRSSRCRFLQGRSRIRGSFVRSDISRSTN