; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg032519 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg032519
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold2:30926755..30929270
RNA-Seq ExpressionSpg032519
SyntenySpg032519
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]8.3e-2230.36Show/hide
Query:  FCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLKPE
        FC HP + +VPLV+EFYA L+ +  +   V    ++F+S  IN V  I  + +    ++I +   +Q+KE LK +A  G QW        +    +L+P 
Subjt:  FCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLKPE

Query:  STVLLHFIKNRFMPTTHDSTISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKL---
        + V  HF+ +R + +TH  TIS +R +LLY ++ G  INVG +I D+I AC  K  G L+F SLI++LC +  +     E R      +DL  I ++   
Subjt:  STVLLHFIKNRFMPTTHDSTISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKL---

Query:  ---------------QQNRPSPSLEALAIA---------------------YHQLDQIRENLKTYWVYAKERDEAITEFY
                       + +RPS S    A A                     +  L Q +E L  +WVY+++RD A+ + +
Subjt:  ---------------QQNRPSPSLEALAIA---------------------YHQLDQIRENLKTYWVYAKERDEAITEFY

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]4.5e-2835.89Show/hide
Query:  EEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLK
        ++FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +   ++   ++ I N +   +   L+ VA  G +W        + + S L 
Subjt:  EEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLK

Query:  PESTVLLHFIKNRFMPTTHDSTISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQ
        P + V  HF+K+  +PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF SLIT+LC+  +     +EE+      ID   + ++ 
Subjt:  PESTVLLHFIKNRFMPTTHDSTISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQ

Query:  QNRPSPSLE
        Q  P+ S +
Subjt:  QNRPSPSLE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.7e-2831.45Show/hide
Query:  EEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLK
        ++FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +   ++   ++ I+N + + +   L+ VA  G +W        + + S L 
Subjt:  EEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLK

Query:  PESTVLLHFIKNRFMPTTHDSTISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQ
        P + V  HF+K+R +PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF SLIT+LC+  +     +EE+      ID   + ++ 
Subjt:  PESTVLLHFIKNRFMPTTHDSTISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQ

Query:  QNRPSPS------------------------LEAL--------AIAYHQLDQIRENLK---TYWVYAKERDEAITEFYLSIAPSIDPVFQDFPQSLLPQE
        Q  P+ S                        L+AL           YH +  ++   K    +W Y+KERD A+ +   +      P F  FPQ +L   
Subjt:  QNRPSPS------------------------LEAL--------AIAYHQLDQIRENLK---TYWVYAKERDEAITEFYLSIAPSIDPVFQDFPQSLLPQE

Query:  EDSDEEEDEENNDEDDEE
        +D D E + E++ +   E
Subjt:  EDSDEEEDEENNDEDDEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.2e-2334.62Show/hide
Query:  FCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLKPE
        FCAHP++ +VPLVREFY  +         +RG  V  S   IN ++ +   ++   ++ + + +  ++   L+ VA  G +W        + + S L P 
Subjt:  FCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLKPE

Query:  STVLLHFIKNRFMPTTHDSTISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEER
        + V  HF+K+R +PTTH  T+S + V LLY ++ G  INVG +I  EI AC  +++G LFF SLIT +C+  +     +EE+
Subjt:  STVLLHFIKNRFMPTTHDSTISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEER

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.5e-2330.74Show/hide
Query:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLKPESTVLLHFIK
        +PLVREFYA L +   +   VRG  VS+S   IN V+ +   ++   ++ I N +  ++   L+ VA  G +W        + + S L P + V  HF+K
Subjt:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLKPESTVLLHFIK

Query:  NRFMPTTHDSTISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQQNRPSPS----
        +R +PTTH   +S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF SLIT+LC+    +   +EE+      ID   + ++ Q  P+ S    
Subjt:  NRFMPTTHDSTISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQQNRPSPS----

Query:  --------------------LEALAIAYHQLDQIRENLKTYWVYAKERDEAITEFYLSIAPSIDPVFQDFPQSLLPQEEDSDEEEDEENNDEDDEE
                            L+AL     Q +   +  + +W Y+KERD A+ +   +      P F  FPQ +L   +D D E + E++ +   E
Subjt:  --------------------LEALAIAYHQLDQIRENLKTYWVYAKERDEAITEFYLSIAPSIDPVFQDFPQSLLPQEEDSDEEEDEENNDEDDEE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.2e-2835.89Show/hide
Query:  EEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLK
        ++FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +   ++   ++ I N +   +   L+ VA  G +W        + + S L 
Subjt:  EEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLK

Query:  PESTVLLHFIKNRFMPTTHDSTISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQ
        P + V  HF+K+  +PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF SLIT+LC+  +     +EE+      ID   + ++ 
Subjt:  PESTVLLHFIKNRFMPTTHDSTISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQ

Query:  QNRPSPSLE
        Q  P+ S +
Subjt:  QNRPSPSLE

A0A2P5BCG4 Uncharacterized protein (Fragment)1.3e-2831.45Show/hide
Query:  EEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLK
        ++FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +   ++   ++ I+N + + +   L+ VA  G +W        + + S L 
Subjt:  EEFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLK

Query:  PESTVLLHFIKNRFMPTTHDSTISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQ
        P + V  HF+K+R +PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF SLIT+LC+  +     +EE+      ID   + ++ 
Subjt:  PESTVLLHFIKNRFMPTTHDSTISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQ

Query:  QNRPSPS------------------------LEAL--------AIAYHQLDQIRENLK---TYWVYAKERDEAITEFYLSIAPSIDPVFQDFPQSLLPQE
        Q  P+ S                        L+AL           YH +  ++   K    +W Y+KERD A+ +   +      P F  FPQ +L   
Subjt:  QNRPSPS------------------------LEAL--------AIAYHQLDQIRENLK---TYWVYAKERDEAITEFYLSIAPSIDPVFQDFPQSLLPQE

Query:  EDSDEEEDEENNDEDDEE
        +D D E + E++ +   E
Subjt:  EDSDEEEDEENNDEDDEE

A0A2P5DAQ2 Uncharacterized protein5.6e-2434.62Show/hide
Query:  FCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLKPE
        FCAHP++ +VPLVREFY  +         +RG  V  S   IN ++ +   ++   ++ + + +  ++   L+ VA  G +W        + + S L P 
Subjt:  FCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLKPE

Query:  STVLLHFIKNRFMPTTHDSTISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEER
        + V  HF+K+R +PTTH  T+S + V LLY ++ G  INVG +I  EI AC  +++G LFF SLIT +C+  +     +EE+
Subjt:  STVLLHFIKNRFMPTTHDSTISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEER

A0A2P5DXM3 Uncharacterized protein7.3e-2430.74Show/hide
Query:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLKPESTVLLHFIK
        +PLVREFYA L +   +   VRG  VS+S   IN V+ +   ++   ++ I N +  ++   L+ VA  G +W        + + S L P + V  HF+K
Subjt:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLKPESTVLLHFIK

Query:  NRFMPTTHDSTISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQQNRPSPS----
        +R +PTTH   +S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF SLIT+LC+    +   +EE+      ID   + ++ Q  P+ S    
Subjt:  NRFMPTTHDSTISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQQNRPSPS----

Query:  --------------------LEALAIAYHQLDQIRENLKTYWVYAKERDEAITEFYLSIAPSIDPVFQDFPQSLLPQEEDSDEEEDEENNDEDDEE
                            L+AL     Q +   +  + +W Y+KERD A+ +   +      P F  FPQ +L   +D D E + E++ +   E
Subjt:  --------------------LEALAIAYHQLDQIRENLKTYWVYAKERDEAITEFYLSIAPSIDPVFQDFPQSLLPQEEDSDEEEDEENNDEDDEE

W9RBS1 Uncharacterized protein4.0e-2230.36Show/hide
Query:  FCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLKPE
        FC HP + +VPLV+EFYA L+ +  +   V    ++F+S  IN V  I  + +    ++I +   +Q+KE LK +A  G QW        +    +L+P 
Subjt:  FCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLKPE

Query:  STVLLHFIKNRFMPTTHDSTISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKL---
        + V  HF+ +R + +TH  TIS +R +LLY ++ G  INVG +I D+I AC  K  G L+F SLI++LC +  +     E R      +DL  I ++   
Subjt:  STVLLHFIKNRFMPTTHDSTISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKL---

Query:  ---------------QQNRPSPSLEALAIA---------------------YHQLDQIRENLKTYWVYAKERDEAITEFY
                       + +RPS S    A A                     +  L Q +E L  +WVY+++RD A+ + +
Subjt:  ---------------QQNRPSPSLEALAIA---------------------YHQLDQIRENLKTYWVYAKERDEAITEFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGGCACACGAAGAACGAGACCCACAGGATTCTCGCGGGCGGTCGTGAACCAAGCATCCAACGCTCCAACTCCATCTTCTTCAACAATGCCGGCCATGTCGAGGGA
GATGCCGAGTTCGTCTACATCGAGACGGTTCACGCGTGCCACTACTGTCTGCCAAACCCAAAAGCCCGCCACTCAACAGTTTAGAAAACGTTCGCGGGAGTGGTTTGCGA
TGAGCTTTGAGATGGGTGCTCAGAGATGTGCTGCCCTTGAAGAAGAAGGGAATTGGCAAGATGAAAAAGAAGCCGCCAAGGCAGCTGAAAGCTCTCGACAAGGAGAAACT
TCAATGGCCCCTCAAAACCGCAAGGATGCCTCCCGACGTATTCGAAGAATAATTCGCCAAGCAGTGGCAAAGGCTCTTGCGATTGCTGAAGGGTACAAAGCTGAACAGGA
TGCTTTGAAAGAGATTGAGGAGGAGAGAGAGATGGAAAATCAAAAAATGGCTGAGGAAGACAATTTTGCAAAGGAAAGAGATGAGGACGAAGAGAAAAGGAGAGAAGAAG
AACAAGAGGCCGAGAGGATCTTAGAAGCTGAAGAAGAAAGAAAGTATGAGGAAAACCTCAGGAGGGCAGCCATGGATTTGCAACTCCTTGAGGAAGAGAAAAAGAGAAGG
GAAGAAATAAAAGAAGATGAAAAAAGAATGAAGGAAGCTGAATACTTCCTTGCAGCTTTTGAGCCACTTCACAAGGCTCAAAGTGAGGCTGAATTGCTGCAAAGGAGGGT
AGAAGAGGCCCAACAGGGGCCAACTGAAGAAATTTTAGAAAAAGAAAAAGAAAGAGAAGTGGAAAATGAAAGCCAGAATGCGACCGCATCTGGGCCGCATTTTGAAGAAG
GCCTAGCCAAGGCCAATAAAGAGCAGCCTGCTGAAGAGGTTTTTGAGCCTCTATTCACAAATGACCCACCAGCAGCTGATAGCACCTCTTCGGGAGAGAAGAGGGATGAA
GAGGAAAAGGAAGACGAGGAGGCCGAGACCTCCAGTGATTCTGACTCTGACACAGAATCTGATTCAGAGATTAGGGAGCTAGATGGCGACCAAGTCCCTATCTCTACAGC
ATTAAGAAGAAAGAGGAAGAGAGAGATTAAAGCTGAGAGGAGGACAAAGAACAAGAATGACCCCATATTTGCCAAGAGGCCGAGGACTAGGTCCATGGACGCCTCTCCTA
CAGTTCCTCCTACCATCTCACCCGCCAAGTCAAAGGGCAAATCACCCAAGGCTGCATCTCCCAGAAATCCATTCCCTGAGGAGTTCTGTGCTCACCCTCAGGAGGCTGTT
GTGCCTTTAGTGCGAGAGTTTTACGCCGGCCTGAGGGAGGAGAGCATTAGCATGGCGGTGGTGAGGGGGAAGATGGTCAGTTTCTCCTCAGTCGACATTAATAGGGTGTA
CAGGATCAAGGCATCCTTGAACCCAAGAGGGAATGATGTGATAAGGAACCCTTCGACCAAGCAAATGAAGGAAGCTCTGAAACTTGTGGCCAATAAAGGGGTCCAATGGA
AAGAATTGCAGTCAAAAGTGAAGTCGTTAGTGCCAAGCGACCTAAAGCCAGAATCGACAGTTTTGCTTCACTTCATCAAGAACCGCTTCATGCCAACCACCCACGACAGC
ACGATTTCAATAGATAGAGTGATGCTACTCTATTGCCTTATGAAGGGGTTGGAGATCAACGTGGGGAGCATTATCAGGGATGAGATTTTAGCCTGTGGGAGAAAGCGAGC
AGGCAAACTTTTCTTTGGATCACTCATCACCCAACTTTGCCAAAGGGTGAAGATCGTTCCGGGCAAGGACGAGGAGCGTCACTTCTTCAAGCCGACCATCGACCTGTCCT
TGATCCGGAAGCTCCAGCAGAACAGGCCTTCGCCATCGTTAGAAGCCCTAGCCATTGCCTACCATCAGCTAGATCAAATCAGGGAGAACCTGAAGACGTATTGGGTATAT
GCAAAGGAGAGGGATGAAGCTATTACAGAGTTCTATCTCTCTATTGCCCCGAGTATCGATCCAGTCTTTCAAGATTTCCCTCAATCGCTGCTGCCCCAAGAAGAGGATTC
TGATGAAGAGGAAGATGAAGAGAATAATGATGAAGATGATGAAGAGAAAGAGAGTTCCTCGGACGAGGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAGGCACACGAAGAACGAGACCCACAGGATTCTCGCGGGCGGTCGTGAACCAAGCATCCAACGCTCCAACTCCATCTTCTTCAACAATGCCGGCCATGTCGAGGGA
GATGCCGAGTTCGTCTACATCGAGACGGTTCACGCGTGCCACTACTGTCTGCCAAACCCAAAAGCCCGCCACTCAACAGTTTAGAAAACGTTCGCGGGAGTGGTTTGCGA
TGAGCTTTGAGATGGGTGCTCAGAGATGTGCTGCCCTTGAAGAAGAAGGGAATTGGCAAGATGAAAAAGAAGCCGCCAAGGCAGCTGAAAGCTCTCGACAAGGAGAAACT
TCAATGGCCCCTCAAAACCGCAAGGATGCCTCCCGACGTATTCGAAGAATAATTCGCCAAGCAGTGGCAAAGGCTCTTGCGATTGCTGAAGGGTACAAAGCTGAACAGGA
TGCTTTGAAAGAGATTGAGGAGGAGAGAGAGATGGAAAATCAAAAAATGGCTGAGGAAGACAATTTTGCAAAGGAAAGAGATGAGGACGAAGAGAAAAGGAGAGAAGAAG
AACAAGAGGCCGAGAGGATCTTAGAAGCTGAAGAAGAAAGAAAGTATGAGGAAAACCTCAGGAGGGCAGCCATGGATTTGCAACTCCTTGAGGAAGAGAAAAAGAGAAGG
GAAGAAATAAAAGAAGATGAAAAAAGAATGAAGGAAGCTGAATACTTCCTTGCAGCTTTTGAGCCACTTCACAAGGCTCAAAGTGAGGCTGAATTGCTGCAAAGGAGGGT
AGAAGAGGCCCAACAGGGGCCAACTGAAGAAATTTTAGAAAAAGAAAAAGAAAGAGAAGTGGAAAATGAAAGCCAGAATGCGACCGCATCTGGGCCGCATTTTGAAGAAG
GCCTAGCCAAGGCCAATAAAGAGCAGCCTGCTGAAGAGGTTTTTGAGCCTCTATTCACAAATGACCCACCAGCAGCTGATAGCACCTCTTCGGGAGAGAAGAGGGATGAA
GAGGAAAAGGAAGACGAGGAGGCCGAGACCTCCAGTGATTCTGACTCTGACACAGAATCTGATTCAGAGATTAGGGAGCTAGATGGCGACCAAGTCCCTATCTCTACAGC
ATTAAGAAGAAAGAGGAAGAGAGAGATTAAAGCTGAGAGGAGGACAAAGAACAAGAATGACCCCATATTTGCCAAGAGGCCGAGGACTAGGTCCATGGACGCCTCTCCTA
CAGTTCCTCCTACCATCTCACCCGCCAAGTCAAAGGGCAAATCACCCAAGGCTGCATCTCCCAGAAATCCATTCCCTGAGGAGTTCTGTGCTCACCCTCAGGAGGCTGTT
GTGCCTTTAGTGCGAGAGTTTTACGCCGGCCTGAGGGAGGAGAGCATTAGCATGGCGGTGGTGAGGGGGAAGATGGTCAGTTTCTCCTCAGTCGACATTAATAGGGTGTA
CAGGATCAAGGCATCCTTGAACCCAAGAGGGAATGATGTGATAAGGAACCCTTCGACCAAGCAAATGAAGGAAGCTCTGAAACTTGTGGCCAATAAAGGGGTCCAATGGA
AAGAATTGCAGTCAAAAGTGAAGTCGTTAGTGCCAAGCGACCTAAAGCCAGAATCGACAGTTTTGCTTCACTTCATCAAGAACCGCTTCATGCCAACCACCCACGACAGC
ACGATTTCAATAGATAGAGTGATGCTACTCTATTGCCTTATGAAGGGGTTGGAGATCAACGTGGGGAGCATTATCAGGGATGAGATTTTAGCCTGTGGGAGAAAGCGAGC
AGGCAAACTTTTCTTTGGATCACTCATCACCCAACTTTGCCAAAGGGTGAAGATCGTTCCGGGCAAGGACGAGGAGCGTCACTTCTTCAAGCCGACCATCGACCTGTCCT
TGATCCGGAAGCTCCAGCAGAACAGGCCTTCGCCATCGTTAGAAGCCCTAGCCATTGCCTACCATCAGCTAGATCAAATCAGGGAGAACCTGAAGACGTATTGGGTATAT
GCAAAGGAGAGGGATGAAGCTATTACAGAGTTCTATCTCTCTATTGCCCCGAGTATCGATCCAGTCTTTCAAGATTTCCCTCAATCGCTGCTGCCCCAAGAAGAGGATTC
TGATGAAGAGGAAGATGAAGAGAATAATGATGAAGATGATGAAGAGAAAGAGAGTTCCTCGGACGAGGAATAG
Protein sequenceShow/hide protein sequence
MQGTRRTRPTGFSRAVVNQASNAPTPSSSTMPAMSREMPSSSTSRRFTRATTVCQTQKPATQQFRKRSREWFAMSFEMGAQRCAALEEEGNWQDEKEAAKAAESSRQGET
SMAPQNRKDASRRIRRIIRQAVAKALAIAEGYKAEQDALKEIEEEREMENQKMAEEDNFAKERDEDEEKRREEEQEAERILEAEEERKYEENLRRAAMDLQLLEEEKKRR
EEIKEDEKRMKEAEYFLAAFEPLHKAQSEAELLQRRVEEAQQGPTEEILEKEKEREVENESQNATASGPHFEEGLAKANKEQPAEEVFEPLFTNDPPAADSTSSGEKRDE
EEKEDEEAETSSDSDSDTESDSEIRELDGDQVPISTALRRKRKREIKAERRTKNKNDPIFAKRPRTRSMDASPTVPPTISPAKSKGKSPKAASPRNPFPEEFCAHPQEAV
VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKASLNPRGNDVIRNPSTKQMKEALKLVANKGVQWKELQSKVKSLVPSDLKPESTVLLHFIKNRFMPTTHDS
TISIDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIRKLQQNRPSPSLEALAIAYHQLDQIRENLKTYWVY
AKERDEAITEFYLSIAPSIDPVFQDFPQSLLPQEEDSDEEEDEENNDEDDEEKESSSDEE