; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020887 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020887
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein MNN4-like
Genome locationscaffold9:3547649..3550161
RNA-Seq ExpressionSpg020887
SyntenySpg020887
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]5.8e-3239.9Show/hide
Query:  MKKRDFLNEKGF----SNRAGALPEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRRNDVIRN
        ++ R    EKGF    S   G LP F++++I+Q+ W++FCAHP++ +VPLVREFYA+L +   +   VRG  VS+S   IN V+ +  P++   ++ I N
Subjt:  MKKRDFLNEKGF----SNRAGALPEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRRNDVIRN

Query:  PSAKQMKEALKLVANKGIQWKESQTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLF
         +   +   L+ VA  G +W  S     + + S L P + VW HF+K+HL+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LF
Subjt:  PSAKQMKEALKLVANKGIQWKESQTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLF

PON41193.1 LOW QUALITY PROTEIN: hypothetical protein PanWU01x14_291710 [Parasponia andersonii]7.1e-2233.14Show/hide
Query:  PEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRRNDVIRNPSAKQMKEALKLVANKGIQWKES
        P F + +I+Q+ W+ FCA+P++ ++PLV EFYA++ +       +  ++  F    IN ++ ++ P++   ++ +   +  ++   L+ VA    +W  S
Subjt:  PEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRRNDVIRNPSAKQMKEALKLVANKGIQWKES

Query:  QTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLF
             + + S L   + +W HF+K++L+PTTH  T+S DR +LLY ++ G  INVG II  EI AC  K+ G LF
Subjt:  QTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLF

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]4.4e-3230.77Show/hide
Query:  MKKRDFLNEKGF----SNRAGALPEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRRNDVIRN
        ++ R    EKGF    S   G LP F++++I+Q+ W++FCAHP++ +VPLVREFYA+L +   +   VRG  VS+S   IN V+ +  P++   ++ I+N
Subjt:  MKKRDFLNEKGF----SNRAGALPEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRRNDVIRN

Query:  PSAKQMKEALKLVANKGIQWKESQTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLF--
         + + +   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LF  
Subjt:  PSAKQMKEALKLVANKGIQWKESQTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLF--

Query:  -----LAHSSPSSNNIQRKDKASTSQATPQSGSNVASSSQHTPFTGPSPSSEALAIAYR-------QIDRLRDDL---------------------RTYW
             L  ++ +   +  +   +T +    + + +A          PS S  A A + R       Q+  L   L                     + +W
Subjt:  -----LAHSSPSSNNIQRKDKASTSQATPQSGSNVASSSQHTPFTGPSPSSEALAIAYR-------QIDRLRDDL---------------------RTYW

Query:  AYAKERDEAIREFYLSIAPSISPVFPNFPQSLLPQEEEDSDEEEDEENDEE
        AY+KERD A+++   +      P FP FPQ +L   + + + E D++   E
Subjt:  AYAKERDEAIREFYLSIAPSISPVFPNFPQSLLPQEEEDSDEEEDEENDEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.3e-2633.33Show/hide
Query:  ASPKNPFPKVFKDINFQERMEIMKKRDFLNEKGF---SNRAGALPEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVVRGKMVSFSSVD
        AS    F     +I ++E ++    R    EK F   +++    P F++ +I Q+ WQ FCAHP++ +VPLVREFY ++         +RG  V  S   
Subjt:  ASPKNPFPKVFKDINFQERMEIMKKRDFLNEKGF---SNRAGALPEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVVRGKMVSFSSVD

Query:  INRVYRIKAPLNPRRNDVIRNPSAKQMKEALKLVANKGIQWKESQTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVG
        IN ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S + V LLY ++ G  INVG
Subjt:  INRVYRIKAPLNPRRNDVIRNPSAKQMKEALKLVANKGIQWKESQTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVG

Query:  SIIRDEILACGRKRAGKLF
         +I  EI AC  +++G LF
Subjt:  SIIRDEILACGRKRAGKLF

TYH88163.1 hypothetical protein ES332_D01G168900v1 [Gossypium tomentosum]2.7e-2134.25Show/hide
Query:  SPKNPFPKVFKDINFQERME-IMKKRDFLNEKGF---SNRAGALPEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVVRGKMVSFSSVD
        +PKNP   +  D   +ER + I K +  + EKGF   SN     P  + K I+ +KW+ FC     +   LVREFYASL  +  +  +VR K V  +S  
Subjt:  SPKNPFPKVFKDINFQERME-IMKKRDFLNEKGF---SNRAGALPEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVVRGKMVSFSSVD

Query:  INRVYRIKAPLNPRRNDVIRNPSAKQMKEALKLVANKGIQWKESQTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVG
        IN ++ +          +I N +   +++ L +V N G QW   +    S     LKP + VW +F++   MP +H  TIS++R++LLY ++    INVG
Subjt:  INRVYRIKAPLNPRRNDVIRNPSAKQMKEALKLVANKGIQWKESQTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVG

Query:  SIIRDEILACGRKRAGKLF
         II  EI  C +K+AG ++
Subjt:  SIIRDEILACGRKRAGKLF

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.8e-3239.9Show/hide
Query:  MKKRDFLNEKGF----SNRAGALPEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRRNDVIRN
        ++ R    EKGF    S   G LP F++++I+Q+ W++FCAHP++ +VPLVREFYA+L +   +   VRG  VS+S   IN V+ +  P++   ++ I N
Subjt:  MKKRDFLNEKGF----SNRAGALPEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRRNDVIRN

Query:  PSAKQMKEALKLVANKGIQWKESQTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLF
         +   +   L+ VA  G +W  S     + + S L P + VW HF+K+HL+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LF
Subjt:  PSAKQMKEALKLVANKGIQWKESQTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLF

A0A2P5AXB5 Uncharacterized protein3.4e-2233.14Show/hide
Query:  PEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRRNDVIRNPSAKQMKEALKLVANKGIQWKES
        P F + +I+Q+ W+ FCA+P++ ++PLV EFYA++ +       +  ++  F    IN ++ ++ P++   ++ +   +  ++   L+ VA    +W  S
Subjt:  PEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRRNDVIRNPSAKQMKEALKLVANKGIQWKES

Query:  QTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLF
             + + S L   + +W HF+K++L+PTTH  T+S DR +LLY ++ G  INVG II  EI AC  K+ G LF
Subjt:  QTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLF

A0A2P5BCG4 Uncharacterized protein (Fragment)2.1e-3230.77Show/hide
Query:  MKKRDFLNEKGF----SNRAGALPEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRRNDVIRN
        ++ R    EKGF    S   G LP F++++I+Q+ W++FCAHP++ +VPLVREFYA+L +   +   VRG  VS+S   IN V+ +  P++   ++ I+N
Subjt:  MKKRDFLNEKGF----SNRAGALPEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRRNDVIRN

Query:  PSAKQMKEALKLVANKGIQWKESQTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLF--
         + + +   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LF  
Subjt:  PSAKQMKEALKLVANKGIQWKESQTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLF--

Query:  -----LAHSSPSSNNIQRKDKASTSQATPQSGSNVASSSQHTPFTGPSPSSEALAIAYR-------QIDRLRDDL---------------------RTYW
             L  ++ +   +  +   +T +    + + +A          PS S  A A + R       Q+  L   L                     + +W
Subjt:  -----LAHSSPSSNNIQRKDKASTSQATPQSGSNVASSSQHTPFTGPSPSSEALAIAYR-------QIDRLRDDL---------------------RTYW

Query:  AYAKERDEAIREFYLSIAPSISPVFPNFPQSLLPQEEEDSDEEEDEENDEE
        AY+KERD A+++   +      P FP FPQ +L   + + + E D++   E
Subjt:  AYAKERDEAIREFYLSIAPSISPVFPNFPQSLLPQEEEDSDEEEDEENDEE

A0A2P5DAQ2 Uncharacterized protein6.1e-2733.33Show/hide
Query:  ASPKNPFPKVFKDINFQERMEIMKKRDFLNEKGF---SNRAGALPEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVVRGKMVSFSSVD
        AS    F     +I ++E ++    R    EK F   +++    P F++ +I Q+ WQ FCAHP++ +VPLVREFY ++         +RG  V  S   
Subjt:  ASPKNPFPKVFKDINFQERMEIMKKRDFLNEKGF---SNRAGALPEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVVRGKMVSFSSVD

Query:  INRVYRIKAPLNPRRNDVIRNPSAKQMKEALKLVANKGIQWKESQTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVG
        IN ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S + V LLY ++ G  INVG
Subjt:  INRVYRIKAPLNPRRNDVIRNPSAKQMKEALKLVANKGIQWKESQTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVG

Query:  SIIRDEILACGRKRAGKLF
         +I  EI AC  +++G LF
Subjt:  SIIRDEILACGRKRAGKLF

A0A5D2MA47 Uncharacterized protein1.3e-2134.25Show/hide
Query:  SPKNPFPKVFKDINFQERME-IMKKRDFLNEKGF---SNRAGALPEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVVRGKMVSFSSVD
        +PKNP   +  D   +ER + I K +  + EKGF   SN     P  + K I+ +KW+ FC     +   LVREFYASL  +  +  +VR K V  +S  
Subjt:  SPKNPFPKVFKDINFQERME-IMKKRDFLNEKGF---SNRAGALPEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVVRGKMVSFSSVD

Query:  INRVYRIKAPLNPRRNDVIRNPSAKQMKEALKLVANKGIQWKESQTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVG
        IN ++ +          +I N +   +++ L +V N G QW   +    S     LKP + VW +F++   MP +H  TIS++R++LLY ++    INVG
Subjt:  INRVYRIKAPLNPRRNDVIRNPSAKQMKEALKLVANKGIQWKESQTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINVG

Query:  SIIRDEILACGRKRAGKLF
         II  EI  C +K+AG ++
Subjt:  SIIRDEILACGRKRAGKLF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGGCACACGAAGAACGAGACCCACGGGATTCTCGCCGGCGGTTGTGAACCAAACGCCCAACGCTCAAGCTCCATCCTCTTCGGCAATTTCGGCCACGTCGAGGGA
GATGCCGAGTTCGTCTTCACCAAGACGGTTCACCCGCGCCACTGCTGTTCGTCAAACCCAAAAACCCGCCACTCAACAGTACAGAAAACGTTCGCGGGAGTGGTTTGAAA
TGATCCGTGAGATGGGTTCCAAGAGACGAGCTGCCCTTGAAGAAGAAGGGAATCGGCAAGATGAAGACAAAGCCGCCAAGGCAGCTGAAAGCTCTCGGCAAGGAGAAACT
TCATTGGGTAAGGTTTCTGAACCTTCAACTAACCCTTCTCTCTTTTGCAGGATCAAGCTCGTTGTTACTTACAGCGCAAGAAAGAGGAGCTTGAAGAAGGTTGAGTCTGA
AAAGCCGCTTGAAATGGAGTCCCTCAAAACCGCAAGGATGCCTCCGGACGTATTCGAAGGAATAATTCGCCAAGCAGTGGCAAAGGCCCTTGAGATTGCAGAGGGGTACA
AGGCTGAACAAGATGCTTTGAAAGAAGTTGAAGCGGAGAGAGAGATGGAAAATAAGAAAATGGCTGAGGAAGATGAGTTTGCAAAGGAAAGAGATGAGGGGAATGAGAAA
AGAAAAAGAGAAGAAGAGCAAGAGGCCGAGAGGGCCTTAGAAGCTGAAGAAGAGAAAAGATTAGGTGAAAGCCTCAGGAGGGCAGCCATTGATTTGCAACTCCTTGAGGA
AGAGAAAAAGAGAAGGGAAAAAATAAAAGAAGATGAAAGGCGAAGAAAAGAAGCCGAAGACTTCCTTGCAGCTTTTGAGCCACTCCACAAGGCTCAAAGTGAGGCTGAAG
CACTACAAGGAAGGGTAGAAGAAGAGGCCCAACAGGGGCCAAATGAAGAAATTTTTGAAAAAGAAAAAGGAAGAGAAATGGAGAATGAAGGCCAGAATGTGACCGCATTT
GGACCGCATACTGAGGAAGGCCTAGCCGAGGCCACCATTGATCAGCCAGCTGAAAAGGTTTTTGAGCCTCTATTCACACATGACCCACCAGCAGCTAATAGCACCTCTTC
GGGAGAGAAGAGGGATGATGAGAAAAAAGAAGACGAGGAGGCCGAGACCTCCAGTGATTCTGATTCTGATTTAGAATCTGATTCAGAGATTAGGGAGCTAGATGGCGATC
AAGCCACTATCTCTGCAGCGTTGAGAAGAAAGAGGAAGAGAGAGATAAAGGCTGAGAGGAGGACAAAGAACAAAAATGACCCGATATTTGCCAAGAGGCCGAGGACAAGG
TCCATGGACTCCTCTCCTACAGTCCCTCCGACCGTCTCACCCACCAAGCCTAAGGGCAAGTCACCGAAGGCCGCATCACCTAAAAATCCATTCCCCAAGGTATTTAAAGA
TATTAATTTTCAAGAACGGATGGAGATCATGAAGAAGAGAGATTTCCTCAATGAGAAAGGATTCTCTAACAGAGCAGGAGCACTGCCAGAGTTCGTGAGCAAGATCATAT
CTCAATACAAATGGCAGGAGTTCTGTGCTCACCCTCAGGAGGCTGTTGTGCCTCTAGTGCGAGAATTTTACGCCAGCCTGAGGGAGGAGAGCATTAGCATGGCGGTGGTG
AGGGGGAAGATGGTCAGTTTCTCCTCTGTCGACATTAACAGGGTGTACAGGATCAAAGCACCTTTGAACCCAAGAAGGAATGATGTGATCAGGAACCCTTCGGCCAAGCA
AATGAAAGAAGCATTGAAACTTGTGGCCAACAAGGGGATCCAGTGGAAAGAATCACAGACAAAAGTGAAGTCTCTAGTGTCAAGCGACCTAAAGCCAGAATCGGCAGTTT
GGCTTCACTTCATAAAAAACCACTTGATGCCAACCACCCACGACAGCACGATTTCAGTGGATAGAGTGATGCTACTCTATTGTCTTATGAAGGGGTTGGAAATCAACGTA
GGGAGCATTATCAGGGATGAAATCTTAGCCTGTGGACGGAAAAGGGCAGGCAAGCTTTTCTTGGCTCACTCATCACCCAGCTCTAACAACATCCAGAGGAAGGATAAAGC
CTCCACATCACAAGCCACTCCTCAATCAGGGTCGAATGTAGCTTCTTCATCCCAGCACACTCCTTTCACAGGGCCATCACCGTCATCAGAGGCCCTAGCTATTGCCTACC
GCCAGATAGATCGACTCAGGGACGACCTAAGGACATACTGGGCATATGCAAAGGAGCGGGATGAAGCCATTAGAGAGTTTTATCTCTCTATAGCCCCGAGTATTAGTCCG
GTCTTTCCTAATTTCCCTCAGTCGCTGCTGCCCCAAGAAGAAGAGGATTCTGATGAAGAGGAAGATGAAGAGAATGATGAAGAGAAAGAGAGTTCCTCAGACGAGGAATA
G
mRNA sequenceShow/hide mRNA sequence
ATGCAAGGCACACGAAGAACGAGACCCACGGGATTCTCGCCGGCGGTTGTGAACCAAACGCCCAACGCTCAAGCTCCATCCTCTTCGGCAATTTCGGCCACGTCGAGGGA
GATGCCGAGTTCGTCTTCACCAAGACGGTTCACCCGCGCCACTGCTGTTCGTCAAACCCAAAAACCCGCCACTCAACAGTACAGAAAACGTTCGCGGGAGTGGTTTGAAA
TGATCCGTGAGATGGGTTCCAAGAGACGAGCTGCCCTTGAAGAAGAAGGGAATCGGCAAGATGAAGACAAAGCCGCCAAGGCAGCTGAAAGCTCTCGGCAAGGAGAAACT
TCATTGGGTAAGGTTTCTGAACCTTCAACTAACCCTTCTCTCTTTTGCAGGATCAAGCTCGTTGTTACTTACAGCGCAAGAAAGAGGAGCTTGAAGAAGGTTGAGTCTGA
AAAGCCGCTTGAAATGGAGTCCCTCAAAACCGCAAGGATGCCTCCGGACGTATTCGAAGGAATAATTCGCCAAGCAGTGGCAAAGGCCCTTGAGATTGCAGAGGGGTACA
AGGCTGAACAAGATGCTTTGAAAGAAGTTGAAGCGGAGAGAGAGATGGAAAATAAGAAAATGGCTGAGGAAGATGAGTTTGCAAAGGAAAGAGATGAGGGGAATGAGAAA
AGAAAAAGAGAAGAAGAGCAAGAGGCCGAGAGGGCCTTAGAAGCTGAAGAAGAGAAAAGATTAGGTGAAAGCCTCAGGAGGGCAGCCATTGATTTGCAACTCCTTGAGGA
AGAGAAAAAGAGAAGGGAAAAAATAAAAGAAGATGAAAGGCGAAGAAAAGAAGCCGAAGACTTCCTTGCAGCTTTTGAGCCACTCCACAAGGCTCAAAGTGAGGCTGAAG
CACTACAAGGAAGGGTAGAAGAAGAGGCCCAACAGGGGCCAAATGAAGAAATTTTTGAAAAAGAAAAAGGAAGAGAAATGGAGAATGAAGGCCAGAATGTGACCGCATTT
GGACCGCATACTGAGGAAGGCCTAGCCGAGGCCACCATTGATCAGCCAGCTGAAAAGGTTTTTGAGCCTCTATTCACACATGACCCACCAGCAGCTAATAGCACCTCTTC
GGGAGAGAAGAGGGATGATGAGAAAAAAGAAGACGAGGAGGCCGAGACCTCCAGTGATTCTGATTCTGATTTAGAATCTGATTCAGAGATTAGGGAGCTAGATGGCGATC
AAGCCACTATCTCTGCAGCGTTGAGAAGAAAGAGGAAGAGAGAGATAAAGGCTGAGAGGAGGACAAAGAACAAAAATGACCCGATATTTGCCAAGAGGCCGAGGACAAGG
TCCATGGACTCCTCTCCTACAGTCCCTCCGACCGTCTCACCCACCAAGCCTAAGGGCAAGTCACCGAAGGCCGCATCACCTAAAAATCCATTCCCCAAGGTATTTAAAGA
TATTAATTTTCAAGAACGGATGGAGATCATGAAGAAGAGAGATTTCCTCAATGAGAAAGGATTCTCTAACAGAGCAGGAGCACTGCCAGAGTTCGTGAGCAAGATCATAT
CTCAATACAAATGGCAGGAGTTCTGTGCTCACCCTCAGGAGGCTGTTGTGCCTCTAGTGCGAGAATTTTACGCCAGCCTGAGGGAGGAGAGCATTAGCATGGCGGTGGTG
AGGGGGAAGATGGTCAGTTTCTCCTCTGTCGACATTAACAGGGTGTACAGGATCAAAGCACCTTTGAACCCAAGAAGGAATGATGTGATCAGGAACCCTTCGGCCAAGCA
AATGAAAGAAGCATTGAAACTTGTGGCCAACAAGGGGATCCAGTGGAAAGAATCACAGACAAAAGTGAAGTCTCTAGTGTCAAGCGACCTAAAGCCAGAATCGGCAGTTT
GGCTTCACTTCATAAAAAACCACTTGATGCCAACCACCCACGACAGCACGATTTCAGTGGATAGAGTGATGCTACTCTATTGTCTTATGAAGGGGTTGGAAATCAACGTA
GGGAGCATTATCAGGGATGAAATCTTAGCCTGTGGACGGAAAAGGGCAGGCAAGCTTTTCTTGGCTCACTCATCACCCAGCTCTAACAACATCCAGAGGAAGGATAAAGC
CTCCACATCACAAGCCACTCCTCAATCAGGGTCGAATGTAGCTTCTTCATCCCAGCACACTCCTTTCACAGGGCCATCACCGTCATCAGAGGCCCTAGCTATTGCCTACC
GCCAGATAGATCGACTCAGGGACGACCTAAGGACATACTGGGCATATGCAAAGGAGCGGGATGAAGCCATTAGAGAGTTTTATCTCTCTATAGCCCCGAGTATTAGTCCG
GTCTTTCCTAATTTCCCTCAGTCGCTGCTGCCCCAAGAAGAAGAGGATTCTGATGAAGAGGAAGATGAAGAGAATGATGAAGAGAAAGAGAGTTCCTCAGACGAGGAATA
G
Protein sequenceShow/hide protein sequence
MQGTRRTRPTGFSPAVVNQTPNAQAPSSSAISATSREMPSSSSPRRFTRATAVRQTQKPATQQYRKRSREWFEMIREMGSKRRAALEEEGNRQDEDKAAKAAESSRQGET
SLGKVSEPSTNPSLFCRIKLVVTYSARKRSLKKVESEKPLEMESLKTARMPPDVFEGIIRQAVAKALEIAEGYKAEQDALKEVEAEREMENKKMAEEDEFAKERDEGNEK
RKREEEQEAERALEAEEEKRLGESLRRAAIDLQLLEEEKKRREKIKEDERRRKEAEDFLAAFEPLHKAQSEAEALQGRVEEEAQQGPNEEIFEKEKGREMENEGQNVTAF
GPHTEEGLAEATIDQPAEKVFEPLFTHDPPAANSTSSGEKRDDEKKEDEEAETSSDSDSDLESDSEIRELDGDQATISAALRRKRKREIKAERRTKNKNDPIFAKRPRTR
SMDSSPTVPPTVSPTKPKGKSPKAASPKNPFPKVFKDINFQERMEIMKKRDFLNEKGFSNRAGALPEFVSKIISQYKWQEFCAHPQEAVVPLVREFYASLREESISMAVV
RGKMVSFSSVDINRVYRIKAPLNPRRNDVIRNPSAKQMKEALKLVANKGIQWKESQTKVKSLVSSDLKPESAVWLHFIKNHLMPTTHDSTISVDRVMLLYCLMKGLEINV
GSIIRDEILACGRKRAGKLFLAHSSPSSNNIQRKDKASTSQATPQSGSNVASSSQHTPFTGPSPSSEALAIAYRQIDRLRDDLRTYWAYAKERDEAIREFYLSIAPSISP
VFPNFPQSLLPQEEEDSDEEEDEENDEEKESSSDEE