; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg024627 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg024627
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein MNN4-like
Genome locationscaffold12:16333483..16336122
RNA-Seq ExpressionSpg024627
SyntenySpg024627
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]1.9e-2729.26Show/hide
Query:  PFPEVFRDVNFQERMEIMKKRDFLNEKG--FSNRAGA-LPEFVSRIISQYKWQDFCAHPQEVVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVY
        P P  F D   +E  + +K R    E G  FS    A L   V  +++++KWQ F  HP  V   +V+EFY+ + E +    +VRG  + F+   INR +
Subjt:  PFPEVFRDVNFQERMEIMKKRDFLNEKG--FSNRAGA-LPEFVSRIISQYKWQDFCAHPQEVVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVY

Query:  RIQAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIISY
        ++Q   +       +    +  +  L+ +   G +W   Q K K++    L P   +W HF+K++LMPT+H+ T+S  R++LL+ ++ G  I++G II  
Subjt:  RIQAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIISY

Query:  EILACRRKRAGKLFFGSLITQLCQRVKIRKDK--------ASTSQATPP--IGPNVASPSQHTSFTGRSPSS-----------EALVIAYRQIDQLRENL
            C +++A  L F +LIT LC++ K+R++         A  ++A  P  +G   A   +H + T R  SS           +A+   ++ + QL + L
Subjt:  EILACRRKRAGKLFFGSLITQLCQRVKIRKDK--------ASTSQATPP--IGPNVASPSQHTSFTGRSPSS-----------EALVIAYRQIDQLRENL

Query:  KTYWVCAKERD
          Y+  AK RD
Subjt:  KTYWVCAKERD

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]4.3e-3540.28Show/hide
Query:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEVVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIQAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F++++I+Q+ W+ FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I N
Subjt:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEVVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIQAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIISYEILACRRKRAGKLFFG
         +   +   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF 
Subjt:  PSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIISYEILACRRKRAGKLFFG

Query:  SLITQLCQRVK
        SLIT+LC+  +
Subjt:  SLITQLCQRVK

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]5.4e-3831.64Show/hide
Query:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEVVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIQAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F++++I+Q+ W+ FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I+N
Subjt:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEVVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIQAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIISYEILACRRKRAGKLFFG
         + + +   L+ VA  G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF 
Subjt:  PSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIISYEILACRRKRAGKLFFG

Query:  SLITQLCQRVK----IRKDKASTSQATPPI--------GPNVASPSQHTSFTGRSPSSEALVIAYRQIDQLRENL---------------------KTYW
        SLIT+LC+  +    + ++K   +     I        GP  ++    +S    + S+       +Q+  L + L                     + +W
Subjt:  SLITQLCQRVK----IRKDKASTSQATPPI--------GPNVASPSQHTSFTGRSPSSEALVIAYRQIDQLRENL---------------------KTYW

Query:  VCAKERDEAIRDFYLSIARSIAPVFPNFPQSLLPEEEKDSDEEEDEENDDEEKE
          +KERD A++    +      P FP FPQ +L + + + + E D++  +E  E
Subjt:  VCAKERDEAIRDFYLSIARSIAPVFPNFPQSLLPEEEKDSDEEEDEENDDEEKE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]3.7e-3134.48Show/hide
Query:  KAASPKNPFPEVFRDVNFQER-MEIMKKRDFLNEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEVVVPLVREFYAGLREESISMAVVRGKMVSFSSVD
        KA   ++   E+  + N Q R + + K+  + N K         P F++ +I Q+ WQ FCAHP++ +VPLVREFY  +         +RG  V  S   
Subjt:  KAASPKNPFPEVFRDVNFQER-MEIMKKRDFLNEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEVVVPLVREFYAGLREESISMAVVRGKMVSFSSVD

Query:  INRVYRIQAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVG
        IN ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S + V LLY ++ G  INVG
Subjt:  INRVYRIQAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVG

Query:  SIISYEILACRRKRAGKLFFGSLITQLCQRVK
         +I  EI AC  +++G LFF SLIT +C+  +
Subjt:  SIISYEILACRRKRAGKLFFGSLITQLCQRVK

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.1e-2731.63Show/hide
Query:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIQAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIK
        +PLVREFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I N +  ++   L+ VA  G +W  S     + + S L P + VW HF+K
Subjt:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIQAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIK

Query:  NRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIISYEILACRRKRAGKLFFGSLITQLCQRVK--IRKDKASTSQATPPI--------GPNVASPSQHT
        +RL+PTTH   +S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF SLIT+LC+     + ++K   +     I        GP  ++    +
Subjt:  NRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIISYEILACRRKRAGKLFFGSLITQLCQRVK--IRKDKASTSQATPPI--------GPNVASPSQHT

Query:  SFTGRSPSSEALVIAYRQIDQLRENL----------KTYWVCAKERDEAIRDFYLSIARSIAPVFPNFPQSLLPEEEKDSDEEEDEENDDEEKE
        S    + SS       +Q+  L + L          + +W  +KERD A++    +      P FP FPQ +L + + + + E D++  +E  E
Subjt:  SFTGRSPSSEALVIAYRQIDQLRENL----------KTYWVCAKERDEAIRDFYLSIARSIAPVFPNFPQSLLPEEEKDSDEEEDEENDDEEKE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.1e-3540.28Show/hide
Query:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEVVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIQAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F++++I+Q+ W+ FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I N
Subjt:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEVVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIQAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIISYEILACRRKRAGKLFFG
         +   +   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF 
Subjt:  PSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIISYEILACRRKRAGKLFFG

Query:  SLITQLCQRVK
        SLIT+LC+  +
Subjt:  SLITQLCQRVK

A0A2P5BCG4 Uncharacterized protein (Fragment)2.6e-3831.64Show/hide
Query:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEVVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIQAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F++++I+Q+ W+ FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I+N
Subjt:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEVVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIQAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIISYEILACRRKRAGKLFFG
         + + +   L+ VA  G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF 
Subjt:  PSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIISYEILACRRKRAGKLFFG

Query:  SLITQLCQRVK----IRKDKASTSQATPPI--------GPNVASPSQHTSFTGRSPSSEALVIAYRQIDQLRENL---------------------KTYW
        SLIT+LC+  +    + ++K   +     I        GP  ++    +S    + S+       +Q+  L + L                     + +W
Subjt:  SLITQLCQRVK----IRKDKASTSQATPPI--------GPNVASPSQHTSFTGRSPSSEALVIAYRQIDQLRENL---------------------KTYW

Query:  VCAKERDEAIRDFYLSIARSIAPVFPNFPQSLLPEEEKDSDEEEDEENDDEEKE
          +KERD A++    +      P FP FPQ +L + + + + E D++  +E  E
Subjt:  VCAKERDEAIRDFYLSIARSIAPVFPNFPQSLLPEEEKDSDEEEDEENDDEEKE

A0A2P5DAQ2 Uncharacterized protein1.8e-3134.48Show/hide
Query:  KAASPKNPFPEVFRDVNFQER-MEIMKKRDFLNEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEVVVPLVREFYAGLREESISMAVVRGKMVSFSSVD
        KA   ++   E+  + N Q R + + K+  + N K         P F++ +I Q+ WQ FCAHP++ +VPLVREFY  +         +RG  V  S   
Subjt:  KAASPKNPFPEVFRDVNFQER-MEIMKKRDFLNEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEVVVPLVREFYAGLREESISMAVVRGKMVSFSSVD

Query:  INRVYRIQAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVG
        IN ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S + V LLY ++ G  INVG
Subjt:  INRVYRIQAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVG

Query:  SIISYEILACRRKRAGKLFFGSLITQLCQRVK
         +I  EI AC  +++G LFF SLIT +C+  +
Subjt:  SIISYEILACRRKRAGKLFFGSLITQLCQRVK

A0A2P5DXM3 Uncharacterized protein5.5e-2831.63Show/hide
Query:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIQAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIK
        +PLVREFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I N +  ++   L+ VA  G +W  S     + + S L P + VW HF+K
Subjt:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIQAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIK

Query:  NRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIISYEILACRRKRAGKLFFGSLITQLCQRVK--IRKDKASTSQATPPI--------GPNVASPSQHT
        +RL+PTTH   +S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF SLIT+LC+     + ++K   +     I        GP  ++    +
Subjt:  NRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIISYEILACRRKRAGKLFFGSLITQLCQRVK--IRKDKASTSQATPPI--------GPNVASPSQHT

Query:  SFTGRSPSSEALVIAYRQIDQLRENL----------KTYWVCAKERDEAIRDFYLSIARSIAPVFPNFPQSLLPEEEKDSDEEEDEENDDEEKE
        S    + SS       +Q+  L + L          + +W  +KERD A++    +      P FP FPQ +L + + + + E D++  +E  E
Subjt:  SFTGRSPSSEALVIAYRQIDQLRENL----------KTYWVCAKERDEAIRDFYLSIARSIAPVFPNFPQSLLPEEEKDSDEEEDEENDDEEKE

A0A6A2ZUE4 Uncharacterized protein9.3e-2829.26Show/hide
Query:  PFPEVFRDVNFQERMEIMKKRDFLNEKG--FSNRAGA-LPEFVSRIISQYKWQDFCAHPQEVVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVY
        P P  F D   +E  + +K R    E G  FS    A L   V  +++++KWQ F  HP  V   +V+EFY+ + E +    +VRG  + F+   INR +
Subjt:  PFPEVFRDVNFQERMEIMKKRDFLNEKG--FSNRAGA-LPEFVSRIISQYKWQDFCAHPQEVVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVY

Query:  RIQAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIISY
        ++Q   +       +    +  +  L+ +   G +W   Q K K++    L P   +W HF+K++LMPT+H+ T+S  R++LL+ ++ G  I++G II  
Subjt:  RIQAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIISY

Query:  EILACRRKRAGKLFFGSLITQLCQRVKIRKDK--------ASTSQATPP--IGPNVASPSQHTSFTGRSPSS-----------EALVIAYRQIDQLRENL
            C +++A  L F +LIT LC++ K+R++         A  ++A  P  +G   A   +H + T R  SS           +A+   ++ + QL + L
Subjt:  EILACRRKRAGKLFFGSLITQLCQRVKIRKDK--------ASTSQATPP--IGPNVASPSQHTSFTGRSPSS-----------EALVIAYRQIDQLRENL

Query:  KTYWVCAKERD
          Y+  AK RD
Subjt:  KTYWVCAKERD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAACACTCCAAAACCCTCATCATCACGCAAGAACACTCGATCTCAGAGTGCTCAAGCAACCCACGAAGCTGAAGCAAACGTGCGACGGCAAGAAGAGAACCCCGA
AACGCCCATGCAAGGCACAAGAAGAATGAGACCCACGAAGCTGAACCAAGCGTCCAACGCTCCAACTCCATCTTCTTCGACAACGTCGGCCAGTTCGAGGGAGATGCCAA
GTTCATCTACGCCAAGTAGGTTCATGCGCGCCACTGCTGTCCGCCAAACCCAAAAGCCCGCCACTCAACAGTTCAGAAAACGTTCGCAGGAGTGGTTTGCAATGATCCGA
GAGATGGGTGCTCAGAGACGTACTGCCCTGAAAGAGAAAGGGAATCAGCAAGATGAAAAAGAAGCCGCCAAGGCAGTTGGAAGCTCTCGGCAAGGAGAAGCTTCAATGGA
TAGGGTTTCCAAACCTTCAACTAACCCTTCTCTATCTTGCAGGACCAAACCCGTTGTTACTTACAGTGCAAGAAAGAGGAGCGTGAAGATGGTTGTGTCTGAAAGGCCGC
TAAAAATTGAGCCCCTCAAAATCGCAAGGATTCCTCCGGATGTATTCGAAGGAATAATTCGCCAAGCAGTGGCAAAGGCTCTTGCGATTGCTGAAGGGTATAAGGCTGAA
CAGGATGCTTTGAAAGAGATTGAGACTGAGAGAGAGATGAAAAATCAGAAAATGGTTAAGGAAGACGAGCTTGCAAAGGGAAGAGACAGTGAGGAAGAGAAAAGAATGAG
AGAAGAAGAACAAGAGGTCGAGAAAGCCTTAGAAGCTGAGGAAGTAAGAAAGTATGAAGAAAACCTCAGGAGGGCAGCTATGGATTTGCAACTCTTTGAGGATGAGAAAA
AGAGAAGAGAAGAGCTGAAACAAGATGAAAAAAGAAGGAAGGAAGCTGAAGACCTCCTTACAGCCTTTGAGCCACTCCACAAGGCTCAAAGCCTAGCTGAGGGCACCATT
GATCAGCCAGCTGAAGAGGTTTTTGAACCTCTATTCACGAATGACCCACTAGCTGCTGATAACACCTCTTTGGGAGAGAAGAGGGACGAAGAGGAAAAGGAAGATGAGGA
GGCCGAGATCTCCACTAACTCTGATACAGAATCTGATTCAGAGATAAGGGAGCTGGATGATGACCAAGTTCCTATCTTTGCAGCGTTAAGAAGAAAGAGAAAGAGAGAGA
TTAAGGTCGAGAGGAGGACAAAGAACAAGAATGACCGGATATTTGCCAAGAGGCCGAGGACGAGGTCCATGGACGCCTCTCCTGCAGTTCCTCCCTCCATCTCACCCGCC
AAGCCAAAGGGAAAATCACCTAAGGCTGCATCTCCCAAAAATCCGTTCCCTGAGGTATTTAGAGATGTTAATTTTCAGGAACGAATGGAGATCATGAAGAAAAGAGATTT
CCTCAATGAAAAGGGATTCTCTAACAGAGCTGGAGCACTGCCAGAGTTTGTGAGCAGGATCATATCTCAATACAAATGGCAGGACTTCTGTGCTCACCCTCAGGAGGTTG
TTGTGCCTTTAGTTCGTGAGTTTTACGCTGGCCTGAGGGAGGAAAGTATTAGCATGGCGGTTGTGAGGGGGAAGATGGTCAGTTTCTCCTCAGTTGACATTAATAGGGTG
TACAGGATCCAGGCACCCCTGAATCCGAGAGGTAATGATGTGATAAGGAACCCTTCGGCCAAGCAAATGAAGGAAGCTCTGAAACTTGTGGCCAACAAAGGGGTCCAATG
GAAAGAATCACAGACGAAAGTGAAGTCTTTAGTGCCAAGCGACCTAAAGCCAGAATCTGCAGTTTGGCTTCACTTCATCAAGAACCGCTTGATGCCAACCACCCACGACA
ACACAATTTCAGTGGATAGAGTGATGCTACTCTATTGCCTTATGAAGGGGTTGGAAATCAACGTAGGGAGCATTATCAGTTATGAGATTTTAGCCTGTAGGAGAAAGCGA
GCAGGCAAGCTTTTCTTTGGATCACTCATCACCCAACTTTGCCAAAGGGTGAAGATCAGGAAAGACAAAGCCTCTACATCTCAGGCTACTCCACCTATAGGGCCGAATGT
AGCTTCTCCATCCCAGCACACTTCTTTCACAGGGCGTTCGCCATCATCTGAAGCCCTAGTCATTGCCTACCGCCAGATAGATCAACTCAGGGAGAACCTGAAGACGTATT
GGGTATGTGCAAAGGAGAGGGATGAAGCTATTAGAGATTTCTATCTCTCGATCGCCCGAAGTATTGCTCCAGTCTTTCCAAATTTCCCTCAGTCGCTGCTGCCTGAAGAA
GAGAAGGATTCTGATGAAGAGGAAGATGAAGAGAATGATGATGAAGAGAAAGAGAGTTCCTCGGACGAGGAATATGGGAGTTTTCTGATCCCCTTTGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAACACTCCAAAACCCTCATCATCACGCAAGAACACTCGATCTCAGAGTGCTCAAGCAACCCACGAAGCTGAAGCAAACGTGCGACGGCAAGAAGAGAACCCCGA
AACGCCCATGCAAGGCACAAGAAGAATGAGACCCACGAAGCTGAACCAAGCGTCCAACGCTCCAACTCCATCTTCTTCGACAACGTCGGCCAGTTCGAGGGAGATGCCAA
GTTCATCTACGCCAAGTAGGTTCATGCGCGCCACTGCTGTCCGCCAAACCCAAAAGCCCGCCACTCAACAGTTCAGAAAACGTTCGCAGGAGTGGTTTGCAATGATCCGA
GAGATGGGTGCTCAGAGACGTACTGCCCTGAAAGAGAAAGGGAATCAGCAAGATGAAAAAGAAGCCGCCAAGGCAGTTGGAAGCTCTCGGCAAGGAGAAGCTTCAATGGA
TAGGGTTTCCAAACCTTCAACTAACCCTTCTCTATCTTGCAGGACCAAACCCGTTGTTACTTACAGTGCAAGAAAGAGGAGCGTGAAGATGGTTGTGTCTGAAAGGCCGC
TAAAAATTGAGCCCCTCAAAATCGCAAGGATTCCTCCGGATGTATTCGAAGGAATAATTCGCCAAGCAGTGGCAAAGGCTCTTGCGATTGCTGAAGGGTATAAGGCTGAA
CAGGATGCTTTGAAAGAGATTGAGACTGAGAGAGAGATGAAAAATCAGAAAATGGTTAAGGAAGACGAGCTTGCAAAGGGAAGAGACAGTGAGGAAGAGAAAAGAATGAG
AGAAGAAGAACAAGAGGTCGAGAAAGCCTTAGAAGCTGAGGAAGTAAGAAAGTATGAAGAAAACCTCAGGAGGGCAGCTATGGATTTGCAACTCTTTGAGGATGAGAAAA
AGAGAAGAGAAGAGCTGAAACAAGATGAAAAAAGAAGGAAGGAAGCTGAAGACCTCCTTACAGCCTTTGAGCCACTCCACAAGGCTCAAAGCCTAGCTGAGGGCACCATT
GATCAGCCAGCTGAAGAGGTTTTTGAACCTCTATTCACGAATGACCCACTAGCTGCTGATAACACCTCTTTGGGAGAGAAGAGGGACGAAGAGGAAAAGGAAGATGAGGA
GGCCGAGATCTCCACTAACTCTGATACAGAATCTGATTCAGAGATAAGGGAGCTGGATGATGACCAAGTTCCTATCTTTGCAGCGTTAAGAAGAAAGAGAAAGAGAGAGA
TTAAGGTCGAGAGGAGGACAAAGAACAAGAATGACCGGATATTTGCCAAGAGGCCGAGGACGAGGTCCATGGACGCCTCTCCTGCAGTTCCTCCCTCCATCTCACCCGCC
AAGCCAAAGGGAAAATCACCTAAGGCTGCATCTCCCAAAAATCCGTTCCCTGAGGTATTTAGAGATGTTAATTTTCAGGAACGAATGGAGATCATGAAGAAAAGAGATTT
CCTCAATGAAAAGGGATTCTCTAACAGAGCTGGAGCACTGCCAGAGTTTGTGAGCAGGATCATATCTCAATACAAATGGCAGGACTTCTGTGCTCACCCTCAGGAGGTTG
TTGTGCCTTTAGTTCGTGAGTTTTACGCTGGCCTGAGGGAGGAAAGTATTAGCATGGCGGTTGTGAGGGGGAAGATGGTCAGTTTCTCCTCAGTTGACATTAATAGGGTG
TACAGGATCCAGGCACCCCTGAATCCGAGAGGTAATGATGTGATAAGGAACCCTTCGGCCAAGCAAATGAAGGAAGCTCTGAAACTTGTGGCCAACAAAGGGGTCCAATG
GAAAGAATCACAGACGAAAGTGAAGTCTTTAGTGCCAAGCGACCTAAAGCCAGAATCTGCAGTTTGGCTTCACTTCATCAAGAACCGCTTGATGCCAACCACCCACGACA
ACACAATTTCAGTGGATAGAGTGATGCTACTCTATTGCCTTATGAAGGGGTTGGAAATCAACGTAGGGAGCATTATCAGTTATGAGATTTTAGCCTGTAGGAGAAAGCGA
GCAGGCAAGCTTTTCTTTGGATCACTCATCACCCAACTTTGCCAAAGGGTGAAGATCAGGAAAGACAAAGCCTCTACATCTCAGGCTACTCCACCTATAGGGCCGAATGT
AGCTTCTCCATCCCAGCACACTTCTTTCACAGGGCGTTCGCCATCATCTGAAGCCCTAGTCATTGCCTACCGCCAGATAGATCAACTCAGGGAGAACCTGAAGACGTATT
GGGTATGTGCAAAGGAGAGGGATGAAGCTATTAGAGATTTCTATCTCTCGATCGCCCGAAGTATTGCTCCAGTCTTTCCAAATTTCCCTCAGTCGCTGCTGCCTGAAGAA
GAGAAGGATTCTGATGAAGAGGAAGATGAAGAGAATGATGATGAAGAGAAAGAGAGTTCCTCGGACGAGGAATATGGGAGTTTTCTGATCCCCTTTGACTGA
Protein sequenceShow/hide protein sequence
MKNTPKPSSSRKNTRSQSAQATHEAEANVRRQEENPETPMQGTRRMRPTKLNQASNAPTPSSSTTSASSREMPSSSTPSRFMRATAVRQTQKPATQQFRKRSQEWFAMIR
EMGAQRRTALKEKGNQQDEKEAAKAVGSSRQGEASMDRVSKPSTNPSLSCRTKPVVTYSARKRSVKMVVSERPLKIEPLKIARIPPDVFEGIIRQAVAKALAIAEGYKAE
QDALKEIETEREMKNQKMVKEDELAKGRDSEEEKRMREEEQEVEKALEAEEVRKYEENLRRAAMDLQLFEDEKKRREELKQDEKRRKEAEDLLTAFEPLHKAQSLAEGTI
DQPAEEVFEPLFTNDPLAADNTSLGEKRDEEEKEDEEAEISTNSDTESDSEIRELDDDQVPIFAALRRKRKREIKVERRTKNKNDRIFAKRPRTRSMDASPAVPPSISPA
KPKGKSPKAASPKNPFPEVFRDVNFQERMEIMKKRDFLNEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEVVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRV
YRIQAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIISYEILACRRKR
AGKLFFGSLITQLCQRVKIRKDKASTSQATPPIGPNVASPSQHTSFTGRSPSSEALVIAYRQIDQLRENLKTYWVCAKERDEAIRDFYLSIARSIAPVFPNFPQSLLPEE
EKDSDEEEDEENDDEEKESSSDEEYGSFLIPFD