; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg011021 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg011021
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein MNN4-like
Genome locationscaffold4:24839304..24841938
RNA-Seq ExpressionSpg011021
SyntenySpg011021
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]5.7e-1131.11Show/hide
Query:  PEFVSRIISQYKWQDFCAHPQEAVLPLVREFYVGLREKSISMAVVTGKMVSFSAVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKES
        P F++R+I Q+ W+ FC HP   ++PLVREFY  L + +     V    V F+A  IN ++ ++  ++    D     + +Q++  L  VA +G  W+ S
Subjt:  PEFVSRIISQYKWQDFCAHPQEAVLPLVREFYVGLREKSISMAVVTGKMVSFSAVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKES

Query:  QTKVKSLVPSYLKPESAVCLHFIKNRLMPTTHDRT
             + +   LK  + +  HF+  R MP+TH +T
Subjt:  QTKVKSLVPSYLKPESAVCLHFIKNRLMPTTHDRT

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.4e-1735.44Show/hide
Query:  MKKRDFLKEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVLPLVREFYVGLREKSISMAVVTGKMVSFSAVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F++++I+Q+ W+ FCAHP++ ++PLVREFY  L +   +   V G  VS+S   IN V+ +  P++   ++ I N
Subjt:  MKKRDFLKEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVLPLVREFYVGLREKSISMAVVTGKMVSFSAVDINRVYRIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQTKVKSLVPSYLKPESAVCLHFIKNRLMPTTHDRT
         +   +   L+ VA  G +W  S     + + S L P + V  HF+K+ L+PTTH +T
Subjt:  PSAKQMKEALKLVANKGVQWKESQTKVKSLVPSYLKPESAVCLHFIKNRLMPTTHDRT

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.2e-1826.82Show/hide
Query:  MKKRDFLKEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVLPLVREFYVGLREKSISMAVVTGKMVSFSAVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F++++I+Q+ W+ FCAHP++ ++PLVREFY  L +   +   V G  VS+S   IN V+ +  P++   ++ I+N
Subjt:  MKKRDFLKEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVLPLVREFYVGLREKSISMAVVTGKMVSFSAVDINRVYRIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQTKVKSLVPSYLKPESAVCLHFIKNRLMPTTHDRTHFSGYCDATLLPYEGIVPGKD-----------------EERH
         + + +   L+ VA  G +W  S     + + S L P + V  HF+K+RL+PTTH +T      D  LL +  ++ GK                  +   
Subjt:  PSAKQMKEALKLVANKGVQWKESQTKVKSLVPSYLKPESAVCLHFIKNRLMPTTHDRTHFSGYCDATLLPYEGIVPGKD-----------------EERH

Query:  FFKPTIDLSLIGKLQQNSIQRKDKASTSQAIPPIG-SNVASPSQHTPFTGPSPSSEALAIAYRQLDQIRENLKT--------------------------
         F P++   L    +   +  ++K   +  I  I  + +A          PS S  A A + R    I + LK                           
Subjt:  FFKPTIDLSLIGKLQQNSIQRKDKASTSQAIPPIG-SNVASPSQHTPFTGPSPSSEALAIAYRQLDQIRENLKT--------------------------

Query:  --YWAYATERDEAIREFYLSIAPSIAPVFPNFPQSLLPEEEEDSDEEEDEENDDEDDE
          +WAY+ ERD A+++   +      P FP FPQ +L + + + + E D++  +E  E
Subjt:  --YWAYATERDEAIREFYLSIAPSIAPVFPNFPQSLLPEEEEDSDEEEDEENDDEDDE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]8.5e-1528.65Show/hide
Query:  KAASSKNLIPEVFRDVNFQERMEIMKKRDFLKEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEAVLPLVREFYVGLREKSISMAVVTGKMVSFSAVDI
        KA   ++   E+  + N Q R  +  +++F+ +   +++    P F++ +I Q+ WQ FCAHP++ ++PLVREFY  +         + G  V  S   I
Subjt:  KAASSKNLIPEVFRDVNFQERMEIMKKRDFLKEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEAVLPLVREFYVGLREKSISMAVVTGKMVSFSAVDI

Query:  NRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSYLKPESAVCLHFIKNRLMPTTHDRT
        N ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + S L P + V  HF+K+RL+PTTH +T
Subjt:  NRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSYLKPESAVCLHFIKNRLMPTTHDRT

XP_038904385.1 uncharacterized protein LOC120090747 [Benincasa hispida]2.3e-1225.48Show/hide
Query:  EEGLAEATVDQPAEEVFEPLFTNDPPAADSTSSGEK--RDEEEKED--EEAKTSTDSDTESDLEIRELDDDQVHISAALRRKRKREIKAERRKNKNDPIF
        +E +   + + P   V E +     P    T+ G+K  + +E++ D  EEA+   +   +   E RE            RR+ KR  K E+RK       
Subjt:  EEGLAEATVDQPAEEVFEPLFTNDPPAADSTSSGEK--RDEEEKED--EEAKTSTDSDTESDLEIRELDDDQVHISAALRRKRKREIKAERRKNKNDPIF

Query:  AKRPRTRSMDASPAVPSTISPAKPKGKSPKAASS--KNLIPEVFRDVNFQERMEIMKKR--------------------DFLKEKGFSNRAGALPEFVSR
        A+ P       + +V   +SP + K   P+ AS   + ++ E   D +    M   K+                     D + E GF   +  LP+F S 
Subjt:  AKRPRTRSMDASPAVPSTISPAKPKGKSPKAASS--KNLIPEVFRDVNFQERMEIMKKR--------------------DFLKEKGFSNRAGALPEFVSR

Query:  IISQYKWQDFCAHPQEAVLPLVREFYVGLREKSISMAVVTGKMVSFSAVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKS
        ++ ++ W+ F       +  +VR FY G    +    ++ G +V FSA DIN +Y++K   +  GN +I +P  ++M++AL+ +   G QW  S   +K+
Subjt:  IISQYKWQDFCAHPQEAVLPLVREFYVGLREKSISMAVVTGKMVSFSAVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKS

Query:  LVPSYLKPESAVCLHFIKNRLMPTTHDRT-----HFSGYCDATLLPYEGIVPGKDEERHFFKPTIDLS-LIGKLQQNSIQRKDKASTS--------QAIP
        L  S L PE+ + ++ +K R++PT+HD+T       + YC A      GI+             ID+S LI    + + QRK +   S          + 
Subjt:  LVPSYLKPESAVCLHFIKNRLMPTTHDRT-----HFSGYCDATLLPYEGIVPGKDEERHFFKPTIDLS-LIGKLQQNSIQRKDKASTS--------QAIP

Query:  PIGSNV--ASPSQHTPFTGP
        P+  N   + P+  TPF  P
Subjt:  PIGSNV--ASPSQHTPFTGP

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)6.8e-1835.44Show/hide
Query:  MKKRDFLKEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVLPLVREFYVGLREKSISMAVVTGKMVSFSAVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F++++I+Q+ W+ FCAHP++ ++PLVREFY  L +   +   V G  VS+S   IN V+ +  P++   ++ I N
Subjt:  MKKRDFLKEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVLPLVREFYVGLREKSISMAVVTGKMVSFSAVDINRVYRIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQTKVKSLVPSYLKPESAVCLHFIKNRLMPTTHDRT
         +   +   L+ VA  G +W  S     + + S L P + V  HF+K+ L+PTTH +T
Subjt:  PSAKQMKEALKLVANKGVQWKESQTKVKSLVPSYLKPESAVCLHFIKNRLMPTTHDRT

A0A2P5BCG4 Uncharacterized protein (Fragment)1.0e-1826.82Show/hide
Query:  MKKRDFLKEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVLPLVREFYVGLREKSISMAVVTGKMVSFSAVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F++++I+Q+ W+ FCAHP++ ++PLVREFY  L +   +   V G  VS+S   IN V+ +  P++   ++ I+N
Subjt:  MKKRDFLKEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVLPLVREFYVGLREKSISMAVVTGKMVSFSAVDINRVYRIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQTKVKSLVPSYLKPESAVCLHFIKNRLMPTTHDRTHFSGYCDATLLPYEGIVPGKD-----------------EERH
         + + +   L+ VA  G +W  S     + + S L P + V  HF+K+RL+PTTH +T      D  LL +  ++ GK                  +   
Subjt:  PSAKQMKEALKLVANKGVQWKESQTKVKSLVPSYLKPESAVCLHFIKNRLMPTTHDRTHFSGYCDATLLPYEGIVPGKD-----------------EERH

Query:  FFKPTIDLSLIGKLQQNSIQRKDKASTSQAIPPIG-SNVASPSQHTPFTGPSPSSEALAIAYRQLDQIRENLKT--------------------------
         F P++   L    +   +  ++K   +  I  I  + +A          PS S  A A + R    I + LK                           
Subjt:  FFKPTIDLSLIGKLQQNSIQRKDKASTSQAIPPIG-SNVASPSQHTPFTGPSPSSEALAIAYRQLDQIRENLKT--------------------------

Query:  --YWAYATERDEAIREFYLSIAPSIAPVFPNFPQSLLPEEEEDSDEEEDEENDDEDDE
          +WAY+ ERD A+++   +      P FP FPQ +L + + + + E D++  +E  E
Subjt:  --YWAYATERDEAIREFYLSIAPSIAPVFPNFPQSLLPEEEEDSDEEEDEENDDEDDE

A0A2P5DAQ2 Uncharacterized protein4.1e-1528.65Show/hide
Query:  KAASSKNLIPEVFRDVNFQERMEIMKKRDFLKEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEAVLPLVREFYVGLREKSISMAVVTGKMVSFSAVDI
        KA   ++   E+  + N Q R  +  +++F+ +   +++    P F++ +I Q+ WQ FCAHP++ ++PLVREFY  +         + G  V  S   I
Subjt:  KAASSKNLIPEVFRDVNFQERMEIMKKRDFLKEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEAVLPLVREFYVGLREKSISMAVVTGKMVSFSAVDI

Query:  NRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSYLKPESAVCLHFIKNRLMPTTHDRT
        N ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + S L P + V  HF+K+RL+PTTH +T
Subjt:  NRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSYLKPESAVCLHFIKNRLMPTTHDRT

A0A5A7TZE0 Protein MNN4-like6.6e-0527.64Show/hide
Query:  SSGEKRDEEEKEDEEAKTSTDSDTESDLEIRELD--DDQVHISAALRRKRKREIK-AERRKNKNDPIFAKRPRTRSMDASPAVPSTISPAKPKGKSPKAA
        + G K   EEK++ + KT+ +   E + E+ EL   +D+V +    ++KR  E + A +R+ KN                     T+   +   KS +  
Subjt:  SSGEKRDEEEKEDEEAKTSTDSDTESDLEIRELD--DDQVHISAALRRKRKREIK-AERRKNKNDPIFAKRPRTRSMDASPAVPSTISPAKPKGKSPKAA

Query:  SSKNLIPEVFRDVNFQERMEIMKKRDFLKEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEAVLPLVREFYVGLREKSISMAVVTGKMVSFSAVDINRV
                    V F E +     + F+ EKG     G LP F++  I   KW+ F          ++  FY G        A+V GKMV+F    +N +
Subjt:  SSKNLIPEVFRDVNFQERMEIMKKRDFLKEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEAVLPLVREFYVGLREKSISMAVVTGKMVSFSAVDINRV

Query:  YRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSYLKPESAVCLHFIKNRLMPTTHDRT
        Y ++         + + PS   M+ AL+ VA  G++W  +  K   L P  LK  ++V L FIK  LMPT HD T
Subjt:  YRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSYLKPESAVCLHFIKNRLMPTTHDRT

W9RBS1 Uncharacterized protein4.0e-1029.3Show/hide
Query:  FAKRPRTRSMDASPAVPSTISPAKPKGKSPKAASSKNLIPEVFRDVNFQERMEIMKKRDFLKEKGF---SNRAGALPEFVSRIISQYKWQDFCAHPQEAV
        FAKRP + S    PA+    + A P   + + ++++  +     +  ++E    +  R+ +KEKGF    +     P F+S +I    WQ FC HP + +
Subjt:  FAKRPRTRSMDASPAVPSTISPAKPKGKSPKAASSKNLIPEVFRDVNFQERMEIMKKRDFLKEKGF---SNRAGALPEFVSRIISQYKWQDFCAHPQEAV

Query:  LPLVREFYVGLREKSISMAVVTGKMVSFSAVDINRVYRIKAPLNPRGND----VIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSYLKPESAVCL
        +PLV+EFY  L+ +  +   V    ++F++  IN V  I     P  +D    +I +   +Q+KE LK +A  G QW  S     +     L+P + V  
Subjt:  LPLVREFYVGLREKSISMAVVTGKMVSFSAVDINRVYRIKAPLNPRGND----VIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSYLKPESAVCL

Query:  HFIKNRLMPTTHDRT
        HF+ +RL+ +TH +T
Subjt:  HFIKNRLMPTTHDRT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAACACTCCAAAACCCTCATCATCACGCAAGAACACTCGTTCTCAGAGTGCTCGAGCAACCCACGAAGCTGAAGCAAACGTGCGACGACAAGAGGAGAACCCCGA
AACGCCCATGCAAGGCACACGAAGGACGAGACCCACGGGATTCTCACCGGCGGTCGTGAACCAAGCGTCCAACGCTCCAACTCCATCTTCTTCGACAATGCCGACTAGTT
CGAGGGAGATGCTGAGTTTGTCTACGCCAAGGAGGTTCATGCGTGCTACTGTCTTCCGCCAAACCCAAAAGCCCGCCACTCAACAGTTCAAAAAACGTTCGCGGGAGTGG
TTTGCAATGATCCGGGAGATGGGTGCTCAGAGACGTGCTACCCTTGAAGAGGAAGGGAATCGACAAGATGAAAAAGAAGCCGCCAAGGCAGCTGGAAGCTCTCGGCAAGG
AGAAGCTTCAATGGGTAAGGTTTCCGAACCTTCAACTAATCCTTCTCTATCTTGCACGATCAAGCCCGTTGTTACCTATAACGCAAGAAAGAGGAGCCCGAAGAAGGTTG
TGTCTGAAAAACCGCTTGAGGCCAAACCCCTCAAAACCGCAAGGATGCCTCCGGATGTATTCGAAGGAATAATTTGCCAAGCAGTGGCAAAGGCTCTTGCGATTGCTGAA
GGGTATAAGGCTGAACAGGATGCTTTGAAAGAGATTGAGACTGAGAAAGAGATAGAAAATCAGAAAATGGTTGAGGAAGACAAGCTTGCAAAAGGAAGAGACCGTGAAGA
AGAGAAAAGAAGAAGAGAAGAAGAGCAAGAGGCCGAGAGGGCCTTAGAAGTTGAGGAAGAAAGAAAATATGAGGAAAACCTCAGGAGAGCAGCCATGGATTTGCAACTCT
CTAGGGAAGAGAAAAAGAGAAGGGAAGAAATAAAAGAAAATGAAAAAAAAAGGAAGGAAGCGGAAGACTTCCTTGCAGTCTTTGTGCCACTCCACAAAGCTCAAAGTGAG
GCTGAAGCACTGCAAGGGAAGAATGCGACCGCATTTGGGCCGCATTCTGAAGAAGGCCTAGCCGAGGCCACCGTTGATCAGCCAGCTGAAGAGGTTTTTGAACCCCTATT
TACAAATGACCCACCAGCTGCTGATAGCACCTCTTCAGGAGAGAAGAGGGACGAAGAAGAAAAGGAAGATGAGGAGGCCAAGACCTCCACTGACTCTGATACAGAATCTG
ATTTAGAGATAAGGGAGCTGGATGATGACCAAGTCCATATCTCTGCAGCGTTAAGAAGAAAGAGAAAGAGAGAGATTAAGGCTGAGAGGAGGAAGAACAAGAATGACCCA
ATATTTGCCAAGAGGCCGAGGACAAGGTCCATGGACGCCTCTCCTGCAGTCCCTTCGACCATCTCACCCGCCAAGCCAAAGGGCAAATCACCGAAGGCTGCATCTTCCAA
AAATTTGATCCCTGAGGTATTTAGAGATGTTAATTTCCAGGAACGAATGGAGATCATGAAGAAAAGAGATTTCCTCAAGGAAAAAGGATTCTCTAACAGAGCTGGAGCAC
TGCCAGAGTTCGTGAGCAGGATCATATCTCAATACAAGTGGCAGGACTTCTGTGCTCACCCTCAGGAGGCTGTTCTGCCTTTAGTTCGTGAATTCTACGTCGGCCTGAGG
GAGAAGAGTATCAGCATGGCGGTTGTGACGGGGAAGATGGTCAGTTTCTCCGCAGTCGACATTAATAGGGTGTACAGGATCAAGGCACCCCTGAACCCGAGAGGGAATGA
TGTAATCAGGAACCCTTCGGCTAAACAAATGAAGGAAGCATTGAAACTTGTGGCCAACAAGGGGGTCCAATGGAAAGAATCACAGACGAAAGTGAAGTCTTTAGTGCCAA
GCTACTTAAAGCCAGAATCGGCAGTTTGTCTTCACTTCATCAAGAACCGCTTGATGCCAACCACCCACGACAGAACACATTTCAGTGGATATTGTGATGCTACTCTATTG
CCTTATGAAGGGATCGTTCCAGGCAAGGACGAGGAGCGTCACTTCTTCAAGCCGACCATCGACCTGTCCTTGATCGGGAAGCTACAGCAGAACAGCATCCAGAGAAAAGA
CAAAGCCTCTACATCTCAGGCTATTCCACCTATAGGGTCGAATGTAGCTTCTCCATCCCAGCACACTCCTTTCACAGGGCCTTCGCCATCATCGGAAGCACTAGCTATTG
CCTACCGTCAGCTCGATCAAATCAGGGAGAACCTGAAGACATATTGGGCGTATGCAACGGAGCGGGATGAAGCCATTAGAGAGTTTTATCTCTCTATTGCCCCAAGTATT
GCTCCGGTCTTTCCAAATTTCCCTCAGTCGCTGCTGCCAGAAGAAGAAGAGGATTCTGATGAAGAGGAAGATGAAGAGAATGATGATGAAGATGATGAAGAGAAAGAGAG
TTCCTCAGACGAGGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGAACACTCCAAAACCCTCATCATCACGCAAGAACACTCGTTCTCAGAGTGCTCGAGCAACCCACGAAGCTGAAGCAAACGTGCGACGACAAGAGGAGAACCCCGA
AACGCCCATGCAAGGCACACGAAGGACGAGACCCACGGGATTCTCACCGGCGGTCGTGAACCAAGCGTCCAACGCTCCAACTCCATCTTCTTCGACAATGCCGACTAGTT
CGAGGGAGATGCTGAGTTTGTCTACGCCAAGGAGGTTCATGCGTGCTACTGTCTTCCGCCAAACCCAAAAGCCCGCCACTCAACAGTTCAAAAAACGTTCGCGGGAGTGG
TTTGCAATGATCCGGGAGATGGGTGCTCAGAGACGTGCTACCCTTGAAGAGGAAGGGAATCGACAAGATGAAAAAGAAGCCGCCAAGGCAGCTGGAAGCTCTCGGCAAGG
AGAAGCTTCAATGGGTAAGGTTTCCGAACCTTCAACTAATCCTTCTCTATCTTGCACGATCAAGCCCGTTGTTACCTATAACGCAAGAAAGAGGAGCCCGAAGAAGGTTG
TGTCTGAAAAACCGCTTGAGGCCAAACCCCTCAAAACCGCAAGGATGCCTCCGGATGTATTCGAAGGAATAATTTGCCAAGCAGTGGCAAAGGCTCTTGCGATTGCTGAA
GGGTATAAGGCTGAACAGGATGCTTTGAAAGAGATTGAGACTGAGAAAGAGATAGAAAATCAGAAAATGGTTGAGGAAGACAAGCTTGCAAAAGGAAGAGACCGTGAAGA
AGAGAAAAGAAGAAGAGAAGAAGAGCAAGAGGCCGAGAGGGCCTTAGAAGTTGAGGAAGAAAGAAAATATGAGGAAAACCTCAGGAGAGCAGCCATGGATTTGCAACTCT
CTAGGGAAGAGAAAAAGAGAAGGGAAGAAATAAAAGAAAATGAAAAAAAAAGGAAGGAAGCGGAAGACTTCCTTGCAGTCTTTGTGCCACTCCACAAAGCTCAAAGTGAG
GCTGAAGCACTGCAAGGGAAGAATGCGACCGCATTTGGGCCGCATTCTGAAGAAGGCCTAGCCGAGGCCACCGTTGATCAGCCAGCTGAAGAGGTTTTTGAACCCCTATT
TACAAATGACCCACCAGCTGCTGATAGCACCTCTTCAGGAGAGAAGAGGGACGAAGAAGAAAAGGAAGATGAGGAGGCCAAGACCTCCACTGACTCTGATACAGAATCTG
ATTTAGAGATAAGGGAGCTGGATGATGACCAAGTCCATATCTCTGCAGCGTTAAGAAGAAAGAGAAAGAGAGAGATTAAGGCTGAGAGGAGGAAGAACAAGAATGACCCA
ATATTTGCCAAGAGGCCGAGGACAAGGTCCATGGACGCCTCTCCTGCAGTCCCTTCGACCATCTCACCCGCCAAGCCAAAGGGCAAATCACCGAAGGCTGCATCTTCCAA
AAATTTGATCCCTGAGGTATTTAGAGATGTTAATTTCCAGGAACGAATGGAGATCATGAAGAAAAGAGATTTCCTCAAGGAAAAAGGATTCTCTAACAGAGCTGGAGCAC
TGCCAGAGTTCGTGAGCAGGATCATATCTCAATACAAGTGGCAGGACTTCTGTGCTCACCCTCAGGAGGCTGTTCTGCCTTTAGTTCGTGAATTCTACGTCGGCCTGAGG
GAGAAGAGTATCAGCATGGCGGTTGTGACGGGGAAGATGGTCAGTTTCTCCGCAGTCGACATTAATAGGGTGTACAGGATCAAGGCACCCCTGAACCCGAGAGGGAATGA
TGTAATCAGGAACCCTTCGGCTAAACAAATGAAGGAAGCATTGAAACTTGTGGCCAACAAGGGGGTCCAATGGAAAGAATCACAGACGAAAGTGAAGTCTTTAGTGCCAA
GCTACTTAAAGCCAGAATCGGCAGTTTGTCTTCACTTCATCAAGAACCGCTTGATGCCAACCACCCACGACAGAACACATTTCAGTGGATATTGTGATGCTACTCTATTG
CCTTATGAAGGGATCGTTCCAGGCAAGGACGAGGAGCGTCACTTCTTCAAGCCGACCATCGACCTGTCCTTGATCGGGAAGCTACAGCAGAACAGCATCCAGAGAAAAGA
CAAAGCCTCTACATCTCAGGCTATTCCACCTATAGGGTCGAATGTAGCTTCTCCATCCCAGCACACTCCTTTCACAGGGCCTTCGCCATCATCGGAAGCACTAGCTATTG
CCTACCGTCAGCTCGATCAAATCAGGGAGAACCTGAAGACATATTGGGCGTATGCAACGGAGCGGGATGAAGCCATTAGAGAGTTTTATCTCTCTATTGCCCCAAGTATT
GCTCCGGTCTTTCCAAATTTCCCTCAGTCGCTGCTGCCAGAAGAAGAAGAGGATTCTGATGAAGAGGAAGATGAAGAGAATGATGATGAAGATGATGAAGAGAAAGAGAG
TTCCTCAGACGAGGAATAG
Protein sequenceShow/hide protein sequence
MKNTPKPSSSRKNTRSQSARATHEAEANVRRQEENPETPMQGTRRTRPTGFSPAVVNQASNAPTPSSSTMPTSSREMLSLSTPRRFMRATVFRQTQKPATQQFKKRSREW
FAMIREMGAQRRATLEEEGNRQDEKEAAKAAGSSRQGEASMGKVSEPSTNPSLSCTIKPVVTYNARKRSPKKVVSEKPLEAKPLKTARMPPDVFEGIICQAVAKALAIAE
GYKAEQDALKEIETEKEIENQKMVEEDKLAKGRDREEEKRRREEEQEAERALEVEEERKYEENLRRAAMDLQLSREEKKRREEIKENEKKRKEAEDFLAVFVPLHKAQSE
AEALQGKNATAFGPHSEEGLAEATVDQPAEEVFEPLFTNDPPAADSTSSGEKRDEEEKEDEEAKTSTDSDTESDLEIRELDDDQVHISAALRRKRKREIKAERRKNKNDP
IFAKRPRTRSMDASPAVPSTISPAKPKGKSPKAASSKNLIPEVFRDVNFQERMEIMKKRDFLKEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEAVLPLVREFYVGLR
EKSISMAVVTGKMVSFSAVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSYLKPESAVCLHFIKNRLMPTTHDRTHFSGYCDATLL
PYEGIVPGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKASTSQAIPPIGSNVASPSQHTPFTGPSPSSEALAIAYRQLDQIRENLKTYWAYATERDEAIREFYLSIAPSI
APVFPNFPQSLLPEEEEDSDEEEDEENDDEDDEEKESSSDEE