; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg018485 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg018485
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold3:5278741..5288691
RNA-Seq ExpressionSpg018485
SyntenySpg018485
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]3.0e-3035.81Show/hide
Query:  PHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNLQNFPHAAYNEIAVAPSNEQLSDAVREVGIEGAQWQLS
        P F+   I  HGW +FC  P +    +VREFYAN+         V+ ++V ++  AIN+++ L+      Y + A   ++EQL   + EV IEGA WQ+S
Subjt:  PHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNLQNFPHAAYNEIAVAPSNEQLSDAVREVGIEGAQWQLS

Query:  KTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSVDVGKIIADEISGC-WKKKVGKLFFPNTITMLCKRAGVPENEGDVILF
             T     LKR A  W  F+  R +P+TH  TV+++RVLL ++IL  +SV++ +I   EI  C   +K G L+FP+ IT L  +A VP ++ + I+ 
Subjt:  KTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSVDVGKIIADEISGC-WKKKVGKLFFPNTITMLCKRAGVPENEGDVILF

Query:  DKGIIYTPNLARLQR
        + G I T +++R+ +
Subjt:  DKGIIYTPNLARLQR

KAE8680640.1 hypothetical protein F3Y22_tig00111372pilonHSYRG00020 [Hibiscus syriacus]1.7e-2833.33Show/hide
Query:  YDRFVNNLAIAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNL
        + +F ++ A A++    K+   FE GF       G     +   +    W++F   P SVNA VV+EFYANI K       VRG ++ ++PSAI   ++L
Subjt:  YDRFVNNLAIAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNL

Query:  QNF--PHAAYNEIAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSVDVGKIIAD
        Q+    HA + E A + + +++   + ++  E  +W   +T + +     L+  A  W  F+K +L+PT++++TVS  R+LL  +I  S  +DVG+II  
Subjt:  QNF--PHAAYNEIAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSVDVGKIIAD

Query:  EISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVIL
        +++ C  KK   L FPN IT LC++  V EN  D IL
Subjt:  EISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVIL

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.6e-3134.67Show/hide
Query:  KRVATASKEPDEIEESQLPYDRFVNNLAIAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLA
        KRVA  + +  +  E++    R+ NN+          R    E+GF        G LP F+   I  H W++FC+ PE     +VREFYAN+        
Subjt:  KRVATASKEPDEIEESQLPYDRFVNNLAIAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLA

Query:  IVRGIEVDWSPSAINALYNLQNFPHAAYNEIAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLL
         VRG++V WS  AINA++ L + P   ++E     +   L   +  V + GA+W +S     T   + L   A  W  F+K  LLPTTH  TVS++R+LL
Subjt:  IVRGIEVDWSPSAINALYNLQNFPHAAYNEIAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLL

Query:  AFAILRSLSVDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIYTPNLARLQRTQE
          ++L   S++VG++I  EI  C  +K G LFFP+ IT LC+ A  P    +  L + G I    +AR+  TQE
Subjt:  AFAILRSLSVDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIYTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]3.2e-3234.94Show/hide
Query:  KRVATASKEPDEIEESQLPYDRFVNNLAIAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLA
        KRVA  + +  +  E++    R+ NN+          R    E+GF        G LP F+   I  H W++FC+ PE     +VREFYAN+   E    
Subjt:  KRVATASKEPDEIEESQLPYDRFVNNLAIAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLA

Query:  IVRGIEVDWSPSAINALYNLQNFPHAAYNEIAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLL
         VRG++V WS  AINA++ L + P   ++E     + + L   +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS++R+LL
Subjt:  IVRGIEVDWSPSAINALYNLQNFPHAAYNEIAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLL

Query:  AFAILRSLSVDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIYTPNLARL
          ++L   S++VG++I  EI  C  +K G LFFP+ IT LC+ A  P    +  L + G I    +AR+
Subjt:  AFAILRSLSVDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIYTPNLARL

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]4.4e-2931.88Show/hide
Query:  EIEESQLPYDRFVNNLAIAKYAELLKRDFLFERGFSGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPSAINALY
        E + +++ Y+  + N  ++     ++++F+++     + P F+   I  H W+ FC+ PE     +VREFY N+   +     +RG++V  S  AIN ++
Subjt:  EIEESQLPYDRFVNNLAIAKYAELLKRDFLFERGFSGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPSAINALY

Query:  NLQNFPHAAYNEIAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSVDVGKIIAD
        +L + P   ++E     +  +L   +  V I GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS+E V L +++L   S++VG++I  
Subjt:  NLQNFPHAAYNEIAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSVDVGKIIAD

Query:  EISGCWKKKVGKLFFPNTITMLCKRAGVP
        EI  C  +K G LFFP+ IT +C+    P
Subjt:  EISGCWKKKVGKLFFPNTITMLCKRAGVP

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)7.8e-3234.67Show/hide
Query:  KRVATASKEPDEIEESQLPYDRFVNNLAIAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLA
        KRVA  + +  +  E++    R+ NN+          R    E+GF        G LP F+   I  H W++FC+ PE     +VREFYAN+        
Subjt:  KRVATASKEPDEIEESQLPYDRFVNNLAIAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLA

Query:  IVRGIEVDWSPSAINALYNLQNFPHAAYNEIAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLL
         VRG++V WS  AINA++ L + P   ++E     +   L   +  V + GA+W +S     T   + L   A  W  F+K  LLPTTH  TVS++R+LL
Subjt:  IVRGIEVDWSPSAINALYNLQNFPHAAYNEIAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLL

Query:  AFAILRSLSVDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIYTPNLARLQRTQE
          ++L   S++VG++I  EI  C  +K G LFFP+ IT LC+ A  P    +  L + G I    +AR+  TQE
Subjt:  AFAILRSLSVDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIYTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)1.6e-3234.94Show/hide
Query:  KRVATASKEPDEIEESQLPYDRFVNNLAIAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLA
        KRVA  + +  +  E++    R+ NN+          R    E+GF        G LP F+   I  H W++FC+ PE     +VREFYAN+   E    
Subjt:  KRVATASKEPDEIEESQLPYDRFVNNLAIAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLA

Query:  IVRGIEVDWSPSAINALYNLQNFPHAAYNEIAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLL
         VRG++V WS  AINA++ L + P   ++E     + + L   +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS++R+LL
Subjt:  IVRGIEVDWSPSAINALYNLQNFPHAAYNEIAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLL

Query:  AFAILRSLSVDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIYTPNLARL
          ++L   S++VG++I  EI  C  +K G LFFP+ IT LC+ A  P    +  L + G I    +AR+
Subjt:  AFAILRSLSVDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIYTPNLARL

A0A2P5DAQ2 Uncharacterized protein2.1e-2931.88Show/hide
Query:  EIEESQLPYDRFVNNLAIAKYAELLKRDFLFERGFSGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPSAINALY
        E + +++ Y+  + N  ++     ++++F+++     + P F+   I  H W+ FC+ PE     +VREFY N+   +     +RG++V  S  AIN ++
Subjt:  EIEESQLPYDRFVNNLAIAKYAELLKRDFLFERGFSGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPSAINALY

Query:  NLQNFPHAAYNEIAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSVDVGKIIAD
        +L + P   ++E     +  +L   +  V I GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS+E V L +++L   S++VG++I  
Subjt:  NLQNFPHAAYNEIAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSVDVGKIIAD

Query:  EISGCWKKKVGKLFFPNTITMLCKRAGVP
        EI  C  +K G LFFP+ IT +C+    P
Subjt:  EISGCWKKKVGKLFFPNTITMLCKRAGVP

A0A6A2YMQ9 Uncharacterized protein8.1e-2933.33Show/hide
Query:  YDRFVNNLAIAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNL
        + +F ++ A A++    K+   FE GF       G     +   +    W++F   P SVNA VV+EFYANI K       VRG ++ ++PSAI   ++L
Subjt:  YDRFVNNLAIAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNL

Query:  QNF--PHAAYNEIAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSVDVGKIIAD
        Q+    HA + E A + + +++   + ++  E  +W   +T + +     L+  A  W  F+K +L+PT++++TVS  R+LL  +I  S  +DVG+II  
Subjt:  QNF--PHAAYNEIAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSVDVGKIIAD

Query:  EISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVIL
        +++ C  KK   L FPN IT LC++  V EN  D IL
Subjt:  EISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVIL

W9QTD9 Uncharacterized protein1.5e-3035.81Show/hide
Query:  PHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNLQNFPHAAYNEIAVAPSNEQLSDAVREVGIEGAQWQLS
        P F+   I  HGW +FC  P +    +VREFYAN+         V+ ++V ++  AIN+++ L+      Y + A   ++EQL   + EV IEGA WQ+S
Subjt:  PHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNLQNFPHAAYNEIAVAPSNEQLSDAVREVGIEGAQWQLS

Query:  KTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSVDVGKIIADEISGC-WKKKVGKLFFPNTITMLCKRAGVPENEGDVILF
             T     LKR A  W  F+  R +P+TH  TV+++RVLL ++IL  +SV++ +I   EI  C   +K G L+FP+ IT L  +A VP ++ + I+ 
Subjt:  KTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSVDVGKIIADEISGC-WKKKVGKLFFPNTITMLCKRAGVPENEGDVILF

Query:  DKGIIYTPNLARLQR
        + G I T +++R+ +
Subjt:  DKGIIYTPNLARLQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAAAACGAGAGCAAGAAAAGAGAGAGATAATGAGGAAGAGGAGGTACCGGTGACCCCCGAAGCATCGAAAGTAAAGGCAAAGAAGAAGAAGACACCAGAAGAAAA
AGAAGCTAAAAGAAGAAGAAAACAGCAGAGGACTGAAGATCAAGAAGTTGCTCAGAAAGCGGTGGAGGATGTTATTTCGGAAGAAGATCCGAAAGAACCAGAAGGACAGA
ATCAAGAGCAATCTGAGCCAGGAGTTGCGGATACAGAGGAAGTTCGAGAAGAAAATACAGAGGAAGTTCGAGAAGAAAATACAGAGGAAGTTCGAGAAGAAAATACAGAG
GAAGTTCAAGAAAAGCAGGCCGAGGATTTGCAAAAAGAACAGGCAGAGGTTGCGCCTGAAGAAGTTAGTGAGCAAGAACAGGAGGCTCGTGTGGAGGTGATCATGCCGGA
AGTGCCCAAACGCCGCCGTATAAAGCGAAAAGCGGGTCATGTTAAGAAAGAAGAGCGTGAGATGAAGGAGGCTGAGGATAAAGCAAGAGAGGAAGCAGAGAAAAAAGCTG
AAGAAGAAAGTTTGCGCAAGCAAAGGGCAGACAGGGGCAAGCGTGTTGCTACGGCATCGAAGGAACCTGATGAAATAGAAGAGTCACAATTGCCGTATGATCGCTTTGTC
AACAATCTTGCCATAGCAAAATATGCAGAGTTGCTGAAAAGAGACTTCCTGTTTGAGAGGGGATTTAGTGGTGATCTTCCACATTTTCTGAGGACTGGTATTGCAGACCA
CGGTTGGGAACGGTTTTGTTCAAAGCCTGAATCTGTGAACGCGCAGGTGGTGCGCGAGTTTTATGCAAATATTGACAAAGAAGAAGGTTTTCTAGCGATTGTTCGAGGTA
TTGAGGTCGACTGGAGTCCTAGTGCTATTAATGCACTGTATAACCTTCAAAATTTCCCCCATGCAGCATATAATGAGATAGCTGTAGCGCCATCCAATGAGCAGCTGAGT
GACGCTGTGAGGGAAGTTGGTATTGAAGGGGCGCAGTGGCAGCTTTCGAAAACAGAGAAAAGGACGTTCCAGTCAACCTATTTAAAGAGGGAAGCAAATACTTGGATGGG
ATTTATCAAACAAAGGTTGCTTCCAACGACTCATGACTCGACGGTTTCTAGGGAACGAGTGCTTCTGGCTTTCGCTATTTTGCGGTCTCTCAGTGTTGATGTGGGAAAAA
TTATTGCTGATGAAATATCTGGTTGTTGGAAGAAGAAAGTGGGGAAGCTGTTTTTCCCGAATACCATTACCATGCTTTGCAAGCGAGCAGGGGTTCCAGAGAATGAAGGA
GATGTGATATTATTTGACAAGGGAATCATTTACACGCCTAACTTGGCGCGGCTTCAGCGTACGCAAGAGGCACGACAGGGTGGGCTGGTCTACGACATCAACACGATTTT
AGAACAACTCGCACTTTCGGCCAGCAGGCAGGACTCGGACTGGCTAAGCTTAATTAGATCAAGAGCTAGGCTGTGGCGAGTTCTTAGAATTGAGTTAAAAGTGGTGATTA
TTTGTCCATGTTGGAAGAATTATTTTGCTGAAGCAGAGCTTGGTTTTGCAGAATGCTCAGAATATGTAGCTGGGCGACTTGAGGGAGCAAACTCTGTGCTGGAGCAAAGC
TGGGAGCGAAAACTGCCACGTCACAGCTCATTAAATTCACCAACTGCTTGTGATATAATCACGAACTACTGCGAACACCACCACTATGGCTACACCAGTATAACTCTGAG
ATTTCTAGAGGCAGGAGACTGGTGGGAGTCTAGAAGGAACCTTCAGGGGAATTCTTTGAGACCCTCATTTGGTCTTGCCCTTGAAATGGATACCCCCACCCGCATGTCTC
CTACATGGATGCTTTGGATCATTGCATCTGTATCGAATACAAGGTGGGTTGTATCACATAGTGTCACCAGGATAAGACTTCGTGAGGCATTTTATTTTTCACGTGCAGCA
TCTGATTTCGTTTGTGGGTCGTCTGGTTGCATGCGGTTGCTGAAGCGTTCGGTTTATGAGCAGTTGGGCGTGCATATTTTTGGGCTGGTTCAAGGTGATTTTGAGCTGGT
TCAGTGCGGTTCAGCCCAATTTTTCGTCGGTTCGAAGAGTTTGGACGCGGTTCGAGGCTGTTTTGGGCTTATTCACGAAAAGATAGGATCGCCTATGCCGGCTGGAGTTA
TCTCCTACATGGGAACTGACCTGATCCATGCTCGACTTCGTGTCGCCTTGGACGCGGCCTCCCTTCGGAAAGTGTTTGCATGGGTCAAGTTAAATGTGTTAACTGATCCT
TGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAAAACGAGAGCAAGAAAAGAGAGAGATAATGAGGAAGAGGAGGTACCGGTGACCCCCGAAGCATCGAAAGTAAAGGCAAAGAAGAAGAAGACACCAGAAGAAAA
AGAAGCTAAAAGAAGAAGAAAACAGCAGAGGACTGAAGATCAAGAAGTTGCTCAGAAAGCGGTGGAGGATGTTATTTCGGAAGAAGATCCGAAAGAACCAGAAGGACAGA
ATCAAGAGCAATCTGAGCCAGGAGTTGCGGATACAGAGGAAGTTCGAGAAGAAAATACAGAGGAAGTTCGAGAAGAAAATACAGAGGAAGTTCGAGAAGAAAATACAGAG
GAAGTTCAAGAAAAGCAGGCCGAGGATTTGCAAAAAGAACAGGCAGAGGTTGCGCCTGAAGAAGTTAGTGAGCAAGAACAGGAGGCTCGTGTGGAGGTGATCATGCCGGA
AGTGCCCAAACGCCGCCGTATAAAGCGAAAAGCGGGTCATGTTAAGAAAGAAGAGCGTGAGATGAAGGAGGCTGAGGATAAAGCAAGAGAGGAAGCAGAGAAAAAAGCTG
AAGAAGAAAGTTTGCGCAAGCAAAGGGCAGACAGGGGCAAGCGTGTTGCTACGGCATCGAAGGAACCTGATGAAATAGAAGAGTCACAATTGCCGTATGATCGCTTTGTC
AACAATCTTGCCATAGCAAAATATGCAGAGTTGCTGAAAAGAGACTTCCTGTTTGAGAGGGGATTTAGTGGTGATCTTCCACATTTTCTGAGGACTGGTATTGCAGACCA
CGGTTGGGAACGGTTTTGTTCAAAGCCTGAATCTGTGAACGCGCAGGTGGTGCGCGAGTTTTATGCAAATATTGACAAAGAAGAAGGTTTTCTAGCGATTGTTCGAGGTA
TTGAGGTCGACTGGAGTCCTAGTGCTATTAATGCACTGTATAACCTTCAAAATTTCCCCCATGCAGCATATAATGAGATAGCTGTAGCGCCATCCAATGAGCAGCTGAGT
GACGCTGTGAGGGAAGTTGGTATTGAAGGGGCGCAGTGGCAGCTTTCGAAAACAGAGAAAAGGACGTTCCAGTCAACCTATTTAAAGAGGGAAGCAAATACTTGGATGGG
ATTTATCAAACAAAGGTTGCTTCCAACGACTCATGACTCGACGGTTTCTAGGGAACGAGTGCTTCTGGCTTTCGCTATTTTGCGGTCTCTCAGTGTTGATGTGGGAAAAA
TTATTGCTGATGAAATATCTGGTTGTTGGAAGAAGAAAGTGGGGAAGCTGTTTTTCCCGAATACCATTACCATGCTTTGCAAGCGAGCAGGGGTTCCAGAGAATGAAGGA
GATGTGATATTATTTGACAAGGGAATCATTTACACGCCTAACTTGGCGCGGCTTCAGCGTACGCAAGAGGCACGACAGGGTGGGCTGGTCTACGACATCAACACGATTTT
AGAACAACTCGCACTTTCGGCCAGCAGGCAGGACTCGGACTGGCTAAGCTTAATTAGATCAAGAGCTAGGCTGTGGCGAGTTCTTAGAATTGAGTTAAAAGTGGTGATTA
TTTGTCCATGTTGGAAGAATTATTTTGCTGAAGCAGAGCTTGGTTTTGCAGAATGCTCAGAATATGTAGCTGGGCGACTTGAGGGAGCAAACTCTGTGCTGGAGCAAAGC
TGGGAGCGAAAACTGCCACGTCACAGCTCATTAAATTCACCAACTGCTTGTGATATAATCACGAACTACTGCGAACACCACCACTATGGCTACACCAGTATAACTCTGAG
ATTTCTAGAGGCAGGAGACTGGTGGGAGTCTAGAAGGAACCTTCAGGGGAATTCTTTGAGACCCTCATTTGGTCTTGCCCTTGAAATGGATACCCCCACCCGCATGTCTC
CTACATGGATGCTTTGGATCATTGCATCTGTATCGAATACAAGGTGGGTTGTATCACATAGTGTCACCAGGATAAGACTTCGTGAGGCATTTTATTTTTCACGTGCAGCA
TCTGATTTCGTTTGTGGGTCGTCTGGTTGCATGCGGTTGCTGAAGCGTTCGGTTTATGAGCAGTTGGGCGTGCATATTTTTGGGCTGGTTCAAGGTGATTTTGAGCTGGT
TCAGTGCGGTTCAGCCCAATTTTTCGTCGGTTCGAAGAGTTTGGACGCGGTTCGAGGCTGTTTTGGGCTTATTCACGAAAAGATAGGATCGCCTATGCCGGCTGGAGTTA
TCTCCTACATGGGAACTGACCTGATCCATGCTCGACTTCGTGTCGCCTTGGACGCGGCCTCCCTTCGGAAAGTGTTTGCATGGGTCAAGTTAAATGTGTTAACTGATCCT
TGA
Protein sequenceShow/hide protein sequence
MAKTRARKERDNEEEEVPVTPEASKVKAKKKKTPEEKEAKRRRKQQRTEDQEVAQKAVEDVISEEDPKEPEGQNQEQSEPGVADTEEVREENTEEVREENTEEVREENTE
EVQEKQAEDLQKEQAEVAPEEVSEQEQEARVEVIMPEVPKRRRIKRKAGHVKKEEREMKEAEDKAREEAEKKAEEESLRKQRADRGKRVATASKEPDEIEESQLPYDRFV
NNLAIAKYAELLKRDFLFERGFSGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPSAINALYNLQNFPHAAYNEIAVAPSNEQLS
DAVREVGIEGAQWQLSKTEKRTFQSTYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSVDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPENEG
DVILFDKGIIYTPNLARLQRTQEARQGGLVYDINTILEQLALSASRQDSDWLSLIRSRARLWRVLRIELKVVIICPCWKNYFAEAELGFAECSEYVAGRLEGANSVLEQS
WERKLPRHSSLNSPTACDIITNYCEHHHYGYTSITLRFLEAGDWWESRRNLQGNSLRPSFGLALEMDTPTRMSPTWMLWIIASVSNTRWVVSHSVTRIRLREAFYFSRAA
SDFVCGSSGCMRLLKRSVYEQLGVHIFGLVQGDFELVQCGSAQFFVGSKSLDAVRGCFGLIHEKIGSPMPAGVISYMGTDLIHARLRVALDAASLRKVFAWVKLNVLTDP