; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg004667 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg004667
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold5:22231750..22233464
RNA-Seq ExpressionSpg004667
SyntenySpg004667
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]3.4e-2728.37Show/hide
Query:  LPYERFVNHFARAKYLEMLKRDFLFERGF------SGDLPHFLRASITNHGWELFCSKPESVNAHVVCEFYANIDEEEGFQVIVRGVAVDWSPGAINSLY
        + +++F N  A+A++     R+  FE GF       G     +   +    W  F   P SVNA +V EFYANI +     + VRG  + ++  AIN  +
Subjt:  LPYERFVNHFARAKYLEMLKRDFLFERGF------SGDLPHFLRASITNHGWELFCSKPESVNAHVVCEFYANIDEEEGFQVIVRGVAVDWSPGAINSLY

Query:  NLQDF--PHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTFQAAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVL
        +LQ+    HA + E                    A SN + +  +E++  E  +W   +T + +     L+  A  W  F+K +L+PT+H++TVS  R+L
Subjt:  NLQDF--PHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTFQAAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVL

Query:  LAFAILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKGIIDTPNLARL----------------------QQQLALSA
        L  +++ S  IDVG+II  ++  C  KK   L F N IT LC++  V ENA D IL     I    L  L                        ++ L A
Subjt:  LAFAILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKGIIDTPNLARL----------------------QQQLALSA

Query:  SRQEFAERQSQ---------TFWNYVKRRDANLKMALQENLSKPYPALPVFPDDLL
          +   + Q+Q          F+ YVK RD  ++   QE +       P FPD++L
Subjt:  SRQEFAERQSQ---------TFWNYVKRRDANLKMALQENLSKPYPALPVFPDDLL

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]5.6e-3033.1Show/hide
Query:  RFVNHFARAKYLEMLK-RDFLFERGF-------SGDLPHFLRASITNHGWELFCSKPESVNAHVVCEFYANIDEEEGFQVIVRGVAVDWSPGAINSLYNL
        +F    A  +Y   ++ R    E+GF        G LP F+   IT H W+ FC+ PE     +V EFYAN+ +     V VRGV V WS  AIN+++ L
Subjt:  RFVNHFARAKYLEMLK-RDFLFERGF-------SGDLPHFLRASITNHGWELFCSKPESVNAHVVCEFYANIDEEEGFQVIVRGVAVDWSPGAINSLYNL

Query:  QD--FPHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTFQAAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLA
         D    H+E+ E +                    +   L   +E V V GA+W +S     T   + L   A  W  F+K  LLPTTH  TVS++R+LL 
Subjt:  QD--FPHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTFQAAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLA

Query:  FAILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKGIIDTPNLARLQQQLALSASRQEFAER
         ++L   +I+VG++I +EI  C  +K G LFF + IT LC+    P    +  L + G ID   +AR+ Q+    +++Q  + R
Subjt:  FAILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKGIIDTPNLARLQQQLALSASRQEFAER

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]8.1e-3730.91Show/hide
Query:  RFVNHFARAKYLEMLK-RDFLFERGF-------SGDLPHFLRASITNHGWELFCSKPESVNAHVVCEFYANIDEEEGFQVIVRGVAVDWSPGAINSLYNL
        +F    A  +Y   ++ R    E+GF        G LP F+   IT H W+ FC+ PE     +V EFYAN+ + E   V VRGV V WS  AIN+++ L
Subjt:  RFVNHFARAKYLEMLK-RDFLFERGF-------SGDLPHFLRASITNHGWELFCSKPESVNAHVVCEFYANIDEEEGFQVIVRGVAVDWSPGAINSLYNL

Query:  QDFPHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTFQAAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFA
         D P  E++E +        N   ++          L   +E V   GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS++R+LL  +
Subjt:  QDFPHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTFQAAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFA

Query:  ILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKGIIDTPNLARLQQQLALSASRQ-----------------------
        +L   +I+VG++I +EI  C  +K G LFF + IT LC+    P    +  L + G ID   +AR+ Q+    +++Q                       
Subjt:  ILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKGIIDTPNLARLQQQLALSASRQ-----------------------

Query:  --------------------EFAERQSQTFWNYVKRRDANLKMALQENLSKPYPALPVFPDDLLNPWIPLPPVERDEEEENEQGQ
                            +   +Q Q FW Y K RD  LK ALQ N ++P P  P FP ++L         E D++  NE  +
Subjt:  --------------------EFAERQSQTFWNYVKRRDANLKMALQENLSKPYPALPVFPDDLLNPWIPLPPVERDEEEENEQGQ

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]2.9e-2630.04Show/hide
Query:  EKSTRAEEEAEVAEPEEGRLPYERFVNHFARAKYLEMLKRDFLFERGFSGDLPHFLRASITNHGWELFCSKPESVNAHVVCEFYANIDEEEGFQVIVRGV
        E+  R   +A   E +   + YE  + +   +     ++++F+++     + P F+   I  H W+LFC+ PE     +V EFY N+   +   V +RGV
Subjt:  EKSTRAEEEAEVAEPEEGRLPYERFVNHFARAKYLEMLKRDFLFERGFSGDLPHFLRASITNHGWELFCSKPESVNAHVVCEFYANIDEEEGFQVIVRGV

Query:  AVDWSPGAINSLYNLQD--FPHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTFQAAYLKKEANTWMGFIKQRLLP
         V  S  AIN++++L D    H+E+ E +  P                    +L   +E V + GA+W +S     T   + L   A  W  F+K RLLP
Subjt:  AVDWSPGAINSLYNLQD--FPHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTFQAAYLKKEANTWMGFIKQRLLP

Query:  TTHDSTVSRERVLLAFAILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKG
        TTH  TVS+E V L +++L   +I+VG++I  EI  C  +K G LFF + IT +C+    P    +  L + G
Subjt:  TTHDSTVSRERVLLAFAILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKG

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.8e-3132.91Show/hide
Query:  VVCEFYANIDEEEGFQVIVRGVAVDWSPGAINSLYNLQD--FPHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTF
        +V EFYAN+ + E   + VRGV V WS  AIN+++ L D    H+E+ E +  P                    +L   +E V   GA+W +S     T 
Subjt:  VVCEFYANIDEEEGFQVIVRGVAVDWSPGAINSLYNLQD--FPHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTF

Query:  QAAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKGIIDTP
          + L   A  W  F+K RLLPTTH   VS++R+LL  ++L   +I+VG++I +EI  C  +K G LFF + IT LC+    P    +  L + G ID  
Subjt:  QAAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKGIIDTP

Query:  NLARLQQQ--------------LALSASR------------------QEFAERQSQTFWNYVKRRDANLKMALQENLSKPYPALPVFPDDLLNPWIPLPP
         +AR+ Q+               A S+SR                  QE   +Q Q FW Y K RD  LK ALQ N ++P P  P FP ++L        
Subjt:  NLARLQQQ--------------LALSASR------------------QEFAERQSQTFWNYVKRRDANLKMALQENLSKPYPALPVFPDDLLNPWIPLPP

Query:  VERDEEEENEQGQ
         E D++  NE  +
Subjt:  VERDEEEENEQGQ

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.7e-3033.1Show/hide
Query:  RFVNHFARAKYLEMLK-RDFLFERGF-------SGDLPHFLRASITNHGWELFCSKPESVNAHVVCEFYANIDEEEGFQVIVRGVAVDWSPGAINSLYNL
        +F    A  +Y   ++ R    E+GF        G LP F+   IT H W+ FC+ PE     +V EFYAN+ +     V VRGV V WS  AIN+++ L
Subjt:  RFVNHFARAKYLEMLK-RDFLFERGF-------SGDLPHFLRASITNHGWELFCSKPESVNAHVVCEFYANIDEEEGFQVIVRGVAVDWSPGAINSLYNL

Query:  QD--FPHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTFQAAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLA
         D    H+E+ E +                    +   L   +E V V GA+W +S     T   + L   A  W  F+K  LLPTTH  TVS++R+LL 
Subjt:  QD--FPHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTFQAAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLA

Query:  FAILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKGIIDTPNLARLQQQLALSASRQEFAER
         ++L   +I+VG++I +EI  C  +K G LFF + IT LC+    P    +  L + G ID   +AR+ Q+    +++Q  + R
Subjt:  FAILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKGIIDTPNLARLQQQLALSASRQEFAER

A0A2P5BCG4 Uncharacterized protein (Fragment)3.9e-3730.91Show/hide
Query:  RFVNHFARAKYLEMLK-RDFLFERGF-------SGDLPHFLRASITNHGWELFCSKPESVNAHVVCEFYANIDEEEGFQVIVRGVAVDWSPGAINSLYNL
        +F    A  +Y   ++ R    E+GF        G LP F+   IT H W+ FC+ PE     +V EFYAN+ + E   V VRGV V WS  AIN+++ L
Subjt:  RFVNHFARAKYLEMLK-RDFLFERGF-------SGDLPHFLRASITNHGWELFCSKPESVNAHVVCEFYANIDEEEGFQVIVRGVAVDWSPGAINSLYNL

Query:  QDFPHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTFQAAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFA
         D P  E++E +        N   ++          L   +E V   GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS++R+LL  +
Subjt:  QDFPHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTFQAAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFA

Query:  ILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKGIIDTPNLARLQQQLALSASRQ-----------------------
        +L   +I+VG++I +EI  C  +K G LFF + IT LC+    P    +  L + G ID   +AR+ Q+    +++Q                       
Subjt:  ILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKGIIDTPNLARLQQQLALSASRQ-----------------------

Query:  --------------------EFAERQSQTFWNYVKRRDANLKMALQENLSKPYPALPVFPDDLLNPWIPLPPVERDEEEENEQGQ
                            +   +Q Q FW Y K RD  LK ALQ N ++P P  P FP ++L         E D++  NE  +
Subjt:  --------------------EFAERQSQTFWNYVKRRDANLKMALQENLSKPYPALPVFPDDLLNPWIPLPPVERDEEEENEQGQ

A0A2P5DAQ2 Uncharacterized protein1.4e-2630.04Show/hide
Query:  EKSTRAEEEAEVAEPEEGRLPYERFVNHFARAKYLEMLKRDFLFERGFSGDLPHFLRASITNHGWELFCSKPESVNAHVVCEFYANIDEEEGFQVIVRGV
        E+  R   +A   E +   + YE  + +   +     ++++F+++     + P F+   I  H W+LFC+ PE     +V EFY N+   +   V +RGV
Subjt:  EKSTRAEEEAEVAEPEEGRLPYERFVNHFARAKYLEMLKRDFLFERGFSGDLPHFLRASITNHGWELFCSKPESVNAHVVCEFYANIDEEEGFQVIVRGV

Query:  AVDWSPGAINSLYNLQD--FPHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTFQAAYLKKEANTWMGFIKQRLLP
         V  S  AIN++++L D    H+E+ E +  P                    +L   +E V + GA+W +S     T   + L   A  W  F+K RLLP
Subjt:  AVDWSPGAINSLYNLQD--FPHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTFQAAYLKKEANTWMGFIKQRLLP

Query:  TTHDSTVSRERVLLAFAILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKG
        TTH  TVS+E V L +++L   +I+VG++I  EI  C  +K G LFF + IT +C+    P    +  L + G
Subjt:  TTHDSTVSRERVLLAFAILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKG

A0A2P5DXM3 Uncharacterized protein8.5e-3232.91Show/hide
Query:  VVCEFYANIDEEEGFQVIVRGVAVDWSPGAINSLYNLQD--FPHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTF
        +V EFYAN+ + E   + VRGV V WS  AIN+++ L D    H+E+ E +  P                    +L   +E V   GA+W +S     T 
Subjt:  VVCEFYANIDEEEGFQVIVRGVAVDWSPGAINSLYNLQD--FPHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTF

Query:  QAAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKGIIDTP
          + L   A  W  F+K RLLPTTH   VS++R+LL  ++L   +I+VG++I +EI  C  +K G LFF + IT LC+    P    +  L + G ID  
Subjt:  QAAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKGIIDTP

Query:  NLARLQQQ--------------LALSASR------------------QEFAERQSQTFWNYVKRRDANLKMALQENLSKPYPALPVFPDDLLNPWIPLPP
         +AR+ Q+               A S+SR                  QE   +Q Q FW Y K RD  LK ALQ N ++P P  P FP ++L        
Subjt:  NLARLQQQ--------------LALSASR------------------QEFAERQSQTFWNYVKRRDANLKMALQENLSKPYPALPVFPDDLLNPWIPLPP

Query:  VERDEEEENEQGQ
         E D++  NE  +
Subjt:  VERDEEEENEQGQ

A0A6A3BU96 Uncharacterized protein1.7e-2728.37Show/hide
Query:  LPYERFVNHFARAKYLEMLKRDFLFERGF------SGDLPHFLRASITNHGWELFCSKPESVNAHVVCEFYANIDEEEGFQVIVRGVAVDWSPGAINSLY
        + +++F N  A+A++     R+  FE GF       G     +   +    W  F   P SVNA +V EFYANI +     + VRG  + ++  AIN  +
Subjt:  LPYERFVNHFARAKYLEMLKRDFLFERGF------SGDLPHFLRASITNHGWELFCSKPESVNAHVVCEFYANIDEEEGFQVIVRGVAVDWSPGAINSLY

Query:  NLQDF--PHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTFQAAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVL
        +LQ+    HA + E                    A SN + +  +E++  E  +W   +T + +     L+  A  W  F+K +L+PT+H++TVS  R+L
Subjt:  NLQDF--PHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTFQAAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVL

Query:  LAFAILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKGIIDTPNLARL----------------------QQQLALSA
        L  +++ S  IDVG+II  ++  C  KK   L F N IT LC++  V ENA D IL     I    L  L                        ++ L A
Subjt:  LAFAILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKGIIDTPNLARL----------------------QQQLALSA

Query:  SRQEFAERQSQ---------TFWNYVKRRDANLKMALQENLSKPYPALPVFPDDLL
          +   + Q+Q          F+ YVK RD  ++   QE +       P FPD++L
Subjt:  SRQEFAERQSQ---------TFWNYVKRRDANLKMALQENLSKPYPALPVFPDDLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAGACAAGAGGAAGAAAAGAAAGGGACACCGAGGAAGAGGATGTGGCTGTAACCCCTGAGGCTCCGAAGAAAACTAAGAAGAAGAAAACGCCAGAGGAAAAGGA
AGCGAAGAGAAGGAGGCGGCAGCAGAAGGATAAAGTTGAGGAAGTGGTGCGAAAGACTGCCCCAATAGTCGAGGATACGCAAGAAGTTCAAGAAAACCAGACAGAGGATA
TGCAAGCGGAACAGACGGAGGTTGTGCCAGAAAAAGGTGATGAGCAAGAGGTGCAGGAGGCTCGCGTTGAAGTCGTTATGATGGAAGTTCCACGTCGTAGACGGAGGAAA
CAGAAGGCTGGTCGAGTGAAAATAATCCGTTCGGATACTCCGACACCTACTACATCAGATTCTGAGAAAGAAAAATCTACTAGAGCAGAAGAAGAAGCAGAAGTGGCGGA
GCCAGAGGAGGGAAGATTACCATATGAGCGATTTGTCAATCATTTTGCTAGAGCAAAATACTTGGAGATGTTGAAGAGGGACTTCTTGTTTGAGAGAGGATTTAGTGGTG
ATCTTCCACATTTTCTGCGGGCCAGCATTACGAACCACGGTTGGGAGTTATTCTGCTCCAAGCCTGAATCTGTGAACGCACATGTAGTGTGCGAGTTTTATGCAAACATT
GATGAGGAAGAGGGTTTCCAAGTTATCGTTCGAGGAGTAGCAGTTGACTGGAGTCCTGGTGCCATTAACTCCCTGTACAACCTTCAGGATTTCCCCCACGCAGAATACAA
TGAGATGGTTGTGGCGCCATCTAATGGGCAATTAAACGCTGCTGTTCGGGAAGTTGGTGTTGTGGCGCCATCTAATTGGCAATTAAACGCTGCTGTTGAGGAAGTTGGTG
TTGAAGGGGCACAGTGGAGACTTTCGAAAACAGAGAAAAGGACATTTCAGGCAGCCTATCTAAAGAAGGAAGCAAATACATGGATGGGATTCATCAAACAAAGGTTGCTT
CCAACGACTCATGACTCGACGGTTTCTAGGGAACGTGTTCTTCTGGCTTTCGCGATTTTAAGGTCTCTCAATATTGATGTGGGCAAGATTATTGCGAATGAGATATTTGG
ATGCTGGCGGAAGAAGGTTGGGAAGTTGTTTTTCTCGAATACAATTACAATGCTTTGCAAGCGAGTTGGGGTTCCGGAGAATGCAAAAGATGTCATATTATTCGACAAGG
GAATCATTGATACACCTAACTTGGCACGACTTCAGCAACAACTGGCACTGTCGGCCAGCAGGCAAGAGTTTGCCGAGAGGCAGTCTCAAACTTTCTGGAACTATGTTAAA
CGTCGTGATGCCAATCTGAAGATGGCGCTACAGGAAAATTTGTCCAAACCATATCCAGCCCTTCCAGTATTCCCTGATGATTTATTGAACCCATGGATTCCACTGCCGCC
TGTAGAGAGAGATGAGGAGGAAGAAAATGAGCAGGGTCAGGAGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAGACAAGAGGAAGAAAAGAAAGGGACACCGAGGAAGAGGATGTGGCTGTAACCCCTGAGGCTCCGAAGAAAACTAAGAAGAAGAAAACGCCAGAGGAAAAGGA
AGCGAAGAGAAGGAGGCGGCAGCAGAAGGATAAAGTTGAGGAAGTGGTGCGAAAGACTGCCCCAATAGTCGAGGATACGCAAGAAGTTCAAGAAAACCAGACAGAGGATA
TGCAAGCGGAACAGACGGAGGTTGTGCCAGAAAAAGGTGATGAGCAAGAGGTGCAGGAGGCTCGCGTTGAAGTCGTTATGATGGAAGTTCCACGTCGTAGACGGAGGAAA
CAGAAGGCTGGTCGAGTGAAAATAATCCGTTCGGATACTCCGACACCTACTACATCAGATTCTGAGAAAGAAAAATCTACTAGAGCAGAAGAAGAAGCAGAAGTGGCGGA
GCCAGAGGAGGGAAGATTACCATATGAGCGATTTGTCAATCATTTTGCTAGAGCAAAATACTTGGAGATGTTGAAGAGGGACTTCTTGTTTGAGAGAGGATTTAGTGGTG
ATCTTCCACATTTTCTGCGGGCCAGCATTACGAACCACGGTTGGGAGTTATTCTGCTCCAAGCCTGAATCTGTGAACGCACATGTAGTGTGCGAGTTTTATGCAAACATT
GATGAGGAAGAGGGTTTCCAAGTTATCGTTCGAGGAGTAGCAGTTGACTGGAGTCCTGGTGCCATTAACTCCCTGTACAACCTTCAGGATTTCCCCCACGCAGAATACAA
TGAGATGGTTGTGGCGCCATCTAATGGGCAATTAAACGCTGCTGTTCGGGAAGTTGGTGTTGTGGCGCCATCTAATTGGCAATTAAACGCTGCTGTTGAGGAAGTTGGTG
TTGAAGGGGCACAGTGGAGACTTTCGAAAACAGAGAAAAGGACATTTCAGGCAGCCTATCTAAAGAAGGAAGCAAATACATGGATGGGATTCATCAAACAAAGGTTGCTT
CCAACGACTCATGACTCGACGGTTTCTAGGGAACGTGTTCTTCTGGCTTTCGCGATTTTAAGGTCTCTCAATATTGATGTGGGCAAGATTATTGCGAATGAGATATTTGG
ATGCTGGCGGAAGAAGGTTGGGAAGTTGTTTTTCTCGAATACAATTACAATGCTTTGCAAGCGAGTTGGGGTTCCGGAGAATGCAAAAGATGTCATATTATTCGACAAGG
GAATCATTGATACACCTAACTTGGCACGACTTCAGCAACAACTGGCACTGTCGGCCAGCAGGCAAGAGTTTGCCGAGAGGCAGTCTCAAACTTTCTGGAACTATGTTAAA
CGTCGTGATGCCAATCTGAAGATGGCGCTACAGGAAAATTTGTCCAAACCATATCCAGCCCTTCCAGTATTCCCTGATGATTTATTGAACCCATGGATTCCACTGCCGCC
TGTAGAGAGAGATGAGGAGGAAGAAAATGAGCAGGGTCAGGAGGACTGA
Protein sequenceShow/hide protein sequence
MAKTRGRKERDTEEEDVAVTPEAPKKTKKKKTPEEKEAKRRRRQQKDKVEEVVRKTAPIVEDTQEVQENQTEDMQAEQTEVVPEKGDEQEVQEARVEVVMMEVPRRRRRK
QKAGRVKIIRSDTPTPTTSDSEKEKSTRAEEEAEVAEPEEGRLPYERFVNHFARAKYLEMLKRDFLFERGFSGDLPHFLRASITNHGWELFCSKPESVNAHVVCEFYANI
DEEEGFQVIVRGVAVDWSPGAINSLYNLQDFPHAEYNEMVVAPSNGQLNAAVREVGVVAPSNWQLNAAVEEVGVEGAQWRLSKTEKRTFQAAYLKKEANTWMGFIKQRLL
PTTHDSTVSRERVLLAFAILRSLNIDVGKIIANEIFGCWRKKVGKLFFSNTITMLCKRVGVPENAKDVILFDKGIIDTPNLARLQQQLALSASRQEFAERQSQTFWNYVK
RRDANLKMALQENLSKPYPALPVFPDDLLNPWIPLPPVERDEEEENEQGQED