; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008160 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008160
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr9:13550066..13552197
RNA-Seq ExpressionLag0008160
SyntenyLag0008160
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8661093.1 hypothetical protein F3Y22_tig00116939pilonHSYRG00213 [Hibiscus syriacus]1.6e-3133.33Show/hide
Query:  YDRFVNNFAKAKYTELLKRDFLFERGF------SGDLPPFLRTGIADHGWELFCAKPEAVNAQVVREFYANIDKEEGFLVIVRGVEIDWSPSAINALYHL
        + +F N+ AKA++     R   FE GF       G   P +   +    W+ F   P +VNA +V+EFYANI K   + + VRG +I ++ +AIN  +HL
Subjt:  YDRFVNNFAKAKYTELLKRDFLFERGF------SGDLPPFLRTGIADHGWELFCAKPEAVNAQVVREFYANIDKEEGFLVIVRGVEIDWSPSAINALYHL

Query:  QNF--PHATFNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICN
        Q+    HATF E A      +    + ++  E  +W   +T + +     L+  AK W  F++ +L+PT+H++TVS  R+LL  +I+ S  IDVG II  
Subjt:  QNF--PHATFNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICN

Query:  EIACCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQRVQEVRQGGLIH
        ++  C  KK   L FPN IT LCR+  V  +  D IL     I+   L  L  ++  +    +H
Subjt:  EIACCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQRVQEVRQGGLIH

KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]1.1e-3230.36Show/hide
Query:  VPYDRFVNNFAKAKYTELLKRDFLFERGF------SGDLPPFLRTGIADHGWELFCAKPEAVNAQVVREFYANIDKEEGFLVIVRGVEIDWSPSAINALY
        + + +F N+ AKA++     R+  FE GF       G   P +   +    W  F   P +VNA +V+EFYANI K     + VRG +I ++  AIN  +
Subjt:  VPYDRFVNNFAKAKYTELLKRDFLFERGF------SGDLPPFLRTGIADHGWELFCAKPEAVNAQVVREFYANIDKEEGFLVIVRGVEIDWSPSAINALY

Query:  HLQNF--PHATFNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKII
        HLQ     HA F E A      +    + ++  E  +W   +T + +     L+  AK W  F++ +L+PT+H++TVS  R+LL  +++ S  IDVG+II
Subjt:  HLQNF--PHATFNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKII

Query:  CNEIACCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQRVQEVRQGGLIH----GINSILEQLALSASQQEFAERQT-------
          ++  C  KK   L FPN IT LCR+  V  +  D IL     I    L  L  ++  +    +H    G      ++ L A ++   + Q        
Subjt:  CNEIACCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQRVQEVRQGGLIH----GINSILEQLALSASQQEFAERQT-------

Query:  --LTFWNYVKNRDASLRRALQENFSKPYPALPMVGF
            F+ YVK+RD  +    QE    PY      GF
Subjt:  --LTFWNYVKNRDASLRRALQENFSKPYPALPMVGF

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.8e-3337.35Show/hide
Query:  RFVNNFAKAKY-TELLKRDFLFERGF-------SGDLPPFLRTGIADHGWELFCAKPEAVNAQVVREFYANIDKEEGFLVIVRGVEIDWSPSAINALYHL
        +F    A+ +Y   +  R    E+GF        G L PF+   I  H W+ FCA PE     +VREFYAN+       V VRGV++ WS  AINA++ L
Subjt:  RFVNNFAKAKY-TELLKRDFLFERGF-------SGDLPPFLRTGIADHGWELFCAKPEAVNAQVVREFYANIDKEEGFLVIVRGVEIDWSPSAINALYHL

Query:  QNFPHATFNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEI
         + P    +E     +E  L   +  V + GA+W +S     T   + L   AK W  F++  LLPTTH  TVS++R+LL  ++L   SI+VG++I +EI
Subjt:  QNFPHATFNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEI

Query:  ACCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL
          C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+
Subjt:  ACCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]5.4e-4332.95Show/hide
Query:  ERQVPYDRFVNNFAKAKYTELLKRDFLFERGFSGDLPPFLRTGIADHGWELFCAKPEAVNAQVVREFYANIDKEEGFLVIVRGVEIDWSPSAINALYHLQ
        E +    R+ NN          ++ F+ +   +    PF+   I  H W+ FCA PE     +VREFYAN+   E   V VRGV++ WS  AINA++ L 
Subjt:  ERQVPYDRFVNNFAKAKYTELLKRDFLFERGFSGDLPPFLRTGIADHGWELFCAKPEAVNAQVVREFYANIDKEEGFLVIVRGVEIDWSPSAINALYHLQ

Query:  NFPHATFNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEIA
        + P    +E     +++ L   +  V   GA+W +S     T   + L   AK W  F++ RLLPTTH  TVS++R+LL  ++L   SI+VG++I +EI 
Subjt:  NFPHATFNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEIA

Query:  CCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQR---VQEVRQ---------------GGLIHGINSILEQLALSASQQ-----
         C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+ +    +  +Q               G ++  + ++ ++L+    QQ     
Subjt:  CCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQR---VQEVRQ---------------GGLIHGINSILEQLALSASQQ-----

Query:  --EFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPMVGFREILYDLTH
          +   +Q   FW Y K RD +L++ALQ NF++P P  P    +EIL DL +
Subjt:  --EFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPMVGFREILYDLTH

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]2.2e-3636.4Show/hide
Query:  VVREFYANIDKEEGFLVIVRGVEIDWSPSAINALYHLQNFPHATFNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQR
        +VREFYAN+   E   + VRGV++ WS  AINA++ L + P    +E     +E +L   +  V   GA+W +S     T   + L   AK W  F++ R
Subjt:  VVREFYANIDKEEGFLVIVRGVEIDWSPSAINALYHLQNFPHATFNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQR

Query:  LLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEIACCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQR---VQEVRQ---
        LLPTTH   VS++R+LL  ++L   SI+VG++I +EI  C  +K G LFFP+ IT LCR A   V+E    L + G ID   +AR+ +    +  +Q   
Subjt:  LLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEIACCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQR---VQEVRQ---

Query:  ------------GGLIHGINSILEQLALSASQQEFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPMVGFREILYDLTH
                    G ++  + ++ ++L    SQQE   +Q   FW Y K RD +L++ALQ NF++P P  P    +EIL DL +
Subjt:  ------------GGLIHGINSILEQLALSASQQEFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPMVGFREILYDLTH

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)8.5e-3437.35Show/hide
Query:  RFVNNFAKAKY-TELLKRDFLFERGF-------SGDLPPFLRTGIADHGWELFCAKPEAVNAQVVREFYANIDKEEGFLVIVRGVEIDWSPSAINALYHL
        +F    A+ +Y   +  R    E+GF        G L PF+   I  H W+ FCA PE     +VREFYAN+       V VRGV++ WS  AINA++ L
Subjt:  RFVNNFAKAKY-TELLKRDFLFERGF-------SGDLPPFLRTGIADHGWELFCAKPEAVNAQVVREFYANIDKEEGFLVIVRGVEIDWSPSAINALYHL

Query:  QNFPHATFNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEI
         + P    +E     +E  L   +  V + GA+W +S     T   + L   AK W  F++  LLPTTH  TVS++R+LL  ++L   SI+VG++I +EI
Subjt:  QNFPHATFNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEI

Query:  ACCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL
          C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+
Subjt:  ACCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL

A0A2P5BCG4 Uncharacterized protein (Fragment)2.6e-4332.95Show/hide
Query:  ERQVPYDRFVNNFAKAKYTELLKRDFLFERGFSGDLPPFLRTGIADHGWELFCAKPEAVNAQVVREFYANIDKEEGFLVIVRGVEIDWSPSAINALYHLQ
        E +    R+ NN          ++ F+ +   +    PF+   I  H W+ FCA PE     +VREFYAN+   E   V VRGV++ WS  AINA++ L 
Subjt:  ERQVPYDRFVNNFAKAKYTELLKRDFLFERGFSGDLPPFLRTGIADHGWELFCAKPEAVNAQVVREFYANIDKEEGFLVIVRGVEIDWSPSAINALYHLQ

Query:  NFPHATFNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEIA
        + P    +E     +++ L   +  V   GA+W +S     T   + L   AK W  F++ RLLPTTH  TVS++R+LL  ++L   SI+VG++I +EI 
Subjt:  NFPHATFNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEIA

Query:  CCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQR---VQEVRQ---------------GGLIHGINSILEQLALSASQQ-----
         C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+ +    +  +Q               G ++  + ++ ++L+    QQ     
Subjt:  CCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQR---VQEVRQ---------------GGLIHGINSILEQLALSASQQ-----

Query:  --EFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPMVGFREILYDLTH
          +   +Q   FW Y K RD +L++ALQ NF++P P  P    +EIL DL +
Subjt:  --EFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPMVGFREILYDLTH

A0A2P5DXM3 Uncharacterized protein1.1e-3636.4Show/hide
Query:  VVREFYANIDKEEGFLVIVRGVEIDWSPSAINALYHLQNFPHATFNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQR
        +VREFYAN+   E   + VRGV++ WS  AINA++ L + P    +E     +E +L   +  V   GA+W +S     T   + L   AK W  F++ R
Subjt:  VVREFYANIDKEEGFLVIVRGVEIDWSPSAINALYHLQNFPHATFNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQR

Query:  LLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEIACCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQR---VQEVRQ---
        LLPTTH   VS++R+LL  ++L   SI+VG++I +EI  C  +K G LFFP+ IT LCR A   V+E    L + G ID   +AR+ +    +  +Q   
Subjt:  LLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEIACCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQR---VQEVRQ---

Query:  ------------GGLIHGINSILEQLALSASQQEFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPMVGFREILYDLTH
                    G ++  + ++ ++L    SQQE   +Q   FW Y K RD +L++ALQ NF++P P  P    +EIL DL +
Subjt:  ------------GGLIHGINSILEQLALSASQQEFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPMVGFREILYDLTH

A0A6A2WM54 Uncharacterized protein7.9e-3233.33Show/hide
Query:  YDRFVNNFAKAKYTELLKRDFLFERGF------SGDLPPFLRTGIADHGWELFCAKPEAVNAQVVREFYANIDKEEGFLVIVRGVEIDWSPSAINALYHL
        + +F N+ AKA++     R   FE GF       G   P +   +    W+ F   P +VNA +V+EFYANI K   + + VRG +I ++ +AIN  +HL
Subjt:  YDRFVNNFAKAKYTELLKRDFLFERGF------SGDLPPFLRTGIADHGWELFCAKPEAVNAQVVREFYANIDKEEGFLVIVRGVEIDWSPSAINALYHL

Query:  QNF--PHATFNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICN
        Q+    HATF E A      +    + ++  E  +W   +T + +     L+  AK W  F++ +L+PT+H++TVS  R+LL  +I+ S  IDVG II  
Subjt:  QNF--PHATFNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICN

Query:  EIACCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQRVQEVRQGGLIH
        ++  C  KK   L FPN IT LCR+  V  +  D IL     I+   L  L  ++  +    +H
Subjt:  EIACCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQRVQEVRQGGLIH

A0A6A3BU96 Uncharacterized protein5.5e-3330.36Show/hide
Query:  VPYDRFVNNFAKAKYTELLKRDFLFERGF------SGDLPPFLRTGIADHGWELFCAKPEAVNAQVVREFYANIDKEEGFLVIVRGVEIDWSPSAINALY
        + + +F N+ AKA++     R+  FE GF       G   P +   +    W  F   P +VNA +V+EFYANI K     + VRG +I ++  AIN  +
Subjt:  VPYDRFVNNFAKAKYTELLKRDFLFERGF------SGDLPPFLRTGIADHGWELFCAKPEAVNAQVVREFYANIDKEEGFLVIVRGVEIDWSPSAINALY

Query:  HLQNF--PHATFNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKII
        HLQ     HA F E A      +    + ++  E  +W   +T + +     L+  AK W  F++ +L+PT+H++TVS  R+LL  +++ S  IDVG+II
Subjt:  HLQNF--PHATFNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKII

Query:  CNEIACCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQRVQEVRQGGLIH----GINSILEQLALSASQQEFAERQT-------
          ++  C  KK   L FPN IT LCR+  V  +  D IL     I    L  L  ++  +    +H    G      ++ L A ++   + Q        
Subjt:  CNEIACCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQRVQEVRQGGLIH----GINSILEQLALSASQQEFAERQT-------

Query:  --LTFWNYVKNRDASLRRALQENFSKPYPALPMVGF
            F+ YVK+RD  +    QE    PY      GF
Subjt:  --LTFWNYVKNRDASLRRALQENFSKPYPALPMVGF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAAAACAAGAGCAAGGAAGGTAAGAGAGAGGGAGAGTGAGGAGGAAGAGATACCCGTGACACCAGAAGTCCAGAAAGGAAAAACTAAGAAAAAGAGAACG
CCAGAAGAGAAGGAGGCTAAGCGAAGAAGACGACAACAGAGGGTTGAAGAACAAGAAAGAGCGAGAGAGAATGAGGTTGTTACAGAAGAAGAAGAGGACCCACAA
GAATCTGGCAAAACGAATCAAGAAGAGGATGGACAGGAAGTGGCAACTATAGAAGAGGCTCGCGAGGAAATTCAGGCGAAACAGAGAGAGGTTATGCAAACGGAG
GCTGTAGAGGAAGAAGACAGAGAGCCAGTTCAGGAGGCTCGTGTGGAGGTTATCATACCTGAACCTCCGAAACGACGCCACATTAAGCGGAAGGCTGGGCGCATA
CCAGTGATCAGGACTGATACCCCATCGCCTCCATCATCAGATTCCGAGAGAGAGAAGACCGAGCAAGAAGAACGAGAGAAAAAAGAGGCTGAGGAAAGAAGGCGA
GAAGAGGCAGAGAAGCAGGCAGAGGAGGAACAGTTGCTGAAGCGGAGAGAAGAGAAGGGCAAAAAGATTGCTGAAGCATCAGAGGAGCACGATGAAATAGCAGAG
CGACAGGTGCCATACGATCGCTTCGTCAATAATTTTGCTAAAGCAAAATACACTGAGCTCCTGAAAAGGGATTTTCTGTTCGAAAGAGGTTTCAGCGGTGATCTT
CCGCCATTTCTAAGGACCGGCATAGCTGACCACGGCTGGGAGCTGTTTTGTGCTAAGCCGGAGGCTGTGAACGCACAGGTGGTGCGTGAATTCTATGCCAACATT
GATAAAGAAGAGGGTTTCCTGGTAATTGTCAGAGGAGTCGAGATAGATTGGAGTCCAAGTGCGATCAACGCACTGTACCATCTTCAAAACTTCCCCCACGCGACA
TTCAATGAAATGGCAGTCGCGCCATCTGAGGAGCAGTTAAGTAATGCTGTGAGGGAGGTAGGAATCGAGGGGGCGCAGTGGCAACTATCTAAGACTCAGAAACGG
ACATTCCAGTCGGCTTATTTGAAAAAGGAAGCGAAGACATGGATGGGTTTCATCAGGCAGAGGTTGCTTCCGACAACGCACGACTCGACAGTTTCCAGGGAACGA
ATTCTTTTGGCTTTTGCTATCTTAAGGTCTCTCAGTATTGACGTAGGAAAAATTATTTGTAATGAAATTGCTTGCTGCTGGAAGAAAAAGGTGGGGAAACTATTC
TTTCCGAACACAATTACTATGCTATGCAGAAGAGCAGGGGTTCCGGTAGATGAGGGAGATGTGATCCTGTTTGATAAGGGGATCATAGACACGCCCAATTTGGCA
CGGCTCCAGCGCGTGCAGGAGGTACGTCAAGGTGGGCTTATCCATGGCATCAACTCGATCCTAGAACAACTAGCACTTTCGGCCAGTCAGCAGGAGTTTGCTGAG
AGGCAAACTTTAACCTTCTGGAACTATGTTAAGAATCGGGATGCCAGCTTAAGAAGGGCACTGCAAGAGAATTTTTCCAAACCTTACCCTGCCCTTCCCATGGTT
GGTTTTAGAGAGATATTATATGATTTAACCCATTTGGAAGAATGGAAGCATTGGAAATTATTTTGTGCAGATTATGCTGCTGAGCGACTGGAGGGAGCAAATTCT
ATGCTGCAGCAAAACTGGGAGCAGAAACTGCCACATCACAGCTCGTTAGCCAACTTCATGAACCGACTTCTGTTGAGTTATTTTCGTGATAAAGGATCAAGGAGA
GCCTTACACGTGTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCAAAACAAGAGCAAGGAAGGTAAGAGAGAGGGAGAGTGAGGAGGAAGAGATACCCGTGACACCAGAAGTCCAGAAAGGAAAAACTAAGAAAAAGAGAACG
CCAGAAGAGAAGGAGGCTAAGCGAAGAAGACGACAACAGAGGGTTGAAGAACAAGAAAGAGCGAGAGAGAATGAGGTTGTTACAGAAGAAGAAGAGGACCCACAA
GAATCTGGCAAAACGAATCAAGAAGAGGATGGACAGGAAGTGGCAACTATAGAAGAGGCTCGCGAGGAAATTCAGGCGAAACAGAGAGAGGTTATGCAAACGGAG
GCTGTAGAGGAAGAAGACAGAGAGCCAGTTCAGGAGGCTCGTGTGGAGGTTATCATACCTGAACCTCCGAAACGACGCCACATTAAGCGGAAGGCTGGGCGCATA
CCAGTGATCAGGACTGATACCCCATCGCCTCCATCATCAGATTCCGAGAGAGAGAAGACCGAGCAAGAAGAACGAGAGAAAAAAGAGGCTGAGGAAAGAAGGCGA
GAAGAGGCAGAGAAGCAGGCAGAGGAGGAACAGTTGCTGAAGCGGAGAGAAGAGAAGGGCAAAAAGATTGCTGAAGCATCAGAGGAGCACGATGAAATAGCAGAG
CGACAGGTGCCATACGATCGCTTCGTCAATAATTTTGCTAAAGCAAAATACACTGAGCTCCTGAAAAGGGATTTTCTGTTCGAAAGAGGTTTCAGCGGTGATCTT
CCGCCATTTCTAAGGACCGGCATAGCTGACCACGGCTGGGAGCTGTTTTGTGCTAAGCCGGAGGCTGTGAACGCACAGGTGGTGCGTGAATTCTATGCCAACATT
GATAAAGAAGAGGGTTTCCTGGTAATTGTCAGAGGAGTCGAGATAGATTGGAGTCCAAGTGCGATCAACGCACTGTACCATCTTCAAAACTTCCCCCACGCGACA
TTCAATGAAATGGCAGTCGCGCCATCTGAGGAGCAGTTAAGTAATGCTGTGAGGGAGGTAGGAATCGAGGGGGCGCAGTGGCAACTATCTAAGACTCAGAAACGG
ACATTCCAGTCGGCTTATTTGAAAAAGGAAGCGAAGACATGGATGGGTTTCATCAGGCAGAGGTTGCTTCCGACAACGCACGACTCGACAGTTTCCAGGGAACGA
ATTCTTTTGGCTTTTGCTATCTTAAGGTCTCTCAGTATTGACGTAGGAAAAATTATTTGTAATGAAATTGCTTGCTGCTGGAAGAAAAAGGTGGGGAAACTATTC
TTTCCGAACACAATTACTATGCTATGCAGAAGAGCAGGGGTTCCGGTAGATGAGGGAGATGTGATCCTGTTTGATAAGGGGATCATAGACACGCCCAATTTGGCA
CGGCTCCAGCGCGTGCAGGAGGTACGTCAAGGTGGGCTTATCCATGGCATCAACTCGATCCTAGAACAACTAGCACTTTCGGCCAGTCAGCAGGAGTTTGCTGAG
AGGCAAACTTTAACCTTCTGGAACTATGTTAAGAATCGGGATGCCAGCTTAAGAAGGGCACTGCAAGAGAATTTTTCCAAACCTTACCCTGCCCTTCCCATGGTT
GGTTTTAGAGAGATATTATATGATTTAACCCATTTGGAAGAATGGAAGCATTGGAAATTATTTTGTGCAGATTATGCTGCTGAGCGACTGGAGGGAGCAAATTCT
ATGCTGCAGCAAAACTGGGAGCAGAAACTGCCACATCACAGCTCGTTAGCCAACTTCATGAACCGACTTCTGTTGAGTTATTTTCGTGATAAAGGATCAAGGAGA
GCCTTACACGTGTCCTAG
Protein sequenceShow/hide protein sequence
MAKTRARKVRERESEEEEIPVTPEVQKGKTKKKRTPEEKEAKRRRRQQRVEEQERARENEVVTEEEEDPQESGKTNQEEDGQEVATIEEAREEIQAKQREVMQTE
AVEEEDREPVQEARVEVIIPEPPKRRHIKRKAGRIPVIRTDTPSPPSSDSEREKTEQEEREKKEAEERRREEAEKQAEEEQLLKRREEKGKKIAEASEEHDEIAE
RQVPYDRFVNNFAKAKYTELLKRDFLFERGFSGDLPPFLRTGIADHGWELFCAKPEAVNAQVVREFYANIDKEEGFLVIVRGVEIDWSPSAINALYHLQNFPHAT
FNEMAVAPSEEQLSNAVREVGIEGAQWQLSKTQKRTFQSAYLKKEAKTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEIACCWKKKVGKLF
FPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQRVQEVRQGGLIHGINSILEQLALSASQQEFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPMV
GFREILYDLTHLEEWKHWKLFCADYAAERLEGANSMLQQNWEQKLPHHSSLANFMNRLLLSYFRDKGSRRALHVS