; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg009183 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg009183
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold7:38847680..38849686
RNA-Seq ExpressionSpg009183
SyntenySpg009183
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]8.3e-1423.12Show/hide
Query:  FVNNLARAKYDELLKRDFLFERGF------SGDLPHFLRAGITNHGWELFCYKPEFVNAQVVRKFYANIDEEEGFQVIVRGVAVDWSPGAINALYNLQDF
        FV+  A+  Y  +  R   FE GF      + +L   +   +T H W+ F   P  VNA +V++FY+NI E     V+VRG+++ ++P AIN  + LQ  
Subjt:  FVNNLARAKYDELLKRDFLFERGF------SGDLPHFLRAGITNHGWELFCYKPEFVNAQVVRKFYANIDEEEGFQVIVRGVAVDWSPGAINALYNLQDF

Query:  PHVGYNEMLVAPSNEQLRDAVLEGTCSSGV------------------------------------------CD--------LRSLSIDVGRIIASEISG
            +N       + +    +LE  C  G                                           C         L   +ID+G+II      
Subjt:  PHVGYNEMLVAPSNEQLRDAVLEGTCSSGV------------------------------------------CD--------LRSLSIDVGRIIASEISG

Query:  CWRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARLQSTQEARQGGLVYGIHKILEQLALSASRQEFAERQSQT-------------FW
        C +++   L FPN IT LC++  V E   D IL     ++   +  L   +EA+         ++     + AS  +  +   +T             ++
Subjt:  CWRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARLQSTQEARQGGLVYGIHKILEQLALSASRQEFAERQSQT-------------FW

Query:  NYVKRRDGNLKKVLQENFSK
         Y KRRD  L   L E+  +
Subjt:  NYVKRRDGNLKKVLQENFSK

KAE8707640.1 hypothetical protein F3Y22_tig00110378pilonHSYRG00039 [Hibiscus syriacus]3.7e-1424.37Show/hide
Query:  LAEASPEPKDSEAEEPRLPYNRFVNNLARAKYDELLKRDFLFERGFSGDLPHFLRAGITNHGW--------ELFCYKPEFVNAQVVRKFYANIDEEEGFQ
        +  + P P D+ A        +F N+ A+ ++  +  R   FE G       F+    T+ G+        + F   P  VNA +V++FYANI +     
Subjt:  LAEASPEPKDSEAEEPRLPYNRFVNNLARAKYDELLKRDFLFERGFSGDLPHFLRAGITNHGW--------ELFCYKPEFVNAQVVRKFYANIDEEEGFQ

Query:  VIVRGVAVDWSPGAINALYNLQDFPHVGYNEMLVAPSNEQLRDAVLEGTCSSGV----------------------------------------------
        + VRG  + ++  AIN  ++LQ+   V  + ML   ++    D VLE  C                                                  
Subjt:  VIVRGVAVDWSPGAINALYNLQDFPHVGYNEMLVAPSNEQLRDAVLEGTCSSGV----------------------------------------------

Query:  ----CDLRSLSIDVGRIIASEISGCWRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARLQSTQEARQGGLVY----GIHKILEQLALS
              + S  IDVGRII  ++  C  KK   L FPN IT LC +  V EN  D IL     I    L  L   +  +    V+    G  +   +  L 
Subjt:  ----CDLRSLSIDVGRIIASEISGCWRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARLQSTQEARQGGLVY----GIHKILEQLALS

Query:  ASRQEFAERQSQ---------TFWNYVKRRDGNLKKVLQENFSKPYSTLLVFPDDLL
        A  ++  + Q+Q          F+ YVK RD  ++ + QE           FPD++L
Subjt:  ASRQEFAERQSQ---------TFWNYVKRRDGNLKKVLQENFSKPYSTLLVFPDDLL

KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]1.5e-1524.23Show/hide
Query:  LAEASPEPKDSEAEEPRLPYNRFVNNLARAKYDELLKRDFLFERGF------SGDLPHFLRAGITNHGWELFCYKPEFVNAQVVRKFYANIDEEEGFQVI
        +  + P P D+      + + +F N+ A+A++     R+  FE GF       G     +   +    W  F + P  VNA +V++FYANI +     + 
Subjt:  LAEASPEPKDSEAEEPRLPYNRFVNNLARAKYDELLKRDFLFERGF------SGDLPHFLRAGITNHGWELFCYKPEFVNAQVVRKFYANIDEEEGFQVI

Query:  VRGVAVDWSPGAINALYNLQDFPHVGYNEMLVAPSNEQLRDAVLEGTCSSGV------------------------------------------------
        VRG  + ++  AIN  ++LQ+   V  + M    ++    D VLE  C                                                    
Subjt:  VRGVAVDWSPGAINALYNLQDFPHVGYNEMLVAPSNEQLRDAVLEGTCSSGV------------------------------------------------

Query:  --CDLRSLSIDVGRIIASEISGCWRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARLQSTQEARQGGLVY----GIHKILEQLALSAS
            + S  IDVGRII  ++  C  KK   L FPN IT LC++  V EN  D IL     I    L  L   +  +    V+    G  +   ++ L A 
Subjt:  --CDLRSLSIDVGRIIASEISGCWRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARLQSTQEARQGGLVY----GIHKILEQLALSAS

Query:  RQEFAERQSQ---------TFWNYVKRRDGNLKKVLQENFSKPYSTLLVFPDDLL
         +   + Q+Q          F+ YVK RD  ++ + QE           FPD++L
Subjt:  RQEFAERQSQ---------TFWNYVKRRDGNLKKVLQENFSKPYSTLLVFPDDLL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]4.7e-2529.65Show/hide
Query:  RFVNNLARAKYD-ELLKRDFLFERGF-------SGDLPHFLRAGITNHGWELFCYKPEFVNAQVVRKFYANIDEEEGFQVIVRGVAVDWSPGAINALYNL
        +F    A  +Y+  +  R    E+GF        G LP F+   IT H W+ FC  PE     +VR+FYAN+ + E   V VRGV V WS  AINA++ L
Subjt:  RFVNNLARAKYD-ELLKRDFLFERGF-------SGDLPHFLRAGITNHGWELFCYKPEFVNAQVVRKFYANIDEEEGFQVIVRGVAVDWSPGAINALYNL

Query:  QDFPHVGYNEML--------------VAPSNEQ--------------------------LRDAVLEGTCSSGVCDLRSL---------SIDVGRIIASEI
         D P   ++E +              VA +  +                          L+  +L  T    V   R L         SI+VGR+I SEI
Subjt:  QDFPHVGYNEML--------------VAPSNEQ--------------------------LRDAVLEGTCSSGVCDLRSL---------SIDVGRIIASEI

Query:  SGCWRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARL------QSTQEA-----------RQGGLVYGIHKILEQLALSASRQEF---
          C  +K G LFFP+ IT LC+    P    +  L + G ID + +AR+      +STQ+            R  G +    K LEQ       Q++   
Subjt:  SGCWRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARL------QSTQEA-----------RQGGLVYGIHKILEQLALSASRQEF---

Query:  -----AERQSQTFWNYVKRRDGNLKKVLQENFSKPYSTLLVFPDDLLNPWIPPPPMEREKEDDENEQGQED
               +Q Q FW Y K RD  LKK LQ NF++P  T   FP ++L        ++ E E + ++ G  +
Subjt:  -----AERQSQTFWNYVKRRDGNLKKVLQENFSKPYSTLLVFPDDLLNPWIPPPPMEREKEDDENEQGQED

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]8.5e-1930.56Show/hide
Query:  VVRKFYANIDEEEGFQVIVRGVAVDWSPGAINALYNLQDFPHVGYNEMLVAPSNEQLRDAVLEGTCSSGV------------------------------
        +VR+FYAN+ + E   + VRGV V WS  AINA++ L D P   ++E +   +  +L   VLE   ++G                               
Subjt:  VVRKFYANIDEEEGFQVIVRGVAVDWSPGAINALYNLQDFPHVGYNEMLVAPSNEQLRDAVLEGTCSSGV------------------------------

Query:  --------------------CDLRSLSIDVGRIIASEISGCWRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARL------QSTQE--
                              L   SI+VGR+I SEI  C  +K G LFFP+ IT LC+      NE    L + G ID + +AR+      +STQ+  
Subjt:  --------------------CDLRSLSIDVGRIIASEISGCWRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARL------QSTQE--

Query:  ---------ARQGGLVYGIHKILEQLALSASRQEFAERQSQTFWNYVKRRDGNLKKVLQENFSKPYSTLLVFPDDLLNPWIPPPPMEREKEDDENEQGQE
                 +R  G V    K LEQ     S+QE   +Q Q FW Y K RD  LKK LQ NF++P  T   FP ++L        ++ E E + ++ G  
Subjt:  ---------ARQGGLVYGIHKILEQLALSASRQEFAERQSQTFWNYVKRRDGNLKKVLQENFSKPYSTLLVFPDDLLNPWIPPPPMEREKEDDENEQGQE

Query:  D
        +
Subjt:  D

TrEMBL top hitse value%identityAlignment
A0A1S2Z475 uncharacterized protein LOC101493401 isoform X38.9e-1423.24Show/hide
Query:  YNRFVNNLARAKYDELLK-RDFLFERGFSGD-------LPHFLRAGITNHGWELFCYKPEFVNAQVVRKFYANIDEEEGFQVIVRGVAVDWSPGAINALY
        + +F+N   + K+D L+K R+F  E GFS +       LP  L + I  H W+ F        A +VR+FY+ I E +   V+VRGV V ++P  +N  +
Subjt:  YNRFVNNLARAKYDELLK-RDFLFERGFSGD-------LPHFLRAGITNHGWELFCYKPEFVNAQVVRKFYANIDEEEGFQVIVRGVAVDWSPGAINALY

Query:  NL------QDFPHVGYNEMLVAPSNEQLRDAVLEGTCSSGV---------------------------------------------------CDLRSLSI
        NL       D   V   + L    +++  +++++     G                                                    C +   SI
Subjt:  NL------QDFPHVGYNEMLVAPSNEQLRDAVLEGTCSSGV---------------------------------------------------CDLRSLSI

Query:  DVGRIIASEISGC--WRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARL--QSTQEARQGGLV-------YGIHK-----ILEQLALS
        +VG+II  EI  C   +KK  +L FP+ I+ LC R GV   + D ++ ++  I   +L R       ++++ G V        G  +       E+  + 
Subjt:  DVGRIIASEISGC--WRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARL--QSTQEARQGGLV-------YGIHK-----ILEQLALS

Query:  ASRQEFA--------------ERQSQTFWNYVKRRDGNLKKVLQENFSKPYSTLLVFPDDLLNPWIPPPPMEREKEDDENEQG
          +++F                +Q++ FW + K      +K+ + NF K       FPD++L P++  P  E+ K  D  E G
Subjt:  ASRQEFA--------------ERQSQTFWNYVKRRDGNLKKVLQENFSKPYSTLLVFPDDLLNPWIPPPPMEREKEDDENEQG

A0A2P5BCG4 Uncharacterized protein (Fragment)2.3e-2529.65Show/hide
Query:  RFVNNLARAKYD-ELLKRDFLFERGF-------SGDLPHFLRAGITNHGWELFCYKPEFVNAQVVRKFYANIDEEEGFQVIVRGVAVDWSPGAINALYNL
        +F    A  +Y+  +  R    E+GF        G LP F+   IT H W+ FC  PE     +VR+FYAN+ + E   V VRGV V WS  AINA++ L
Subjt:  RFVNNLARAKYD-ELLKRDFLFERGF-------SGDLPHFLRAGITNHGWELFCYKPEFVNAQVVRKFYANIDEEEGFQVIVRGVAVDWSPGAINALYNL

Query:  QDFPHVGYNEML--------------VAPSNEQ--------------------------LRDAVLEGTCSSGVCDLRSL---------SIDVGRIIASEI
         D P   ++E +              VA +  +                          L+  +L  T    V   R L         SI+VGR+I SEI
Subjt:  QDFPHVGYNEML--------------VAPSNEQ--------------------------LRDAVLEGTCSSGVCDLRSL---------SIDVGRIIASEI

Query:  SGCWRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARL------QSTQEA-----------RQGGLVYGIHKILEQLALSASRQEF---
          C  +K G LFFP+ IT LC+    P    +  L + G ID + +AR+      +STQ+            R  G +    K LEQ       Q++   
Subjt:  SGCWRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARL------QSTQEA-----------RQGGLVYGIHKILEQLALSASRQEF---

Query:  -----AERQSQTFWNYVKRRDGNLKKVLQENFSKPYSTLLVFPDDLLNPWIPPPPMEREKEDDENEQGQED
               +Q Q FW Y K RD  LKK LQ NF++P  T   FP ++L        ++ E E + ++ G  +
Subjt:  -----AERQSQTFWNYVKRRDGNLKKVLQENFSKPYSTLLVFPDDLLNPWIPPPPMEREKEDDENEQGQED

A0A2P5DXM3 Uncharacterized protein4.1e-1930.56Show/hide
Query:  VVRKFYANIDEEEGFQVIVRGVAVDWSPGAINALYNLQDFPHVGYNEMLVAPSNEQLRDAVLEGTCSSGV------------------------------
        +VR+FYAN+ + E   + VRGV V WS  AINA++ L D P   ++E +   +  +L   VLE   ++G                               
Subjt:  VVRKFYANIDEEEGFQVIVRGVAVDWSPGAINALYNLQDFPHVGYNEMLVAPSNEQLRDAVLEGTCSSGV------------------------------

Query:  --------------------CDLRSLSIDVGRIIASEISGCWRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARL------QSTQE--
                              L   SI+VGR+I SEI  C  +K G LFFP+ IT LC+      NE    L + G ID + +AR+      +STQ+  
Subjt:  --------------------CDLRSLSIDVGRIIASEISGCWRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARL------QSTQE--

Query:  ---------ARQGGLVYGIHKILEQLALSASRQEFAERQSQTFWNYVKRRDGNLKKVLQENFSKPYSTLLVFPDDLLNPWIPPPPMEREKEDDENEQGQE
                 +R  G V    K LEQ     S+QE   +Q Q FW Y K RD  LKK LQ NF++P  T   FP ++L        ++ E E + ++ G  
Subjt:  ---------ARQGGLVYGIHKILEQLALSASRQEFAERQSQTFWNYVKRRDGNLKKVLQENFSKPYSTLLVFPDDLLNPWIPPPPMEREKEDDENEQGQE

Query:  D
        +
Subjt:  D

A0A6A2ZUE4 Uncharacterized protein4.0e-1423.12Show/hide
Query:  FVNNLARAKYDELLKRDFLFERGF------SGDLPHFLRAGITNHGWELFCYKPEFVNAQVVRKFYANIDEEEGFQVIVRGVAVDWSPGAINALYNLQDF
        FV+  A+  Y  +  R   FE GF      + +L   +   +T H W+ F   P  VNA +V++FY+NI E     V+VRG+++ ++P AIN  + LQ  
Subjt:  FVNNLARAKYDELLKRDFLFERGF------SGDLPHFLRAGITNHGWELFCYKPEFVNAQVVRKFYANIDEEEGFQVIVRGVAVDWSPGAINALYNLQDF

Query:  PHVGYNEMLVAPSNEQLRDAVLEGTCSSGV------------------------------------------CD--------LRSLSIDVGRIIASEISG
            +N       + +    +LE  C  G                                           C         L   +ID+G+II      
Subjt:  PHVGYNEMLVAPSNEQLRDAVLEGTCSSGV------------------------------------------CD--------LRSLSIDVGRIIASEISG

Query:  CWRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARLQSTQEARQGGLVYGIHKILEQLALSASRQEFAERQSQT-------------FW
        C +++   L FPN IT LC++  V E   D IL     ++   +  L   +EA+         ++     + AS  +  +   +T             ++
Subjt:  CWRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARLQSTQEARQGGLVYGIHKILEQLALSASRQEFAERQSQT-------------FW

Query:  NYVKRRDGNLKKVLQENFSK
         Y KRRD  L   L E+  +
Subjt:  NYVKRRDGNLKKVLQENFSK

A0A6A3BU96 Uncharacterized protein7.3e-1624.23Show/hide
Query:  LAEASPEPKDSEAEEPRLPYNRFVNNLARAKYDELLKRDFLFERGF------SGDLPHFLRAGITNHGWELFCYKPEFVNAQVVRKFYANIDEEEGFQVI
        +  + P P D+      + + +F N+ A+A++     R+  FE GF       G     +   +    W  F + P  VNA +V++FYANI +     + 
Subjt:  LAEASPEPKDSEAEEPRLPYNRFVNNLARAKYDELLKRDFLFERGF------SGDLPHFLRAGITNHGWELFCYKPEFVNAQVVRKFYANIDEEEGFQVI

Query:  VRGVAVDWSPGAINALYNLQDFPHVGYNEMLVAPSNEQLRDAVLEGTCSSGV------------------------------------------------
        VRG  + ++  AIN  ++LQ+   V  + M    ++    D VLE  C                                                    
Subjt:  VRGVAVDWSPGAINALYNLQDFPHVGYNEMLVAPSNEQLRDAVLEGTCSSGV------------------------------------------------

Query:  --CDLRSLSIDVGRIIASEISGCWRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARLQSTQEARQGGLVY----GIHKILEQLALSAS
            + S  IDVGRII  ++  C  KK   L FPN IT LC++  V EN  D IL     I    L  L   +  +    V+    G  +   ++ L A 
Subjt:  --CDLRSLSIDVGRIIASEISGCWRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARLQSTQEARQGGLVY----GIHKILEQLALSAS

Query:  RQEFAERQSQ---------TFWNYVKRRDGNLKKVLQENFSKPYSTLLVFPDDLL
         +   + Q+Q          F+ YVK RD  ++ + QE           FPD++L
Subjt:  RQEFAERQSQ---------TFWNYVKRRDGNLKKVLQENFSKPYSTLLVFPDDLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCTGCCATGTGTCGCGCAGGGACTGAGCCGATAGTTGCGAATACGGAGGAAGTCCGAGAAGAAAACACAGAGGAAGTTCAAGAAAAGCAGACTGAGGATGCACG
AGAAGAACGGACAGAGGTTGCGCCTGAAAGAGGTAATGAGCAGGAGGTACAAGAGGCTCGAGTGGAGGTTATCATGCCGGAAGTACCAAGACGTCGCCACCGGAAGCAAA
AAGCTAGTCGCGTCAAGAAAAAGGAGGCCGAAGAAAAAACAAGAGAGAAAGAGGAGAAAAAGGCTGAAGAAGAAAGTTTGCTCAAGCAAAGGGCAGACAAGGGCAAAGGT
CTTGCTGAGGCATCGCCGGAACCAAAAGATAGCGAAGCAGAGGAGCCACGGTTACCGTACAATCGCTTTGTCAACAATCTTGCCAGAGCAAAGTATGATGAGCTACTGAA
GAGAGACTTCCTGTTTGAGAGAGGATTTAGTGGTGATCTTCCACATTTTCTGAGAGCCGGCATTACGAACCATGGTTGGGAGTTGTTTTGTTATAAGCCTGAATTTGTGA
ACGCGCAAGTAGTGCGCAAGTTTTATGCAAATATCGACGAAGAAGAGGGTTTCCAAGTTATCGTTCGAGGAGTAGCAGTTGACTGGAGTCCTGGTGCCATTAACGCCCTG
TATAACCTTCAAGATTTCCCCCACGTAGGATACAATGAGATGCTTGTGGCGCCATCTAATGAGCAATTGAGAGATGCTGTGCTGGAAGGAACGTGTTCTTCTGGCGTTTG
CGATTTAAGGTCTCTTAGTATTGATGTGGGAAGGATTATTGCGAGTGAGATATCTGGATGCTGGAGGAAGAAAGTGGGGAAGTTGTTTTTCCCGAATACAATTACAATGC
TTTGCAAGCGAGTAGGGGTTCCGGAGAATGAAGGAGATGCCATATTATTTGACAAGGGAATCATTGATACGCTTAACTTGGCACGACTTCAGAGTACACAAGAGGCACGC
CAGGGTGGGCTTGTTTATGGCATTCACAAGATTTTAGAACAACTTGCATTGTCGGCCAGCAGGCAAGAGTTTGCCGAGAGGCAATCTCAAACTTTCTGGAACTATGTTAA
ACGTCGTGATGGCAATCTGAAGAAGGTGCTACAGGAAAATTTTTCCAAACCATATTCAACCCTTCTAGTGTTCCCTGATGATTTGTTGAACCCCTGGATTCCGCCCCCAC
CAATGGAAAGAGAAAAAGAGGATGATGAAAATGAGCAGGGTCAGGAGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGCTGCCATGTGTCGCGCAGGGACTGAGCCGATAGTTGCGAATACGGAGGAAGTCCGAGAAGAAAACACAGAGGAAGTTCAAGAAAAGCAGACTGAGGATGCACG
AGAAGAACGGACAGAGGTTGCGCCTGAAAGAGGTAATGAGCAGGAGGTACAAGAGGCTCGAGTGGAGGTTATCATGCCGGAAGTACCAAGACGTCGCCACCGGAAGCAAA
AAGCTAGTCGCGTCAAGAAAAAGGAGGCCGAAGAAAAAACAAGAGAGAAAGAGGAGAAAAAGGCTGAAGAAGAAAGTTTGCTCAAGCAAAGGGCAGACAAGGGCAAAGGT
CTTGCTGAGGCATCGCCGGAACCAAAAGATAGCGAAGCAGAGGAGCCACGGTTACCGTACAATCGCTTTGTCAACAATCTTGCCAGAGCAAAGTATGATGAGCTACTGAA
GAGAGACTTCCTGTTTGAGAGAGGATTTAGTGGTGATCTTCCACATTTTCTGAGAGCCGGCATTACGAACCATGGTTGGGAGTTGTTTTGTTATAAGCCTGAATTTGTGA
ACGCGCAAGTAGTGCGCAAGTTTTATGCAAATATCGACGAAGAAGAGGGTTTCCAAGTTATCGTTCGAGGAGTAGCAGTTGACTGGAGTCCTGGTGCCATTAACGCCCTG
TATAACCTTCAAGATTTCCCCCACGTAGGATACAATGAGATGCTTGTGGCGCCATCTAATGAGCAATTGAGAGATGCTGTGCTGGAAGGAACGTGTTCTTCTGGCGTTTG
CGATTTAAGGTCTCTTAGTATTGATGTGGGAAGGATTATTGCGAGTGAGATATCTGGATGCTGGAGGAAGAAAGTGGGGAAGTTGTTTTTCCCGAATACAATTACAATGC
TTTGCAAGCGAGTAGGGGTTCCGGAGAATGAAGGAGATGCCATATTATTTGACAAGGGAATCATTGATACGCTTAACTTGGCACGACTTCAGAGTACACAAGAGGCACGC
CAGGGTGGGCTTGTTTATGGCATTCACAAGATTTTAGAACAACTTGCATTGTCGGCCAGCAGGCAAGAGTTTGCCGAGAGGCAATCTCAAACTTTCTGGAACTATGTTAA
ACGTCGTGATGGCAATCTGAAGAAGGTGCTACAGGAAAATTTTTCCAAACCATATTCAACCCTTCTAGTGTTCCCTGATGATTTGTTGAACCCCTGGATTCCGCCCCCAC
CAATGGAAAGAGAAAAAGAGGATGATGAAAATGAGCAGGGTCAGGAGGACTGA
Protein sequenceShow/hide protein sequence
MDAAMCRAGTEPIVANTEEVREENTEEVQEKQTEDAREERTEVAPERGNEQEVQEARVEVIMPEVPRRRHRKQKASRVKKKEAEEKTREKEEKKAEEESLLKQRADKGKG
LAEASPEPKDSEAEEPRLPYNRFVNNLARAKYDELLKRDFLFERGFSGDLPHFLRAGITNHGWELFCYKPEFVNAQVVRKFYANIDEEEGFQVIVRGVAVDWSPGAINAL
YNLQDFPHVGYNEMLVAPSNEQLRDAVLEGTCSSGVCDLRSLSIDVGRIIASEISGCWRKKVGKLFFPNTITMLCKRVGVPENEGDAILFDKGIIDTLNLARLQSTQEAR
QGGLVYGIHKILEQLALSASRQEFAERQSQTFWNYVKRRDGNLKKVLQENFSKPYSTLLVFPDDLLNPWIPPPPMEREKEDDENEQGQED