; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg024622 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg024622
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold12:18207516..18211258
RNA-Seq ExpressionSpg024622
SyntenySpg024622
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]6.4e-2528.57Show/hide
Query:  FVNNIARAKYLEMLKRDFLFERGF------GDDLPHFLRARITNHGWEQFCAKPVPVNSNIVREFYANIDQEEGFQAIVRGVVVDWSPGAINSLFNL---
        FV+  A+  Y  +  R   FE GF        +L   +   +T H W++F   PVPVN+ IV+EFY+NI +      +VRG+ + ++P AIN  F L   
Subjt:  FVNNIARAKYLEMLKRDFLFERGF------GDDLPHFLRARITNHGWEQFCAKPVPVNSNIVREFYANIDQEEGFQAIVRGVVVDWSPGAINSLFNL---

Query:  --------QDFPHVGYN---EMAAAP----SNDQLNAAVRESE-----ANTWLGFVKLRLLPTTHDSIVSRDRVLLVFAILRSLSIDVGKIISSKIHTFW
                Q+  H  Y    E    P    +  QL     + +        W  F+K +L+PT+H++ VS  R+LL+ +IL   +ID+GKII    H   
Subjt:  --------QDFPHVGYN---EMAAAP----SNDQLNAAVRESE-----ANTWLGFVKLRLLPTTHDSIVSRDRVLLVFAILRSLSIDVGKIISSKIHTFW

Query:  RKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARL-------QRSQEARQGRLVCGIH------QIQEQLQMHSSRMEFVERQFQTYWNY
        +++   L FPN IT LC++  V   + D IL     ++   +  L        +  EA   R+    H       +++ +Q     +  +  +   Y+ Y
Subjt:  RKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARL-------QRSQEARQGRLVCGIH------QIQEQLQMHSSRMEFVERQFQTYWNY

Query:  VKWRDATLRRALQSNFSKLYQA
         K RDA L  AL  +  +L +A
Subjt:  VKWRDATLRRALQSNFSKLYQA

KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]2.7e-2326.18Show/hide
Query:  LPYERFVNNIARAKYLEMLKRDFLFE----------RGFGDDLPHFLRARITNHGWEQFCAKPVPVNSNIVREFYANIDQEEGFQAIVRGVVVDWSPGAI
        + +++F N+ A+A++     R+  FE           GFG D+       +    W +F   P  VN+++V+EFYANI +       VRG  + ++  AI
Subjt:  LPYERFVNNIARAKYLEMLKRDFLFE----------RGFGDDLPHFLRARITNHGWEQFCAKPVPVNSNIVREFYANIDQEEGFQAIVRGVVVDWSPGAI

Query:  NSLFNLQDF--PHVGYNEMAAAPSND---------------------QLNAAVRESEANTWLGFVKLRLLPTTHDSIVSRDRVLLVFAILRSLSIDVGKI
        N  F+LQ+    H  + E A +   D                      +N    +  A  W  F+K +L+PT+H++ VS  R+LL+ +++ S  IDVG+I
Subjt:  NSLFNLQDF--PHVGYNEMAAAPSND---------------------QLNAAVRESEANTWLGFVKLRLLPTTHDSIVSRDRVLLVFAILRSLSIDVGKI

Query:  ISSKIHTFWRKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARL-------------QRSQEARQGRLVCGIHQIQEQLQMHSSRMEFVE
        I  ++H    KK   L FPN IT LC++  V  +  D IL     I    L  L             ++S    +      +  ++E +    +++  + 
Subjt:  ISSKIHTFWRKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARL-------------QRSQEARQGRLVCGIHQIQEQLQMHSSRMEFVE

Query:  RQFQTYWNYVKWRDATLRRALQSNFSKLYQAFPVFPDDLL
           + ++ YVK RD  +    Q       + FP FPD++L
Subjt:  RQFQTYWNYVKWRDATLRRALQSNFSKLYQAFPVFPDDLL

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.4e-2435.89Show/hide
Query:  RFVNNIARAKYLEMLK-RDFLFERGFGDD-------LPHFLRARITNHGWEQFCAKPVPVNSNIVREFYANIDQEEGFQAIVRGVVVDWSPGAINSLFNL
        +F    A  +Y   ++ R    E+GF  D       LP F+   IT H W+QFCA P      +VREFYAN+         VRGV V WS  AIN++F L
Subjt:  RFVNNIARAKYLEMLK-RDFLFERGFGDD-------LPHFLRARITNHGWEQFCAKPVPVNSNIVREFYANIDQEEGFQAIVRGVVVDWSPGAINSLFNL

Query:  QD--FPHVGYNE---------------MAAAPSNDQLNAA---VRES---EANTWLGFVKLRLLPTTHDSIVSRDRVLLVFAILRSLSIDVGKIISSKIH
         D    H  + E               +A A  N     A   +R +    A  W  F+K  LLPTTH   VS+DR+LL+ ++L   SI+VG++I S+I 
Subjt:  QD--FPHVGYNE---------------MAAAPSNDQLNAA---VRES---EANTWLGFVKLRLLPTTHDSIVSRDRVLLVFAILRSLSIDVGKIISSKIH

Query:  TFWRKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARL
            +K G LFFP+ IT LC+ A  P  + +  L + G ID   +AR+
Subjt:  TFWRKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]7.5e-3432.96Show/hide
Query:  EAAKEEIEEQWLPYERFVNNIARAKYLEMLKRDFLFERGFGDD-------LPHFLRARITNHGWEQFCAKPVPVNSNIVREFYANIDQEEGFQAIVRGVV
        +A K E E     YE  + N           R    E+GF  D       LP F+   IT H W+QFCA P      +VREFYAN+   E     VRGV 
Subjt:  EAAKEEIEEQWLPYERFVNNIARAKYLEMLKRDFLFERGFGDD-------LPHFLRARITNHGWEQFCAKPVPVNSNIVREFYANIDQEEGFQAIVRGVV

Query:  VDWSPGAINSLFNLQDFPHVGYNE----------------MAAAPSNDQLNA-----AVRES---EANTWLGFVKLRLLPTTHDSIVSRDRVLLVFAILR
        V WS  AIN++F L D P   ++E                +AAA +   ++A      +R +    A  W  F+K RLLPTTH   VS+DR+LL+ ++L 
Subjt:  VDWSPGAINSLFNLQDFPHVGYNE----------------MAAAPSNDQLNA-----AVRES---EANTWLGFVKLRLLPTTHDSIVSRDRVLLVFAILR

Query:  SLSIDVGKIISSKIHTFWRKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARLQR------------------SQEARQGRLVCGIHQIQ
          SI+VG++I S+I     +K G LFFP+ IT LC+ A  P  + +  L + G ID   +AR+ +                  S     G ++  +  ++
Subjt:  SLSIDVGKIISSKIHTFWRKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARLQR------------------SQEARQGRLVCGIHQIQ

Query:  EQL------QMH-SSRMEFVERQFQTYWNYVKWRDATLRRALQSNFSKLYQAFPVFPDDLL
        ++L      Q H  S ++   +Q Q +W Y K RD  L++ALQ+NF++    FP FP ++L
Subjt:  EQL------QMH-SSRMEFVERQFQTYWNYVKWRDATLRRALQSNFSKLYQAFPVFPDDLL

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]5.6e-2935.66Show/hide
Query:  IVREFYANIDQEEGFQAIVRGVVVDWSPGAINSLFNLQD--FPHVGYNE-------------MAAAPSNDQLNA-----AVRES---EANTWLGFVKLRL
        +VREFYAN+   E     VRGV V WS  AIN++F L D    H  + E             +AAA +   ++A      +R +    A  W  F+K RL
Subjt:  IVREFYANIDQEEGFQAIVRGVVVDWSPGAINSLFNLQD--FPHVGYNE-------------MAAAPSNDQLNA-----AVRES---EANTWLGFVKLRL

Query:  LPTTHDSIVSRDRVLLVFAILRSLSIDVGKIISSKIHTFWRKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARL--------------Q
        LPTTH  IVS+DR+LL+ ++L   SI+VG++I S+I     +K G LFFP+ IT LC+ A  P  + +  L + G ID   +AR+               
Subjt:  LPTTHDSIVSRDRVLLVFAILRSLSIDVGKIISSKIHTFWRKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARL--------------Q

Query:  RSQEARQGRLVCGIHQIQEQLQMHSSRMEFVERQFQTYWNYVKWRDATLRRALQSNFSKLYQAFPVFPDDLL
        R   A   R    + Q  + L+   S+ E   +Q Q +W Y K RD  L++ALQ+NF++    FP FP ++L
Subjt:  RSQEARQGRLVCGIHQIQEQLQMHSSRMEFVERQFQTYWNYVKWRDATLRRALQSNFSKLYQAFPVFPDDLL

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)6.9e-2535.89Show/hide
Query:  RFVNNIARAKYLEMLK-RDFLFERGFGDD-------LPHFLRARITNHGWEQFCAKPVPVNSNIVREFYANIDQEEGFQAIVRGVVVDWSPGAINSLFNL
        +F    A  +Y   ++ R    E+GF  D       LP F+   IT H W+QFCA P      +VREFYAN+         VRGV V WS  AIN++F L
Subjt:  RFVNNIARAKYLEMLK-RDFLFERGFGDD-------LPHFLRARITNHGWEQFCAKPVPVNSNIVREFYANIDQEEGFQAIVRGVVVDWSPGAINSLFNL

Query:  QD--FPHVGYNE---------------MAAAPSNDQLNAA---VRES---EANTWLGFVKLRLLPTTHDSIVSRDRVLLVFAILRSLSIDVGKIISSKIH
         D    H  + E               +A A  N     A   +R +    A  W  F+K  LLPTTH   VS+DR+LL+ ++L   SI+VG++I S+I 
Subjt:  QD--FPHVGYNE---------------MAAAPSNDQLNAA---VRES---EANTWLGFVKLRLLPTTHDSIVSRDRVLLVFAILRSLSIDVGKIISSKIH

Query:  TFWRKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARL
            +K G LFFP+ IT LC+ A  P  + +  L + G ID   +AR+
Subjt:  TFWRKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARL

A0A2P5BCG4 Uncharacterized protein (Fragment)3.6e-3432.96Show/hide
Query:  EAAKEEIEEQWLPYERFVNNIARAKYLEMLKRDFLFERGFGDD-------LPHFLRARITNHGWEQFCAKPVPVNSNIVREFYANIDQEEGFQAIVRGVV
        +A K E E     YE  + N           R    E+GF  D       LP F+   IT H W+QFCA P      +VREFYAN+   E     VRGV 
Subjt:  EAAKEEIEEQWLPYERFVNNIARAKYLEMLKRDFLFERGFGDD-------LPHFLRARITNHGWEQFCAKPVPVNSNIVREFYANIDQEEGFQAIVRGVV

Query:  VDWSPGAINSLFNLQDFPHVGYNE----------------MAAAPSNDQLNA-----AVRES---EANTWLGFVKLRLLPTTHDSIVSRDRVLLVFAILR
        V WS  AIN++F L D P   ++E                +AAA +   ++A      +R +    A  W  F+K RLLPTTH   VS+DR+LL+ ++L 
Subjt:  VDWSPGAINSLFNLQDFPHVGYNE----------------MAAAPSNDQLNA-----AVRES---EANTWLGFVKLRLLPTTHDSIVSRDRVLLVFAILR

Query:  SLSIDVGKIISSKIHTFWRKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARLQR------------------SQEARQGRLVCGIHQIQ
          SI+VG++I S+I     +K G LFFP+ IT LC+ A  P  + +  L + G ID   +AR+ +                  S     G ++  +  ++
Subjt:  SLSIDVGKIISSKIHTFWRKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARLQR------------------SQEARQGRLVCGIHQIQ

Query:  EQL------QMH-SSRMEFVERQFQTYWNYVKWRDATLRRALQSNFSKLYQAFPVFPDDLL
        ++L      Q H  S ++   +Q Q +W Y K RD  L++ALQ+NF++    FP FP ++L
Subjt:  EQL------QMH-SSRMEFVERQFQTYWNYVKWRDATLRRALQSNFSKLYQAFPVFPDDLL

A0A2P5DXM3 Uncharacterized protein2.7e-2935.66Show/hide
Query:  IVREFYANIDQEEGFQAIVRGVVVDWSPGAINSLFNLQD--FPHVGYNE-------------MAAAPSNDQLNA-----AVRES---EANTWLGFVKLRL
        +VREFYAN+   E     VRGV V WS  AIN++F L D    H  + E             +AAA +   ++A      +R +    A  W  F+K RL
Subjt:  IVREFYANIDQEEGFQAIVRGVVVDWSPGAINSLFNLQD--FPHVGYNE-------------MAAAPSNDQLNA-----AVRES---EANTWLGFVKLRL

Query:  LPTTHDSIVSRDRVLLVFAILRSLSIDVGKIISSKIHTFWRKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARL--------------Q
        LPTTH  IVS+DR+LL+ ++L   SI+VG++I S+I     +K G LFFP+ IT LC+ A  P  + +  L + G ID   +AR+               
Subjt:  LPTTHDSIVSRDRVLLVFAILRSLSIDVGKIISSKIHTFWRKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARL--------------Q

Query:  RSQEARQGRLVCGIHQIQEQLQMHSSRMEFVERQFQTYWNYVKWRDATLRRALQSNFSKLYQAFPVFPDDLL
        R   A   R    + Q  + L+   S+ E   +Q Q +W Y K RD  L++ALQ+NF++    FP FP ++L
Subjt:  RSQEARQGRLVCGIHQIQEQLQMHSSRMEFVERQFQTYWNYVKWRDATLRRALQSNFSKLYQAFPVFPDDLL

A0A6A2ZUE4 Uncharacterized protein3.1e-2528.57Show/hide
Query:  FVNNIARAKYLEMLKRDFLFERGF------GDDLPHFLRARITNHGWEQFCAKPVPVNSNIVREFYANIDQEEGFQAIVRGVVVDWSPGAINSLFNL---
        FV+  A+  Y  +  R   FE GF        +L   +   +T H W++F   PVPVN+ IV+EFY+NI +      +VRG+ + ++P AIN  F L   
Subjt:  FVNNIARAKYLEMLKRDFLFERGF------GDDLPHFLRARITNHGWEQFCAKPVPVNSNIVREFYANIDQEEGFQAIVRGVVVDWSPGAINSLFNL---

Query:  --------QDFPHVGYN---EMAAAP----SNDQLNAAVRESE-----ANTWLGFVKLRLLPTTHDSIVSRDRVLLVFAILRSLSIDVGKIISSKIHTFW
                Q+  H  Y    E    P    +  QL     + +        W  F+K +L+PT+H++ VS  R+LL+ +IL   +ID+GKII    H   
Subjt:  --------QDFPHVGYN---EMAAAP----SNDQLNAAVRESE-----ANTWLGFVKLRLLPTTHDSIVSRDRVLLVFAILRSLSIDVGKIISSKIHTFW

Query:  RKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARL-------QRSQEARQGRLVCGIH------QIQEQLQMHSSRMEFVERQFQTYWNY
        +++   L FPN IT LC++  V   + D IL     ++   +  L        +  EA   R+    H       +++ +Q     +  +  +   Y+ Y
Subjt:  RKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARL-------QRSQEARQGRLVCGIH------QIQEQLQMHSSRMEFVERQFQTYWNY

Query:  VKWRDATLRRALQSNFSKLYQA
         K RDA L  AL  +  +L +A
Subjt:  VKWRDATLRRALQSNFSKLYQA

A0A6A3BU96 Uncharacterized protein1.3e-2326.18Show/hide
Query:  LPYERFVNNIARAKYLEMLKRDFLFE----------RGFGDDLPHFLRARITNHGWEQFCAKPVPVNSNIVREFYANIDQEEGFQAIVRGVVVDWSPGAI
        + +++F N+ A+A++     R+  FE           GFG D+       +    W +F   P  VN+++V+EFYANI +       VRG  + ++  AI
Subjt:  LPYERFVNNIARAKYLEMLKRDFLFE----------RGFGDDLPHFLRARITNHGWEQFCAKPVPVNSNIVREFYANIDQEEGFQAIVRGVVVDWSPGAI

Query:  NSLFNLQDF--PHVGYNEMAAAPSND---------------------QLNAAVRESEANTWLGFVKLRLLPTTHDSIVSRDRVLLVFAILRSLSIDVGKI
        N  F+LQ+    H  + E A +   D                      +N    +  A  W  F+K +L+PT+H++ VS  R+LL+ +++ S  IDVG+I
Subjt:  NSLFNLQDF--PHVGYNEMAAAPSND---------------------QLNAAVRESEANTWLGFVKLRLLPTTHDSIVSRDRVLLVFAILRSLSIDVGKI

Query:  ISSKIHTFWRKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARL-------------QRSQEARQGRLVCGIHQIQEQLQMHSSRMEFVE
        I  ++H    KK   L FPN IT LC++  V  +  D IL     I    L  L             ++S    +      +  ++E +    +++  + 
Subjt:  ISSKIHTFWRKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARL-------------QRSQEARQGRLVCGIHQIQEQLQMHSSRMEFVE

Query:  RQFQTYWNYVKWRDATLRRALQSNFSKLYQAFPVFPDDLL
           + ++ YVK RD  +    Q       + FP FPD++L
Subjt:  RQFQTYWNYVKWRDATLRRALQSNFSKLYQAFPVFPDDLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGAACCACCACGTCGTCGCCGCTGCAAGCAAAAGGCGGGACGAATTAAGAGGGTTCGGACGGACACCCCATCCCCGCCAACTACAGAATCAGAGAAGGAGGAACT
AGAAAAAGAGGATCAAGAGAAGGAAGAAATTGAAAAGAAGATTAAAGAGGACAGAGGCAAAGGAGTTGTCGAAGCAGCGAAAGAAGAAATTGAGGAACAATGGTTGCCAT
ACGAACGCTTCGTCAACAACATTGCTAGAGCAAAATACTTGGAGATGCTGAAGAGGGATTTTCTTTTTGAAAGAGGATTTGGAGATGACCTACCACACTTTTTACGAGCA
AGGATTACAAACCACGGATGGGAGCAATTTTGTGCAAAACCGGTGCCAGTCAACTCAAATATAGTTCGCGAATTCTACGCGAATATTGACCAAGAAGAGGGCTTCCAGGC
AATTGTTCGAGGAGTCGTGGTGGATTGGAGCCCAGGTGCGATCAATTCCTTATTTAACCTTCAAGATTTCCCACATGTCGGATATAATGAGATGGCGGCAGCGCCATCTA
ATGACCAGTTGAATGCAGCTGTTAGGGAAAGTGAAGCAAACACATGGTTGGGCTTTGTTAAGTTGCGTCTTCTGCCGACAACCCATGACTCCATTGTCTCCCGTGATCGC
GTTCTCCTGGTTTTCGCTATACTGAGGTCATTAAGCATTGATGTCGGCAAAATTATTTCCAGCAAGATTCATACTTTCTGGAGGAAAAAGGTGGGCAAGCTTTTCTTTCC
AAACACAATAACTATGCTCTGTCAAAGAGCTGGGGTTCCTACGAGTATAGAAGATGTCATCCTAATAGATAAGGGAATAATAGACATGCCAAACCTGGCAAGGCTTCAGA
GAAGTCAGGAAGCACGCCAAGGCAGATTGGTGTGTGGCATCCACCAAATACAAGAACAACTGCAAATGCATTCCAGTCGAATGGAGTTTGTCGAGAGGCAATTCCAAACG
TATTGGAATTATGTTAAGTGGAGGGATGCCACACTAAGGAGGGCTTTGCAATCCAACTTTTCCAAGCTATATCAAGCCTTCCCTGTATTCCCTGATGATTTATTAAATTC
ACCAACTGCTTGTGAAATAATCACGAACTACTGCGAACACCACCACTATGGCTACACCGGTATGACTCTGAGACTTCTAGAGGCAGGAGACTGGTCGGTGGATTTTGCTG
CAGCACACGATTTTGCTGGGGGTTCACTGGATCTTCTAGACCCTGCAGGAGACGCGGCACATCAGCCATTGAATTTTGGTTTCTACGTTGCGCTCTCCTTCTTCTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGGAACCACCACGTCGTCGCCGCTGCAAGCAAAAGGCGGGACGAATTAAGAGGGTTCGGACGGACACCCCATCCCCGCCAACTACAGAATCAGAGAAGGAGGAACT
AGAAAAAGAGGATCAAGAGAAGGAAGAAATTGAAAAGAAGATTAAAGAGGACAGAGGCAAAGGAGTTGTCGAAGCAGCGAAAGAAGAAATTGAGGAACAATGGTTGCCAT
ACGAACGCTTCGTCAACAACATTGCTAGAGCAAAATACTTGGAGATGCTGAAGAGGGATTTTCTTTTTGAAAGAGGATTTGGAGATGACCTACCACACTTTTTACGAGCA
AGGATTACAAACCACGGATGGGAGCAATTTTGTGCAAAACCGGTGCCAGTCAACTCAAATATAGTTCGCGAATTCTACGCGAATATTGACCAAGAAGAGGGCTTCCAGGC
AATTGTTCGAGGAGTCGTGGTGGATTGGAGCCCAGGTGCGATCAATTCCTTATTTAACCTTCAAGATTTCCCACATGTCGGATATAATGAGATGGCGGCAGCGCCATCTA
ATGACCAGTTGAATGCAGCTGTTAGGGAAAGTGAAGCAAACACATGGTTGGGCTTTGTTAAGTTGCGTCTTCTGCCGACAACCCATGACTCCATTGTCTCCCGTGATCGC
GTTCTCCTGGTTTTCGCTATACTGAGGTCATTAAGCATTGATGTCGGCAAAATTATTTCCAGCAAGATTCATACTTTCTGGAGGAAAAAGGTGGGCAAGCTTTTCTTTCC
AAACACAATAACTATGCTCTGTCAAAGAGCTGGGGTTCCTACGAGTATAGAAGATGTCATCCTAATAGATAAGGGAATAATAGACATGCCAAACCTGGCAAGGCTTCAGA
GAAGTCAGGAAGCACGCCAAGGCAGATTGGTGTGTGGCATCCACCAAATACAAGAACAACTGCAAATGCATTCCAGTCGAATGGAGTTTGTCGAGAGGCAATTCCAAACG
TATTGGAATTATGTTAAGTGGAGGGATGCCACACTAAGGAGGGCTTTGCAATCCAACTTTTCCAAGCTATATCAAGCCTTCCCTGTATTCCCTGATGATTTATTAAATTC
ACCAACTGCTTGTGAAATAATCACGAACTACTGCGAACACCACCACTATGGCTACACCGGTATGACTCTGAGACTTCTAGAGGCAGGAGACTGGTCGGTGGATTTTGCTG
CAGCACACGATTTTGCTGGGGGTTCACTGGATCTTCTAGACCCTGCAGGAGACGCGGCACATCAGCCATTGAATTTTGGTTTCTACGTTGCGCTCTCCTTCTTCTCCTAA
Protein sequenceShow/hide protein sequence
MPEPPRRRRCKQKAGRIKRVRTDTPSPPTTESEKEELEKEDQEKEEIEKKIKEDRGKGVVEAAKEEIEEQWLPYERFVNNIARAKYLEMLKRDFLFERGFGDDLPHFLRA
RITNHGWEQFCAKPVPVNSNIVREFYANIDQEEGFQAIVRGVVVDWSPGAINSLFNLQDFPHVGYNEMAAAPSNDQLNAAVRESEANTWLGFVKLRLLPTTHDSIVSRDR
VLLVFAILRSLSIDVGKIISSKIHTFWRKKVGKLFFPNTITMLCQRAGVPTSIEDVILIDKGIIDMPNLARLQRSQEARQGRLVCGIHQIQEQLQMHSSRMEFVERQFQT
YWNYVKWRDATLRRALQSNFSKLYQAFPVFPDDLLNSPTACEIITNYCEHHHYGYTGMTLRLLEAGDWSVDFAAAHDFAGGSLDLLDPAGDAAHQPLNFGFYVALSFFS