; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019899 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019899
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold5:31842751..31855330
RNA-Seq ExpressionSpg019899
SyntenySpg019899
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]9.8e-2328.22Show/hide
Query:  PQLSYVLFVNRLTRAKYDELLKRDFLFERGFSGDLPHFLRVG------ITNHGWELFCSKPESMNAQVVHEFYANIDEEEGFQIIVRGIEVGWSPGAINA
        P  S   FV+   +  Y  +  R   FE GF         +G      +T H W+ F   P  +NA +V EFY+NI E     ++VRGI + ++P AIN 
Subjt:  PQLSYVLFVNRLTRAKYDELLKRDFLFERGFSGDLPHFLRVG------ITNHGWELFCSKPESMNAQVVHEFYANIDEEEGFQIIVRGIEVGWSPGAINA

Query:  LYNLQ-------------------------DFPHTRYNEMAVAPS--------------NEQLSDRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIA
         + LQ                           P TR+N   +                 N  +  +L+PT+H++TVS +R+LL  +IL   +ID+GKII 
Subjt:  LYNLQ-------------------------DFPHTRYNEMAVAPS--------------NEQLSDRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIA

Query:  SEISGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARLQRTQEAR------------QGGLVFGIHNILEQLALSASRQEFAERQS
             C K++   L FPN IT LC+K  V +   D IL     ++   +  L   +EA+                V      LEQ A+  + Q   +   
Subjt:  SEISGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARLQRTQEAR------------QGGLVFGIHNILEQLALSASRQEFAERQS

Query:  --LTFWSYVKRRDANLKKALEENFSK
          + +++Y KRRDA L  AL E+  +
Subjt:  --LTFWSYVKRRDANLKKALEENFSK

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.6e-2333.86Show/hide
Query:  VLFVNRLTRAKYD-ELLKRDFLFERGF-------SGDLPHFLRVGITNHGWELFCSKPESMNAQVVHEFYANIDEEEGFQIIVRGIEVGWSPGAINALYN
        V F       +Y+  +  R    E+GF        G LP   +V IT H W+ FC+ PE     +V EFYAN+ +     + VRG++V WS  AINA++ 
Subjt:  VLFVNRLTRAKYD-ELLKRDFLFERGF-------SGDLPHFLRVGITNHGWELFCSKPESMNAQVVHEFYANIDEEEGFQIIVRGIEVGWSPGAINALYN

Query:  LQD--FPHTRYNE-------------MAVAPSNEQLS------------------------DRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEI
        L D    H+ + E             +AVA +   +S                          LLPTTH  TVS++R+LL  ++L   SI+VG++I SEI
Subjt:  LQD--FPHTRYNE-------------MAVAPSNEQLS------------------------DRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEI

Query:  SGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARLQRTQE
          C  +K G LFFP+ IT LC+ A  P    +  L + G ID   +AR+  TQE
Subjt:  SGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.5e-3432.18Show/hide
Query:  VLFVNRLTRAKYD-ELLKRDFLFERGF-------SGDLPHFLRVGITNHGWELFCSKPESMNAQVVHEFYANIDEEEGFQIIVRGIEVGWSPGAINALYN
        V F       +Y+  +  R    E+GF        G LP   +V IT H W+ FC+ PE     +V EFYAN+ + E   + VRG++V WS  AINA++ 
Subjt:  VLFVNRLTRAKYD-ELLKRDFLFERGF-------SGDLPHFLRVGITNHGWELFCSKPESMNAQVVHEFYANIDEEEGFQIIVRGIEVGWSPGAINALYN

Query:  LQDFPHTRYNEM------------------------------------AVAPSNEQ----LSDRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASE
        L D P   ++E                                     A+ P+ +     L  RLLPTTH  TVS++R+LL  ++L   SI+VG++I SE
Subjt:  LQDFPHTRYNEM------------------------------------AVAPSNEQ----LSDRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASE

Query:  ISGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARL------QRTQEA-----------RQGGLVFGIHNILEQLALSASRQEF--
        I  C  +K G LFFP+ IT LC+ A  P    +  L + G ID   +AR+      + TQ+            R  G +      LEQ       Q++  
Subjt:  ISGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARL------QRTQEA-----------RQGGLVFGIHNILEQLALSASRQEF--

Query:  ------AERQSLTFWSYVKRRDANLKKALEENFSKPYPDLPVFPDDLL
                +Q   FW+Y K RD  LKKAL+ NF++P P  P FP ++L
Subjt:  ------AERQSLTFWSYVKRRDANLKKALEENFSKPYPDLPVFPDDLL

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]2.4e-2137.77Show/hide
Query:  LSDRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARL------QRTQ
        L  RLLPTTH  TVS++R+LL +++L   SI+VG++I SEI  C  +K G LFFP+ IT LC+ A  P    +  L   G ID   +AR+      + TQ
Subjt:  LSDRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARL------QRTQ

Query:  E-----------ARQGGLVFGIHNILEQLALSASRQEF--------AERQSLTFWSYVKRRDANLKKALEENFSKPYPDLPVFPDDLL
        +           +R  G +      LEQ       Q++          +Q   FW+Y K RD  LKKAL+ NF++P P  P FP +LL
Subjt:  E-----------ARQGGLVFGIHNILEQLALSASRQEF--------AERQSLTFWSYVKRRDANLKKALEENFSKPYPDLPVFPDDLL

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.0e-2734.55Show/hide
Query:  VVHEFYANIDEEEGFQIIVRGIEVGWSPGAINALYNLQD--FPHTRYNEMAVAPS-------------------------------------NEQLSDRL
        +V EFYAN+ + E   I VRG++V WS  AINA++ L D    H+ + E    P                                         L  RL
Subjt:  VVHEFYANIDEEEGFQIIVRGIEVGWSPGAINALYNLQD--FPHTRYNEMAVAPS-------------------------------------NEQLSDRL

Query:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARL------QRTQE----
        LPTTH   VS++R+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LC+ A    N+    L + G ID   +AR+      + TQ+    
Subjt:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARL------QRTQE----

Query:  -------ARQGGLVFGIHNILEQLALSASRQEFAERQSLTFWSYVKRRDANLKKALEENFSKPYPDLPVFPDDLL
               +R  G V      LEQ     S+QE   +Q   FW+Y K RD  LKKAL+ NF++P P  P FP ++L
Subjt:  -------ARQGGLVFGIHNILEQLALSASRQEFAERQSLTFWSYVKRRDANLKKALEENFSKPYPDLPVFPDDLL

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.3e-2333.86Show/hide
Query:  VLFVNRLTRAKYD-ELLKRDFLFERGF-------SGDLPHFLRVGITNHGWELFCSKPESMNAQVVHEFYANIDEEEGFQIIVRGIEVGWSPGAINALYN
        V F       +Y+  +  R    E+GF        G LP   +V IT H W+ FC+ PE     +V EFYAN+ +     + VRG++V WS  AINA++ 
Subjt:  VLFVNRLTRAKYD-ELLKRDFLFERGF-------SGDLPHFLRVGITNHGWELFCSKPESMNAQVVHEFYANIDEEEGFQIIVRGIEVGWSPGAINALYN

Query:  LQD--FPHTRYNE-------------MAVAPSNEQLS------------------------DRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEI
        L D    H+ + E             +AVA +   +S                          LLPTTH  TVS++R+LL  ++L   SI+VG++I SEI
Subjt:  LQD--FPHTRYNE-------------MAVAPSNEQLS------------------------DRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEI

Query:  SGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARLQRTQE
          C  +K G LFFP+ IT LC+ A  P    +  L + G ID   +AR+  TQE
Subjt:  SGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)1.2e-3432.18Show/hide
Query:  VLFVNRLTRAKYD-ELLKRDFLFERGF-------SGDLPHFLRVGITNHGWELFCSKPESMNAQVVHEFYANIDEEEGFQIIVRGIEVGWSPGAINALYN
        V F       +Y+  +  R    E+GF        G LP   +V IT H W+ FC+ PE     +V EFYAN+ + E   + VRG++V WS  AINA++ 
Subjt:  VLFVNRLTRAKYD-ELLKRDFLFERGF-------SGDLPHFLRVGITNHGWELFCSKPESMNAQVVHEFYANIDEEEGFQIIVRGIEVGWSPGAINALYN

Query:  LQDFPHTRYNEM------------------------------------AVAPSNEQ----LSDRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASE
        L D P   ++E                                     A+ P+ +     L  RLLPTTH  TVS++R+LL  ++L   SI+VG++I SE
Subjt:  LQDFPHTRYNEM------------------------------------AVAPSNEQ----LSDRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASE

Query:  ISGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARL------QRTQEA-----------RQGGLVFGIHNILEQLALSASRQEF--
        I  C  +K G LFFP+ IT LC+ A  P    +  L + G ID   +AR+      + TQ+            R  G +      LEQ       Q++  
Subjt:  ISGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARL------QRTQEA-----------RQGGLVFGIHNILEQLALSASRQEF--

Query:  ------AERQSLTFWSYVKRRDANLKKALEENFSKPYPDLPVFPDDLL
                +Q   FW+Y K RD  LKKAL+ NF++P P  P FP ++L
Subjt:  ------AERQSLTFWSYVKRRDANLKKALEENFSKPYPDLPVFPDDLL

A0A2P5CEY2 Uncharacterized protein1.2e-2137.77Show/hide
Query:  LSDRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARL------QRTQ
        L  RLLPTTH  TVS++R+LL +++L   SI+VG++I SEI  C  +K G LFFP+ IT LC+ A  P    +  L   G ID   +AR+      + TQ
Subjt:  LSDRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARL------QRTQ

Query:  E-----------ARQGGLVFGIHNILEQLALSASRQEF--------AERQSLTFWSYVKRRDANLKKALEENFSKPYPDLPVFPDDLL
        +           +R  G +      LEQ       Q++          +Q   FW+Y K RD  LKKAL+ NF++P P  P FP +LL
Subjt:  E-----------ARQGGLVFGIHNILEQLALSASRQEF--------AERQSLTFWSYVKRRDANLKKALEENFSKPYPDLPVFPDDLL

A0A2P5DXM3 Uncharacterized protein4.9e-2834.55Show/hide
Query:  VVHEFYANIDEEEGFQIIVRGIEVGWSPGAINALYNLQD--FPHTRYNEMAVAPS-------------------------------------NEQLSDRL
        +V EFYAN+ + E   I VRG++V WS  AINA++ L D    H+ + E    P                                         L  RL
Subjt:  VVHEFYANIDEEEGFQIIVRGIEVGWSPGAINALYNLQD--FPHTRYNEMAVAPS-------------------------------------NEQLSDRL

Query:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARL------QRTQE----
        LPTTH   VS++R+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LC+ A    N+    L + G ID   +AR+      + TQ+    
Subjt:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARL------QRTQE----

Query:  -------ARQGGLVFGIHNILEQLALSASRQEFAERQSLTFWSYVKRRDANLKKALEENFSKPYPDLPVFPDDLL
               +R  G V      LEQ     S+QE   +Q   FW+Y K RD  LKKAL+ NF++P P  P FP ++L
Subjt:  -------ARQGGLVFGIHNILEQLALSASRQEFAERQSLTFWSYVKRRDANLKKALEENFSKPYPDLPVFPDDLL

A0A6A2ZUE4 Uncharacterized protein4.8e-2328.22Show/hide
Query:  PQLSYVLFVNRLTRAKYDELLKRDFLFERGFSGDLPHFLRVG------ITNHGWELFCSKPESMNAQVVHEFYANIDEEEGFQIIVRGIEVGWSPGAINA
        P  S   FV+   +  Y  +  R   FE GF         +G      +T H W+ F   P  +NA +V EFY+NI E     ++VRGI + ++P AIN 
Subjt:  PQLSYVLFVNRLTRAKYDELLKRDFLFERGFSGDLPHFLRVG------ITNHGWELFCSKPESMNAQVVHEFYANIDEEEGFQIIVRGIEVGWSPGAINA

Query:  LYNLQ-------------------------DFPHTRYNEMAVAPS--------------NEQLSDRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIA
         + LQ                           P TR+N   +                 N  +  +L+PT+H++TVS +R+LL  +IL   +ID+GKII 
Subjt:  LYNLQ-------------------------DFPHTRYNEMAVAPS--------------NEQLSDRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIA

Query:  SEISGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARLQRTQEAR------------QGGLVFGIHNILEQLALSASRQEFAERQS
             C K++   L FPN IT LC+K  V +   D IL     ++   +  L   +EA+                V      LEQ A+  + Q   +   
Subjt:  SEISGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARLQRTQEAR------------QGGLVFGIHNILEQLALSASRQEFAERQS

Query:  --LTFWSYVKRRDANLKKALEENFSK
          + +++Y KRRDA L  AL E+  +
Subjt:  --LTFWSYVKRRDANLKKALEENFSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACTCGAGCATGTCAAGCTTGGAAGCTTATGCGCTAGAGCTTGATGAGTTAGGAGCTTGGAGCATGACATGCTGTAGCAGTCTGAAGCATGGAAAGCTTCAGCTGAA
GCATGAAAAGCTTGATGTTGTAGTAGCTAGCGCATGGAAAGCTTATTGCTATAGTAGTCTAGCGCAAGGAATGCATGATGATGCTACAGCAACCTGTCGCATCCATATTT
TATTCTTATTACCCCAGCACAACATCATTAGAATAGGATTTAATTTGAGATATTTTTACTACTCGCTCCTTCCCTGTGGATTCGACCCTAGAATACTCCAGGCTAAAGAA
TTAAAGAGAATCGAAGTTGGATCCACACCAAAGAGGGCTTGCCGAGGACCAGCGTCGAGACCCTTGCCAAAGGGCGTCTCGACGCTCCAACTTTCCCTTTCTGTTTCTGG
CGACAACAGTCACAGCGTCTCGATGCTAGAATTGGGGTTGTGGAAGCTGAACAAAGTGGGGGAAAGCGGGATTCAAGGCTTTGTTCGTGGAGATCGTGACAGGGACATAT
TATATGTTTTAACCCATTTGGAAGAATGGAAGCTTTGGAAGTTATTTGGTGCAGAATATATTGCAAAGCGACTTGAGGGAGCAAAAGCTGTCCTAGAGCAAAGCTGGGAG
CAAAACTGCCACGTCACAGCTCGTGTGAGTTTGGTGCATGAGCTATCTGCCTGGGGAAAGCATGAAGGATCTGACGGCCAGCACAACACTTACATGTCAACGGGATTCAA
ATTTTACCGATCCTTTAAATCTGATTACATGCAAATAGCGTCTTCAATAGCGAAAACGAGAGCAAGAAAAGAGAGAGACAATGAGGAAGAGGAGGTGTCTGTTACCCCCG
AAGTACAGAAAGCTAAGGCAAAGAAAAAGAAAACACCAGAAGAGAAAGAGGCTAAGAAGAGAAGAAGACAGCAGAGGGCTGAGGACCAAGAAATAGTACAGAAGGTAGTC
GAGGAGGTTGTTGCCGAAGCAGTTGAAGAAGGCAATCCGAAGGAACCTGAAGGACAAAACCCAGAGCAGACTGAGCCGAGAATTGCGGATACAGAGGAAATTCGAGAAGA
AAATGCAGAGGAAGTTCAAGAAAGGCAAACTGAGGATAGAAGTGCAGAAAGAGAGGAACAGGAGAAAAAAGAGGCCGAGGATAAAGCAAGAGAGGAAGCAAAGAAAAAGG
CTGAAGAAGAAAGATTGATCAAGCAAAGGGCAGGCAAGGGCAAGAGTGTTGCTGCGGCATCGGAGGAACCTGACGAGATAGAAGAGCCACAGTTGTCGTATGTCCTCTTC
GTCAACCGTCTTACCAGAGCAAAGTATGATGAGTTGCTAAAGAGAGACTTCCTGTTCGAGAGAGGATTTAGTGGTGATCTTCCACATTTTCTGAGGGTCGGCATTACGAA
CCACGGCTGGGAGTTGTTTTGTTCTAAGCCTGAATCAATGAACGCGCAAGTAGTGCACGAATTTTATGCAAATATTGACGAGGAAGAAGGTTTCCAAATTATTGTTCGAG
GTATAGAAGTTGGCTGGAGTCCTGGTGCCATTAACGCCCTGTATAACCTTCAAGATTTCCCCCACACAAGATACAATGAGATGGCTGTGGCGCCATCTAATGAGCAATTG
AGCGATAGGTTGCTTCCAACGACTCATGACTCGACGGTTTCTAGGGAACGCGTTCTGCTGGCGTTTGCGATTTTAAGGTCTCTCAGTATTGATGTGGGCAAGATCATTGC
GAGTGAGATATCTGGATGCTGGAAGAAGAAAGTTGGGAAGTTGTTTTTCCCGAATACAATTACAATGCTTTGCAAGAAAGCAGGGGTTCCGAAGAATGATGGAGATGTTA
TATTGTTCGACAAGGGAATCATCGATACGCCTAACTTGGCACGGCTTCAGCGTACGCAAGAGGCACGCCAGGGTGGGCTTGTTTTTGGCATTCACAACATTTTAGAACAA
CTTGCACTGTCGGCCAGCAGGCAAGAGTTTGCTGAGAGGCAATCTCTAACCTTCTGGAGCTATGTTAAACGTCGTGATGCCAACCTGAAGAAGGCGCTAGAGGAAAATTT
TTCCAAACCATACCCAGACCTTCCGGTGTTCCCTGACGATTTATTGAACCCCTGGATTCCGCCGCCACCTGCTGAGAGAGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCACTCGAGCATGTCAAGCTTGGAAGCTTATGCGCTAGAGCTTGATGAGTTAGGAGCTTGGAGCATGACATGCTGTAGCAGTCTGAAGCATGGAAAGCTTCAGCTGAA
GCATGAAAAGCTTGATGTTGTAGTAGCTAGCGCATGGAAAGCTTATTGCTATAGTAGTCTAGCGCAAGGAATGCATGATGATGCTACAGCAACCTGTCGCATCCATATTT
TATTCTTATTACCCCAGCACAACATCATTAGAATAGGATTTAATTTGAGATATTTTTACTACTCGCTCCTTCCCTGTGGATTCGACCCTAGAATACTCCAGGCTAAAGAA
TTAAAGAGAATCGAAGTTGGATCCACACCAAAGAGGGCTTGCCGAGGACCAGCGTCGAGACCCTTGCCAAAGGGCGTCTCGACGCTCCAACTTTCCCTTTCTGTTTCTGG
CGACAACAGTCACAGCGTCTCGATGCTAGAATTGGGGTTGTGGAAGCTGAACAAAGTGGGGGAAAGCGGGATTCAAGGCTTTGTTCGTGGAGATCGTGACAGGGACATAT
TATATGTTTTAACCCATTTGGAAGAATGGAAGCTTTGGAAGTTATTTGGTGCAGAATATATTGCAAAGCGACTTGAGGGAGCAAAAGCTGTCCTAGAGCAAAGCTGGGAG
CAAAACTGCCACGTCACAGCTCGTGTGAGTTTGGTGCATGAGCTATCTGCCTGGGGAAAGCATGAAGGATCTGACGGCCAGCACAACACTTACATGTCAACGGGATTCAA
ATTTTACCGATCCTTTAAATCTGATTACATGCAAATAGCGTCTTCAATAGCGAAAACGAGAGCAAGAAAAGAGAGAGACAATGAGGAAGAGGAGGTGTCTGTTACCCCCG
AAGTACAGAAAGCTAAGGCAAAGAAAAAGAAAACACCAGAAGAGAAAGAGGCTAAGAAGAGAAGAAGACAGCAGAGGGCTGAGGACCAAGAAATAGTACAGAAGGTAGTC
GAGGAGGTTGTTGCCGAAGCAGTTGAAGAAGGCAATCCGAAGGAACCTGAAGGACAAAACCCAGAGCAGACTGAGCCGAGAATTGCGGATACAGAGGAAATTCGAGAAGA
AAATGCAGAGGAAGTTCAAGAAAGGCAAACTGAGGATAGAAGTGCAGAAAGAGAGGAACAGGAGAAAAAAGAGGCCGAGGATAAAGCAAGAGAGGAAGCAAAGAAAAAGG
CTGAAGAAGAAAGATTGATCAAGCAAAGGGCAGGCAAGGGCAAGAGTGTTGCTGCGGCATCGGAGGAACCTGACGAGATAGAAGAGCCACAGTTGTCGTATGTCCTCTTC
GTCAACCGTCTTACCAGAGCAAAGTATGATGAGTTGCTAAAGAGAGACTTCCTGTTCGAGAGAGGATTTAGTGGTGATCTTCCACATTTTCTGAGGGTCGGCATTACGAA
CCACGGCTGGGAGTTGTTTTGTTCTAAGCCTGAATCAATGAACGCGCAAGTAGTGCACGAATTTTATGCAAATATTGACGAGGAAGAAGGTTTCCAAATTATTGTTCGAG
GTATAGAAGTTGGCTGGAGTCCTGGTGCCATTAACGCCCTGTATAACCTTCAAGATTTCCCCCACACAAGATACAATGAGATGGCTGTGGCGCCATCTAATGAGCAATTG
AGCGATAGGTTGCTTCCAACGACTCATGACTCGACGGTTTCTAGGGAACGCGTTCTGCTGGCGTTTGCGATTTTAAGGTCTCTCAGTATTGATGTGGGCAAGATCATTGC
GAGTGAGATATCTGGATGCTGGAAGAAGAAAGTTGGGAAGTTGTTTTTCCCGAATACAATTACAATGCTTTGCAAGAAAGCAGGGGTTCCGAAGAATGATGGAGATGTTA
TATTGTTCGACAAGGGAATCATCGATACGCCTAACTTGGCACGGCTTCAGCGTACGCAAGAGGCACGCCAGGGTGGGCTTGTTTTTGGCATTCACAACATTTTAGAACAA
CTTGCACTGTCGGCCAGCAGGCAAGAGTTTGCTGAGAGGCAATCTCTAACCTTCTGGAGCTATGTTAAACGTCGTGATGCCAACCTGAAGAAGGCGCTAGAGGAAAATTT
TTCCAAACCATACCCAGACCTTCCGGTGTTCCCTGACGATTTATTGAACCCCTGGATTCCGCCGCCACCTGCTGAGAGAGAATAA
Protein sequenceShow/hide protein sequence
MHSSMSSLEAYALELDELGAWSMTCCSSLKHGKLQLKHEKLDVVVASAWKAYCYSSLAQGMHDDATATCRIHILFLLPQHNIIRIGFNLRYFYYSLLPCGFDPRILQAKE
LKRIEVGSTPKRACRGPASRPLPKGVSTLQLSLSVSGDNSHSVSMLELGLWKLNKVGESGIQGFVRGDRDRDILYVLTHLEEWKLWKLFGAEYIAKRLEGAKAVLEQSWE
QNCHVTARVSLVHELSAWGKHEGSDGQHNTYMSTGFKFYRSFKSDYMQIASSIAKTRARKERDNEEEEVSVTPEVQKAKAKKKKTPEEKEAKKRRRQQRAEDQEIVQKVV
EEVVAEAVEEGNPKEPEGQNPEQTEPRIADTEEIREENAEEVQERQTEDRSAEREEQEKKEAEDKAREEAKKKAEEERLIKQRAGKGKSVAAASEEPDEIEEPQLSYVLF
VNRLTRAKYDELLKRDFLFERGFSGDLPHFLRVGITNHGWELFCSKPESMNAQVVHEFYANIDEEEGFQIIVRGIEVGWSPGAINALYNLQDFPHTRYNEMAVAPSNEQL
SDRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMLCKKAGVPKNDGDVILFDKGIIDTPNLARLQRTQEARQGGLVFGIHNILEQ
LALSASRQEFAERQSLTFWSYVKRRDANLKKALEENFSKPYPDLPVFPDDLLNPWIPPPPAERE