; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg017129 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg017129
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold4:34123842..34126477
RNA-Seq ExpressionSpg017129
SyntenySpg017129
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN01433.1 hypothetical protein CDL12_26059 [Handroanthus impetiginosus]4.0e-2533.49Show/hide
Query:  EKGFSNRTGALLEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNL--------------------
        E+GF  +  A  E +   + + KW+ F A P+  V+PLVREFYA   E      +VRG+ V F SV IN +Y I  P+ L                    
Subjt:  EKGFSNRTGALLEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNL--------------------

Query:  --RGNEWKESQTKVKSLVPSDLKPESVVWLHFIKNRLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIV
           G +WK ++ +  S   + L   + +WL FI  R++PT H+  ++ D+ +LLYC+M G   ++G II D I+         L+F SLIT+LC R  + 
Subjt:  --RGNEWKESQTKVKSLVPSDLKPESVVWLHFIKNRLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIV

Query:  SSKDEERHFFKPTID
          + EE  F +  ID
Subjt:  SSKDEERHFFKPTID

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.1e-3434.98Show/hide
Query:  MRKRDFLNEKGF---SNRTGALLEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLN----------
        ++ R    EKGF   ++ T   L F+ +VI Q+ W+ FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P++          
Subjt:  MRKRDFLNEKGF---SNRTGALLEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLN----------

Query:  ------------LRGNEWKESQTKVKSLVPSDLKPESVVWLHFIKNRLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILACGRKRAGKLFFGSL
                    + G EW  S     + + S L P + VW HF+K+ L+PTTH  T+S D+++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SL
Subjt:  ------------LRGNEWKESQTKVKSLVPSDLKPESVVWLHFIKNRLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILACGRKRAGKLFFGSL

Query:  ITQLCQRVKIVSSKDEERHFFKPTIDLSLIGKLQQNRPSPSSE
        IT+LC+  +     +EE+      ID   + ++ Q  P+ S++
Subjt:  ITQLCQRVKIVSSKDEERHFFKPTIDLSLIGKLQQNRPSPSSE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]4.5e-3732.58Show/hide
Query:  MRKRDFLNEKGF---SNRTGALLEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLN----------
        ++ R    EKGF   ++ T   L F+ +VI Q+ W+ FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P++          
Subjt:  MRKRDFLNEKGF---SNRTGALLEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLN----------

Query:  ------------LRGNEWKESQTKVKSLVPSDLKPESVVWLHFIKNRLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILACGRKRAGKLFFGSL
                      G EW  S     + + S L P + VW HF+K+RL+PTTH  T+S D+++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SL
Subjt:  ------------LRGNEWKESQTKVKSLVPSDLKPESVVWLHFIKNRLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILACGRKRAGKLFFGSL

Query:  ITQLCQRVKIVSSKDEERHFFKPTIDLSLIGKLQQ-------NRPSPSSEALAIAYRQLDQIRENLKT----------------------------YWAY
        IT+LC+  +     +EE+      ID   + ++ Q        +PS S  A A + R    I + LK                             +WAY
Subjt:  ITQLCQRVKIVSSKDEERHFFKPTIDLSLIGKLQQ-------NRPSPSSEALAIAYRQLDQIRENLKT----------------------------YWAY

Query:  AKERDEAIREFYLSIAPSIAPFFPNFPQSLLPQEEKDSDEEEDEENDDEDDEE
        +KERD A+++   +      P FP FPQ +L    KD D E + E+D +   E
Subjt:  AKERDEAIREFYLSIAPSIAPFFPNFPQSLLPQEEKDSDEEEDEENDDEDDEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]7.8e-2936.08Show/hide
Query:  FVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLN----------------------LRGNEWKESQTK
        F+  VI Q+ WQ FCAHP++ +VPLVREFY  +         +RG  V  S   IN ++ +  P++                      + G EW  S   
Subjt:  FVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLN----------------------LRGNEWKESQTK

Query:  VKSLVPSDLKPESVVWLHFIKNRLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVSSKDEER
          + + S L P + VW HF+K+RL+PTTH  T+S + V LLY ++ G  IN+G +I  EI AC  +++G LFF SLIT +C+  +     +EE+
Subjt:  VKSLVPSDLKPESVVWLHFIKNRLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVSSKDEER

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.8e-2531.19Show/hide
Query:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLN----------------------LRGNEWKESQTKVKSLVPSDLKPESVVWLHFIKN
        +PLVREFYA L +   +   VRG  VS+S   IN V+ +  P++                        G EW  S     + + S L P + VW HF+K+
Subjt:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLN----------------------LRGNEWKESQTKVKSLVPSDLKPESVVWLHFIKN

Query:  RLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVSSKDEERHFFKPTIDLSLIGKLQQ-------NRPS
        RL+PTTH   +S D+++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SLIT+LC+    +   +EE+      ID   + ++ Q        +PS
Subjt:  RLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVSSKDEERHFFKPTIDLSLIGKLQQ-------NRPS

Query:  PSSEALAIAYRQLDQIRENLKT-----------------YWAYAKERDEAIREFYLSIAPSIAPFFPNFPQSLLPQEEKDSDEEEDEENDDEDDE
         S  A A + R    + + LK                  +WAY+KERD A+++   +      P FP FPQ +L   + + + E D++  +E  E
Subjt:  PSSEALAIAYRQLDQIRENLKT-----------------YWAYAKERDEAIREFYLSIAPSIAPFFPNFPQSLLPQEEKDSDEEEDEENDDEDDE

TrEMBL top hitse value%identityAlignment
A0A2G9G807 Uncharacterized protein1.9e-2533.49Show/hide
Query:  EKGFSNRTGALLEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNL--------------------
        E+GF  +  A  E +   + + KW+ F A P+  V+PLVREFYA   E      +VRG+ V F SV IN +Y I  P+ L                    
Subjt:  EKGFSNRTGALLEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNL--------------------

Query:  --RGNEWKESQTKVKSLVPSDLKPESVVWLHFIKNRLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIV
           G +WK ++ +  S   + L   + +WL FI  R++PT H+  ++ D+ +LLYC+M G   ++G II D I+         L+F SLIT+LC R  + 
Subjt:  --RGNEWKESQTKVKSLVPSDLKPESVVWLHFIKNRLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIV

Query:  SSKDEERHFFKPTID
          + EE  F +  ID
Subjt:  SSKDEERHFFKPTID

A0A2P5AGA5 Uncharacterized protein (Fragment)1.0e-3434.98Show/hide
Query:  MRKRDFLNEKGF---SNRTGALLEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLN----------
        ++ R    EKGF   ++ T   L F+ +VI Q+ W+ FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P++          
Subjt:  MRKRDFLNEKGF---SNRTGALLEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLN----------

Query:  ------------LRGNEWKESQTKVKSLVPSDLKPESVVWLHFIKNRLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILACGRKRAGKLFFGSL
                    + G EW  S     + + S L P + VW HF+K+ L+PTTH  T+S D+++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SL
Subjt:  ------------LRGNEWKESQTKVKSLVPSDLKPESVVWLHFIKNRLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILACGRKRAGKLFFGSL

Query:  ITQLCQRVKIVSSKDEERHFFKPTIDLSLIGKLQQNRPSPSSE
        IT+LC+  +     +EE+      ID   + ++ Q  P+ S++
Subjt:  ITQLCQRVKIVSSKDEERHFFKPTIDLSLIGKLQQNRPSPSSE

A0A2P5BCG4 Uncharacterized protein (Fragment)2.2e-3732.58Show/hide
Query:  MRKRDFLNEKGF---SNRTGALLEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLN----------
        ++ R    EKGF   ++ T   L F+ +VI Q+ W+ FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P++          
Subjt:  MRKRDFLNEKGF---SNRTGALLEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLN----------

Query:  ------------LRGNEWKESQTKVKSLVPSDLKPESVVWLHFIKNRLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILACGRKRAGKLFFGSL
                      G EW  S     + + S L P + VW HF+K+RL+PTTH  T+S D+++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SL
Subjt:  ------------LRGNEWKESQTKVKSLVPSDLKPESVVWLHFIKNRLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILACGRKRAGKLFFGSL

Query:  ITQLCQRVKIVSSKDEERHFFKPTIDLSLIGKLQQ-------NRPSPSSEALAIAYRQLDQIRENLKT----------------------------YWAY
        IT+LC+  +     +EE+      ID   + ++ Q        +PS S  A A + R    I + LK                             +WAY
Subjt:  ITQLCQRVKIVSSKDEERHFFKPTIDLSLIGKLQQ-------NRPSPSSEALAIAYRQLDQIRENLKT----------------------------YWAY

Query:  AKERDEAIREFYLSIAPSIAPFFPNFPQSLLPQEEKDSDEEEDEENDDEDDEE
        +KERD A+++   +      P FP FPQ +L    KD D E + E+D +   E
Subjt:  AKERDEAIREFYLSIAPSIAPFFPNFPQSLLPQEEKDSDEEEDEENDDEDDEE

A0A2P5DAQ2 Uncharacterized protein3.8e-2936.08Show/hide
Query:  FVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLN----------------------LRGNEWKESQTK
        F+  VI Q+ WQ FCAHP++ +VPLVREFY  +         +RG  V  S   IN ++ +  P++                      + G EW  S   
Subjt:  FVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLN----------------------LRGNEWKESQTK

Query:  VKSLVPSDLKPESVVWLHFIKNRLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVSSKDEER
          + + S L P + VW HF+K+RL+PTTH  T+S + V LLY ++ G  IN+G +I  EI AC  +++G LFF SLIT +C+  +     +EE+
Subjt:  VKSLVPSDLKPESVVWLHFIKNRLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVSSKDEER

A0A2P5DXM3 Uncharacterized protein8.7e-2631.19Show/hide
Query:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLN----------------------LRGNEWKESQTKVKSLVPSDLKPESVVWLHFIKN
        +PLVREFYA L +   +   VRG  VS+S   IN V+ +  P++                        G EW  S     + + S L P + VW HF+K+
Subjt:  VPLVREFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLN----------------------LRGNEWKESQTKVKSLVPSDLKPESVVWLHFIKN

Query:  RLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVSSKDEERHFFKPTIDLSLIGKLQQ-------NRPS
        RL+PTTH   +S D+++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SLIT+LC+    +   +EE+      ID   + ++ Q        +PS
Subjt:  RLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVSSKDEERHFFKPTIDLSLIGKLQQ-------NRPS

Query:  PSSEALAIAYRQLDQIRENLKT-----------------YWAYAKERDEAIREFYLSIAPSIAPFFPNFPQSLLPQEEKDSDEEEDEENDDEDDE
         S  A A + R    + + LK                  +WAY+KERD A+++   +      P FP FPQ +L   + + + E D++  +E  E
Subjt:  PSSEALAIAYRQLDQIRENLKT-----------------YWAYAKERDEAIREFYLSIAPSIAPFFPNFPQSLLPQEEKDSDEEEDEENDDEDDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAACACTCCAAAATCCTCATCATCACGCAAGAACACTCGATCTCAGAGTGCTCAAGCAACCCACGAAGCTGAAGCAAACATGCGACGGCAAGAACAGAACCCCGA
AACGCCCATGCAAGGCACGCGAAGGACGAGACCCACGGGATTCTTACCGGCTGTCGTGAACGTCGGACCCAACGCTCAAACTCCATCTTCTTCGGCAATGCCGGCCAGTT
TGAGGGAGATACCGAGTTCATCTACACCACGGAGGTTCACGCGCGCCTCTGTCGAGATGGGAGCTCAAAGACGTGCTGCCCTTGAAGAAGAAGGGAAAAGGCAAGATGAA
GAAGAAGCTGCCAAGGCAGCTAGAAGCTCTCGGCAAGGAGAAGCTTCAACGGGTAAGCATTCCGAACCTTTATCTAACCCCTCTTTATCTTGCAAGACTAAACCATTCGT
TACCTATAGTGCAAGGAAGAGGAGCCCGAAGAAGGTTTTGCCTGAAAATTCGATTGAGATTAAGCCCCTCAAAACCGCGAGGATGCCTCCTGACGTATTCGAAGGAATAA
TTCGCCAAGCAGTGGCAAAGGCTCTTGCGATTGCTGAAGGGTACAAGGCTGAACAGGATGCTTTGAAAGAGATTGAGGATGAGAGAGAGATGGAAAATCAGAAAATGGTT
GAGGAAGACGAGCTTGCAAAGGGAAGAGATCTTGAAGAAGAGAAAAGAAGGAGAGAAGAAGAACAAGAGGCCGAGAGGGCCTTAGAAGCTGAAGAAGAAAGAAAATATGA
GGAAAACCTCAGGAGGGCAGCCATGGATTTACAACTTCTTGAGGAAGAGAAAAAGAGAAGGGAAGAAATAAAAGAAGATGAAAAAAGAAGGAAGGAAGCTGAAGACTTCC
TTGCAGCTTTTGAGCCACTCCACAAGGCTCAAAGTGAGTCTAAAGCGTTGCAAGGGAGGGTAGAAGAAGAGGCCCAACAGGGGCTAAGAGAAGAAAATTTAGAAAAAGAA
AAAGAAAGAGAAGTAGAGGAAGAAGGACAGAATGCGACCGCATCTAGGCCGCATTCTGAAGAAGGCCTAGCCGAGGCCACCATTGATCAGCCAGCTGAAGAGGTTTTTGA
GCCTCTATTCACGAATGACCCACCAGCAGCTGATAACACCTCTTCGAGAGAGAAGAGGGACGAAGTGGAGAAGGAAGATGAGGAGGTCGAGACCTCCACTGACTCTGATA
CAGAATCTGATTCGGAGATAAGGAAACTAGATGGCGACCAAGTCCCTATCTCTGCAACATTGAGGAGAAAGAGGAAGAGAGAGATAAAGGCTGAGAGGAGGACAAAGAAC
AAGAATGATCCAATCTTTTCCAAGAGGCCGAGGACGAGGTCCATGGACGCCTCTCCTGCAGTTCCTCCTACCGTCTCACCCGCCAAGCCAAAGGGCAAATCACCCAAGGT
TGCATCTCCTAAAAATCCATTCCTTGAGGTATTTAGAGATGTTAATTTTCAGGAACGGATGGAGATCATGAGAAAGAGAGATTTCCTCAACGAAAAGGGATTCTCTAACA
GAACAGGAGCACTGCTAGAGTTCGTAACAAGAGTTATCTTCCAGTACAAATGGCAGGACTTCTGTGCTCACCCTCAGGAGGCTGTCGTGCCTCTAGTTCGTGAATTTTAC
GCCGGCCTAAGGGAGGAGAGTATTAGCATGGCAGTTGTGAGGGGGAAGATGGTCAGTTTCTCCTCAGTCGACATTAACAGGGTGTACAGGATCAAGGCACCCCTGAATCT
GAGAGGAAATGAGTGGAAAGAATCACAGACGAAAGTGAAGTCGTTAGTGCCAAGCGATCTAAAGCCAGAATCGGTAGTTTGGCTTCACTTCATCAAAAATCGTTTGATGC
CAACCACCCACAACAGCACGATTTCAGTGGATAAAGTGATGCTACTCTATTGCCTTATGAAGGGGTTGGAGATCAACATAGGGAGCATTATCAGGGATGAGATTTTAGCC
TGTGGGAGAAAGCGAGCAGGCAAGCTTTTCTTTGGATCACTCATCACCCAGCTTTGCCAGAGGGTGAAGATTGTTTCGAGTAAGGACGAGGAGCGTCACTTCTTCAAGCC
GACCATCGACCTGTCCTTGATTGGAAAGCTCCAACAGAACAGGCCTTCACCATCATCGGAAGCCCTAGCTATTGCCTACCGCCAGCTAGATCAAATCAGGGAGAACCTGA
AGACGTATTGGGCGTATGCAAAGGAGAGGGACGAAGCCATTAGAGAGTTCTATCTCTCTATTGCCCCAAGTATCGCTCCGTTCTTTCCAAATTTCCCTCAGTCGCTGCTG
CCTCAAGAAGAAAAGGATTCTGATGAAGAGGAAGATGAAGAGAATGATGATGAAGATGATGAAGAGAAAGAGAGTTCCTCGGACGAGGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGAACACTCCAAAATCCTCATCATCACGCAAGAACACTCGATCTCAGAGTGCTCAAGCAACCCACGAAGCTGAAGCAAACATGCGACGGCAAGAACAGAACCCCGA
AACGCCCATGCAAGGCACGCGAAGGACGAGACCCACGGGATTCTTACCGGCTGTCGTGAACGTCGGACCCAACGCTCAAACTCCATCTTCTTCGGCAATGCCGGCCAGTT
TGAGGGAGATACCGAGTTCATCTACACCACGGAGGTTCACGCGCGCCTCTGTCGAGATGGGAGCTCAAAGACGTGCTGCCCTTGAAGAAGAAGGGAAAAGGCAAGATGAA
GAAGAAGCTGCCAAGGCAGCTAGAAGCTCTCGGCAAGGAGAAGCTTCAACGGGTAAGCATTCCGAACCTTTATCTAACCCCTCTTTATCTTGCAAGACTAAACCATTCGT
TACCTATAGTGCAAGGAAGAGGAGCCCGAAGAAGGTTTTGCCTGAAAATTCGATTGAGATTAAGCCCCTCAAAACCGCGAGGATGCCTCCTGACGTATTCGAAGGAATAA
TTCGCCAAGCAGTGGCAAAGGCTCTTGCGATTGCTGAAGGGTACAAGGCTGAACAGGATGCTTTGAAAGAGATTGAGGATGAGAGAGAGATGGAAAATCAGAAAATGGTT
GAGGAAGACGAGCTTGCAAAGGGAAGAGATCTTGAAGAAGAGAAAAGAAGGAGAGAAGAAGAACAAGAGGCCGAGAGGGCCTTAGAAGCTGAAGAAGAAAGAAAATATGA
GGAAAACCTCAGGAGGGCAGCCATGGATTTACAACTTCTTGAGGAAGAGAAAAAGAGAAGGGAAGAAATAAAAGAAGATGAAAAAAGAAGGAAGGAAGCTGAAGACTTCC
TTGCAGCTTTTGAGCCACTCCACAAGGCTCAAAGTGAGTCTAAAGCGTTGCAAGGGAGGGTAGAAGAAGAGGCCCAACAGGGGCTAAGAGAAGAAAATTTAGAAAAAGAA
AAAGAAAGAGAAGTAGAGGAAGAAGGACAGAATGCGACCGCATCTAGGCCGCATTCTGAAGAAGGCCTAGCCGAGGCCACCATTGATCAGCCAGCTGAAGAGGTTTTTGA
GCCTCTATTCACGAATGACCCACCAGCAGCTGATAACACCTCTTCGAGAGAGAAGAGGGACGAAGTGGAGAAGGAAGATGAGGAGGTCGAGACCTCCACTGACTCTGATA
CAGAATCTGATTCGGAGATAAGGAAACTAGATGGCGACCAAGTCCCTATCTCTGCAACATTGAGGAGAAAGAGGAAGAGAGAGATAAAGGCTGAGAGGAGGACAAAGAAC
AAGAATGATCCAATCTTTTCCAAGAGGCCGAGGACGAGGTCCATGGACGCCTCTCCTGCAGTTCCTCCTACCGTCTCACCCGCCAAGCCAAAGGGCAAATCACCCAAGGT
TGCATCTCCTAAAAATCCATTCCTTGAGGTATTTAGAGATGTTAATTTTCAGGAACGGATGGAGATCATGAGAAAGAGAGATTTCCTCAACGAAAAGGGATTCTCTAACA
GAACAGGAGCACTGCTAGAGTTCGTAACAAGAGTTATCTTCCAGTACAAATGGCAGGACTTCTGTGCTCACCCTCAGGAGGCTGTCGTGCCTCTAGTTCGTGAATTTTAC
GCCGGCCTAAGGGAGGAGAGTATTAGCATGGCAGTTGTGAGGGGGAAGATGGTCAGTTTCTCCTCAGTCGACATTAACAGGGTGTACAGGATCAAGGCACCCCTGAATCT
GAGAGGAAATGAGTGGAAAGAATCACAGACGAAAGTGAAGTCGTTAGTGCCAAGCGATCTAAAGCCAGAATCGGTAGTTTGGCTTCACTTCATCAAAAATCGTTTGATGC
CAACCACCCACAACAGCACGATTTCAGTGGATAAAGTGATGCTACTCTATTGCCTTATGAAGGGGTTGGAGATCAACATAGGGAGCATTATCAGGGATGAGATTTTAGCC
TGTGGGAGAAAGCGAGCAGGCAAGCTTTTCTTTGGATCACTCATCACCCAGCTTTGCCAGAGGGTGAAGATTGTTTCGAGTAAGGACGAGGAGCGTCACTTCTTCAAGCC
GACCATCGACCTGTCCTTGATTGGAAAGCTCCAACAGAACAGGCCTTCACCATCATCGGAAGCCCTAGCTATTGCCTACCGCCAGCTAGATCAAATCAGGGAGAACCTGA
AGACGTATTGGGCGTATGCAAAGGAGAGGGACGAAGCCATTAGAGAGTTCTATCTCTCTATTGCCCCAAGTATCGCTCCGTTCTTTCCAAATTTCCCTCAGTCGCTGCTG
CCTCAAGAAGAAAAGGATTCTGATGAAGAGGAAGATGAAGAGAATGATGATGAAGATGATGAAGAGAAAGAGAGTTCCTCGGACGAGGAATAG
Protein sequenceShow/hide protein sequence
MKNTPKSSSSRKNTRSQSAQATHEAEANMRRQEQNPETPMQGTRRTRPTGFLPAVVNVGPNAQTPSSSAMPASLREIPSSSTPRRFTRASVEMGAQRRAALEEEGKRQDE
EEAAKAARSSRQGEASTGKHSEPLSNPSLSCKTKPFVTYSARKRSPKKVLPENSIEIKPLKTARMPPDVFEGIIRQAVAKALAIAEGYKAEQDALKEIEDEREMENQKMV
EEDELAKGRDLEEEKRRREEEQEAERALEAEEERKYEENLRRAAMDLQLLEEEKKRREEIKEDEKRRKEAEDFLAAFEPLHKAQSESKALQGRVEEEAQQGLREENLEKE
KEREVEEEGQNATASRPHSEEGLAEATIDQPAEEVFEPLFTNDPPAADNTSSREKRDEVEKEDEEVETSTDSDTESDSEIRKLDGDQVPISATLRRKRKREIKAERRTKN
KNDPIFSKRPRTRSMDASPAVPPTVSPAKPKGKSPKVASPKNPFLEVFRDVNFQERMEIMRKRDFLNEKGFSNRTGALLEFVTRVIFQYKWQDFCAHPQEAVVPLVREFY
AGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNLRGNEWKESQTKVKSLVPSDLKPESVVWLHFIKNRLMPTTHNSTISVDKVMLLYCLMKGLEINIGSIIRDEILA
CGRKRAGKLFFGSLITQLCQRVKIVSSKDEERHFFKPTIDLSLIGKLQQNRPSPSSEALAIAYRQLDQIRENLKTYWAYAKERDEAIREFYLSIAPSIAPFFPNFPQSLL
PQEEKDSDEEEDEENDDEDDEEKESSSDEE