; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg032937 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg032937
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold11:14855521..14857224
RNA-Seq ExpressionSpg032937
SyntenySpg032937
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]1.3e-2126.27Show/hide
Query:  FAKRPRTRSMDASPIVPPTVSPAKPKGKSPKAASPRNSFPEVFRDVNFQERMEIMKKRDFLNEKGFF---DRAGALPEFVTIAIFQYKWQDVCAHPQEAV
        FAKRP + S    P +    + A     S +  S    F +   +  ++E    +  R+ + EKGF          P F++  I    WQ  C HP + +
Subjt:  FAKRPRTRSMDASPIVPPTVSPAKPKGKSPKAASPRNSFPEVFRDVNFQERMEIMKKRDFLNEKGFF---DRAGALPEFVTIAIFQYKWQDVCAHPQEAV

Query:  VPLVREFYAGLREE------------SISMAVVRG--------------------KMMKDALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKN
        VPLV+EFYA L+ +            + +   + G                    + +K+ LK +A  G QW  S     +    +L+P + VW HF+ +
Subjt:  VPLVREFYAGLREE------------SISMAVVRG--------------------KMMKDALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKN

Query:  RLMSTTHDSTILVDRVMLLYCLMKGLDINVGSIIRDEILACGRKWAGKLFFGSLITQLCQRVKIVPGKDEEHHFFEPTIDLSLIGKLQQNNILRKDKAST
        RL+ +TH  TI  +R +LLY ++ G  INVG +I D+I AC  K  G L+F SLI++LC +  +     E        +DL  I ++      R +K+  
Subjt:  RLMSTTHDSTILVDRVMLLYCLMKGLDINVGSIIRDEILACGRKWAGKLFFGSLITQLCQRVKIVPGKDEEHHFFEPTIDLSLIGKLQQNNILRKDKAST

Query:  SQATPQSGSNVASPSQHTPFTGPSPESEALGM-----------VHRQLDQIRENLKTYWVYAKERDEAIREFY
         +   +        + HT     +   E L             +   L Q +E L  +WVY+++RD A+++ +
Subjt:  SQATPQSGSNVASPSQHTPFTGPSPESEALGM-----------VHRQLDQIRENLKTYWVYAKERDEAIREFY

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]6.1e-2430.74Show/hide
Query:  MKKRDFLNEKGFF----DRAGALPEFVTIAIFQYKWQDVCAHPQEAVVPLVREFYAGLREESISMAVVRGKMMK--------------------------
        ++ R    EKGF     +  G LP F+   I Q+ W+  CAHP++ +VPLVREFYA L +   +   VRG  +                           
Subjt:  MKKRDFLNEKGFF----DRAGALPEFVTIAIFQYKWQDVCAHPQEAVVPLVREFYAGLREESISMAVVRGKMMK--------------------------

Query:  ------DALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKNRLMSTTHDSTILVDRVMLLYCLMKGLDINVGSIIRDEILACGRKWAGKLFFGS
                L+ VA  G +W  S     + + S L P + VW HF+K+ L+ TTH  T+  DR++LL+ ++ G  INVG +I  EI AC  +  G LFF S
Subjt:  ------DALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKNRLMSTTHDSTILVDRVMLLYCLMKGLDINVGSIIRDEILACGRKWAGKLFFGS

Query:  LITQLCQRVKIVPGKDEEHHFFEPTIDLSLIGKLQQNNILRKDKASTSQATPQSGSN
        LIT+LC+  +     +EE       ID   + ++ Q       +  +S     + S+
Subjt:  LITQLCQRVKIVPGKDEEHHFFEPTIDLSLIGKLQQNNILRKDKASTSQATPQSGSN

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]5.9e-2729.18Show/hide
Query:  MKKRDFLNEKGFF----DRAGALPEFVTIAIFQYKWQDVCAHPQEAVVPLVREFYAGLREESISMAVVRGKMMK--------------------------
        ++ R    EKGF     +  G LP F+   I Q+ W+  CAHP++ +VPLVREFYA L +   +   VRG  +                           
Subjt:  MKKRDFLNEKGFF----DRAGALPEFVTIAIFQYKWQDVCAHPQEAVVPLVREFYAGLREESISMAVVRGKMMK--------------------------

Query:  ------DALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKNRLMSTTHDSTILVDRVMLLYCLMKGLDINVGSIIRDEILACGRKWAGKLFFGS
                L+ VA  G +W  S     + + S L P + VW HF+K+RL+ TTH  T+  DR++LL+ ++ G  INVG +I  EI AC  +  G LFF S
Subjt:  ------DALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKNRLMSTTHDSTILVDRVMLLYCLMKGLDINVGSIIRDEILACGRKWAGKLFFGS

Query:  LITQLCQRVKIVPGKDEEHHFFEPTIDLSLIGKLQQNNILR--KDKASTSQATPQSGSNVASPSQHTPFTGPSPESEALGMVHRQ--LDQIRENLKTYWV
        LIT+LC+  +     +EE       ID   + ++ Q       +  +S+  AT  S        Q           + +   H    L    +  + +W 
Subjt:  LITQLCQRVKIVPGKDEEHHFFEPTIDLSLIGKLQQNNILR--KDKASTSQATPQSGSNVASPSQHTPFTGPSPESEALGMVHRQ--LDQIRENLKTYWV

Query:  YAKERDEAIREFYLFIAPSIALVFPNFPQSLLS----QEEEDSDEEENEENEE
        Y+KERD A+++            FP FPQ +L     + E +SD++ + E  E
Subjt:  YAKERDEAIREFYLFIAPSIALVFPNFPQSLLS----QEEEDSDEEENEENEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]8.9e-2332.03Show/hide
Query:  ASPRNSFPEVFRDVNFQERMEIMKKRDFLNEKGF-FDRAGAL--PEFVTIAIFQYKWQDVCAHPQEAVVPLVREFYAGLREESISMAVVRG---------
        AS    F     ++ ++E ++    R    EK F +D +  L  P F+   I Q+ WQ  CAHP++ +VPLVREFY  +         +RG         
Subjt:  ASPRNSFPEVFRDVNFQERMEIMKKRDFLNEKGF-FDRAGAL--PEFVTIAIFQYKWQDVCAHPQEAVVPLVREFYAGLREESISMAVVRG---------

Query:  ---------------KMMKD--------ALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKNRLMSTTHDSTILVDRVMLLYCLMKGLDINVGS
                       + ++D         L+ VA  G +W  S     + + S L P + VW HF+K+RL+ TTH  T+  + V LLY ++ G  INVG 
Subjt:  ---------------KMMKD--------ALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKNRLMSTTHDSTILVDRVMLLYCLMKGLDINVGS

Query:  IIRDEILACGRKWAGKLFFGSLITQLCQRVK
        +I  EI AC  + +G LFF SLIT +C+  +
Subjt:  IIRDEILACGRKWAGKLFFGSLITQLCQRVK

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]2.7e-1928.48Show/hide
Query:  VPLVREFYAGLREESISMAVVRGKMMK--------------------------------DALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKN
        +PLVREFYA L +   +   VRG  +                                   L+ VA  G +W  S     + + S L P + VW HF+K+
Subjt:  VPLVREFYAGLREESISMAVVRGKMMK--------------------------------DALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKN

Query:  RLMSTTHDSTILVDRVMLLYCLMKGLDINVGSIIRDEILACGRKWAGKLFFGSLITQLCQRVKIVPGKDEEHHFFEPTIDLSLIGKLQQNNILRKDKAST
        RL+ TTH   +  DR++LL+ ++ G  INVG +I  EI AC  +  G LFF SLIT+LC+    +  +++ H+  E  ID   + ++ Q       +  T
Subjt:  RLMSTTHDSTILVDRVMLLYCLMKGLDINVGSIIRDEILACGRKWAGKLFFGSLITQLCQRVKIVPGKDEEHHFFEPTIDLSLIGKLQQNNILRKDKAST

Query:  SQATPQSGSNVASPSQHTPFTGPSPESEALGMVHRQLDQIRENLKTYWVYAKERDEAIREFYLFIAPSIALVFPNFPQSLLS----QEEEDSDEEENEEN
              S S  A+ S          + +AL     Q +   +  + +W Y+KERD A+++            FP FPQ +L     + E +SD++ + E 
Subjt:  SQATPQSGSNVASPSQHTPFTGPSPESEALGMVHRQLDQIRENLKTYWVYAKERDEAIREFYLFIAPSIALVFPNFPQSLLS----QEEEDSDEEENEEN

Query:  EE
         E
Subjt:  EE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)3.0e-2430.74Show/hide
Query:  MKKRDFLNEKGFF----DRAGALPEFVTIAIFQYKWQDVCAHPQEAVVPLVREFYAGLREESISMAVVRGKMMK--------------------------
        ++ R    EKGF     +  G LP F+   I Q+ W+  CAHP++ +VPLVREFYA L +   +   VRG  +                           
Subjt:  MKKRDFLNEKGFF----DRAGALPEFVTIAIFQYKWQDVCAHPQEAVVPLVREFYAGLREESISMAVVRGKMMK--------------------------

Query:  ------DALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKNRLMSTTHDSTILVDRVMLLYCLMKGLDINVGSIIRDEILACGRKWAGKLFFGS
                L+ VA  G +W  S     + + S L P + VW HF+K+ L+ TTH  T+  DR++LL+ ++ G  INVG +I  EI AC  +  G LFF S
Subjt:  ------DALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKNRLMSTTHDSTILVDRVMLLYCLMKGLDINVGSIIRDEILACGRKWAGKLFFGS

Query:  LITQLCQRVKIVPGKDEEHHFFEPTIDLSLIGKLQQNNILRKDKASTSQATPQSGSN
        LIT+LC+  +     +EE       ID   + ++ Q       +  +S     + S+
Subjt:  LITQLCQRVKIVPGKDEEHHFFEPTIDLSLIGKLQQNNILRKDKASTSQATPQSGSN

A0A2P5BCG4 Uncharacterized protein (Fragment)2.9e-2729.18Show/hide
Query:  MKKRDFLNEKGFF----DRAGALPEFVTIAIFQYKWQDVCAHPQEAVVPLVREFYAGLREESISMAVVRGKMMK--------------------------
        ++ R    EKGF     +  G LP F+   I Q+ W+  CAHP++ +VPLVREFYA L +   +   VRG  +                           
Subjt:  MKKRDFLNEKGFF----DRAGALPEFVTIAIFQYKWQDVCAHPQEAVVPLVREFYAGLREESISMAVVRGKMMK--------------------------

Query:  ------DALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKNRLMSTTHDSTILVDRVMLLYCLMKGLDINVGSIIRDEILACGRKWAGKLFFGS
                L+ VA  G +W  S     + + S L P + VW HF+K+RL+ TTH  T+  DR++LL+ ++ G  INVG +I  EI AC  +  G LFF S
Subjt:  ------DALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKNRLMSTTHDSTILVDRVMLLYCLMKGLDINVGSIIRDEILACGRKWAGKLFFGS

Query:  LITQLCQRVKIVPGKDEEHHFFEPTIDLSLIGKLQQNNILR--KDKASTSQATPQSGSNVASPSQHTPFTGPSPESEALGMVHRQ--LDQIRENLKTYWV
        LIT+LC+  +     +EE       ID   + ++ Q       +  +S+  AT  S        Q           + +   H    L    +  + +W 
Subjt:  LITQLCQRVKIVPGKDEEHHFFEPTIDLSLIGKLQQNNILR--KDKASTSQATPQSGSNVASPSQHTPFTGPSPESEALGMVHRQ--LDQIRENLKTYWV

Query:  YAKERDEAIREFYLFIAPSIALVFPNFPQSLLS----QEEEDSDEEENEENEE
        Y+KERD A+++            FP FPQ +L     + E +SD++ + E  E
Subjt:  YAKERDEAIREFYLFIAPSIALVFPNFPQSLLS----QEEEDSDEEENEENEE

A0A2P5DAQ2 Uncharacterized protein4.3e-2332.03Show/hide
Query:  ASPRNSFPEVFRDVNFQERMEIMKKRDFLNEKGF-FDRAGAL--PEFVTIAIFQYKWQDVCAHPQEAVVPLVREFYAGLREESISMAVVRG---------
        AS    F     ++ ++E ++    R    EK F +D +  L  P F+   I Q+ WQ  CAHP++ +VPLVREFY  +         +RG         
Subjt:  ASPRNSFPEVFRDVNFQERMEIMKKRDFLNEKGF-FDRAGAL--PEFVTIAIFQYKWQDVCAHPQEAVVPLVREFYAGLREESISMAVVRG---------

Query:  ---------------KMMKD--------ALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKNRLMSTTHDSTILVDRVMLLYCLMKGLDINVGS
                       + ++D         L+ VA  G +W  S     + + S L P + VW HF+K+RL+ TTH  T+  + V LLY ++ G  INVG 
Subjt:  ---------------KMMKD--------ALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKNRLMSTTHDSTILVDRVMLLYCLMKGLDINVGS

Query:  IIRDEILACGRKWAGKLFFGSLITQLCQRVK
        +I  EI AC  + +G LFF SLIT +C+  +
Subjt:  IIRDEILACGRKWAGKLFFGSLITQLCQRVK

A0A2P5DXM3 Uncharacterized protein1.3e-1928.48Show/hide
Query:  VPLVREFYAGLREESISMAVVRGKMMK--------------------------------DALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKN
        +PLVREFYA L +   +   VRG  +                                   L+ VA  G +W  S     + + S L P + VW HF+K+
Subjt:  VPLVREFYAGLREESISMAVVRGKMMK--------------------------------DALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKN

Query:  RLMSTTHDSTILVDRVMLLYCLMKGLDINVGSIIRDEILACGRKWAGKLFFGSLITQLCQRVKIVPGKDEEHHFFEPTIDLSLIGKLQQNNILRKDKAST
        RL+ TTH   +  DR++LL+ ++ G  INVG +I  EI AC  +  G LFF SLIT+LC+    +  +++ H+  E  ID   + ++ Q       +  T
Subjt:  RLMSTTHDSTILVDRVMLLYCLMKGLDINVGSIIRDEILACGRKWAGKLFFGSLITQLCQRVKIVPGKDEEHHFFEPTIDLSLIGKLQQNNILRKDKAST

Query:  SQATPQSGSNVASPSQHTPFTGPSPESEALGMVHRQLDQIRENLKTYWVYAKERDEAIREFYLFIAPSIALVFPNFPQSLLS----QEEEDSDEEENEEN
              S S  A+ S          + +AL     Q +   +  + +W Y+KERD A+++            FP FPQ +L     + E +SD++ + E 
Subjt:  SQATPQSGSNVASPSQHTPFTGPSPESEALGMVHRQLDQIRENLKTYWVYAKERDEAIREFYLFIAPSIALVFPNFPQSLLS----QEEEDSDEEENEEN

Query:  EE
         E
Subjt:  EE

W9RBS1 Uncharacterized protein6.2e-2226.27Show/hide
Query:  FAKRPRTRSMDASPIVPPTVSPAKPKGKSPKAASPRNSFPEVFRDVNFQERMEIMKKRDFLNEKGFF---DRAGALPEFVTIAIFQYKWQDVCAHPQEAV
        FAKRP + S    P +    + A     S +  S    F +   +  ++E    +  R+ + EKGF          P F++  I    WQ  C HP + +
Subjt:  FAKRPRTRSMDASPIVPPTVSPAKPKGKSPKAASPRNSFPEVFRDVNFQERMEIMKKRDFLNEKGFF---DRAGALPEFVTIAIFQYKWQDVCAHPQEAV

Query:  VPLVREFYAGLREE------------SISMAVVRG--------------------KMMKDALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKN
        VPLV+EFYA L+ +            + +   + G                    + +K+ LK +A  G QW  S     +    +L+P + VW HF+ +
Subjt:  VPLVREFYAGLREE------------SISMAVVRG--------------------KMMKDALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKN

Query:  RLMSTTHDSTILVDRVMLLYCLMKGLDINVGSIIRDEILACGRKWAGKLFFGSLITQLCQRVKIVPGKDEEHHFFEPTIDLSLIGKLQQNNILRKDKAST
        RL+ +TH  TI  +R +LLY ++ G  INVG +I D+I AC  K  G L+F SLI++LC +  +     E        +DL  I ++      R +K+  
Subjt:  RLMSTTHDSTILVDRVMLLYCLMKGLDINVGSIIRDEILACGRKWAGKLFFGSLITQLCQRVKIVPGKDEEHHFFEPTIDLSLIGKLQQNNILRKDKAST

Query:  SQATPQSGSNVASPSQHTPFTGPSPESEALGM-----------VHRQLDQIRENLKTYWVYAKERDEAIREFY
         +   +        + HT     +   E L             +   L Q +E L  +WVY+++RD A+++ +
Subjt:  SQATPQSGSNVASPSQHTPFTGPSPESEALGM-----------VHRQLDQIRENLKTYWVYAKERDEAIREFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAAAGAAGGAAGGAAGCTGAAGACTTCCTTGCAGCTTTTGATCCACTTCATAAGGCTCAAAGTGAGGCTGAAGCGTTGCAAGGAAGGGAAGACCTAGCTGAGGT
CACAGTTGATCAGCCAGCTGAAGAGGTCTTTGAACCTGTATTCACACATGACCCACCAGCTGCTGATAGCACCTCTTCGAGAGAAAAGAGGGTTGAAGAGGAAAAAGAAG
ACGAGGAGGCCGAGACCTCCAATGATTCTGACTCTGATACAGAATCTGATTCAGAGATTAGGGAGCTAGATGGCGACCAAGTCCCTATCTCTGCAGCATTAAGAAGGAAG
AGGAAAAGAGAGATAAAGGCTGAGAGGAGGACAAAGAACAAGAATGATCCGATATTTGCCAAGAGGCCGAGGACAAGGTCCATGGACGCCTCCCCTATAGTTCCTCCGAC
CGTCTCACCCGCCAAGCCAAAGGGCAAATCACCCAAGGCTGCATCTCCCAGAAATTCGTTCCCTGAGGTATTTAGAGATGTTAATTTTCAGGAACGAATGGAGATCATGA
AGAAAAGAGATTTCCTCAATGAGAAAGGATTCTTTGATAGAGCTGGAGCACTGCCTGAGTTCGTAACAATAGCTATCTTCCAGTACAAGTGGCAGGACGTCTGTGCTCAC
CCTCAGGAGGCTGTTGTGCCTTTAGTGCGAGAATTTTATGCTGGCCTGAGGGAGGAGAGCATTAGCATGGCAGTGGTGAGGGGGAAGATGATGAAAGACGCATTGAAGCT
TGTGGCCAACAAGGGGGTCCAATGGAAAGAATCGCAGACAAAAGTGAAGTCTTTAGTGTCAAGCGATCTAAAGCCAGAATCGGTAGTTTGGCTTCACTTCATCAAAAACC
GTTTGATGTCAACCACCCATGACAGCACAATTTTAGTGGATAGGGTAATGCTACTCTATTGCCTTATGAAGGGGTTGGACATCAACGTGGGGAGCATAATCAGGGATGAG
ATCTTAGCCTGTGGAAGGAAATGGGCAGGCAAGCTTTTCTTTGGCTCACTCATCACCCAACTCTGTCAGAGGGTGAAGATTGTGCCAGGCAAGGACGAGGAGCATCATTT
CTTTGAACCAACCATTGACTTGTCCTTGATAGGAAAGCTTCAACAGAACAACATCCTGAGGAAGGATAAAGCCTCCACATCACAGGCCACACCTCAATCAGGGTCGAATG
TAGCCTCTCCATCCCAGCATACTCCTTTCACAGGGCCTTCACCAGAATCGGAAGCCCTAGGTATGGTCCACCGCCAGTTAGATCAAATCAGGGAGAACCTGAAGACGTAC
TGGGTATATGCAAAGGAGAGGGATGAAGCTATTAGAGAGTTCTATCTCTTTATCGCCCCTAGTATCGCTCTGGTCTTTCCAAATTTCCCTCAGTCGCTGCTGTCCCAAGA
AGAAGAGGATTCTGATGAAGAGGAAAATGAAGAGAATGAAGAGAAAGAGAGTTCCTCGGACGAGGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAAAAAGAAGGAAGGAAGCTGAAGACTTCCTTGCAGCTTTTGATCCACTTCATAAGGCTCAAAGTGAGGCTGAAGCGTTGCAAGGAAGGGAAGACCTAGCTGAGGT
CACAGTTGATCAGCCAGCTGAAGAGGTCTTTGAACCTGTATTCACACATGACCCACCAGCTGCTGATAGCACCTCTTCGAGAGAAAAGAGGGTTGAAGAGGAAAAAGAAG
ACGAGGAGGCCGAGACCTCCAATGATTCTGACTCTGATACAGAATCTGATTCAGAGATTAGGGAGCTAGATGGCGACCAAGTCCCTATCTCTGCAGCATTAAGAAGGAAG
AGGAAAAGAGAGATAAAGGCTGAGAGGAGGACAAAGAACAAGAATGATCCGATATTTGCCAAGAGGCCGAGGACAAGGTCCATGGACGCCTCCCCTATAGTTCCTCCGAC
CGTCTCACCCGCCAAGCCAAAGGGCAAATCACCCAAGGCTGCATCTCCCAGAAATTCGTTCCCTGAGGTATTTAGAGATGTTAATTTTCAGGAACGAATGGAGATCATGA
AGAAAAGAGATTTCCTCAATGAGAAAGGATTCTTTGATAGAGCTGGAGCACTGCCTGAGTTCGTAACAATAGCTATCTTCCAGTACAAGTGGCAGGACGTCTGTGCTCAC
CCTCAGGAGGCTGTTGTGCCTTTAGTGCGAGAATTTTATGCTGGCCTGAGGGAGGAGAGCATTAGCATGGCAGTGGTGAGGGGGAAGATGATGAAAGACGCATTGAAGCT
TGTGGCCAACAAGGGGGTCCAATGGAAAGAATCGCAGACAAAAGTGAAGTCTTTAGTGTCAAGCGATCTAAAGCCAGAATCGGTAGTTTGGCTTCACTTCATCAAAAACC
GTTTGATGTCAACCACCCATGACAGCACAATTTTAGTGGATAGGGTAATGCTACTCTATTGCCTTATGAAGGGGTTGGACATCAACGTGGGGAGCATAATCAGGGATGAG
ATCTTAGCCTGTGGAAGGAAATGGGCAGGCAAGCTTTTCTTTGGCTCACTCATCACCCAACTCTGTCAGAGGGTGAAGATTGTGCCAGGCAAGGACGAGGAGCATCATTT
CTTTGAACCAACCATTGACTTGTCCTTGATAGGAAAGCTTCAACAGAACAACATCCTGAGGAAGGATAAAGCCTCCACATCACAGGCCACACCTCAATCAGGGTCGAATG
TAGCCTCTCCATCCCAGCATACTCCTTTCACAGGGCCTTCACCAGAATCGGAAGCCCTAGGTATGGTCCACCGCCAGTTAGATCAAATCAGGGAGAACCTGAAGACGTAC
TGGGTATATGCAAAGGAGAGGGATGAAGCTATTAGAGAGTTCTATCTCTTTATCGCCCCTAGTATCGCTCTGGTCTTTCCAAATTTCCCTCAGTCGCTGCTGTCCCAAGA
AGAAGAGGATTCTGATGAAGAGGAAAATGAAGAGAATGAAGAGAAAGAGAGTTCCTCGGACGAGGACTAG
Protein sequenceShow/hide protein sequence
MKKRRKEAEDFLAAFDPLHKAQSEAEALQGREDLAEVTVDQPAEEVFEPVFTHDPPAADSTSSREKRVEEEKEDEEAETSNDSDSDTESDSEIRELDGDQVPISAALRRK
RKREIKAERRTKNKNDPIFAKRPRTRSMDASPIVPPTVSPAKPKGKSPKAASPRNSFPEVFRDVNFQERMEIMKKRDFLNEKGFFDRAGALPEFVTIAIFQYKWQDVCAH
PQEAVVPLVREFYAGLREESISMAVVRGKMMKDALKLVANKGVQWKESQTKVKSLVSSDLKPESVVWLHFIKNRLMSTTHDSTILVDRVMLLYCLMKGLDINVGSIIRDE
ILACGRKWAGKLFFGSLITQLCQRVKIVPGKDEEHHFFEPTIDLSLIGKLQQNNILRKDKASTSQATPQSGSNVASPSQHTPFTGPSPESEALGMVHRQLDQIRENLKTY
WVYAKERDEAIREFYLFIAPSIALVFPNFPQSLLSQEEEDSDEEENEENEEKESSSDED