; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg018912 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg018912
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold12:25902668..25905287
RNA-Seq ExpressionSpg018912
SyntenySpg018912
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]2.0e-2328.84Show/hide
Query:  FAKRPRTRSMDASPAVPPTVSLAKPKGKSHKATSPKNSFPEVFRDVNFQERMEIMKKRDFLNEKGF----SDRAGALPEFVTRVIFQYKWQDLCAHPQEV
        FAKRP + S    PA+    + A     S +  S    F +   +  ++E    +  R+ + EKGF    S   G  P F++ VI    WQ  C HP + 
Subjt:  FAKRPRTRSMDASPAVPPTVSLAKPKGKSHKATSPKNSFPEVFRDVNFQERMEIMKKRDFLNEKGF----SDRAGALPEFVTRVIFQYKWQDLCAHPQEV

Query:  VVPLVHEFYAGLREE------------SISMAVMRGKM-VRN------DVIRNPSAKKMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIK
        +VPLV EFYA L+ +            + +   + G + + N      ++I +   +++KE LK +A  G QW  S     +    +L+P + VW HF+ 
Subjt:  VVPLVHEFYAGLREE------------SISMAVMRGKM-VRN------DVIRNPSAKKMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIK

Query:  NRLMPTTHNSTISVDRVMLLYCLMKGLDINVVSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVSGKDEKRHFFKPTIDLSLIRKLQQNSIQRKDKAS
        +RL+ +TH  TIS +R +LLY ++ G  INV  +I D+I AC  K  G L+F SLI++LC +  +     E R      +DL  I ++     ++ +K  
Subjt:  NRLMPTTHNSTISVDRVMLLYCLMKGLDINVVSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVSGKDEKRHFFKPTIDLSLIRKLQQNSIQRKDKAS

Query:  TSQ--------ATPPTGSNVASPSQHTPFTGPSPASEALGMVHRQLDQIRENPKTYWVYAKERDEAIREFY
          +        +T  T S  A+ SQ       S        +   L Q +E    +WVY+++RD A+++ +
Subjt:  TSQ--------ATPPTGSNVASPSQHTPFTGPSPASEALGMVHRQLDQIRENPKTYWVYAKERDEAIREFY

EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]1.9e-2132.26Show/hide
Query:  PEFVTRVIFQYKWQDLCAHPQEVVVPLVHEFYAGL---REESISMAVMRGKMVRN----------------DVIRNPSAKKMKEALKLVANKGVQWKESQ
        P F+TRVI Q+ W+  C HP   +VPLV EFYA L    +E++ +  ++                      D     + ++++  L  VA +G  W+ S 
Subjt:  PEFVTRVIFQYKWQDLCAHPQEVVVPLVHEFYAGL---REESISMAVMRGKMVRN----------------DVIRNPSAKKMKEALKLVANKGVQWKESQ

Query:  TKVKSLVPSDLKPESAVWLHFIKNRLMPTTHNSTISVDRVMLLYCLMKGLDINVVSIIRDEILAC--GRKRAGKLFFGSLITQLCQRVKIVSGKDEK-RH
            + +  +LK  + +W HF+  R MP+TH  T++ DRV+LLY ++ G+ +N+  I   EI AC   RKR G L+F SLITQL  +  +   KDE   H
Subjt:  TKVKSLVPSDLKPESAVWLHFIKNRLMPTTHNSTISVDRVMLLYCLMKGLDINVVSIIRDEILAC--GRKRAGKLFFGSLITQLCQRVKIVSGKDEK-RH

Query:  FFKPTIDLSLIRKLQQNSIQ-------RKDKASTSQATPPTGSNVASP
               LS+ R  Q  ++        R    S+ Q T  T S   SP
Subjt:  FFKPTIDLSLIRKLQQNSIQ-------RKDKASTSQATPPTGSNVASP

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]3.6e-2832.63Show/hide
Query:  KSHKATSPKNSFPEVFRDVNFQERMEIMKKRDFLN-EKGF----SDRAGALPEFVTRVIFQYKWQDLCAHPQEVVVPLVHEFYAGLREESISMAVMRGKM
        K+HKA   +    E   + N Q R         LN EKGF    S+  G LP F+ +VI Q+ W+  CAHP++ +VPLV EFYA L +   +   +RG  
Subjt:  KSHKATSPKNSFPEVFRDVNFQERMEIMKKRDFLN-EKGF----SDRAGALPEFVTRVIFQYKWQDLCAHPQEVVVPLVHEFYAGLREESISMAVMRGKM

Query:  V-------------------RNDVIRNPSAKKMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHNSTISVDRVMLLYCLMKG
        V                    ++ I N +   +   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G
Subjt:  V-------------------RNDVIRNPSAKKMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHNSTISVDRVMLLYCLMKG

Query:  LDINVVSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVSGKDEKRHFFKPTIDLSLIRKLQQNSIQRKDKASTSQATPPTGSN
          INV  +I  EI AC  ++ G LFF SLIT+LC+  +     +E++      ID   + ++ Q     +     S + P T S+
Subjt:  LDINVVSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVSGKDEKRHFFKPTIDLSLIRKLQQNSIQRKDKASTSQATPPTGSN

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]4.1e-3230.91Show/hide
Query:  KSHKATSPKNSFPEVFRDVNFQERMEIMKKRDFLN-EKGF----SDRAGALPEFVTRVIFQYKWQDLCAHPQEVVVPLVHEFYAGLREESISMAVMRGKM
        K+HKA   +        + N Q R         LN EKGF    S+  G LP F+ +VI Q+ W+  CAHP++ +VPLV EFYA L +   +   +RG  
Subjt:  KSHKATSPKNSFPEVFRDVNFQERMEIMKKRDFLN-EKGF----SDRAGALPEFVTRVIFQYKWQDLCAHPQEVVVPLVHEFYAGLREESISMAVMRGKM

Query:  V-------------------RNDVIRNPSAKKMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHNSTISVDRVMLLYCLMKG
        V                    ++ I+N + + +   L+ VA  G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S DR++LL+ ++ G
Subjt:  V-------------------RNDVIRNPSAKKMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHNSTISVDRVMLLYCLMKG

Query:  LDINVVSIIRDEILACGRKRAGKLFFGSLITQLCQRVKI-VSGKDEKRHFFKPTIDLSLIRKLQQNSIQRKDKASTSQ-ATPPTGSNVASPSQHTPFTGP
          INV  +I  EI AC  ++ G LFF SLIT+LC+  +      +EK H       +++ R  Q+   +   + S+S+ AT  +        Q       
Subjt:  LDINVVSIIRDEILACGRKRAGKLFFGSLITQLCQRVKI-VSGKDEKRHFFKPTIDLSLIRKLQQNSIQRKDKASTSQ-ATPPTGSNVASPSQHTPFTGP

Query:  SPASEALGMVHRQ--LDQIRENPKTYWVYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEKDSDEEEDEENDEKESSSDEE
          + + +   H    L    +  + +W Y+KERD A+++   +      P FP FPQ +L    KD D E + E+D+  S+   E
Subjt:  SPASEALGMVHRQ--LDQIRENPKTYWVYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEKDSDEEEDEENDEKESSSDEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]4.9e-2532.03Show/hide
Query:  KATSPKNSFPEVFRDVNFQER-MEIMKKRDFLNEKGFSDRAGALPEFVTRVIFQYKWQDLCAHPQEVVVPLVHEFYAGLREESISMAVMRGKMV------
        KA   ++   E+  + N Q R + + K+  + N K         P F+  VI Q+ WQ  CAHP++ +VPLV EFY  +         +RG  V      
Subjt:  KATSPKNSFPEVFRDVNFQER-MEIMKKRDFLNEKGFSDRAGALPEFVTRVIFQYKWQDLCAHPQEVVVPLVHEFYAGLREESISMAVMRGKMV------

Query:  -------------RNDVIRNPSAKKMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHNSTISVDRVMLLYCLMKGLDINVVS
                      ++ + + +  ++   L+ VA  G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S + V LLY ++ G  INV  
Subjt:  -------------RNDVIRNPSAKKMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHNSTISVDRVMLLYCLMKGLDINVVS

Query:  IIRDEILACGRKRAGKLFFGSLITQLCQRVK
        +I  EI AC  +++G LFF SLIT +C+  +
Subjt:  IIRDEILACGRKRAGKLFFGSLITQLCQRVK

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.7e-2832.63Show/hide
Query:  KSHKATSPKNSFPEVFRDVNFQERMEIMKKRDFLN-EKGF----SDRAGALPEFVTRVIFQYKWQDLCAHPQEVVVPLVHEFYAGLREESISMAVMRGKM
        K+HKA   +    E   + N Q R         LN EKGF    S+  G LP F+ +VI Q+ W+  CAHP++ +VPLV EFYA L +   +   +RG  
Subjt:  KSHKATSPKNSFPEVFRDVNFQERMEIMKKRDFLN-EKGF----SDRAGALPEFVTRVIFQYKWQDLCAHPQEVVVPLVHEFYAGLREESISMAVMRGKM

Query:  V-------------------RNDVIRNPSAKKMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHNSTISVDRVMLLYCLMKG
        V                    ++ I N +   +   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G
Subjt:  V-------------------RNDVIRNPSAKKMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHNSTISVDRVMLLYCLMKG

Query:  LDINVVSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVSGKDEKRHFFKPTIDLSLIRKLQQNSIQRKDKASTSQATPPTGSN
          INV  +I  EI AC  ++ G LFF SLIT+LC+  +     +E++      ID   + ++ Q     +     S + P T S+
Subjt:  LDINVVSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVSGKDEKRHFFKPTIDLSLIRKLQQNSIQRKDKASTSQATPPTGSN

A0A2P5BCG4 Uncharacterized protein (Fragment)2.0e-3230.91Show/hide
Query:  KSHKATSPKNSFPEVFRDVNFQERMEIMKKRDFLN-EKGF----SDRAGALPEFVTRVIFQYKWQDLCAHPQEVVVPLVHEFYAGLREESISMAVMRGKM
        K+HKA   +        + N Q R         LN EKGF    S+  G LP F+ +VI Q+ W+  CAHP++ +VPLV EFYA L +   +   +RG  
Subjt:  KSHKATSPKNSFPEVFRDVNFQERMEIMKKRDFLN-EKGF----SDRAGALPEFVTRVIFQYKWQDLCAHPQEVVVPLVHEFYAGLREESISMAVMRGKM

Query:  V-------------------RNDVIRNPSAKKMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHNSTISVDRVMLLYCLMKG
        V                    ++ I+N + + +   L+ VA  G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S DR++LL+ ++ G
Subjt:  V-------------------RNDVIRNPSAKKMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHNSTISVDRVMLLYCLMKG

Query:  LDINVVSIIRDEILACGRKRAGKLFFGSLITQLCQRVKI-VSGKDEKRHFFKPTIDLSLIRKLQQNSIQRKDKASTSQ-ATPPTGSNVASPSQHTPFTGP
          INV  +I  EI AC  ++ G LFF SLIT+LC+  +      +EK H       +++ R  Q+   +   + S+S+ AT  +        Q       
Subjt:  LDINVVSIIRDEILACGRKRAGKLFFGSLITQLCQRVKI-VSGKDEKRHFFKPTIDLSLIRKLQQNSIQRKDKASTSQ-ATPPTGSNVASPSQHTPFTGP

Query:  SPASEALGMVHRQ--LDQIRENPKTYWVYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEKDSDEEEDEENDEKESSSDEE
          + + +   H    L    +  + +W Y+KERD A+++   +      P FP FPQ +L    KD D E + E+D+  S+   E
Subjt:  SPASEALGMVHRQ--LDQIRENPKTYWVYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEKDSDEEEDEENDEKESSSDEE

A0A2P5DAQ2 Uncharacterized protein2.4e-2532.03Show/hide
Query:  KATSPKNSFPEVFRDVNFQER-MEIMKKRDFLNEKGFSDRAGALPEFVTRVIFQYKWQDLCAHPQEVVVPLVHEFYAGLREESISMAVMRGKMV------
        KA   ++   E+  + N Q R + + K+  + N K         P F+  VI Q+ WQ  CAHP++ +VPLV EFY  +         +RG  V      
Subjt:  KATSPKNSFPEVFRDVNFQER-MEIMKKRDFLNEKGFSDRAGALPEFVTRVIFQYKWQDLCAHPQEVVVPLVHEFYAGLREESISMAVMRGKMV------

Query:  -------------RNDVIRNPSAKKMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHNSTISVDRVMLLYCLMKGLDINVVS
                      ++ + + +  ++   L+ VA  G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S + V LLY ++ G  INV  
Subjt:  -------------RNDVIRNPSAKKMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHNSTISVDRVMLLYCLMKGLDINVVS

Query:  IIRDEILACGRKRAGKLFFGSLITQLCQRVK
        +I  EI AC  +++G LFF SLIT +C+  +
Subjt:  IIRDEILACGRKRAGKLFFGSLITQLCQRVK

A0A2P5DXM3 Uncharacterized protein9.3e-2230.07Show/hide
Query:  VPLVHEFYAGLREESISMAVMRGKMV-------------------RNDVIRNPSAKKMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKN
        +PLV EFYA L +   +   +RG  V                    ++ I N +  ++   L+ VA  G +W  S     + + S L P + VW HF+K+
Subjt:  VPLVHEFYAGLREESISMAVMRGKMV-------------------RNDVIRNPSAKKMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKN

Query:  RLMPTTHNSTISVDRVMLLYCLMKGLDINVVSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVSGKDEKRHFFKPTIDLSLIRKLQQNSIQRKDKAST
        RL+PTTH   +S DR++LL+ ++ G  INV  +I  EI AC  ++ G LFF SLIT+LC+    +   +EK H     ID   + ++ Q       +  T
Subjt:  RLMPTTHNSTISVDRVMLLYCLMKGLDINVVSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVSGKDEKRHFFKPTIDLSLIRKLQQNSIQRKDKAST

Query:  SQATPPTGSNVASPSQHTPFTGPSPASEALGMVHRQLDQIRENPKTYWVYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEKDSDEEEDEENDEKE
             P+ S  A+ S            +AL     Q +   +  + +W Y+KERD A+++   +      P FP FPQ +L    +D D E + E+D+  
Subjt:  SQATPPTGSNVASPSQHTPFTGPSPASEALGMVHRQLDQIRENPKTYWVYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEKDSDEEEDEENDEKE

Query:  SSSDEE
        S+   E
Subjt:  SSSDEE

W9RBS1 Uncharacterized protein9.9e-2428.84Show/hide
Query:  FAKRPRTRSMDASPAVPPTVSLAKPKGKSHKATSPKNSFPEVFRDVNFQERMEIMKKRDFLNEKGF----SDRAGALPEFVTRVIFQYKWQDLCAHPQEV
        FAKRP + S    PA+    + A     S +  S    F +   +  ++E    +  R+ + EKGF    S   G  P F++ VI    WQ  C HP + 
Subjt:  FAKRPRTRSMDASPAVPPTVSLAKPKGKSHKATSPKNSFPEVFRDVNFQERMEIMKKRDFLNEKGF----SDRAGALPEFVTRVIFQYKWQDLCAHPQEV

Query:  VVPLVHEFYAGLREE------------SISMAVMRGKM-VRN------DVIRNPSAKKMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIK
        +VPLV EFYA L+ +            + +   + G + + N      ++I +   +++KE LK +A  G QW  S     +    +L+P + VW HF+ 
Subjt:  VVPLVHEFYAGLREE------------SISMAVMRGKM-VRN------DVIRNPSAKKMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIK

Query:  NRLMPTTHNSTISVDRVMLLYCLMKGLDINVVSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVSGKDEKRHFFKPTIDLSLIRKLQQNSIQRKDKAS
        +RL+ +TH  TIS +R +LLY ++ G  INV  +I D+I AC  K  G L+F SLI++LC +  +     E R      +DL  I ++     ++ +K  
Subjt:  NRLMPTTHNSTISVDRVMLLYCLMKGLDINVVSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVSGKDEKRHFFKPTIDLSLIRKLQQNSIQRKDKAS

Query:  TSQ--------ATPPTGSNVASPSQHTPFTGPSPASEALGMVHRQLDQIRENPKTYWVYAKERDEAIREFY
          +        +T  T S  A+ SQ       S        +   L Q +E    +WVY+++RD A+++ +
Subjt:  TSQ--------ATPPTGSNVASPSQHTPFTGPSPASEALGMVHRQLDQIRENPKTYWVYAKERDEAIREFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAACACTCAAGGATCATCATCCTCACGCAAAAACACTCGATCTCAAAGTGCTCAAGCAACCCACGAAGCTGAAGCAAGTAACCGACGGCAAGAGGAGAACCCCGA
AATGCCCATGCAAGGCACGCGAAGAACGAGACCCACAGGATTCTCGCCGGCGGTCGTGAACCAAGCGCCCAACGCTCCAACTCCATCCTCTTCGACAATGTCGGCTAGTT
CGAAGGGGATGCCGAGTTCATCTACGCCGAGACGGTTCACGCGCGCCACTGCTGTCCATCAAACCCAAAAACCCGCTGCTCAACAGTTCAAGAAACGTTCGCGGGAGTGG
TTTGCAATGATCCGTGAGATGGGTGCTCAAAGACGTGCTACCCTTGAAGAAGAAGGGAATCAGCAAGATGAAAAAGAAGTCGCCAAGGCAGCTGGAAGCTCTCGGCAAGG
AGAAGCTTCGATGGGTAAGAGATTGAGGATGAGAGAGATGGAAAATCAGAAAATGACTGAGGAGGATGAGTTTGCAAAGAAAAGAGACCGAGAAGAAGAGAAAAGAAGAA
GAGAAGAAGAGCAAGAGGCCGAGAGGGCCTTAGAAGCTGAGTATGAGTATGAGGAAAACCTCAGGAGGGCAGCCATTGATTTGCGACTCCTTGAGGAAGAGAAAAAGAGA
AGGGAAGAAATAAAAGAAAATGAAAAAAGAAGGAAGGAAGCTGAAGACTTCCTTGCAGCCTTTGAGCCACTCCACAAGGCTCAAAGTGAAGCTGAAGCACTGCAAGGAAG
GAATGCGACCGCATCTGGGCCGCATTCTGAAGAAGGCCTAGCCGAGGCCACCATTGATCAGCCAGCTGAAGAGGTTTTTGAGCCTCTATTCACAAATGACCCACCAGCAG
CTGATAGCACCTCTTCGGGAGAGAAGAGGGTTGAAGAGGAAAAAGAAGACGAGGAGGCCGAGACCTCTAGTGATTCTGATTCTGAAACAGAATCTAACTCAGAGATAAGG
GAGCTAGATGGCGACCAAGTCCCTATCTCTGCAGCGTTGAGAAGAAAGAGAAAGAGAGAGATTAAGGCTGGGAGGAGGACAAAGAACAAGAATGACCCAATATTTGCCAA
GAGGCCGAGGACAAGGTCCATGGACGCCTCTCCTGCAGTTCCTCCTACCGTCTCACTCGCCAAGCCAAAGGGCAAATCACATAAGGCTACATCTCCCAAAAATTCGTTCC
CTGAGGTATTTAGAGATGTTAATTTTCAGGAACGAATGGAGATCATGAAGAAAAGAGATTTCCTCAACGAGAAGGGATTCTCTGACAGAGCTGGAGCACTGCCTGAGTTC
GTAACAAGAGTTATCTTCCAGTACAAGTGGCAGGACTTATGTGCTCACCCTCAAGAGGTTGTTGTGCCTTTAGTTCATGAATTCTACGCTGGCCTGAGGGAGGAGAGTAT
TAGCATGGCGGTGATGAGGGGGAAGATGGTCAGGAATGATGTGATCAGGAACCCTTCGGCCAAGAAAATGAAGGAAGCTCTTAAACTTGTGGCCAACAAGGGGGTTCAAT
GGAAAGAATCACAGACCAAAGTGAAGTCTTTAGTGCCAAGCGATCTAAAGCCAGAATCAGCAGTTTGGCTTCACTTCATCAAAAATCGTTTGATGCCAACCACCCACAAC
AGCACGATTTCAGTGGATAGAGTGATGCTACTCTATTGCCTTATGAAGGGGTTGGACATCAATGTGGTGAGCATTATCAGGGACGAGATTTTAGCCTGTGGGAGAAAGCG
AGCAGGCAAGCTTTTCTTTGGATCACTCATCACCCAGCTTTGCCAAAGGGTGAAGATCGTTTCAGGCAAGGATGAGAAGCGTCACTTCTTCAAGCCGACCATCGACCTGT
CCTTGATCAGGAAGCTCCAGCAGAACAGTATCCAGAGGAAAGACAAAGCCTCTACATCTCAGGCTACTCCACCTACAGGGTCGAATGTAGCTTCTCCATCCCAGCACACT
CCTTTCACAGGGCCTTCACCAGCATCGGAAGCCCTAGGTATGGTCCACCGCCAGTTAGATCAAATCAGGGAGAACCCGAAGACGTACTGGGTATATGCAAAGGAGCGGGA
TGAAGCTATTAGAGAGTTCTATCTCTCGATTGCCCCAAGTATTGCTCCGGTCTTTCCAAATTTCCCTCAGTCGCTGCTGCCCCAAGAAGAAAAGGATTCTGATGAAGAGG
AAGATGAAGAGAATGATGAGAAAGAGAGTTCCTCGGACGAGGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGAACACTCAAGGATCATCATCCTCACGCAAAAACACTCGATCTCAAAGTGCTCAAGCAACCCACGAAGCTGAAGCAAGTAACCGACGGCAAGAGGAGAACCCCGA
AATGCCCATGCAAGGCACGCGAAGAACGAGACCCACAGGATTCTCGCCGGCGGTCGTGAACCAAGCGCCCAACGCTCCAACTCCATCCTCTTCGACAATGTCGGCTAGTT
CGAAGGGGATGCCGAGTTCATCTACGCCGAGACGGTTCACGCGCGCCACTGCTGTCCATCAAACCCAAAAACCCGCTGCTCAACAGTTCAAGAAACGTTCGCGGGAGTGG
TTTGCAATGATCCGTGAGATGGGTGCTCAAAGACGTGCTACCCTTGAAGAAGAAGGGAATCAGCAAGATGAAAAAGAAGTCGCCAAGGCAGCTGGAAGCTCTCGGCAAGG
AGAAGCTTCGATGGGTAAGAGATTGAGGATGAGAGAGATGGAAAATCAGAAAATGACTGAGGAGGATGAGTTTGCAAAGAAAAGAGACCGAGAAGAAGAGAAAAGAAGAA
GAGAAGAAGAGCAAGAGGCCGAGAGGGCCTTAGAAGCTGAGTATGAGTATGAGGAAAACCTCAGGAGGGCAGCCATTGATTTGCGACTCCTTGAGGAAGAGAAAAAGAGA
AGGGAAGAAATAAAAGAAAATGAAAAAAGAAGGAAGGAAGCTGAAGACTTCCTTGCAGCCTTTGAGCCACTCCACAAGGCTCAAAGTGAAGCTGAAGCACTGCAAGGAAG
GAATGCGACCGCATCTGGGCCGCATTCTGAAGAAGGCCTAGCCGAGGCCACCATTGATCAGCCAGCTGAAGAGGTTTTTGAGCCTCTATTCACAAATGACCCACCAGCAG
CTGATAGCACCTCTTCGGGAGAGAAGAGGGTTGAAGAGGAAAAAGAAGACGAGGAGGCCGAGACCTCTAGTGATTCTGATTCTGAAACAGAATCTAACTCAGAGATAAGG
GAGCTAGATGGCGACCAAGTCCCTATCTCTGCAGCGTTGAGAAGAAAGAGAAAGAGAGAGATTAAGGCTGGGAGGAGGACAAAGAACAAGAATGACCCAATATTTGCCAA
GAGGCCGAGGACAAGGTCCATGGACGCCTCTCCTGCAGTTCCTCCTACCGTCTCACTCGCCAAGCCAAAGGGCAAATCACATAAGGCTACATCTCCCAAAAATTCGTTCC
CTGAGGTATTTAGAGATGTTAATTTTCAGGAACGAATGGAGATCATGAAGAAAAGAGATTTCCTCAACGAGAAGGGATTCTCTGACAGAGCTGGAGCACTGCCTGAGTTC
GTAACAAGAGTTATCTTCCAGTACAAGTGGCAGGACTTATGTGCTCACCCTCAAGAGGTTGTTGTGCCTTTAGTTCATGAATTCTACGCTGGCCTGAGGGAGGAGAGTAT
TAGCATGGCGGTGATGAGGGGGAAGATGGTCAGGAATGATGTGATCAGGAACCCTTCGGCCAAGAAAATGAAGGAAGCTCTTAAACTTGTGGCCAACAAGGGGGTTCAAT
GGAAAGAATCACAGACCAAAGTGAAGTCTTTAGTGCCAAGCGATCTAAAGCCAGAATCAGCAGTTTGGCTTCACTTCATCAAAAATCGTTTGATGCCAACCACCCACAAC
AGCACGATTTCAGTGGATAGAGTGATGCTACTCTATTGCCTTATGAAGGGGTTGGACATCAATGTGGTGAGCATTATCAGGGACGAGATTTTAGCCTGTGGGAGAAAGCG
AGCAGGCAAGCTTTTCTTTGGATCACTCATCACCCAGCTTTGCCAAAGGGTGAAGATCGTTTCAGGCAAGGATGAGAAGCGTCACTTCTTCAAGCCGACCATCGACCTGT
CCTTGATCAGGAAGCTCCAGCAGAACAGTATCCAGAGGAAAGACAAAGCCTCTACATCTCAGGCTACTCCACCTACAGGGTCGAATGTAGCTTCTCCATCCCAGCACACT
CCTTTCACAGGGCCTTCACCAGCATCGGAAGCCCTAGGTATGGTCCACCGCCAGTTAGATCAAATCAGGGAGAACCCGAAGACGTACTGGGTATATGCAAAGGAGCGGGA
TGAAGCTATTAGAGAGTTCTATCTCTCGATTGCCCCAAGTATTGCTCCGGTCTTTCCAAATTTCCCTCAGTCGCTGCTGCCCCAAGAAGAAAAGGATTCTGATGAAGAGG
AAGATGAAGAGAATGATGAGAAAGAGAGTTCCTCGGACGAGGAATAG
Protein sequenceShow/hide protein sequence
MKNTQGSSSSRKNTRSQSAQATHEAEASNRRQEENPEMPMQGTRRTRPTGFSPAVVNQAPNAPTPSSSTMSASSKGMPSSSTPRRFTRATAVHQTQKPAAQQFKKRSREW
FAMIREMGAQRRATLEEEGNQQDEKEVAKAAGSSRQGEASMGKRLRMREMENQKMTEEDEFAKKRDREEEKRRREEEQEAERALEAEYEYEENLRRAAIDLRLLEEEKKR
REEIKENEKRRKEAEDFLAAFEPLHKAQSEAEALQGRNATASGPHSEEGLAEATIDQPAEEVFEPLFTNDPPAADSTSSGEKRVEEEKEDEEAETSSDSDSETESNSEIR
ELDGDQVPISAALRRKRKREIKAGRRTKNKNDPIFAKRPRTRSMDASPAVPPTVSLAKPKGKSHKATSPKNSFPEVFRDVNFQERMEIMKKRDFLNEKGFSDRAGALPEF
VTRVIFQYKWQDLCAHPQEVVVPLVHEFYAGLREESISMAVMRGKMVRNDVIRNPSAKKMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHN
STISVDRVMLLYCLMKGLDINVVSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVSGKDEKRHFFKPTIDLSLIRKLQQNSIQRKDKASTSQATPPTGSNVASPSQHT
PFTGPSPASEALGMVHRQLDQIRENPKTYWVYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEKDSDEEEDEENDEKESSSDEE