; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020743 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020743
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold10:26013912..26016423
RNA-Seq ExpressionSpg020743
SyntenySpg020743
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]1.8e-1428.63Show/hide
Query:  PEFVTRVITQYKWQELCAHPQEAVVPLVREFYAGLREESMSMAVVRGKMVSFSSVDINRVFNGK----------------------------------SP
        P F+TRVI Q+ W++ C HP   +VPLVREFYA L + +     V+   V F++  IN +F  +                                  SP
Subjt:  PEFVTRVITQYKWQELCAHPQEAVVPLVREFYAGLREESMSMAVVRGKMVSFSSVDINRVFNGK----------------------------------SP

Query:  K----------------------LRLMPTTHNNTISVERVMLLYSIMKGLEINIGSNIREEILSC-GRKKAGKLFFGSLITLLCQRVKIVPGKDEECHFF
        +                       R MP+TH  T++ +RV+LLYSI+ G+ +NI     +EI +C   +K G L+F SLIT L  +  +   KDE     
Subjt:  K----------------------LRLMPTTHNNTISVERVMLLYSIMKGLEINIGSNIREEILSC-GRKKAGKLFFGSLITLLCQRVKIVPGKDEECHFF

Query:  KPTINLPLIGKLQQNNAQRKDKASTSQ
           I+   I ++ Q  A    K   ++
Subjt:  KPTINLPLIGKLQQNNAQRKDKASTSQ

KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]5.5e-1626.77Show/hide
Query:  PSPKNPFPEVFRDVNFQERMKIMRKRDFLNEKGF---SNRAGTLPEFVTRVITQYKWQELCAHPQEAVVPLVREFYAGLREESMSMAVVRGKMVSFSSVD
        P+P  P P  F D   +E  + ++ R    E GF         L   V  V+T++KWQ+   HP      +V+EFY+ + E +    +VRG  + F+   
Subjt:  PSPKNPFPEVFRDVNFQERMKIMRKRDFLNEKGF---SNRAGTLPEFVTRVITQYKWQELCAHPQEAVVPLVREFYAGLREESMSMAVVRGKMVSFSSVD

Query:  INRVF-----------------------------------NGKS---------------------PKLRLMPTTHNNTISVERVMLLYSIMKGLEINIGS
        INR F                                   NG+                       K +LMPT+HN T+S +R++LL+SI+ G  I+IG 
Subjt:  INRVF-----------------------------------NGKS---------------------PKLRLMPTTHNNTISVERVMLLYSIMKGLEINIGS

Query:  NIREEILSCGRKKAGKLFFGSLITLLCQRVKIVPGKDEECHFFKPTINLPLIGKL--QQNNAQRKDKASTSQATPPSGLNPASPSQHTPFSGPSPSSEAL
         I E    C +++A  L F +LIT LC++ K+     +E       +N   I  L   +    +K +A+TS+          + S H   S  +   +A+
Subjt:  NIREEILSCGRKKAGKLFFGSLITLLCQRVKIVPGKDEECHFFKPTINLPLIGKL--QQNNAQRKDKASTSQATPPSGLNPASPSQHTPFSGPSPSSEAL

Query:  EIAYQQLDQIRNNLRTYWAYAKERD
        +  +Q + Q+ + L  Y+AYAK RD
Subjt:  EIAYQQLDQIRNNLRTYWAYAKERD

KAF4375842.1 hypothetical protein G4B88_026421 [Cannabis sativa]8.0e-1527.54Show/hide
Query:  KNPFPEVFRDVNFQERMKIMRKRDFLNEKGF---SNRAGTLPEFVTRVITQYKWQELCAHPQEAVVPLVREFYAG-LREESMSMAVVRGKMVSFSSVDIN
        KN F E+ +    ++ +  +R ++F  ++G        G++P ++   I +  W +LC  P  AV  +V+EFYA  L  E  +   VR   V FS  DIN
Subjt:  KNPFPEVFRDVNFQERMKIMRKRDFLNEKGF---SNRAGTLPEFVTRVITQYKWQELCAHPQEAVVPLVREFYAG-LREESMSMAVVRGKMVSFSSVDIN

Query:  R---------------------------VFNGKSPKLR-------------LMPTTHNNTISVERVMLLYSIMKGLEINIGSNIREEILSCGRKKAGKLF
                                    VF  K   L+             L+PT+H++T+S ER+ +LY I+KG +IN+G  I +EI  C  +  GKLF
Subjt:  R---------------------------VFNGKSPKLR-------------LMPTTHNNTISVERVMLLYSIMKGLEINIGSNIREEILSCGRKKAGKLF

Query:  FGSLITLLCQRVKIVPGKDEECHFFKPTINLPLIGKLQQNNAQRKDKASTSQATPPSGLNPASPSQHTPFSGPSPSSEALEIAYQQLDQIRNNLRTYWAY
        F  LIT  C+                   N+P++    ++   RK   S     P       +PS  T  +   P  E L        Q+   L+T+W Y
Subjt:  FGSLITLLCQRVKIVPGKDEECHFFKPTINLPLIGKLQQNNAQRKDKASTSQATPPSGLNPASPSQHTPFSGPSPSSEALEIAYQQLDQIRNNLRTYWAY

Query:  AKERD
         +ERD
Subjt:  AKERD

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.4e-2729.97Show/hide
Query:  MRKRDFLNEKGF----SNRAGTLPEFVTRVITQYKWQELCAHPQEAVVPLVREFYAGLREESMSMAVVRGKMVSFSSVDINRVFNGKSP-----------
        ++ R    EKGF    S   G LP F+ +VITQ+ W++ CAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN VF    P           
Subjt:  MRKRDFLNEKGF----SNRAGTLPEFVTRVITQYKWQELCAHPQEAVVPLVREFYAGLREESMSMAVVRGKMVSFSSVDINRVFNGKSP-----------

Query:  ---------------------------------------------KLRLMPTTHNNTISVERVMLLYSIMKGLEINIGSNIREEILSCGRKKAGKLFFGS
                                                     K RL+PTTH  T+S +R++LL+S++ G  IN+G  I  EI +C  +K G LFF S
Subjt:  ---------------------------------------------KLRLMPTTHNNTISVERVMLLYSIMKGLEINIGSNIREEILSCGRKKAGKLFFGS

Query:  LITLLCQRVKIVPGKDEECHFFKPTINLPLIGKLQQNNAQRKDKASTSQATPPSGLNPASPSQHTPFSGPSPSSEALEIAYQQ-----------LDQIRN
        LIT LC+  +     +EE       I+   + ++ Q       +  T     PS   PA+ S +          +ALE    Q           L     
Subjt:  LITLLCQRVKIVPGKDEECHFFKPTINLPLIGKLQQNNAQRKDKASTSQATPPSGLNPASPSQHTPFSGPSPSSEALEIAYQQ-----------LDQIRN

Query:  NLRTYWAYAKERDEVIREFYLSITPSIAPVFPDFPQSLLPQEEKDSEDERESSSDEE
          + +WAY+KERD  +++   +      P FP FPQ +L    KD + E E+ SD++
Subjt:  NLRTYWAYAKERDEVIREFYLSITPSIAPVFPDFPQSLLPQEEKDSEDERESSSDEE

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]4.7e-1532.66Show/hide
Query:  KLRLMPTTHNNTISVERVMLLYSIMKGLEINIGSNIREEILSCGRKKAGKLFFGSLITLLCQRVKIVPGKDEECHFFKPTINLPLIGKLQQNNAQRKDKA
        K RL+PTTH   +S +R++LL+S++ G  IN+G  I  EI +C  +K G LFF SLIT LC+    +   +EE       I+   + ++ Q       + 
Subjt:  KLRLMPTTHNNTISVERVMLLYSIMKGLEINIGSNIREEILSCGRKKAGKLFFGSLITLLCQRVKIVPGKDEECHFFKPTINLPLIGKLQQNNAQRKDKA

Query:  STSQATPPSGLNPASPSQHTPFSGPSPSSEALEIAYQQLDQIRNNLRTYWAYAKERDEVIREFYLSITPSIAPVFPDFPQSLLPQEEKDSEDERESSSD
         T     PS   PA+ S            +ALE    Q +      + +WAY+KERD  +++   +      P FP FPQ +L  ++ D E E ES  D
Subjt:  STSQATPPSGLNPASPSQHTPFSGPSPSSEALEIAYQQLDQIRNNLRTYWAYAKERDEVIREFYLSITPSIAPVFPDFPQSLLPQEEKDSEDERESSSD

TrEMBL top hitse value%identityAlignment
A0A2P5BCG4 Uncharacterized protein (Fragment)1.2e-2729.97Show/hide
Query:  MRKRDFLNEKGF----SNRAGTLPEFVTRVITQYKWQELCAHPQEAVVPLVREFYAGLREESMSMAVVRGKMVSFSSVDINRVFNGKSP-----------
        ++ R    EKGF    S   G LP F+ +VITQ+ W++ CAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN VF    P           
Subjt:  MRKRDFLNEKGF----SNRAGTLPEFVTRVITQYKWQELCAHPQEAVVPLVREFYAGLREESMSMAVVRGKMVSFSSVDINRVFNGKSP-----------

Query:  ---------------------------------------------KLRLMPTTHNNTISVERVMLLYSIMKGLEINIGSNIREEILSCGRKKAGKLFFGS
                                                     K RL+PTTH  T+S +R++LL+S++ G  IN+G  I  EI +C  +K G LFF S
Subjt:  ---------------------------------------------KLRLMPTTHNNTISVERVMLLYSIMKGLEINIGSNIREEILSCGRKKAGKLFFGS

Query:  LITLLCQRVKIVPGKDEECHFFKPTINLPLIGKLQQNNAQRKDKASTSQATPPSGLNPASPSQHTPFSGPSPSSEALEIAYQQ-----------LDQIRN
        LIT LC+  +     +EE       I+   + ++ Q       +  T     PS   PA+ S +          +ALE    Q           L     
Subjt:  LITLLCQRVKIVPGKDEECHFFKPTINLPLIGKLQQNNAQRKDKASTSQATPPSGLNPASPSQHTPFSGPSPSSEALEIAYQQ-----------LDQIRN

Query:  NLRTYWAYAKERDEVIREFYLSITPSIAPVFPDFPQSLLPQEEKDSEDERESSSDEE
          + +WAY+KERD  +++   +      P FP FPQ +L    KD + E E+ SD++
Subjt:  NLRTYWAYAKERDEVIREFYLSITPSIAPVFPDFPQSLLPQEEKDSEDERESSSDEE

A0A2P5CEY2 Uncharacterized protein1.1e-1432.38Show/hide
Query:  KLRLMPTTHNNTISVERVMLLYSIMKGLEINIGSNIREEILSCGRKKAGKLFFGSLITLLCQRVKIVPGKDEECHFFKPTINLPLIGKLQQNNAQRKDKA
        K RL+PTTH  T+S +R++LLYS++ G  IN+G  I  EI +C  +K+G LFF SLIT LC+  +     +EE       I+   + ++ Q       + 
Subjt:  KLRLMPTTHNNTISVERVMLLYSIMKGLEINIGSNIREEILSCGRKKAGKLFFGSLITLLCQRVKIVPGKDEECHFFKPTINLPLIGKLQQNNAQRKDKA

Query:  STSQATPPSGLNPASPSQHTPFSGPSPSSEALEIAYQQ-----------LDQIRNNLRTYWAYAKERDEVIREFYLSITPSIAPVFPDFPQSLLPQEEKD
         T     PS   P   S            +ALE    Q           L       + +WAY+KERD  +++   +      P FP FPQ LL  ++ D
Subjt:  STSQATPPSGLNPASPSQHTPFSGPSPSSEALEIAYQQ-----------LDQIRNNLRTYWAYAKERDEVIREFYLSITPSIAPVFPDFPQSLLPQEEKD

Query:  SEDERESSSD
         E E ES  D
Subjt:  SEDERESSSD

A0A2P5DXM3 Uncharacterized protein2.3e-1532.66Show/hide
Query:  KLRLMPTTHNNTISVERVMLLYSIMKGLEINIGSNIREEILSCGRKKAGKLFFGSLITLLCQRVKIVPGKDEECHFFKPTINLPLIGKLQQNNAQRKDKA
        K RL+PTTH   +S +R++LL+S++ G  IN+G  I  EI +C  +K G LFF SLIT LC+    +   +EE       I+   + ++ Q       + 
Subjt:  KLRLMPTTHNNTISVERVMLLYSIMKGLEINIGSNIREEILSCGRKKAGKLFFGSLITLLCQRVKIVPGKDEECHFFKPTINLPLIGKLQQNNAQRKDKA

Query:  STSQATPPSGLNPASPSQHTPFSGPSPSSEALEIAYQQLDQIRNNLRTYWAYAKERDEVIREFYLSITPSIAPVFPDFPQSLLPQEEKDSEDERESSSD
         T     PS   PA+ S            +ALE    Q +      + +WAY+KERD  +++   +      P FP FPQ +L  ++ D E E ES  D
Subjt:  STSQATPPSGLNPASPSQHTPFSGPSPSSEALEIAYQQLDQIRNNLRTYWAYAKERDEVIREFYLSITPSIAPVFPDFPQSLLPQEEKDSEDERESSSD

A0A6A2ZUE4 Uncharacterized protein2.7e-1626.77Show/hide
Query:  PSPKNPFPEVFRDVNFQERMKIMRKRDFLNEKGF---SNRAGTLPEFVTRVITQYKWQELCAHPQEAVVPLVREFYAGLREESMSMAVVRGKMVSFSSVD
        P+P  P P  F D   +E  + ++ R    E GF         L   V  V+T++KWQ+   HP      +V+EFY+ + E +    +VRG  + F+   
Subjt:  PSPKNPFPEVFRDVNFQERMKIMRKRDFLNEKGF---SNRAGTLPEFVTRVITQYKWQELCAHPQEAVVPLVREFYAGLREESMSMAVVRGKMVSFSSVD

Query:  INRVF-----------------------------------NGKS---------------------PKLRLMPTTHNNTISVERVMLLYSIMKGLEINIGS
        INR F                                   NG+                       K +LMPT+HN T+S +R++LL+SI+ G  I+IG 
Subjt:  INRVF-----------------------------------NGKS---------------------PKLRLMPTTHNNTISVERVMLLYSIMKGLEINIGS

Query:  NIREEILSCGRKKAGKLFFGSLITLLCQRVKIVPGKDEECHFFKPTINLPLIGKL--QQNNAQRKDKASTSQATPPSGLNPASPSQHTPFSGPSPSSEAL
         I E    C +++A  L F +LIT LC++ K+     +E       +N   I  L   +    +K +A+TS+          + S H   S  +   +A+
Subjt:  NIREEILSCGRKKAGKLFFGSLITLLCQRVKIVPGKDEECHFFKPTINLPLIGKL--QQNNAQRKDKASTSQATPPSGLNPASPSQHTPFSGPSPSSEAL

Query:  EIAYQQLDQIRNNLRTYWAYAKERD
        +  +Q + Q+ + L  Y+AYAK RD
Subjt:  EIAYQQLDQIRNNLRTYWAYAKERD

A0A6A3BU96 Uncharacterized protein5.6e-1422.62Show/hide
Query:  PSPKNPFP-EVFRDVNFQERMKIMRKRDFLNEKGF---SNRAGTLPEFVTRVITQYKWQELCAHPQEAVVPLVREFYAGLREESMSMAVVRGKMVSFSSV
        P+P +    + F +   + R +  + R+   E GF       G     V  ++   KW +   HP      LV+EFYA + + +     VRGK + F+S+
Subjt:  PSPKNPFP-EVFRDVNFQERMKIMRKRDFLNEKGF---SNRAGTLPEFVTRVITQYKWQELCAHPQEAVVPLVREFYAGLREESMSMAVVRGKMVSFSSV

Query:  DINRVF-----------------------------------NGKSP---------------------KLRLMPTTHNNTISVERVMLLYSIMKGLEINIG
         INR F                                   NG+                       K +LMPT+HN T+S+ R++LL+S++    I++G
Subjt:  DINRVF-----------------------------------NGKSP---------------------KLRLMPTTHNNTISVERVMLLYSIMKGLEINIG

Query:  SNIREEILSCGRKKAGKLFFGSLITLLCQRVKIVPGKDEECHFFKPTINLPLIGKLQQNNAQRKDKASTSQATPPSGLNPASPSQHTPFSGPSPSSEALE
          I +++  C  KKA  L F +LIT LC++ K+     +E         LP +  + ++           ++  P        ++           EA+ 
Subjt:  SNIREEILSCGRKKAGKLFFGSLITLLCQRVKIVPGKDEECHFFKPTINLPLIGKLQQNNAQRKDKASTSQATPPSGLNPASPSQHTPFSGPSPSSEALE

Query:  IAYQQLDQIRNNLRTYWAYAKERDEVIREFYLSITPSIAPVFPDFPQSLLPQEEKDSEDERESSSDE
            QL  +  ++R ++ Y K RD ++   +  I P     FP FP  +LP    ++  E E  + +
Subjt:  IAYQQLDQIRNNLRTYWAYAKERDEVIREFYLSITPSIAPVFPDFPQSLLPQEEKDSEDERESSSDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAACACACCAAAATCATCTTCATCCCGCAAGATTACTCAATCTCAGAGTAATCAACCTGCCCAAGGTGCTGAAGTAAGTGCTCGACGCCAAGAAGAGCAACCCGA
AGTCACCATGCACGACACGAGAGGGACGAGACCTACAGAATTCGAGTTCACCAAGAAGGTCCACACGCACCGATTCCCTCCTCAAAACCCAAAACCCGCGAGCCAACAGT
ACAACAAACACTCAAGGGAATGGTTTAAGATGATTAGGGAGATGAGAACCCAAAGGTGTGCGGCTCTTGAAGAAGAAGCAAGACGGCGTGATGAAGAAGAAGCCACCAAG
GCTAGAGAGAGCTCTCAACATGGAGAGACTCTAACGGGTAAAAGTTCCAACCCTAAAACTAACCCTTCTTCGTCTTGCAGGGATAGGCCTTTTGTTACTTACAGTGCAAG
AAAGAAAGAGTCAAAGAAGGCTGCGCCTGAAAAGCCTCTTGTCATCGAGCCCCTCAAAGTAGCTAGAATGCATCCAAATGTGTTCGAGGACATGATCCACCAAGTCGTGG
CACAAGCCCTTATTATTGCTGAAGGTTATAGGGCTGAGCAAGATGCATTAAGGGAGATAAGGGCTGAAAGAGAGATGGAAAATCAGAGCATGAGGGAAGAGGATGAGTTT
GCGAGAAGAAGGGACTTAGAAGAAGAAAAGGAAGCTGAAAGAAAGAAGGAAGAAGAAAAGAGAGTGGCTGCTGAATTGCAACTCCTTGAGGAAGAAAAAGAAAAAAGAGA
AATTTTGAAAGAAGAAGAGAAAAGAATAAAAGAATCTGAGGACTTCCTTGCAGCCTTTGAGCCACTGCAAAAGGCCCAAAGTGAGGTTGATTTACTGCAAGGAAGAGAAG
AAGAGGCCCTTGAGGGGCCAAGTGAAGAAGACCAAGAAAAAGAAGAAGAAAAAGAAGAAGTAAATGAAGGCCAAAATGCGACCGCATTTGGGCCACATTATGAAGAAGGC
AAAGAGAAGGCCAATGAAGAACAGCTAGCTGATGAAACCTTGGATCCTCTATTTGAGTATGATGTGAGAGGACCTCCACCTGCAGTTGAGAGCACCTCTTCAGGAAAGAA
GAGGGATGAAGAAGAAATTGCAAATAAGGAGGTCGAGACCTCCAGTGATTCAGAAACAGAATCTGATTCAGAGATTAAGGAATTGGATGACGACCAAATTCCTATCTCTG
CAGCATTGAGAAAAAAGAGAAGAAGAGAGATTAGGGTCGAGAGGAGGACCAAGAATAAAAATGATTCGATTTTTGCCAAGAGGCCGAGGGCAAGGTCCATGGACGCCTCT
CTTGCAGCTCCTCCAACCGTCTCACATGCCAAGCCGAAAGCCAAATCACCTAAGGCTCCATCTCCTAAAAATCCATTCCCAGAAGTCTTCAGGGATGTCAATTTTCAGGA
AAGGATGAAGATCATGAGGAAAAGAGACTTCCTAAACGAGAAGGGATTCTCTAACAGAGCTGGGACGCTGCCAGAGTTTGTAACAAGAGTTATCACACAATACAAGTGGC
AGGAGCTCTGTGCTCACCCTCAAGAGGCCGTGGTGCCTTTAGTTCGAGAATTTTACGCCGGACTGAGGGAAGAAAGCATGAGTATGGCAGTGGTGAGAGGCAAGATGGTC
AGCTTCTCTTCTGTTGACATCAACAGGGTGTTCAATGGAAAGAGTCCCAAACTAAGGTTGATGCCAACAACCCACAACAACACCATTTCAGTAGAGAGAGTCATGCTCCT
CTACAGTATTATGAAGGGGTTGGAGATAAACATCGGGAGCAATATCAGGGAGGAGATCCTTTCGTGTGGAAGGAAGAAAGCAGGGAAGCTTTTCTTTGGGTCACTTATTA
CCCTATTATGTCAGAGAGTCAAAATAGTTCCTGGAAAGGATGAAGAGTGCCACTTCTTCAAGCCTACCATCAATTTACCTCTGATTGGGAAGCTCCAACAGAACAACGCC
CAAAGAAAAGACAAAGCTTCCACATCTCAAGCCACTCCACCATCAGGGTTGAATCCGGCTTCTCCATCTCAACACACTCCTTTTTCAGGGCCCTCACCATCATCTGAAGC
CCTAGAAATTGCCTATCAACAGCTGGATCAAATCAGGAACAACCTGAGGACTTATTGGGCTTATGCCAAGGAAAGAGATGAAGTCATTAGAGAGTTTTACCTTTCTATCA
CCCCGAGTATTGCTCCTGTCTTCCCTGACTTCCCTCAATCGTTGTTGCCTCAAGAAGAAAAGGATTCTGAAGATGAAAGAGAGAGTTCCTCGGATGAGGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGAACACACCAAAATCATCTTCATCCCGCAAGATTACTCAATCTCAGAGTAATCAACCTGCCCAAGGTGCTGAAGTAAGTGCTCGACGCCAAGAAGAGCAACCCGA
AGTCACCATGCACGACACGAGAGGGACGAGACCTACAGAATTCGAGTTCACCAAGAAGGTCCACACGCACCGATTCCCTCCTCAAAACCCAAAACCCGCGAGCCAACAGT
ACAACAAACACTCAAGGGAATGGTTTAAGATGATTAGGGAGATGAGAACCCAAAGGTGTGCGGCTCTTGAAGAAGAAGCAAGACGGCGTGATGAAGAAGAAGCCACCAAG
GCTAGAGAGAGCTCTCAACATGGAGAGACTCTAACGGGTAAAAGTTCCAACCCTAAAACTAACCCTTCTTCGTCTTGCAGGGATAGGCCTTTTGTTACTTACAGTGCAAG
AAAGAAAGAGTCAAAGAAGGCTGCGCCTGAAAAGCCTCTTGTCATCGAGCCCCTCAAAGTAGCTAGAATGCATCCAAATGTGTTCGAGGACATGATCCACCAAGTCGTGG
CACAAGCCCTTATTATTGCTGAAGGTTATAGGGCTGAGCAAGATGCATTAAGGGAGATAAGGGCTGAAAGAGAGATGGAAAATCAGAGCATGAGGGAAGAGGATGAGTTT
GCGAGAAGAAGGGACTTAGAAGAAGAAAAGGAAGCTGAAAGAAAGAAGGAAGAAGAAAAGAGAGTGGCTGCTGAATTGCAACTCCTTGAGGAAGAAAAAGAAAAAAGAGA
AATTTTGAAAGAAGAAGAGAAAAGAATAAAAGAATCTGAGGACTTCCTTGCAGCCTTTGAGCCACTGCAAAAGGCCCAAAGTGAGGTTGATTTACTGCAAGGAAGAGAAG
AAGAGGCCCTTGAGGGGCCAAGTGAAGAAGACCAAGAAAAAGAAGAAGAAAAAGAAGAAGTAAATGAAGGCCAAAATGCGACCGCATTTGGGCCACATTATGAAGAAGGC
AAAGAGAAGGCCAATGAAGAACAGCTAGCTGATGAAACCTTGGATCCTCTATTTGAGTATGATGTGAGAGGACCTCCACCTGCAGTTGAGAGCACCTCTTCAGGAAAGAA
GAGGGATGAAGAAGAAATTGCAAATAAGGAGGTCGAGACCTCCAGTGATTCAGAAACAGAATCTGATTCAGAGATTAAGGAATTGGATGACGACCAAATTCCTATCTCTG
CAGCATTGAGAAAAAAGAGAAGAAGAGAGATTAGGGTCGAGAGGAGGACCAAGAATAAAAATGATTCGATTTTTGCCAAGAGGCCGAGGGCAAGGTCCATGGACGCCTCT
CTTGCAGCTCCTCCAACCGTCTCACATGCCAAGCCGAAAGCCAAATCACCTAAGGCTCCATCTCCTAAAAATCCATTCCCAGAAGTCTTCAGGGATGTCAATTTTCAGGA
AAGGATGAAGATCATGAGGAAAAGAGACTTCCTAAACGAGAAGGGATTCTCTAACAGAGCTGGGACGCTGCCAGAGTTTGTAACAAGAGTTATCACACAATACAAGTGGC
AGGAGCTCTGTGCTCACCCTCAAGAGGCCGTGGTGCCTTTAGTTCGAGAATTTTACGCCGGACTGAGGGAAGAAAGCATGAGTATGGCAGTGGTGAGAGGCAAGATGGTC
AGCTTCTCTTCTGTTGACATCAACAGGGTGTTCAATGGAAAGAGTCCCAAACTAAGGTTGATGCCAACAACCCACAACAACACCATTTCAGTAGAGAGAGTCATGCTCCT
CTACAGTATTATGAAGGGGTTGGAGATAAACATCGGGAGCAATATCAGGGAGGAGATCCTTTCGTGTGGAAGGAAGAAAGCAGGGAAGCTTTTCTTTGGGTCACTTATTA
CCCTATTATGTCAGAGAGTCAAAATAGTTCCTGGAAAGGATGAAGAGTGCCACTTCTTCAAGCCTACCATCAATTTACCTCTGATTGGGAAGCTCCAACAGAACAACGCC
CAAAGAAAAGACAAAGCTTCCACATCTCAAGCCACTCCACCATCAGGGTTGAATCCGGCTTCTCCATCTCAACACACTCCTTTTTCAGGGCCCTCACCATCATCTGAAGC
CCTAGAAATTGCCTATCAACAGCTGGATCAAATCAGGAACAACCTGAGGACTTATTGGGCTTATGCCAAGGAAAGAGATGAAGTCATTAGAGAGTTTTACCTTTCTATCA
CCCCGAGTATTGCTCCTGTCTTCCCTGACTTCCCTCAATCGTTGTTGCCTCAAGAAGAAAAGGATTCTGAAGATGAAAGAGAGAGTTCCTCGGATGAGGAATAG
Protein sequenceShow/hide protein sequence
MKNTPKSSSSRKITQSQSNQPAQGAEVSARRQEEQPEVTMHDTRGTRPTEFEFTKKVHTHRFPPQNPKPASQQYNKHSREWFKMIREMRTQRCAALEEEARRRDEEEATK
ARESSQHGETLTGKSSNPKTNPSSSCRDRPFVTYSARKKESKKAAPEKPLVIEPLKVARMHPNVFEDMIHQVVAQALIIAEGYRAEQDALREIRAEREMENQSMREEDEF
ARRRDLEEEKEAERKKEEEKRVAAELQLLEEEKEKREILKEEEKRIKESEDFLAAFEPLQKAQSEVDLLQGREEEALEGPSEEDQEKEEEKEEVNEGQNATAFGPHYEEG
KEKANEEQLADETLDPLFEYDVRGPPPAVESTSSGKKRDEEEIANKEVETSSDSETESDSEIKELDDDQIPISAALRKKRRREIRVERRTKNKNDSIFAKRPRARSMDAS
LAAPPTVSHAKPKAKSPKAPSPKNPFPEVFRDVNFQERMKIMRKRDFLNEKGFSNRAGTLPEFVTRVITQYKWQELCAHPQEAVVPLVREFYAGLREESMSMAVVRGKMV
SFSSVDINRVFNGKSPKLRLMPTTHNNTISVERVMLLYSIMKGLEINIGSNIREEILSCGRKKAGKLFFGSLITLLCQRVKIVPGKDEECHFFKPTINLPLIGKLQQNNA
QRKDKASTSQATPPSGLNPASPSQHTPFSGPSPSSEALEIAYQQLDQIRNNLRTYWAYAKERDEVIREFYLSITPSIAPVFPDFPQSLLPQEEKDSEDERESSSDEE