; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020595 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020595
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold9:15113883..15120377
RNA-Seq ExpressionSpg020595
SyntenySpg020595
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]1.3e-1024.84Show/hide
Query:  FVNNIAKAKYLEMLKRDFLFERGFGDDLPHFLRVG------ITNHGWEQFCGKSEPDNSNIVREFYGNIDEEEVFQAIVRGVAVDWSPGAINSLFNL---
        FV+  AK  Y  +  R   FE GF         +G      +T H W++F     P N+ IV+EFY NI E      +VRG+++ ++P AIN  F L   
Subjt:  FVNNIAKAKYLEMLKRDFLFERGFGDDLPHFLRVG------ITNHGWEQFCGKSEPDNSNIVREFYGNIDEEEVFQAIVRGVAVDWSPGAINSLFNL---

Query:  --------QDFPHAGYN---EMVATP----SNEQL------------------------------NAAVS---------------IDADKVIDNEIHTCW
                Q+  H  Y    E +  P    + +QL                              N  VS               ID  K+I    H C 
Subjt:  --------QDFPHAGYN---EMVATP----SNEQL------------------------------NAAVS---------------IDADKVIDNEIHTCW

Query:  RKKVGKLFFPNTITMLCHRAGVPTSAEDVILLEKGIIDTSNMARL-------QQTQEARQGGLVCDIHLILEQLQLSASRQVFAERQAQ------TYWTY
        +++   L FPN IT LC +  V     D IL     ++ + +  L        +  EA    +    H+      L  + Q   +   Q       Y+ Y
Subjt:  RKKVGKLFFPNTITMLCHRAGVPTSAEDVILLEKGIIDTSNMARL-------QQTQEARQGGLVCDIHLILEQLQLSASRQVFAERQAQ------TYWTY

Query:  VKRRDATLRKALQSNFSK
         KRRDA L  AL  +  +
Subjt:  VKRRDATLRKALQSNFSK

KAE8725369.1 hypothetical protein F3Y22_tig00008957pilonHSYRG00158 [Hibiscus syriacus]4.9e-1025Show/hide
Query:  FLFERGFGDDLPHFLRVGITNHGWEQFCGKSEPDNSNIVREFYGNIDEEEVFQAIVRGVAVDWSPGAINSLFNLQ-------DFPHAGYNE---------
        F+F      DL   +   +T H W+ F       N+ IV+EFY NI E   +  +V G+++ ++  AIN  F LQ        F     NE         
Subjt:  FLFERGFGDDLPHFLRVGITNHGWEQFCGKSEPDNSNIVREFYGNIDEEEVFQAIVRGVAVDWSPGAINSLFNLQ-------DFPHAGYNE---------

Query:  -MVATPSNEQLNAAVSIDADKVIDNEIHTCWRKKVGKLFFPNTITMLCHRAGVPTSAEDVIL------------LEKGIIDTSNMARLQQTQEARQGGLV
         +  T  N Q  +  ++D ++++  ++H C ++    L FPN I  LC +  VP    D +L            L  G  +     ++  T         
Subjt:  -MVATPSNEQLNAAVSIDADKVIDNEIHTCWRKKVGKLFFPNTITMLCHRAGVPTSAEDVIL------------LEKGIIDTSNMARLQQTQEARQGGLV

Query:  CDIHLILEQLQLSASRQV-FAERQAQTYWTYVKRRDATLRKALQSNFSKPYQAFPVFPDDLLNP
              LEQ+       V     +  TY+ Y K RDA L  AL            +FP+ L+ P
Subjt:  CDIHLILEQLQLSASRQV-FAERQAQTYWTYVKRRDATLRKALQSNFSKPYQAFPVFPDDLLNP

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]3.1e-2026.56Show/hide
Query:  KGVAKAAAEEEIEEQRMQYKRFVNNIAKAKYLEMLKRDFLFERGFGDD-------LPHFLRVGITNHGWEQFCGKSEPDNSNIVREFYGNIDEEEVFQAI
        K VA+ A +    E      R+ NNI          R    E+GF  D       LP   +V IT H W+QFC   E     +VREFY N+ + E     
Subjt:  KGVAKAAAEEEIEEQRMQYKRFVNNIAKAKYLEMLKRDFLFERGFGDD-------LPHFLRVGITNHGWEQFCGKSEPDNSNIVREFYGNIDEEEVFQAI

Query:  VRGVAVDWSPGAINSLFNLQDFPHAGYNEMVATPSNEQLNAAV---------------------------------------------------------
        VRGV V WS  AIN++F L D P   ++E +   + + L   +                                                         
Subjt:  VRGVAVDWSPGAINSLFNLQDFPHAGYNEMVATPSNEQLNAAV---------------------------------------------------------

Query:  -------SIDADKVIDNEIHTCWRKKVGKLFFPNTITMLCHRAGVPTSAEDVILLEKGIIDTSNMARLQQ---TQEARQ---------------GGLVCD
               SI+  ++I +EI  C  +K G LFFP+ IT LC  A  P    +  L   G ID   +AR+ Q   T+  +Q               G ++  
Subjt:  -------SIDADKVIDNEIHTCWRKKVGKLFFPNTITMLCHRAGVPTSAEDVILLEKGIIDTSNMARLQQ---TQEARQ---------------GGLVCD

Query:  IHLILEQLQLSASRQV-------FAERQAQTYWTYVKRRDATLRKALQSNFSKPYQAFPVFPDDLLNPWILPPPVEREEEEEDD
        +  + ++L     +Q           +Q Q +W Y K RD  L+KALQ+NF++P   FP FP ++L        ++ E E E D
Subjt:  IHLILEQLQLSASRQV-------FAERQAQTYWTYVKRRDATLRKALQSNFSKPYQAFPVFPDDLLNPWILPPPVEREEEEEDD

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]4.4e-1132.77Show/hide
Query:  SIDADKVIDNEIHTCWRKKVGKLFFPNTITMLCHRAGVPTSAEDVILLEKGIIDTSNMARLQQ------TQE-----------ARQGGLVCDIHLILEQL
        SI+  ++I +EI  C  +K G LFFP+ IT LC  A  P    +  L   G ID   +AR+ Q      TQ+           +R  G +      LEQ 
Subjt:  SIDADKVIDNEIHTCWRKKVGKLFFPNTITMLCHRAGVPTSAEDVILLEKGIIDTSNMARLQQ------TQE-----------ARQGGLVCDIHLILEQL

Query:  QLSASRQVF--------AERQAQTYWTYVKRRDATLRKALQSNFSKPYQAFPVFPDDLLNPWILPPPVEREEEEEDD
              Q +          +Q Q +W Y K RD  L+KALQ+NF++P   FP FP +LL        ++ E E E D
Subjt:  QLSASRQVF--------AERQAQTYWTYVKRRDATLRKALQSNFSKPYQAFPVFPDDLLNPWILPPPVEREEEEEDD

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.3e-1528.18Show/hide
Query:  IVREFYGNIDEEEVFQAIVRGVAVDWSPGAINSLFNLQD--FPHAGYNEMVATPS---------------------------------------------
        +VREFY N+ + E     VRGV V WS  AIN++F L D    H+ + E +  P                                              
Subjt:  IVREFYGNIDEEEVFQAIVRGVAVDWSPGAINSLFNLQD--FPHAGYNEMVATPS---------------------------------------------

Query:  -----------------NEQLNAAVSIDADKVIDNEIHTCWRKKVGKLFFPNTITMLCHRAGVPTSAEDVILLEKGIIDTSNMARLQQ------TQE---
                         +  LN   SI+  ++I +EI  C  +K G LFFP+ IT LC  A  P    +  L   G ID   +AR+ Q      TQ+   
Subjt:  -----------------NEQLNAAVSIDADKVIDNEIHTCWRKKVGKLFFPNTITMLCHRAGVPTSAEDVILLEKGIIDTSNMARLQQ------TQE---

Query:  -----ARQGGLVCDIHLILEQLQLSASRQVFAERQAQTYWTYVKRRDATLRKALQSNFSKPYQAFPVFPDDLLNPWILPPPVEREEEEEDD
             A       D+   L+ L+   S+Q    +Q Q +W Y K RD  L+KALQ+NF++P   FP FP ++L        ++ E E E D
Subjt:  -----ARQGGLVCDIHLILEQLQLSASRQVFAERQAQTYWTYVKRRDATLRKALQSNFSKPYQAFPVFPDDLLNPWILPPPVEREEEEEDD

TrEMBL top hitse value%identityAlignment
A0A1S2Z475 uncharacterized protein LOC101493401 isoform X39.0e-1022.75Show/hide
Query:  YKRFVNNIAKAKYLEMLK-RDFLFERGFGDD-------LPHFLRVGITNHGWEQFCGKSEPDNSNIVREFYGNIDEEEVFQAIVRGVAVDWSPGAINSLF
        + +F+N   + K+  ++K R+F  E GF  +       LP  L   I  H W+ F   S    ++IVREFY  I E +    +VRGV V ++P  +N  F
Subjt:  YKRFVNNIAKAKYLEMLK-RDFLFERGFGDD-------LPHFLRVGITNHGWEQFCGKSEPDNSNIVREFYGNIDEEEVFQAIVRGVAVDWSPGAINSLF

Query:  NLQDFPHAGYNEMV-------ATPSNEQLNAAV-----------------------------------------------------------------SI
        NL        N++V        T S+E+LN+ +                                                                 SI
Subjt:  NLQDFPHAGYNEMV-------ATPSNEQLNAAV-----------------------------------------------------------------SI

Query:  DADKVIDNEIHTC--WRKKVGKLFFPNTITMLCHRAGVPTSAEDVILLEKGIIDTSNMARLQQ---------------------TQEARQGGLVCDIHLI
        +  K+I +EI  C   +KK  +L FP+ I+ LC R GV    +D +++ +  I   ++ R  +                     T+E R  G   +  + 
Subjt:  DADKVIDNEIHTC--WRKKVGKLFFPNTITMLCHRAGVPTSAEDVILLEKGIIDTSNMARLQQ---------------------TQEARQGGLVCDIHLI

Query:  LEQLQLSASRQVFAE-------RQAQTYWTYVKRRDATLRKALQSNFSKPYQAFPVFPDDLLNPWILPPPVEREEEEE
         E+ Q     + F         +Q + +W + K      RK  + NF K     P FPD++L P++  P  E+ + ++
Subjt:  LEQLQLSASRQVFAE-------RQAQTYWTYVKRRDATLRKALQSNFSKPYQAFPVFPDDLLNPWILPPPVEREEEEE

A0A2P5AGA5 Uncharacterized protein (Fragment)9.0e-1026.67Show/hide
Query:  KGVAKAAAEEEIEEQRMQYKRFVNNIAKAKYLEMLKRDFLFERGFGDD-------LPHFLRVGITNHGWEQFCGKSEPDNSNIVREFYGNIDEEEVFQAI
        K VA+ A +    E      R+ NNI          R    E+GF  D       LP   +V IT H W+QFC   E     +VREFY N+ +       
Subjt:  KGVAKAAAEEEIEEQRMQYKRFVNNIAKAKYLEMLKRDFLFERGFGDD-------LPHFLRVGITNHGWEQFCGKSEPDNSNIVREFYGNIDEEEVFQAI

Query:  VRGVAVDWSPGAINSLFNLQDFPHAGYNEMVATPSNEQLNAAV---------------------------------------------------------
        VRGV V WS  AIN++F L D P   ++E +   +   L   +                                                         
Subjt:  VRGVAVDWSPGAINSLFNLQDFPHAGYNEMVATPSNEQLNAAV---------------------------------------------------------

Query:  -------SIDADKVIDNEIHTCWRKKVGKLFFPNTITMLCHRAGVPTSAEDVILLEKGIIDTSNMARLQQ
               SI+  ++I +EI  C  +K G LFFP+ IT LC  A  P    +  L   G ID   +AR+ Q
Subjt:  -------SIDADKVIDNEIHTCWRKKVGKLFFPNTITMLCHRAGVPTSAEDVILLEKGIIDTSNMARLQQ

A0A2P5BCG4 Uncharacterized protein (Fragment)1.5e-2026.56Show/hide
Query:  KGVAKAAAEEEIEEQRMQYKRFVNNIAKAKYLEMLKRDFLFERGFGDD-------LPHFLRVGITNHGWEQFCGKSEPDNSNIVREFYGNIDEEEVFQAI
        K VA+ A +    E      R+ NNI          R    E+GF  D       LP   +V IT H W+QFC   E     +VREFY N+ + E     
Subjt:  KGVAKAAAEEEIEEQRMQYKRFVNNIAKAKYLEMLKRDFLFERGFGDD-------LPHFLRVGITNHGWEQFCGKSEPDNSNIVREFYGNIDEEEVFQAI

Query:  VRGVAVDWSPGAINSLFNLQDFPHAGYNEMVATPSNEQLNAAV---------------------------------------------------------
        VRGV V WS  AIN++F L D P   ++E +   + + L   +                                                         
Subjt:  VRGVAVDWSPGAINSLFNLQDFPHAGYNEMVATPSNEQLNAAV---------------------------------------------------------

Query:  -------SIDADKVIDNEIHTCWRKKVGKLFFPNTITMLCHRAGVPTSAEDVILLEKGIIDTSNMARLQQ---TQEARQ---------------GGLVCD
               SI+  ++I +EI  C  +K G LFFP+ IT LC  A  P    +  L   G ID   +AR+ Q   T+  +Q               G ++  
Subjt:  -------SIDADKVIDNEIHTCWRKKVGKLFFPNTITMLCHRAGVPTSAEDVILLEKGIIDTSNMARLQQ---TQEARQ---------------GGLVCD

Query:  IHLILEQLQLSASRQV-------FAERQAQTYWTYVKRRDATLRKALQSNFSKPYQAFPVFPDDLLNPWILPPPVEREEEEEDD
        +  + ++L     +Q           +Q Q +W Y K RD  L+KALQ+NF++P   FP FP ++L        ++ E E E D
Subjt:  IHLILEQLQLSASRQV-------FAERQAQTYWTYVKRRDATLRKALQSNFSKPYQAFPVFPDDLLNPWILPPPVEREEEEEDD

A0A2P5CEY2 Uncharacterized protein2.1e-1132.77Show/hide
Query:  SIDADKVIDNEIHTCWRKKVGKLFFPNTITMLCHRAGVPTSAEDVILLEKGIIDTSNMARLQQ------TQE-----------ARQGGLVCDIHLILEQL
        SI+  ++I +EI  C  +K G LFFP+ IT LC  A  P    +  L   G ID   +AR+ Q      TQ+           +R  G +      LEQ 
Subjt:  SIDADKVIDNEIHTCWRKKVGKLFFPNTITMLCHRAGVPTSAEDVILLEKGIIDTSNMARLQQ------TQE-----------ARQGGLVCDIHLILEQL

Query:  QLSASRQVF--------AERQAQTYWTYVKRRDATLRKALQSNFSKPYQAFPVFPDDLLNPWILPPPVEREEEEEDD
              Q +          +Q Q +W Y K RD  L+KALQ+NF++P   FP FP +LL        ++ E E E D
Subjt:  QLSASRQVF--------AERQAQTYWTYVKRRDATLRKALQSNFSKPYQAFPVFPDDLLNPWILPPPVEREEEEEDD

A0A2P5DXM3 Uncharacterized protein6.4e-1628.18Show/hide
Query:  IVREFYGNIDEEEVFQAIVRGVAVDWSPGAINSLFNLQD--FPHAGYNEMVATPS---------------------------------------------
        +VREFY N+ + E     VRGV V WS  AIN++F L D    H+ + E +  P                                              
Subjt:  IVREFYGNIDEEEVFQAIVRGVAVDWSPGAINSLFNLQD--FPHAGYNEMVATPS---------------------------------------------

Query:  -----------------NEQLNAAVSIDADKVIDNEIHTCWRKKVGKLFFPNTITMLCHRAGVPTSAEDVILLEKGIIDTSNMARLQQ------TQE---
                         +  LN   SI+  ++I +EI  C  +K G LFFP+ IT LC  A  P    +  L   G ID   +AR+ Q      TQ+   
Subjt:  -----------------NEQLNAAVSIDADKVIDNEIHTCWRKKVGKLFFPNTITMLCHRAGVPTSAEDVILLEKGIIDTSNMARLQQ------TQE---

Query:  -----ARQGGLVCDIHLILEQLQLSASRQVFAERQAQTYWTYVKRRDATLRKALQSNFSKPYQAFPVFPDDLLNPWILPPPVEREEEEEDD
             A       D+   L+ L+   S+Q    +Q Q +W Y K RD  L+KALQ+NF++P   FP FP ++L        ++ E E E D
Subjt:  -----ARQGGLVCDIHLILEQLQLSASRQVFAERQAQTYWTYVKRRDATLRKALQSNFSKPYQAFPVFPDDLLNPWILPPPVEREEEEEDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAGACAAGGGGTAGAAGAGAGATGGAAGTTGAGGAGGAGGAAGTGCCAGTGACACCGGAGGCACCGAAAAAGAAAACAAAGAGAAGAAGAACACCTGACGAAAG
AGATGCCAAATGGTTGAGAAGACAACAACAGGCTGCGGTTACAGAGATTCTGCAAAAAGTAATTGAGGATGTTGTAGAGGAAGTGGCTGGGGAAGAGCAGCCAACAGGCC
CTAAAGTAGGAAAAGGTCTGGAGCAAGGAGATCAACCAGTCGAAACTCAACAGGAAGTACAAGATAAGCGAGCACAAGATGTGTCAGAGCAAGGGGATCGTCAAGAAGTT
CAAGAACAGCAGAAGGAAAACACAGAAAAAGAGGATCAAGAGAATGAAGAGACCGAGAATAAAGCTGAAGAGGATAGAGGCAAAGGAGTTGCCAAAGCAGCAGCGGAAGA
AGAAATTGAGGAGCAGCGGATGCAATACAAACGCTTCGTCAACAACATTGCCAAAGCAAAATATTTAGAAATGCTAAAGAGGGATTTTCTTTTTGAAAGAGGATTTGGAG
ATGACCTACCACACTTTCTGCGAGTAGGGATTACTAATCACGGATGGGAGCAGTTTTGTGGTAAGTCGGAGCCAGACAACTCAAATATAGTTCGCGAGTTCTACGGGAAT
ATTGACGAAGAAGAGGTCTTCCAGGCAATTGTTCGAGGGGTCGCTGTGGATTGGAGCCCAGGTGCGATTAATTCTTTATTTAATCTCCAAGACTTCCCCCATGCCGGTTA
CAATGAGATGGTGGCAACGCCATCTAATGAGCAGCTGAATGCGGCTGTTAGTATCGATGCCGATAAGGTGATTGACAATGAGATTCATACTTGCTGGCGAAAGAAGGTGG
GCAAGCTTTTCTTTCCAAACACTATAACTATGTTATGTCATAGGGCAGGGGTGCCTACAAGTGCAGAAGATGTTATCTTATTAGAGAAGGGAATTATAGACACATCTAAC
ATGGCGAGGCTTCAGCAGACTCAAGAAGCGCGTCAAGGCGGGTTGGTGTGCGACATCCATCTGATTTTAGAACAACTCCAACTTTCAGCCAGTAGGCAGGTGTTTGCTGA
AAGGCAAGCTCAGACATACTGGACTTATGTTAAACGGAGGGATGCCACGTTAAGGAAGGCACTGCAATCGAACTTTTCAAAACCATATCAAGCCTTCCCTGTATTCCCTG
ATGATTTGTTGAACCCATGGATCCTGCCACCGCCGGTCGAAAGAGAGGAAGAAGAGGAGGATGATGTTGAAACCTTTTGCTTGAGCATTCCTTCTAGCCTGGTCATCGAT
GCGGCAAGAAGTTCTGAGGGGAGAATCCGGACCTTGCTTGCTGCCCAAGACGCACAGCGTCGCGACGCTGTGACGTCATCCTTTCTTGCCGCTCAAGTTGGTGCAGTGTC
GCGACGCTACACAGACAGCGTCGCGATTCTGGAATTCTCTTCAGAGCTTGCCAAGCGTGGTGATTATTTGTTCAATGCTGAAACAACTGTTTTGCTGCATCAATGCGTGG
TTTTACAGGATGCTCAGGTAAAGGTTGAAGGTAGTGTTGGCTTATCTGTTTTAATTGAGTTGTGCTATGATCAATGTTTTGAATTTTCAAGGGAAGAATGGATGCTTTGG
AAGTTGTTTTGCGGAAAAATTGATGCTGAGCGACTTGACGGCGCAAATTCTATGATCCAGCAAAGCACAGAGCAAAAGTGGCCACGTCCTCACCAGAGCTTTGGAAGCGA
TTCGGGGCTTAAACGAGCGAAACTGGAGCGTAAAATGACCAAACTGCCCCTGGAGCCATCGCAGCGTCGAGACGCTGCCATAAGAGGGTCGGGACGCTGCTCAAATAAGG
AAAATTCCGTTGGCGCGCAGTTGAAGCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAGACAAGGGGTAGAAGAGAGATGGAAGTTGAGGAGGAGGAAGTGCCAGTGACACCGGAGGCACCGAAAAAGAAAACAAAGAGAAGAAGAACACCTGACGAAAG
AGATGCCAAATGGTTGAGAAGACAACAACAGGCTGCGGTTACAGAGATTCTGCAAAAAGTAATTGAGGATGTTGTAGAGGAAGTGGCTGGGGAAGAGCAGCCAACAGGCC
CTAAAGTAGGAAAAGGTCTGGAGCAAGGAGATCAACCAGTCGAAACTCAACAGGAAGTACAAGATAAGCGAGCACAAGATGTGTCAGAGCAAGGGGATCGTCAAGAAGTT
CAAGAACAGCAGAAGGAAAACACAGAAAAAGAGGATCAAGAGAATGAAGAGACCGAGAATAAAGCTGAAGAGGATAGAGGCAAAGGAGTTGCCAAAGCAGCAGCGGAAGA
AGAAATTGAGGAGCAGCGGATGCAATACAAACGCTTCGTCAACAACATTGCCAAAGCAAAATATTTAGAAATGCTAAAGAGGGATTTTCTTTTTGAAAGAGGATTTGGAG
ATGACCTACCACACTTTCTGCGAGTAGGGATTACTAATCACGGATGGGAGCAGTTTTGTGGTAAGTCGGAGCCAGACAACTCAAATATAGTTCGCGAGTTCTACGGGAAT
ATTGACGAAGAAGAGGTCTTCCAGGCAATTGTTCGAGGGGTCGCTGTGGATTGGAGCCCAGGTGCGATTAATTCTTTATTTAATCTCCAAGACTTCCCCCATGCCGGTTA
CAATGAGATGGTGGCAACGCCATCTAATGAGCAGCTGAATGCGGCTGTTAGTATCGATGCCGATAAGGTGATTGACAATGAGATTCATACTTGCTGGCGAAAGAAGGTGG
GCAAGCTTTTCTTTCCAAACACTATAACTATGTTATGTCATAGGGCAGGGGTGCCTACAAGTGCAGAAGATGTTATCTTATTAGAGAAGGGAATTATAGACACATCTAAC
ATGGCGAGGCTTCAGCAGACTCAAGAAGCGCGTCAAGGCGGGTTGGTGTGCGACATCCATCTGATTTTAGAACAACTCCAACTTTCAGCCAGTAGGCAGGTGTTTGCTGA
AAGGCAAGCTCAGACATACTGGACTTATGTTAAACGGAGGGATGCCACGTTAAGGAAGGCACTGCAATCGAACTTTTCAAAACCATATCAAGCCTTCCCTGTATTCCCTG
ATGATTTGTTGAACCCATGGATCCTGCCACCGCCGGTCGAAAGAGAGGAAGAAGAGGAGGATGATGTTGAAACCTTTTGCTTGAGCATTCCTTCTAGCCTGGTCATCGAT
GCGGCAAGAAGTTCTGAGGGGAGAATCCGGACCTTGCTTGCTGCCCAAGACGCACAGCGTCGCGACGCTGTGACGTCATCCTTTCTTGCCGCTCAAGTTGGTGCAGTGTC
GCGACGCTACACAGACAGCGTCGCGATTCTGGAATTCTCTTCAGAGCTTGCCAAGCGTGGTGATTATTTGTTCAATGCTGAAACAACTGTTTTGCTGCATCAATGCGTGG
TTTTACAGGATGCTCAGGTAAAGGTTGAAGGTAGTGTTGGCTTATCTGTTTTAATTGAGTTGTGCTATGATCAATGTTTTGAATTTTCAAGGGAAGAATGGATGCTTTGG
AAGTTGTTTTGCGGAAAAATTGATGCTGAGCGACTTGACGGCGCAAATTCTATGATCCAGCAAAGCACAGAGCAAAAGTGGCCACGTCCTCACCAGAGCTTTGGAAGCGA
TTCGGGGCTTAAACGAGCGAAACTGGAGCGTAAAATGACCAAACTGCCCCTGGAGCCATCGCAGCGTCGAGACGCTGCCATAAGAGGGTCGGGACGCTGCTCAAATAAGG
AAAATTCCGTTGGCGCGCAGTTGAAGCAGTAG
Protein sequenceShow/hide protein sequence
MAKTRGRREMEVEEEEVPVTPEAPKKKTKRRRTPDERDAKWLRRQQQAAVTEILQKVIEDVVEEVAGEEQPTGPKVGKGLEQGDQPVETQQEVQDKRAQDVSEQGDRQEV
QEQQKENTEKEDQENEETENKAEEDRGKGVAKAAAEEEIEEQRMQYKRFVNNIAKAKYLEMLKRDFLFERGFGDDLPHFLRVGITNHGWEQFCGKSEPDNSNIVREFYGN
IDEEEVFQAIVRGVAVDWSPGAINSLFNLQDFPHAGYNEMVATPSNEQLNAAVSIDADKVIDNEIHTCWRKKVGKLFFPNTITMLCHRAGVPTSAEDVILLEKGIIDTSN
MARLQQTQEARQGGLVCDIHLILEQLQLSASRQVFAERQAQTYWTYVKRRDATLRKALQSNFSKPYQAFPVFPDDLLNPWILPPPVEREEEEEDDVETFCLSIPSSLVID
AARSSEGRIRTLLAAQDAQRRDAVTSSFLAAQVGAVSRRYTDSVAILEFSSELAKRGDYLFNAETTVLLHQCVVLQDAQVKVEGSVGLSVLIELCYDQCFEFSREEWMLW
KLFCGKIDAERLDGANSMIQQSTEQKWPRPHQSFGSDSGLKRAKLERKMTKLPLEPSQRRDAAIRGSGRCSNKENSVGAQLKQ