; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007771 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007771
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr9:4509276..4510783
RNA-Seq ExpressionLag0007771
SyntenyLag0007771
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_021847414.1 uncharacterized protein LOC110787151 [Spinacia oleracea]1.7e-3732.48Show/hide
Query:  WLQWGDRNSKWFHQRATQRRRHNRIEGL-ENRCGSWITEEELEKIVETYFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIEEDMLTALKQM
        WL  GD+N+K+FHQRA+ R+R N I  L +++      +E   K+   YF+ +F ++N      +   LA ++P +    N  L  ++  +++   L QM
Subjt:  WLQWGDRNSKWFHQRATQRRRHNRIEGL-ENRCGSWITEEELEKIVETYFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIEEDMLTALKQM

Query:  GPAKAPGRMAFRHFS---------------------------TKDNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFMTTLGLDGRFA
         P KAPG   ++  S                             DN ++ FE  +++K   +   G+ A+KLDMSKAYDR+E  FLE  +  LG D  + 
Subjt:  GPAKAPGRMAFRHFS---------------------------TKDNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFMTTLGLDGRFA

Query:  R------VTEADMIAKCLKAYSKLTGQEINYGKSGLCLSPNVEMQMKKALASTLNVRLVGFHDRYLGLPAVFPGRKAISSKFIKDRLWSCIHKWKLSNFS
        R       TEAD I + L  Y   +GQ++N+ K+ +  S  V   ++  L++ L VR+V  HDRYLGLP      K + +K +K++LW  +  WK    S
Subjt:  R------VTEADMIAKCLKAYSKLTGQEINYGKSGLCLSPNVEMQMKKALASTLNVRLVGFHDRYLGLPAVFPGRKAISSKFIKDRLWSCIHKWKLSNFS

Query:  AGGKEVLIKVVLQA
          G+EV+IK V Q+
Subjt:  AGGKEVLIKVVLQA

XP_024163940.1 uncharacterized protein LOC112170898 [Rosa chinensis]7.7e-3839.13Show/hide
Query:  TQRRRHNRIEGLENRCGSWITEE-ELEKIVETYFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIEEDMLTALKQMGPAKAPGRMAF-----
        T R++ N I+GL N  G W  E+ EL+ IV +YF  LF+S     ++S   F   I P I  E N  L +    E++  ALKQM P KA G   F     
Subjt:  TQRRRHNRIEGLENRCGSWITEE-ELEKIVETYFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIEEDMLTALKQMGPAKAPGRMAF-----

Query:  RHF------STKDNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFMTTLGLDGRFARVTEADMIAKCLKAYSKLTGQEINYGKSGLCL
        + F         DNS+L FE  + +KR+++G VG+ ALKLDMSKAYDRVE  FLE+ M  LG    +    E  ++    + + KL+GQ+INY KS +  
Subjt:  RHF------STKDNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFMTTLGLDGRFARVTEADMIAKCLKAYSKLTGQEINYGKSGLCL

Query:  SPNVEMQMKKALASTLNVRLVGFHDRYLGLPAVFPGRKAISSKFIKDRLWSCIHKWKLSNFSAGGKEVLIKVVLQA
        S NV    +  LA+ L V  V  HD+YLGLP      K  +  F+++++      WK    S  GKEVL+K V+Q+
Subjt:  SPNVEMQMKKALASTLNVRLVGFHDRYLGLPAVFPGRKAISSKFIKDRLWSCIHKWKLSNFSAGGKEVLIKVVLQA

XP_024171861.1 uncharacterized protein LOC112177844 [Rosa chinensis]5.0e-3732.2Show/hide
Query:  WLQWGDRNSKWFHQRATQRRRHNRIEGLENRCGSWITEEE-LEKIVETYFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIEEDMLTALKQM
        WL+ GDRN+ +FH++A  R R N I GL +  G W  ++E +EK+V  YF N+FS+++    E++   LA I PC+ +  NE L      +++  AL QM
Subjt:  WLQWGDRNSKWFHQRATQRRRHNRIEGLENRCGSWITEEE-LEKIVETYFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIEEDMLTALKQM

Query:  GPAKAPG-----RMAFRHF--------------------STKDNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFM------------
         P K+PG      + F+H+                       DN ++  E  + V  KK+G   + ALKLD+SKAYDR+E  FL K +            
Subjt:  GPAKAPG-----RMAFRHF--------------------STKDNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFM------------

Query:  -----TTLGLDGRFARVTEADMIAKCLKAYSKLTGQEINYGKSGLCLSPNVEMQMKKALASTLNVRLVGFHDRYLGLPAVFPGRKAISSKFIKDRLWSCI
             TT+ +    A + +   I   ++ Y + +GQ +N+ KS +  S N+   M++ ++S + V +V  H+RYLGLP     +K  +  +IK+ L   +
Subjt:  -----TTLGLDGRFARVTEADMIAKCLKAYSKLTGQEINYGKSGLCLSPNVEMQMKKALASTLNVRLVGFHDRYLGLPAVFPGRKAISSKFIKDRLWSCI

Query:  HKWKLSNFSAGGKEVLIKVVLQA
          W+    S  GK++LI+VV QA
Subjt:  HKWKLSNFSAGGKEVLIKVVLQA

XP_030505522.1 uncharacterized protein LOC115720515 [Cannabis sativa]7.7e-3834Show/hide
Query:  RCDWLQWGDRNSKWFHQRATQRRRHNRIEGLENRCGSWITE-EELEKIVETYFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIEE------
        R  WLQ GD+N+K+FHQ+A  R++ N I+GL NR   W ++ E++ KI+  ++  LF++        + E L  ++P +  E N +L + F EE      
Subjt:  RCDWLQWGDRNSKWFHQRATQRRRHNRIEGLENRCGSWITE-EELEKIVETYFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIEE------

Query:  -------DMLTALKQMGPAKAPGRMAFRHFSTKDNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFMTTLGLDGRFARVTEADMIAKC
                 L  +     A  PGR+        DN+++ +E L+ ++R   G   +AA+KLDMSKAYDRVE  F+E+ +  LG + ++      D + KC
Subjt:  -------DMLTALKQMGPAKAPGRMAFRHFSTKDNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFMTTLGLDGRFARVTEADMIAKC

Query:  ---LKAYSKLTGQEINYGKSGLCLSPNVEMQMKKALASTLNVRLVGFHDRYLGLPAVFPGRKAISSKFIKDRLWSCIHKWKLSNFSAGGKEVLIKVVLQA
           ++ Y+  +GQ INY KS L  SPN    ++    S+L + +    + YLGLP V    K    + IKD++WS ++ W    FS  GKE+L+K V+Q+
Subjt:  ---LKAYSKLTGQEINYGKSGLCLSPNVEMQMKKALASTLNVRLVGFHDRYLGLPAVFPGRKAISSKFIKDRLWSCIHKWKLSNFSAGGKEVLIKVVLQA

XP_030924745.1 uncharacterized protein LOC115951731 [Quercus lobata]2.0e-3830.72Show/hide
Query:  RCDWLQWGDRNSKWFHQRATQRRRHNRIEGLENRCGSWI-TEEELEKIVETYFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIEEDMLTAL
        R  WL+ GDRN+ +FH RATQR + N I GLE+  G W+  EE+L ++VE YFQN+F+S+N    E   E LA +QP I  E + SL R +  E++L AL
Subjt:  RCDWLQWGDRNSKWFHQRATQRRRHNRIEGLENRCGSWI-TEEELEKIVETYFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIEEDMLTAL

Query:  KQMGPAKAPG----------------------------------------------------RMA-FRHFS-----------------------------
        KQM P  APG                                                    R+A FR  S                             
Subjt:  KQMGPAKAPG----------------------------------------------------RMA-FRHFS-----------------------------

Query:  --------TKDNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFMTTLGLDGRF-----------------------------------
                  DN ++ FE L+++KRK QG +G+ ALKLDMSKAYD+VE  FL K M  LG   R                                    
Subjt:  --------TKDNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFMTTLGLDGRF-----------------------------------

Query:  ------------------------ARVTEADMIAKCLKAYSKLTGQEINYGKSGLCLSPNVEMQMKKALASTLNVRLVGFHDRYLGLPAVFPGRKAISSK
                                ARV E   I   L  Y K +GQ+IN  K+ +  S N    M+  +   L V  +  +++YLGLPA+    K  S  
Subjt:  ------------------------ARVTEADMIAKCLKAYSKLTGQEINYGKSGLCLSPNVEMQMKKALASTLNVRLVGFHDRYLGLPAVFPGRKAISSK

Query:  FIKDRLWSCIHKWKLSNFSAGGKEVLIKVVLQA
        +IK+R+W  +  WK    S  G+EVLIK V+QA
Subjt:  FIKDRLWSCIHKWKLSNFSAGGKEVLIKVVLQA

TrEMBL top hitse value%identityAlignment
A0A2N9ESC9 Uncharacterized protein1.5e-3932.56Show/hide
Query:  RCDWLQWGDRNSKWFHQRATQRRRHNRIEGLENRCGSWITEEELEKIVET-YFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIEEDMLTAL
        R  WL+ GDRN+K+FH RA+ RRR N I  +    G  I +  L     T Y+Q LF++      E +   L  +QPC+  E N+ L   F E+++++A+
Subjt:  RCDWLQWGDRNSKWFHQRATQRRRHNRIEGLENRCGSWITEEELEKIVET-YFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIEEDMLTAL

Query:  KQMGPAKAPGRMAFRHFSTKDNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFMTTLGLDGRF-------------------------
        KQMGP KAPG          DN ++ FE L H+  K+ G VG  ALKLDMSKAYDRVE  FL++ M  +G   ++                         
Subjt:  KQMGPAKAPGRMAFRHFSTKDNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFMTTLGLDGRF-------------------------

Query:  -----------------------------------ARVTEADMIAKCLKAYSKLTGQEINYGKSGLCLSPNVEMQMKKALASTLNVRLVGFHDRYLGLPA
                                           A + E + I + LK Y K +GQ++N  K+ L  S N     ++ +   L V  +  +++YLGLP+
Subjt:  -----------------------------------ARVTEADMIAKCLKAYSKLTGQEINYGKSGLCLSPNVEMQMKKALASTLNVRLVGFHDRYLGLPA

Query:  VFPGRKAISSKFIKDRLWSCIHKWKLSNFSAGGKEVLIKVVLQA
        +    K      IK+R+WS +  WK    S  G+EVLIK V+QA
Subjt:  VFPGRKAISSKFIKDRLWSCIHKWKLSNFSAGGKEVLIKVVLQA

A0A2N9GBI0 FAD-binding PCMH-type domain-containing protein2.8e-4133.86Show/hide
Query:  RCDWLQWGDRNSKWFHQRATQRRRHNRIEGLENRCGSWIT-EEELEKIVETYFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIEEDMLTAL
        R  WLQ GDRN+++FH +AT R+R N I+G+ +  G+W   E+++E+ + +Y+++LF+S+N    E   E L  +   +  E N  L R F   ++  AL
Subjt:  RCDWLQWGDRNSKWFHQRATQRRRHNRIEGLENRCGSWIT-EEELEKIVETYFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIEEDMLTAL

Query:  KQMGPAKAPG--------RMAFRHFSTK------------DNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFMTTLGL--------D
         QMGP KAPG        ++   H  ++            DN ++ FE L+H+ + +QG  G+ ALKLDMSKAYDRVE  FLEK     G+        D
Subjt:  KQMGPAKAPG--------RMAFRHFSTK------------DNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFMTTLGL--------D

Query:  GRF-------ARVTEADMIAKCLKAYSKLTGQEINYGKSGLCLSPNVEMQMKKALASTLNVRLVGFHDRYLGLPAVFPGRKAISSKFIKDRLWSCIHKWK
        G +       A + E +++ + L+ Y + +GQ+IN  K+ L  S +     ++ +   L +  +  ++ YLGLP++    K  S   +K+R+W  +  WK
Subjt:  GRF-------ARVTEADMIAKCLKAYSKLTGQEINYGKSGLCLSPNVEMQMKKALASTLNVRLVGFHDRYLGLPAVFPGRKAISSKFIKDRLWSCIHKWK

Query:  LSNFSAGGKEVLIKVVLQA
            +  GKE+LIK V+QA
Subjt:  LSNFSAGGKEVLIKVVLQA

A0A2N9HS90 RNase H domain-containing protein3.4e-3932.59Show/hide
Query:  RCDWLQWGDRNSKWFHQRATQRRRHNRIEGLENRCGSWITEEEL--EKIVETYFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIEEDMLTA
        R  WL+ GD N+K+FH RA+ R+R N I  L    G  +T+E L   + +E Y+Q LF++      E +   L  IQPC+ +E N+SL   F EE++  A
Subjt:  RCDWLQWGDRNSKWFHQRATQRRRHNRIEGLENRCGSWITEEEL--EKIVETYFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIEEDMLTA

Query:  LKQMGPAKAPG--------RMAFRHFSTK-------------------------DNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFM
        +KQMGP KAPG          ++ H   K                         DN ++ FE L+H+  K+ G VG  ALKLDMSKAYDRVE  F+EK M
Subjt:  LKQMGPAKAPG--------RMAFRHFSTK-------------------------DNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFM

Query:  TTLGLDGRF-----------------------------------------ARVTEADMIAKCLKAYSKLTGQEINYGKSGLCLSPNVEMQMKKALASTLN
          +G   ++                                         A + E   I + L  Y K +GQ++N  K+ L  S N    +++ +   L 
Subjt:  TTLGLDGRF-----------------------------------------ARVTEADMIAKCLKAYSKLTGQEINYGKSGLCLSPNVEMQMKKALASTLN

Query:  VRLVGFHDRYLGLPAVFPGRKAISSKFIKDRLWSCIHKWKLSNFSAGGKEVLIKVVLQA
        V  +  +++YLGLP++    K      IK+R+WS +  WK    S  G+EVLIK V+QA
Subjt:  VRLVGFHDRYLGLPAVFPGRKAISSKFIKDRLWSCIHKWKLSNFSAGGKEVLIKVVLQA

A0A2N9IFR8 RNase H domain-containing protein4.1e-3736.12Show/hide
Query:  RCDWLQWGDRNSKWFHQRATQRRRHNRIEGLENRCGSW-ITEEELEKIVETYFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIEEDMLTAL
        R  WLQ GDRN+++FH +ATQRRR N I+ L +  G W  +E E+E+++ +Y+ +LF++++    E   E +  +   I  E N+ L   F   ++  AL
Subjt:  RCDWLQWGDRNSKWFHQRATQRRRHNRIEGLENRCGSW-ITEEELEKIVETYFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIEEDMLTAL

Query:  KQMGPAKAPGRMAFRHFSTKDNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFMTTLGLDGRFA--------RVTEADMIAKCLKAY-
        KQM P KAPG    R  S  DN ++ FE L+H++  K G +G+ ALKLDMSKAYDRVE  FLEK M  +G   R+          V+ + +I + L+A  
Subjt:  KQMGPAKAPGRMAFRHFSTKDNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFMTTLGLDGRFA--------RVTEADMIAKCLKAY-

Query:  ------SKLTGQEINYGKSGLCLSPNVEMQMKKALASTLNVRLVGFHDRYLGLPAVFPGRKAISSKFIKDRLWSCIHKWKLSNFSAGGKEVLIKVVLQA
               K+ G  I+ G   L      +  +    A+  +   +  +D+YLGLP++    K  +   +K+R+W+ +  WK    S  G+EVLIK V+QA
Subjt:  ------SKLTGQEINYGKSGLCLSPNVEMQMKKALASTLNVRLVGFHDRYLGLPAVFPGRKAISSKFIKDRLWSCIHKWKLSNFSAGGKEVLIKVVLQA

A0A803QCP7 Uncharacterized protein5.7e-3931.64Show/hide
Query:  RCDWLQWGDRNSKWFHQRATQRRRHNRIEGLENRCGSWIT-EEELEKIVETYFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIEEDMLTAL
        R  WL+ GD+N+K FH++A+ R+  N I+GL +   +W+T    + K+   YF+NLF+S N+   E L EF   +  CI R  NE L   F  ED+   +
Subjt:  RCDWLQWGDRNSKWFHQRATQRRRHNRIEGLENRCGSWIT-EEELEKIVETYFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIEEDMLTAL

Query:  KQMGPAKAPGRMAF------RHFST---------------------------------------------------------------------------
        + + P KAPG          +H+ST                                                                           
Subjt:  KQMGPAKAPGRMAF------RHFST---------------------------------------------------------------------------

Query:  --------KDNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFMTTLGLDGRFARVTEADMIAKCLKAYSKLTGQEINYGKSGLCLSPN
                +DN+I+ FE L+ +K K+ G     ALKLDMSKAYDRVE  FL   M  LG     A+    D +   L+ YS+L GQ+IN  KS +C+   
Subjt:  --------KDNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFMTTLGLDGRFARVTEADMIAKCLKAYSKLTGQEINYGKSGLCLSPN

Query:  VEMQMKKALASTLNVRLVGFHDRYLGLPAVFPGRKAISSKFIKDRLWSCIHKWKLSNFSAGGKEVLIKVVLQA
        +       LA+ L VRLV  H +YLGLP+    RK    + IKD++W+ +  WK S FS  G+E+LIK ++QA
Subjt:  VEMQMKKALASTLNVRLVGFHDRYLGLPAVFPGRKAISSKFIKDRLWSCIHKWKLSNFSAGGKEVLIKVVLQA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases8.3e-0639.44Show/hide
Query:  MGPAKA---PGRMAFRHFSTKDNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFMTTLG
        +GPA+A   PGR+      + DN +   E + H  R+K+G  GW  LKLD+ KAYDR+   +LE  + + G
Subjt:  MGPAKA---PGRMAFRHFSTKDNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFMTTLG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGCGAATATCCTTGGCCCGGCAGCAAGTGCAAACGGCCCTGTCCAATGGAGATGTGATTGGTTGCAGTGGGGTGATAGGAATTCAAAGTGGTTCCATCAACGGGC
AACTCAGAGAAGAAGGCATAACAGAATAGAGGGGCTGGAAAATCGTTGTGGTAGTTGGATAACAGAGGAGGAGTTGGAGAAAATAGTAGAAACTTACTTCCAGAATCTCT
TCTCATCGAACAACCAACAGGGTAGTGAGAGCCTTCGAGAATTCTTGGCGCATATACAGCCGTGCATTGGTAGAGAGGAGAATGAGTCTCTTGGGAGGTCTTTTATAGAG
GAGGATATGCTTACTGCTCTAAAACAGATGGGTCCTGCTAAGGCACCGGGGAGGATGGCCTTCCGGCACTTTTCTACCAAAGACAATTCCATACTTGGTTTTGAATGCTT
GAACCATGTCAAAAGGAAGAAACAAGGAACAGTAGGGTGGGCGGCCTTGAAGCTCGATATGAGCAAAGCTTATGATAGGGTGGAGCGGTTCTTTTTAGAGAAGTTCATGA
CTACCCTAGGTTTGGATGGTAGGTTTGCAAGAGTGACAGAAGCTGATATGATTGCTAAATGTTTGAAAGCCTATTCAAAGCTAACAGGTCAGGAAATCAACTATGGAAAG
TCTGGGCTTTGTTTGAGTCCTAATGTGGAAATGCAAATGAAAAAGGCTCTAGCTTCAACTTTGAATGTGCGTCTGGTTGGCTTCCATGACCGTTATCTAGGCCTTCCAGC
AGTTTTCCCAGGTCGTAAGGCTATATCATCGAAGTTTATAAAAGATCGATTGTGGTCTTGTATACACAAGTGGAAGCTTTCAAACTTTTCAGCAGGAGGGAAGGAGGTTC
TTATAAAGGTTGTGTTGCAGGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGCGAATATCCTTGGCCCGGCAGCAAGTGCAAACGGCCCTGTCCAATGGAGATGTGATTGGTTGCAGTGGGGTGATAGGAATTCAAAGTGGTTCCATCAACGGGC
AACTCAGAGAAGAAGGCATAACAGAATAGAGGGGCTGGAAAATCGTTGTGGTAGTTGGATAACAGAGGAGGAGTTGGAGAAAATAGTAGAAACTTACTTCCAGAATCTCT
TCTCATCGAACAACCAACAGGGTAGTGAGAGCCTTCGAGAATTCTTGGCGCATATACAGCCGTGCATTGGTAGAGAGGAGAATGAGTCTCTTGGGAGGTCTTTTATAGAG
GAGGATATGCTTACTGCTCTAAAACAGATGGGTCCTGCTAAGGCACCGGGGAGGATGGCCTTCCGGCACTTTTCTACCAAAGACAATTCCATACTTGGTTTTGAATGCTT
GAACCATGTCAAAAGGAAGAAACAAGGAACAGTAGGGTGGGCGGCCTTGAAGCTCGATATGAGCAAAGCTTATGATAGGGTGGAGCGGTTCTTTTTAGAGAAGTTCATGA
CTACCCTAGGTTTGGATGGTAGGTTTGCAAGAGTGACAGAAGCTGATATGATTGCTAAATGTTTGAAAGCCTATTCAAAGCTAACAGGTCAGGAAATCAACTATGGAAAG
TCTGGGCTTTGTTTGAGTCCTAATGTGGAAATGCAAATGAAAAAGGCTCTAGCTTCAACTTTGAATGTGCGTCTGGTTGGCTTCCATGACCGTTATCTAGGCCTTCCAGC
AGTTTTCCCAGGTCGTAAGGCTATATCATCGAAGTTTATAAAAGATCGATTGTGGTCTTGTATACACAAGTGGAAGCTTTCAAACTTTTCAGCAGGAGGGAAGGAGGTTC
TTATAAAGGTTGTGTTGCAGGCTTGA
Protein sequenceShow/hide protein sequence
MKANILGPAASANGPVQWRCDWLQWGDRNSKWFHQRATQRRRHNRIEGLENRCGSWITEEELEKIVETYFQNLFSSNNQQGSESLREFLAHIQPCIGREENESLGRSFIE
EDMLTALKQMGPAKAPGRMAFRHFSTKDNSILGFECLNHVKRKKQGTVGWAALKLDMSKAYDRVERFFLEKFMTTLGLDGRFARVTEADMIAKCLKAYSKLTGQEINYGK
SGLCLSPNVEMQMKKALASTLNVRLVGFHDRYLGLPAVFPGRKAISSKFIKDRLWSCIHKWKLSNFSAGGKEVLIKVVLQA