; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017651 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017651
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr5:6399920..6402081
RNA-Seq ExpressionLag0017651
SyntenyLag0017651
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4351405.1 hypothetical protein F8388_001025, partial [Cannabis sativa]1.2e-6728.12Show/hide
Query:  DLVSRLNS-WRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIES-AGENLFAIQFWSRGEKVRVMSTGPWAFD
        +LVSRL    RL D +  ++ L +        SL+L +VGKV   K  N +  ++ M+  W++H   ++E  + +N+F   F SR ++ RV   GPW FD
Subjt:  DLVSRLNS-WRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIES-AGENLFAIQFWSRGEKVRVMSTGPWAFD

Query:  RSLVILKEPRSAGSVMDVEFNDCAFWVQIHNVPMRWQTREMARHLEGKI----------GTMK-----EVGGPREGGASVYRMRRARSFGVRFCMNDYQI
        + L+   +P   G +  + F+  +FW+ I+NVP+   T   AR    +I          GTMK      +  P + G  V    +     + F       
Subjt:  RSLVILKEPRSAGSVMDVEFNDCAFWVQIHNVPMRWQTREMARHLEGKI----------GTMK-----EVGGPREGGASVYRMRRARSFGVRFCMNDYQI

Query:  FCDECGCLGHSRRECGAVGEGALEQQCE--QYGDWM-----------------------RAVVGFGEVGR------------------------------
        FC +CG +GH   +C     G    + +  +YG WM                       R ++  GE  R                              
Subjt:  FCDECGCLGHSRRECGAVGEGALEQQCE--QYGDWM-----------------------RAVVGFGEVGR------------------------------

Query:  QTQEG--GVDN---------AMNRKPEAPKPSTEGHVIHDSVP--EEAGK-----EIAVSRGI-------KGRGSQEGGSVGVVARGEGVADRGKGKQKV
        + +EG  GVD          +++ +P   + S  G  +  +     E GK     ++ V  G        KG+ + EGG   ++     ++D GKGK+ +
Subjt:  QTQEG--GVDN---------AMNRKPEAPKPSTEGHVIHDSVP--EEAGK-----EIAVSRGI-------KGRGSQEGGSVGVVARGEGVADRGKGKQKV

Query:  ESITAESGVSE------GGDDVMLVD--------SVGQGKEVEAIGKGTNSRSWKRMARDILSDVTNRDVS-----AGSVIGKRKAGSVEGVELGGK---
        E   +   VS+         DV +V+        SV +GK V  I       S+ +  +D    +     +       S  G+   G    + +  K   
Subjt:  ESITAESGVSE------GGDDVMLVD--------SVGQGKEVEAIGKGTNSRSWKRMARDILSDVTNRDVS-----AGSVIGKRKAGSVEGVELGGK---

Query:  KLRGSCVRCIGSDCVGGGWFPTPLGI------------MSLIFWNVRGSGSPRTFRRLTKLVKEKRPQVLFLSETKVSASRMSSAKRLLGFDNCFCVDCH
         L  + +  +     GGG     + +             S +     G G+P     L  +V++  P ++FLSETK+        +R + F N F VDC 
Subjt:  KLRGSCVRCIGSDCVGGGWFPTPLGI------------MSLIFWNVRGSGSPRTFRRLTKLVKEKRPQVLFLSETKVSASRMSSAKRLLGFDNCFCVDCH

Query:  GRSGGLALMWVSSVSFSLLSFSKNHIDGWILWDGCR-WQLTGFYGFPLVDLHPQSWSLLSRLKGCADTPWLIDGDFNAILCQDEKDGGRDKLLSELAAFQ
        G+SGGL L+W      S+ SFS  HID  +   G   W+ TGFYG P       SW LL RLKG  D PW+  GDFN IL  +EK GG D+ LS ++ FQ
Subjt:  GRSGGLALMWVSSVSFSLLSFSKNHIDGWILWDGCR-WQLTGFYGFPLVDLHPQSWSLLSRLKGCADTPWLIDGDFNAILCQDEKDGGRDKLLSELAAFQ

Query:  SVVDSCGLLDLGFVGDRFTWCNRRPGGETIYERLDWAFSITPWTDLYPNYVVNHLDYSRSDHRPMELILNPPPQCWSQSDQRIA-RFEETWLR
          +D C L+D+GF G  FTW N+R G   + ERLD  F    W +L+P+  V + D+  SDHRP+  IL     C  Q D++ + RFE  WL+
Subjt:  SVVDSCGLLDLGFVGDRFTWCNRRPGGETIYERLDWAFSITPWTDLYPNYVVNHLDYSRSDHRPMELILNPPPQCWSQSDQRIA-RFEETWLR

MCH80348.1 hypothetical protein [Trifolium medium]4.6e-6728.1Show/hide
Query:  MEDLVSRLNSWRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIESAGENLFAIQFWSRGEKVRVMSTGPWAFD
        ME  VS+    R  D+E E + + +        S    +VGK+++    N+ AF+  M  AWR      I+   +NL+  +F ++ E   V   GPW+FD
Subjt:  MEDLVSRLNSWRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIESAGENLFAIQFWSRGEKVRVMSTGPWAFD

Query:  RSLVILKEPRSAGSVMDVEFNDCAFWVQIHNVPMRWQTREMARHLEGKIGTMKEVGGPREGGASVY-RMR-----------------RARSFGVRFCMND
        R+L+IL          ++  +  +FWV+++++P++ ++  +A+ L   +GT +E+          + RMR                 + +   V +    
Subjt:  RSLVILKEPRSAGSVMDVEFNDCAFWVQIHNVPMRWQTREMARHLEGKIGTMKEVGGPREGGASVY-RMR-----------------RARSFGVRFCMND

Query:  YQIFCDECGCLGHSRRECGAVGE------GALEQQCEQYGDWMRAVVGFGEVGRQTQEGGVDNAMNRKPEAPKPSTEGHVIHDS-VPEEAGKEIAVSRGI
           FC  CG +GH  R+C  + +        LE++ + +G W+RA      + + + E   +++ +   ++  PST      +S   +E  +E+   R  
Subjt:  YQIFCDECGCLGHSRRECGAVGE------GALEQQCEQYGDWMRAVVGFGEVGRQTQEGGVDNAMNRKPEAPKPSTEGHVIHDS-VPEEAGKEIAVSRGI

Query:  KGRGSQEGGSVGVVAR-GEGVADRGKGKQKVESITAESGVSEGGDDVMLVDSVGQGKEVEAIGKGTNSRSWKRMARDILSDVTNRDVSAGSVIGKRKA--
          + SQ+  + G +++  EG   + +  QK   + AE   S G   +     +G+G +    GKG   R W R  + +      ++      IGKR    
Subjt:  KGRGSQEGGSVGVVAR-GEGVADRGKGKQKVESITAESGVSEGGDDVMLVDSVGQGKEVEAIGKGTNSRSWKRMARDILSDVTNRDVSAGSVIGKRKA--

Query:  -----GSVEGVELGGKKLRGSCVRCIGSDCVGGGWFPTPLGIMSLIFWNVRGSGSPRTFRRLTKLVKEKRPQVLFLSETKVSASRMSSAKRLLGFDNCFC
             G++E ++ G KK+RG  V     DCV                     +GSPR  R L +L + + PQ++FL ET++  + + + +  LGF NC  
Subjt:  -----GSVEGVELGGKKLRGSCVRCIGSDCVGGGWFPTPLGIMSLIFWNVRGSGSPRTFRRLTKLVKEKRPQVLFLSETKVSASRMSSAKRLLGFDNCFC

Query:  VDCHG----RSGGLALMWVSSVSFSLLSFSKNHIDGWILWD--GCRWQLTGFYGFPLVDLHPQSWSLLSRLKGCADTPWLIDGDFNAILCQDEKDGGRDK
        VDC+G    R+GGLALMW+  +S ++ SFS NHI G    +  G  W LTG YG+P      ++W L+  L       WL  GDFN IL   EK GG  +
Subjt:  VDCHG----RSGGLALMWVSSVSFSLLSFSKNHIDGWILWD--GCRWQLTGFYGFPLVDLHPQSWSLLSRLKGCADTPWLIDGDFNAILCQDEKDGGRDK

Query:  LLSELAAFQSVVDSCGLLDLGFVGDRFTWCNRRPGGETIYERLDWAFSITPWTDLYPNYVVNHLDYSRSDHRPMELILNPPPQCWSQSDQRIARFEETWL
          ++ +  +  V    L+DLGF G  FTW N R  GE +  RLD A     + + +    VNHL    SDH  + + L  P    ++  +R+ RFEE+W 
Subjt:  LLSELAAFQSVVDSCGLLDLGFVGDRFTWCNRRPGGETIYERLDWAFSITPWTDLYPNYVVNHLDYSRSDHRPMELILNPPPQCWSQSDQRIARFEETWL

Query:  R
        +
Subjt:  R

TXG63812.1 hypothetical protein EZV62_010806 [Acer yangbiense]4.1e-6830.41Show/hide
Query:  DLVSRLNSWRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIESAGENLFAIQFWSRGEKVRVMSTGPWAFDRS
        DL     +  + DE+ E+ ++ +    + +  +E C+VGKV + K VN EAF++V+   W       IE  GEN+F   F +  ++ R+   GPW FDRS
Subjt:  DLVSRLNSWRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIESAGENLFAIQFWSRGEKVRVMSTGPWAFDRS

Query:  LVILKEPRSAGSVMDVEFNDCAFWVQIHNVPMRWQTREMARHLEGKIGTMKEVGGPREGGASVYRMRRARSFGVRFCMNDYQIFCDECGCLGHSRRECGA
        L++L +P   G +  + FN   FWVQIH++P+    +   + L  +IGT+ E+  P E      R  R ++   RF +         CG +GH   EC  
Subjt:  LVILKEPRSAGSVMDVEFNDCAFWVQIHNVPMRWQTREMARHLEGKIGTMKEVGGPREGGASVYRMRRARSFGVRFCMNDYQIFCDECGCLGHSRRECGA

Query:  V--GEGALEQQCEQYGDWMRAVV-----------GFG----------------EVGRQTQEGGVDNAMNRKPE-----APKPSTEGHVIHDSVPEEAGKE
        V   + ALE    +YG W++A             G G                 +G   +E  +    N  PE     A + + E H    S  E AGKE
Subjt:  V--GEGALEQQCEQYGDWMRAVV-----------GFG----------------EVGRQTQEGGVDNAMNRKPE-----APKPSTEGHVIHDSVPEEAGKE

Query:  IAVSRGIKGRGSQEGGSVGVVARGEGVADRGKGKQKVESIT-------AESGVSEGGDDVMLVDSVGQGKEVEA-IGKGTNSRSWKRMARDI----LSDV
              + G  +   G V   A+G   +  G G+ K+  ++             E   D   V  + Q  + E   G     + WKR AR++     SD+
Subjt:  IAVSRGIKGRGSQEGGSVGVVARGEGVADRGKGKQKVESIT-------AESGVSEGGDDVMLVDSVGQGKEVEA-IGKGTNSRSWKRMARDI----LSDV

Query:  --------------------------TNRDVSAGSVIGKRKAGS-------------VEGVELGGKKLRGSCVRCIGSDCVGGGWFPTPLGIMSLIFWNV
                                  T   +S  +   KRKAGS             + G E   K+ +GS  R +     G G    P   M  + WNV
Subjt:  --------------------------TNRDVSAGSVIGKRKAGS-------------VEGVELGGKKLRGSCVRCIGSDCVGGGWFPTPLGIMSLIFWNV

Query:  RGSGSPRTFRRLTKLVKEKRPQVLFLSETKVSASRMSSAKRLLGFDNCFCVDCHGRSGGLALMWVSSVSFSLLSFSKNHIDGWI-LWDGCRWQLTGFYGF
        RG G+P T   L KLVK+  P ++FLSETK+  +R    +  LG+   F VD  G SGGL L+W    + S+ SFS  HID  I + DG  W+ +G YG 
Subjt:  RGSGSPRTFRRLTKLVKEKRPQVLFLSETKVSASRMSSAKRLLGFDNCFCVDCHGRSGGLALMWVSSVSFSLLSFSKNHIDGWI-LWDGCRWQLTGFYGF

Query:  PLVDLHPQSWSLLSRLKGCADTPWLIDGDFNAILCQDEKDGGRDKLLSELAAFQSVVDSCGLLDLGFVGDRFTWCNRRPGGETI
        P  +  P  W+L+ RL+     PW+  GDFN +L   EK GG  K + ++  F+ VV+ C L+DLGF G  FTW N+R G + +
Subjt:  PLVDLHPQSWSLLSRLKGCADTPWLIDGDFNAILCQDEKDGGRDKLLSELAAFQSVVDSCGLLDLGFVGDRFTWCNRRPGGETI

XP_018826186.1 uncharacterized protein LOC108995141 [Juglans regia]1.6e-6728.32Show/hide
Query:  MEDLVSRLNSWRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIESAGENLFAIQFWSRGEKVRVMSTGPWAFD
        M+ L  +    RL ++E   + L E T+   +   +  V+GK+FSS+ ++ E   S M   W++  A +      N FAI F +  +K+RV    PW FD
Subjt:  MEDLVSRLNSWRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIESAGENLFAIQFWSRGEKVRVMSTGPWAFD

Query:  RSLVILKEPRSAGSVMDVEFNDCAFWVQIHNVPMRWQTREMARHLEGKIGTMKEVGGPREGGA---------------SVYRMR----RARSFGVRFCMN
          ++++KE      +  V F+   FW++IHN+P+   T+     + G +G ++ V    +G                  V R R    R   F   F   
Subjt:  RSLVILKEPRSAGSVMDVEFNDCAFWVQIHNVPMRWQTREMARHLEGKIGTMKEVGGPREGGA---------------SVYRMR----RARSFGVRFCMN

Query:  DYQIFCDECGCLGHSRRECGAVGEGALEQQCEQY------GDWMRAVVGFGEVGRQTQEGGVDNAMNRKPEAPKPSTEGHVIHDSVPEEAGKEIAVSRGI
             C  CGC+ H    C   G+G  E+  +++       + +    G  E G+       D    RK E  +  +EG           GK++    G 
Subjt:  DYQIFCDECGCLGHSRRECGAVGEGALEQQCEQY------GDWMRAVVGFGEVGRQTQEGGVDNAMNRKPEAPKPSTEGHVIHDSVPEEAGKEIAVSRGI

Query:  KGRGSQEGGSVGVVARGEGVADRGKGKQKVESITAESGVSEGGDDVMLVDSVGQGKEVEAIGKG--------TNSRSWKRMARDILSDVTNRDVSAGSVI
              +    G++ RGE   D+   +++ E++           D  L ++ GQ  EV + GKG             WKR AR                 
Subjt:  KGRGSQEGGSVGVVARGEGVADRGKGKQKVESITAESGVSEGGDDVMLVDSVGQGKEVEAIGKG--------TNSRSWKRMARDILSDVTNRDVSAGSVI

Query:  GKRKAGSVEGVELGGKKLRGSCVRCIGSDCVGGGWFPTPLGIMSLIFWNVRGSGSPRTFRRLTKLVKEKRPQVLFLSETKVSASRMSSAKRLLGFDNCFC
                      GK+        + SD      +P P  IM ++ WN RG G+PRT + L  +V++ +P ++F+ ETK       S +R L  + CF 
Subjt:  GKRKAGSVEGVELGGKKLRGSCVRCIGSDCVGGGWFPTPLGIMSLIFWNVRGSGSPRTFRRLTKLVKEKRPQVLFLSETKVSASRMSSAKRLLGFDNCFC

Query:  VDCHGRSGGLALMWVSSVSFSLLSFSKNHIDGWILWDG-CRWQLTGFYGFPLVDLHPQSWSLLSRLKGCADTPWLIDGDFNAILCQDEKDGGRDKLLSEL
        V+  G+ GGL L+W S V   ++++S++HI+ WI  +G  +W LT FYG P  +   +SWSLL+ LK  A+  W I GDFN IL  DEK GGR +   ++
Subjt:  VDCHGRSGGLALMWVSSVSFSLLSFSKNHIDGWILWDG-CRWQLTGFYGFPLVDLHPQSWSLLSRLKGCADTPWLIDGDFNAILCQDEKDGGRDKLLSEL

Query:  AAFQSVVDSCGLLDLGFVGDRFTWCNRRPGGETIYERLDWAFSITPWTDLYPNYVVNHLDYSRSDHRPMEL-ILNPPPQCWSQSDQRIARFE
          F+ V++   L DLG+ GD+FTW N         ERLD A +   W ++Y    V  +  S SDH+P+ L +L    + W  + QR  ++E
Subjt:  AAFQSVVDSCGLLDLGFVGDRFTWCNRRPGGETIYERLDWAFSITPWTDLYPNYVVNHLDYSRSDHRPMEL-ILNPPPQCWSQSDQRIARFE

XP_042990668.1 uncharacterized protein LOC122317666 [Carya illinoinensis]9.2e-6828.57Show/hide
Query:  EDLVSRLNSWRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIESAGENLFAIQFWSRGEKVRVMSTGPWAFDR
        ED+ +R  S +LT+EE + + L E   + S    ELC++ K+F+ ++ N EAF++ M   W   G    +    N++ I+F    +K +V+   PW+FDR
Subjt:  EDLVSRLNSWRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIESAGENLFAIQFWSRGEKVRVMSTGPWAFDR

Query:  SLVILKEPRSAGSVMDVEFNDCAFWVQIHNVPMRWQTREMARHLEGKIGTMKEVGGPREG-GASVYRMRRA------------------RSFGVRFCMND
         LV +KE     S+ +V F+   FW+Q+HN+P    ++EM   +   IG + EV    EG G   Y   +A                  +   + F    
Subjt:  SLVILKEPRSAGSVMDVEFNDCAFWVQIHNVPMRWQTREMARHLEGKIGTMKEVGGPREG-GASVYRMRRA------------------RSFGVRFCMND

Query:  YQIFCDECGCLGHSRRECGAVGEGALEQQCEQYGDWMRAVVGFGEVGRQTQEGGVDNAMNRKPEAPKPSTEGHVIHDSVPEEAGKEIAVSRGIKGRGSQE
          +FC +CG   H +  C     GA     +QYG W+RA     +     + GG+      K + P  ST                     G +  G   
Subjt:  YQIFCDECGCLGHSRRECGAVGEGALEQQCEQYGDWMRAVVGFGEVGRQTQEGGVDNAMNRKPEAPKPSTEGHVIHDSVPEEAGKEIAVSRGIKGRGSQE

Query:  GGSVGVVARGEGVADRGKGKQKVESITAESGVSEGG-----DDVMLVDSVGQGKEVEAIGKGTNSRSW-KRMARD-------IL---SDVTNRDV-----
         G  GV     G ++     +  E++    G +EG          + D+    K  +         SW +R+++D       +L   S  ++ DV     
Subjt:  GGSVGVVARGEGVADRGKGKQKVESITAESGVSEGG-----DDVMLVDSVGQGKEVEAIGKGTNSRSW-KRMARD-------IL---SDVTNRDV-----

Query:  -SAGSVIGKRKAGSVEGVELGGKKLRGSCVRCIGSDCVGGGW------FPTPLGIMSLIF----------------WNVRGSGSPR---------TFRRL
         + G    +++ G V   E   + L  S      S   G  W       PTPL  ++ I                 ++ R  G  +         + + L
Subjt:  -SAGSVIGKRKAGSVEGVELGGKKLRGSCVRCIGSDCVGGGW------FPTPLGIMSLIF----------------WNVRGSGSPR---------TFRRL

Query:  TKLVKEKRPQVLFLSETKVSASRMSSAKRLLGFDNCFCVDCHGRSGGLALMWVSSVSFSLLSFSKNHIDGWIL--WDGCRWQLTGFYGFPLVDLHPQSWS
          LVK K+P ++FL+ETK +  R+   K  LG++NCF V+  G+SG LAL+W  SV   +++++  HI   I    D  +WQLTGFYG P     P+SW 
Subjt:  TKLVKEKRPQVLFLSETKVSASRMSSAKRLLGFDNCFCVDCHGRSGGLALMWVSSVSFSLLSFSKNHIDGWIL--WDGCRWQLTGFYGFPLVDLHPQSWS

Query:  LLSRLKGCADTPWLIDGDFNAILCQDEKDGGRDKLLSELAAFQSVVDSCGLLDLGFVGDRFTWCNRRPGGETIYERLDWAFSITPWTDLYPNYVVNHLDY
        LL  LK   + PWL  GDFN I  Q EK G  ++   ++  F++ +  C L DLGF GD+FTW N R G +   ERLD A     W  L+ N+ V+HLD 
Subjt:  LLSRLKGCADTPWLIDGDFNAILCQDEKDGGRDKLLSELAAFQSVVDSCGLLDLGFVGDRFTWCNRRPGGETIYERLDWAFSITPWTDLYPNYVVNHLDY

Query:  SRSDHRPMELILNPPPQCWSQSDQRIARFEETWLR
        ++SDH+   L++            R+ RFE  W +
Subjt:  SRSDHRPMELILNPPPQCWSQSDQRIARFEETWLR

TrEMBL top hitse value%identityAlignment
A0A2N9EV35 Uncharacterized protein7.1e-7429.78Show/hide
Query:  DLVSRLNSWRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIESAGENLFAIQFWSRGEKVRVMSTGPWAFDRS
        D++  L   +LT EE E + L +   V  +    L + GK  S+KS N  A +  M+  W+++   +I   G ++   +F +  +   V+  GPW+F+ +
Subjt:  DLVSRLNSWRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIESAGENLFAIQFWSRGEKVRVMSTGPWAFDRS

Query:  LVILKEPRSAGSVMDVEFNDCAFWVQIHNVPMRWQTREMARHLEGKIG---------------------TMKEVGGPREGGASVYRMRRARSFGVRFCMN
        L++L+      S  ++ F    FW+QI  +P    + +    + G+IG                        +V  P   G  V      +++ V +   
Subjt:  LVILKEPRSAGSVMDVEFNDCAFWVQIHNVPMRWQTREMARHLEGKIG---------------------TMKEVGGPREGGASVYRMRRARSFGVRFCMN

Query:  DYQIFCDECGCLGHSRRECGAVGEGALEQQCEQYGDWMRAVVGFGE--VGRQTQEG----GVDNAMNRKPEAPKP----STEGHVIHDSVPE--------
            FC  CG LGH  R+C +  E  +   C  YG+W+RA  G  E  +GR   +G       N +     AP+P    S E  V  D V E        
Subjt:  DYQIFCDECGCLGHSRRECGAVGEGALEQQCEQYGDWMRAVVGFGE--VGRQTQEG----GVDNAMNRKPEAPKP----STEGHVIHDSVPE--------

Query:  --EAGKEIAVSRGIKGRGSQ--EGGSVGVVARGEGVADRGKGKQKVE---SITAESGVSEGGDDVMLVDSVGQGKEVEAI-----GKGTNSRSWKRMARD
          E  +++      + +  +  E    G     E     GKGK   E   S  A SG    GD+V L   VG+  +V A+     G     +    M   
Subjt:  --EAGKEIAVSRGIKGRGSQ--EGGSVGVVARGEGVADRGKGKQKVE---SITAESGVSEGGDDVMLVDSVGQGKEVEAI-----GKGTNSRSWKRMARD

Query:  ILSDVTNRDVSAGSVIGKRKAGSV----EGVELGG-KKLRGSCVRCIGSDC------VGGGWFPTPLGIMSLIFWNVRGSGSPRTFRRLTKLVKEKRPQV
         L+  T+ +      + +     V    + VE G   K +   + C G  C      +G GW P P   + L+ WN RG G+P   R L  +VK + P V
Subjt:  ILSDVTNRDVSAGSVIGKRKAGSV----EGVELGG-KKLRGSCVRCIGSDC------VGGGWFPTPLGIMSLIFWNVRGSGSPRTFRRLTKLVKEKRPQV

Query:  LFLSETKVSASRMSSAKRLLGFDNCFCVDCHGRSGGLALMWVSSVSFSLLSFSKNHIDGWI-LWDGCRWQLTGFYGFPLVDLHPQSWSLLSRLKGCADTP
        LFL ETK+    M   +  LG+ N F V  HGRSGGLAL+W + +  ++ +F+ NHID  + + DG  W+LT F G P      +SW+LL  L    + P
Subjt:  LFLSETKVSASRMSSAKRLLGFDNCFCVDCHGRSGGLALMWVSSVSFSLLSFSKNHIDGWI-LWDGCRWQLTGFYGFPLVDLHPQSWSLLSRLKGCADTP

Query:  WLIDGDFNAILCQDEKDGGRDKLLSELAAFQSVVDSCGLLDLGFVGDRFTWCNRRPGGETIYERLDWAFSITPWTDLYPNYVVNHLDYSRSDHRPMELIL
        WL  GDFN I+ Q+EK G   + L+++ AF+ V + C LLD+GF G  FTW N R G   + ER+D AF+   W++ +PN  V+HL    SDH P+ + +
Subjt:  WLIDGDFNAILCQDEKDGGRDKLLSELAAFQSVVDSCGLLDLGFVGDRFTWCNRRPGGETIYERLDWAFSITPWTDLYPNYVVNHLDYSRSDHRPMELIL

Query:  NPPPQCWSQSDQRIARFEETWL
                +  ++  RFEE W+
Subjt:  NPPPQCWSQSDQRIARFEETWL

A0A2N9EWI8 Uncharacterized protein1.3e-7030.07Show/hide
Query:  MEDLVSRLNSWRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIESAGENLFAIQFWSRGEKVRVMSTGPWAFD
        +E++  RL   +L+ +EA  + LG  T   S    +  V+ K+ + K  N +A +S +R  W   G   I    +NLF   F +  +  R+    PW FD
Subjt:  MEDLVSRLNSWRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIESAGENLFAIQFWSRGEKVRVMSTGPWAFD

Query:  RSLVILKEPRSAGSVMDVEFNDCAFWVQIHNVPMRWQTREMARHLEGKIGTMKEVGGPREG-GASVY-RMR----------RAR--------------SF
        + L+ + +         V+F   AFWV++ N+P++  TRE+   +   +G   EV  P +G G   Y R+R          R R               F
Subjt:  RSLVILKEPRSAGSVMDVEFNDCAFWVQIHNVPMRWQTREMARHLEGKIGTMKEVGGPREG-GASVY-RMR----------RAR--------------SF

Query:  GVRFCMNDYQIFCDECGCLGHSRREC--GAVGEGALEQQCEQYGDWMRAVVGFGEVGRQTQ-------EGGVDNAMNRKPEAPKPSTEGHVIHDSVPEEA
         V F      IFC  CG LGHS  +C  G    G LE    +YG W+RA+     V RQ +       +G   NA          ++  HV    V  E+
Subjt:  GVRFCMNDYQIFCDECGCLGHSRREC--GAVGEGALEQQCEQYGDWMRAVVGFGEVGRQTQ-------EGGVDNAMNRKPEAPKPSTEGHVIHDSVPEEA

Query:  GKEIAVSRGIKGRGSQEGGSVGVVAR---GEGVADRGKGKQKVESITAESGVSEGGDDVMLVDSVGQGKEVEAIGKGTNSRSWKRMARDILSDVTNRDVS
           +        R  +E  S   + R           + +++   + A+S +    D+   ++      EV  +          R    ++ D TN+   
Subjt:  GKEIAVSRGIKGRGSQEGGSVGVVAR---GEGVADRGKGKQKVESITAESGVSEGGDDVMLVDSVGQGKEVEAIGKGTNSRSWKRMARDILSDVTNRDVS

Query:  AGSVIGKRKAGSVEGVELGGKKLRGSCVRCIGS--DCVGGGWFP-------TPLGIMSLIFWNVRGSGSPRTFRRLTKLVKEKRPQVLFLSETKVSASRM
              K +   V+   +G + +RG+    + +  D   G   P        P G M  I  N RG G+P T   L   VK++ PQ++FL ET++   ++
Subjt:  AGSVIGKRKAGSVEGVELGGKKLRGSCVRCIGS--DCVGGGWFP-------TPLGIMSLIFWNVRGSGSPRTFRRLTKLVKEKRPQVLFLSETKVSASRM

Query:  SSAKRLLGFDNCFCVDCHGRSGGLALMWVSSVSFSLLSFSKNHIDGWI-LWDGCRWQLTGFYGFPLVDLHPQSWSLLSRLKGCADTPWLIDGDFNAILCQ
           +  LG   CF V+  G  GGLAL+W  SV   + S+SK+HID W+    G  W+ TGFYG P+      SW LL RLKG +D PWL+ GDFN I+  
Subjt:  SSAKRLLGFDNCFCVDCHGRSGGLALMWVSSVSFSLLSFSKNHIDGWI-LWDGCRWQLTGFYGFPLVDLHPQSWSLLSRLKGCADTPWLIDGDFNAILCQ

Query:  DEKDGGRDKLLSELAAFQSVVDSCGLLDLGFVGDRFTWCNRRPGGETIYERLDWAFSITPWTDLYPNYVVNHLDYSRSDHRPMELILNPPPQCWSQSDQR
        DEK G   +  +++A F+  ++ C LLDLGF G  FTW N R   E + ERLD   +   W DL+P   + H+ ++ SDH  M L+LN   +   Q  QR
Subjt:  DEKDGGRDKLLSELAAFQSVVDSCGLLDLGFVGDRFTWCNRRPGGETIYERLDWAFSITPWTDLYPNYVVNHLDYSRSDHRPMELILNPPPQCWSQSDQR

Query:  IAR-----FEETWLR
         +R     FE  WLR
Subjt:  IAR-----FEETWLR

A0A2N9GXZ5 CCHC-type domain-containing protein9.6e-7129.2Show/hide
Query:  MEDLVSRLNSWRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIESAGENLFAIQFWSRGEKVRVMSTGPWAFD
        +E+L  R+   RL++ E + +R+ +   + S    +  ++ K+ ++++ N +AF++ +   W  HG   + +  +NLF   F S      V ST PW FD
Subjt:  MEDLVSRLNSWRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIESAGENLFAIQFWSRGEKVRVMSTGPWAFD

Query:  RSLVILKEPRSAGSVMD--VEFNDCAFWVQIHNVPMRWQTREMARHLEGKIGTMKEVGGPREGGASVYRMRRAR---------------------SFGVR
        + L+++   R  G +    V+F   AFW+++ N+P++  TREM   +  ++G +  V  P + G +  R  R R                        V 
Subjt:  RSLVILKEPRSAGSVMD--VEFNDCAFWVQIHNVPMRWQTREMARHLEGKIGTMKEVGGPREGGASVYRMRRAR---------------------SFGVR

Query:  FCMNDYQIFCDECGCLGHSRRECGAVGEGALEQQCEQYGDWMRAVVGFG-EVGRQTQEGGVD-NAMNRKPEAPKPSTEGHVIHDSVPEEAGKEIAVSRGI
        F      IFC  CG LGHS  +C      ++  + +QYG W+RA+     + GR+ ++GG+D N+ +   +  + S  G   H    + A    A+    
Subjt:  FCMNDYQIFCDECGCLGHSRRECGAVGEGALEQQCEQYGDWMRAVVGFG-EVGRQTQEGGVD-NAMNRKPEAPKPSTEGHVIHDSVPEEAGKEIAVSRGI

Query:  KGRGSQEGGSVGVVARGEGVADRGKGKQKVESITAESGVSEGGDDVMLVDSVGQGKEVEAIGKGTNSRSWKRMARDILSDVTNRDVSAGSVIGKRKAGSV
         G    +G                     VE +  E G ++     + +D  G       IG G N+        ++  + T+ D S     GK  AG  
Subjt:  KGRGSQEGGSVGVVARGEGVADRGKGKQKVESITAESGVSEGGDDVMLVDSVGQGKEVEAIGKGTNSRSWKRMARDILSDVTNRDVSAGSVIGKRKAGSV

Query:  EGV---------ELGGKKLRGSCVR---------CIGSDCVGGG-----WFPTPLGIMSLIFWNVRGSGSPRTFRRLTKLVKEKRPQVLFLSETKVSASR
          +             KK R +C+           + S    G          P G M +   N RG G+P T   L ++VK++ P ++FL ET++    
Subjt:  EGV---------ELGGKKLRGSCVR---------CIGSDCVGGG-----WFPTPLGIMSLIFWNVRGSGSPRTFRRLTKLVKEKRPQVLFLSETKVSASR

Query:  MSSAKRLLGFDNCFCVDCHGRSGGLALMWVSSVSFSLLSFSKNHIDGWIL-WDGCRWQLTGFYGFPLVDLHPQSWSLLSRLKGCADTPWLIDGDFNAILC
        +   +  LG   C  VD HG  GGLAL+W SS+  ++ SFS+NHID  ++  DG RW+ TGFYG P   L   SW+LL  L G  + PWL+ GDFN I  
Subjt:  MSSAKRLLGFDNCFCVDCHGRSGGLALMWVSSVSFSLLSFSKNHIDGWIL-WDGCRWQLTGFYGFPLVDLHPQSWSLLSRLKGCADTPWLIDGDFNAILC

Query:  QDEKDGGRDKLLSELAAFQSVVDSCGLLDLGFVGDRFTWCNRRPGGETIYERLDWAFSITPWTDLYPNYVVNHLDYSRSDHRPME
          E+ G  D+ L+++AAF+ V+  C L DLGF    FTW NRR  G+ +  RLD + +   W  L+PN  V H+  + SDH   E
Subjt:  QDEKDGGRDKLLSELAAFQSVVDSCGLLDLGFVGDRFTWCNRRPGGETIYERLDWAFSITPWTDLYPNYVVNHLDYSRSDHRPME

A0A2N9I946 Uncharacterized protein1.5e-7128.55Show/hide
Query:  MEDLVSRLNSWRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIESAGENLFAIQFWSRGEKVRVMSTGPWAFD
        +E++  R+N   L+D+E   + L +     S    +  ++ K+ +++  N +AF+  +R  W VHG   +    +NLF   F S     R+ +  PW FD
Subjt:  MEDLVSRLNSWRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIESAGENLFAIQFWSRGEKVRVMSTGPWAFD

Query:  RSLVILKEPRSAGSVMD--VEFNDCAFWVQIHNVPMRWQTREMARHLEGKIGTMKEVGGPREGGASVYRMRRAR---------------------SFGVR
        + L++L   R  G +    V+F   AFW++I N+P++  TRE+   +  ++G + +V  P E G +  R  R R                        V 
Subjt:  RSLVILKEPRSAGSVMD--VEFNDCAFWVQIHNVPMRWQTREMARHLEGKIGTMKEVGGPREGGASVYRMRRAR---------------------SFGVR

Query:  FCMNDYQIFCDECGCLGHSRRECGA--VGEGALEQQCEQYGDWMRAV-----------------VG-------FGEVGRQTQEGGVD------------N
        F      IFC +CG LGHS  +C A    E A     +QYG W+RA                  +G        GE G    EGGV+            +
Subjt:  FCMNDYQIFCDECGCLGHSRRECGA--VGEGALEQQCEQYGDWMRAV-----------------VG-------FGEVGRQTQEGGVD------------N

Query:  AMNRKPEAPKP-----------STEGHVIHDSVPEEAGKEIAVSRGIKG------RGSQEGGSVGV------VARGEGVADRGKGKQKVESITAESGVSE
          N+ P++               TE  V+H  +P     ++ VS G  G       G + G           + +     D G   Q +++   +  V  
Subjt:  AMNRKPEAPKP-----------STEGHVIHDSVPEEAGKEIAVSRGIKG------RGSQEGGSVGV------VARGEGVADRGKGKQKVESITAESGVSE

Query:  GGDDVMLVDSVGQGKEVEAIGKGTN---------------SRSWKRMARDILS----DVTNRDVSAGSVIGKRKAGSVEGVELGG-KKLRGSCVRCIGSD
        G    +  D   + K++EA  +  +                 +WK+ AR   S      + + V+       R     EG E    KK + +    +   
Subjt:  GGDDVMLVDSVGQGKEVEAIGKGTN---------------SRSWKRMARDILS----DVTNRDVSAGSVIGKRKAGSVEGVELGG-KKLRGSCVRCIGSD

Query:  CVGGGWFPTPLGIMSLIFWNVRGSGSPRTFRRLTKLVKEKRPQVLFLSETKVSASRMSSAKRLLGFDNCFCVDCHGRSGGLALMWVSSVSFSLLSFSKNH
                 P   M     N RG G+P T R L   V+++ P V+FL ET++    +   +  LG   CF V+ +G  GGLAL+W SSV+  + SFS NH
Subjt:  CVGGGWFPTPLGIMSLIFWNVRGSGSPRTFRRLTKLVKEKRPQVLFLSETKVSASRMSSAKRLLGFDNCFCVDCHGRSGGLALMWVSSVSFSLLSFSKNH

Query:  IDG-WILWDGCRWQLTGFYGFPLVDLHPQSWSLLSRLKGCADTPWLIDGDFNAILCQDEKDGGRDKLLSELAAFQSVVDSCGLLDLGFVGDRFTWCNRRP
        ID   ++ DG +W++TGFYG P   L   SW+LL +L    + PWL+ GDFN +L  +E+ G  D+ LS++AAF+  +  C L DLG+ G  F+W NRR 
Subjt:  IDG-WILWDGCRWQLTGFYGFPLVDLHPQSWSLLSRLKGCADTPWLIDGDFNAILCQDEKDGGRDKLLSELAAFQSVVDSCGLLDLGFVGDRFTWCNRRP

Query:  GGETIYERLDWAFSITPWTDLYPNYVVNHLDYSRSDHRPMELILNPPPQCWSQSDQRIARFEETWLR
         G  +  RLD   +   W  L+P+Y V+H+ ++ SDH  + +ILNPPP   S + ++  RFE  W+R
Subjt:  GGETIYERLDWAFSITPWTDLYPNYVVNHLDYSRSDHRPMELILNPPPQCWSQSDQRIARFEETWLR

A0A7N2LIH6 Uncharacterized protein2.7e-7330.35Show/hide
Query:  EDLVSRLNSWRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIESAGENLFAIQFWSRGEKVRVMSTGPWAFDR
        E+L        +T+ E E ++LG  ++  +    + CVV K+ + ++V +EA +  MR+ W+     +I   GE+LF ++F    +K +VM   PW++++
Subjt:  EDLVSRLNSWRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIESAGENLFAIQFWSRGEKVRVMSTGPWAFDR

Query:  SLVILKEPRSAGSVMDVEFNDCAFWVQIHNVPMRWQTREMARHLEGKIGTMKEVGGPREG--GASVYRMR----------RARSFGV-----RFCMNDYQ
         L++++E        +++     FWVQI N+P++  TRE    +  KIG + EV  P +G       R+R          R +   +     R+    Y+
Subjt:  SLVILKEPRSAGSVMDVEFNDCAFWVQIHNVPMRWQTREMARHLEGKIGTMKEVGGPREG--GASVYRMR----------RARSFGV-----RFCMNDYQ

Query:  ---IFCDECGCLGHSRREC--GAVGEGALEQQCEQYGDWMRAVVGFGEVGRQTQEGGVDNAMNRKPEAPKPSTEGHV-IHDSVPEEA--GKEIAVSRGIK
            FC +CG L H  ++C     GE   +++ +QYG W+R   G    GR     G +    R+ +  +  TE    +   + E A  G++    + I 
Subjt:  ---IFCDECGCLGHSRREC--GAVGEGALEQQCEQYGDWMRAVVGFGEVGRQTQEGGVDNAMNRKPEAPKPSTEGHV-IHDSVPEEA--GKEIAVSRGIK

Query:  GRGSQEGGSVGVVARGEGVADRGKGKQKVE-----------------SITAESGVSEGG-----------------DDVMLVDSVGQGKEVEAIGKGT--
          G ++G    +V  G GV  +G   QKVE                  IT     S  G                 DDV  V S    K+ + + KGT  
Subjt:  GRGSQEGGSVGVVARGEGVADRGKGKQKVE-----------------SITAESGVSEGG-----------------DDVMLVDSVGQGKEVEAIGKGT--

Query:  ----------NSRSWKRMARDILSDVTNRDVSAGSVIGKRKAGSVEGVELGGKKLRGSCVRCIGSDCVGGGWFPTPLGIMSLIFWNVRGSGSPRTFRRLT
                  N  S   M  D     T+  +   S   KR A   +        + GS  R  G  C GGG    P   M+++ WN RG G+    R LT
Subjt:  ----------NSRSWKRMARDILSDVTNRDVSAGSVIGKRKAGSVEGVELGGKKLRGSCVRCIGSDCVGGGWFPTPLGIMSLIFWNVRGSGSPRTFRRLT

Query:  KLVKEKRPQVLFLSETKVSASRMSSAKRLLGFDNCFCVDCHGRSGGLALMWVSSVSFSLLSFSKNHIDGWI--LWDGCRWQLTGFYGFPLVDLHPQSWSL
          VK+K P ++FL ETK S  +M   +  LGF     V   GRSGGLAL+W         S S +HID  +     G  W+ TGFYG P       SW L
Subjt:  KLVKEKRPQVLFLSETKVSASRMSSAKRLLGFDNCFCVDCHGRSGGLALMWVSSVSFSLLSFSKNHIDGWI--LWDGCRWQLTGFYGFPLVDLHPQSWSL

Query:  LSRLKGCADTPWLIDGDFNAILCQDEKDGGRDKLLSELAAFQSVVDSCGLLDLGFVGDRFTWCNRRPGGETIYERLDWAFSITPWTDLYPNYVVNHLDYS
        L  L    + PWL+ GDFN I+  DEK G +D+  +++ AF+ V+  CGL+DLGFVG RFTWCN R G +    RLD   +   W+ ++P   V+H+  S
Subjt:  LSRLKGCADTPWLIDGDFNAILCQDEKDGGRDKLLSELAAFQSVVDSCGLLDLGFVGDRFTWCNRRPGGETIYERLDWAFSITPWTDLYPNYVVNHLDYS

Query:  RSDHRPMELILNPPPQCWSQSDQRIAR----FEETWLR
         SDH  + L LN        ++QR  +    FEE W R
Subjt:  RSDHRPMELILNPPPQCWSQSDQRIAR----FEETWLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding3.3e-0723.58Show/hide
Query:  FWSRGEKVRVMSTGPWAFDRSLVILKEPRSAGSVMDVEFNDCAFWVQIHNVPMRWQTREMARHLEGKIGTMKEVGGPREGGASVYRMRRARSFGVRFCMN
        F S      ++  GPW+F+  + +++      S  D EF    FW+QI  +P+R+ T  +   +  ++G   E    R+     ++  + ++        
Subjt:  FWSRGEKVRVMSTGPWAFDRSLVILKEPRSAGSVMDVEFNDCAFWVQIHNVPMRWQTREMARHLEGKIGTMKEVGGPREGGASVYRMRRARSFGVRFCMN

Query:  DYQIFCDECGCLGHSRRECGAVG
            FC  CG L H   EC   G
Subjt:  DYQIFCDECGCLGHSRRECGAVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGATCTAGTTAGCCGTTTGAATTCTTGGAGACTGACGGATGAAGAGGCGGAGATGGTGCGGCTGGGGGAGGGAACATCGGTTCTGTCAATGGGCTCGTTAGAGCT
GTGTGTGGTAGGAAAAGTTTTCTCGTCAAAGAGTGTGAATGTGGAGGCTTTTCAGAGTGTTATGAGGGTAGCATGGAGAGTTCATGGGGCGACGCGCATAGAATCTGCTG
GCGAGAATTTATTTGCTATTCAGTTCTGGTCTAGAGGGGAAAAGGTGCGAGTGATGTCGACGGGTCCATGGGCTTTTGATCGGTCCTTGGTAATTCTGAAGGAGCCAAGA
TCTGCAGGATCGGTAATGGATGTGGAGTTTAATGACTGTGCTTTTTGGGTCCAGATTCATAATGTACCCATGCGTTGGCAGACGAGAGAGATGGCTCGACATTTGGAGGG
TAAGATCGGAACCATGAAGGAAGTTGGTGGACCGCGGGAAGGAGGGGCTTCGGTTTACAGGATGAGGAGGGCAAGGAGTTTCGGTGTCCGGTTTTGTATGAACGACTACC
AGATTTTCTGTGATGAGTGTGGTTGTCTGGGGCACTCGAGGAGGGAGTGCGGGGCTGTTGGTGAAGGTGCTCTGGAGCAGCAGTGTGAGCAGTACGGTGATTGGATGAGA
GCGGTGGTAGGGTTTGGGGAGGTTGGGAGGCAGACTCAAGAGGGAGGGGTGGATAATGCAATGAATCGGAAGCCTGAGGCCCCAAAGCCTTCTACTGAAGGACATGTGAT
CCATGACTCTGTTCCCGAGGAAGCTGGGAAGGAGATTGCTGTGAGCAGGGGGATCAAGGGAAGGGGGAGTCAAGAGGGGGGCAGTGTGGGTGTGGTAGCGAGGGGGGAGG
GGGTGGCTGATAGGGGAAAGGGAAAGCAGAAGGTGGAGAGTATCACGGCTGAAAGTGGGGTCAGTGAGGGAGGGGATGATGTGATGCTGGTTGACTCTGTGGGCCAGGGG
AAAGAGGTGGAAGCGATAGGGAAGGGGACTAATAGTAGGAGCTGGAAGAGAATGGCGAGGGATATCTTGTCTGATGTTACCAATCGAGATGTTTCTGCGGGGTCTGTTAT
TGGTAAGAGGAAGGCAGGAAGTGTGGAGGGTGTGGAGTTAGGGGGAAAGAAATTGAGAGGGAGTTGTGTCAGGTGTATCGGATCCGACTGTGTTGGCGGCGGCTGGTTCC
CAACCCCGCTAGGGATTATGAGTTTGATATTCTGGAATGTTCGGGGATCGGGGTCACCCCGAACATTCAGGCGCCTGACCAAGCTGGTTAAGGAGAAACGACCTCAGGTG
CTCTTCTTATCAGAAACAAAGGTGTCTGCTAGTAGGATGTCTTCTGCGAAGCGTTTGTTGGGCTTTGATAACTGTTTTTGTGTTGATTGTCATGGACGGAGTGGTGGGTT
GGCCCTCATGTGGGTTTCCTCGGTATCCTTTAGTCTTCTTTCATTCTCCAAGAATCACATTGATGGATGGATCTTATGGGATGGCTGTAGGTGGCAGCTCACCGGTTTCT
ATGGTTTCCCTTTAGTGGACCTACATCCCCAGTCTTGGTCTCTCCTATCTAGACTGAAGGGTTGTGCTGATACACCGTGGCTGATTGACGGGGATTTTAACGCCATCTTA
TGTCAGGATGAGAAAGATGGAGGCAGGGATAAGCTGCTGTCTGAGCTAGCTGCTTTTCAGAGTGTGGTTGACTCGTGTGGTCTGCTGGATTTGGGGTTTGTGGGAGATCG
CTTCACTTGGTGTAACAGACGGCCGGGAGGTGAAACGATTTATGAGCGGTTGGATTGGGCTTTTAGCATCACACCTTGGACAGACCTCTATCCGAATTATGTGGTTAACC
ATCTTGACTATAGTCGGTCTGACCATAGGCCGATGGAACTAATTCTTAACCCCCCTCCGCAGTGTTGGTCTCAGAGTGACCAGCGGATTGCTCGTTTTGAGGAAACTTGG
CTTAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGATCTAGTTAGCCGTTTGAATTCTTGGAGACTGACGGATGAAGAGGCGGAGATGGTGCGGCTGGGGGAGGGAACATCGGTTCTGTCAATGGGCTCGTTAGAGCT
GTGTGTGGTAGGAAAAGTTTTCTCGTCAAAGAGTGTGAATGTGGAGGCTTTTCAGAGTGTTATGAGGGTAGCATGGAGAGTTCATGGGGCGACGCGCATAGAATCTGCTG
GCGAGAATTTATTTGCTATTCAGTTCTGGTCTAGAGGGGAAAAGGTGCGAGTGATGTCGACGGGTCCATGGGCTTTTGATCGGTCCTTGGTAATTCTGAAGGAGCCAAGA
TCTGCAGGATCGGTAATGGATGTGGAGTTTAATGACTGTGCTTTTTGGGTCCAGATTCATAATGTACCCATGCGTTGGCAGACGAGAGAGATGGCTCGACATTTGGAGGG
TAAGATCGGAACCATGAAGGAAGTTGGTGGACCGCGGGAAGGAGGGGCTTCGGTTTACAGGATGAGGAGGGCAAGGAGTTTCGGTGTCCGGTTTTGTATGAACGACTACC
AGATTTTCTGTGATGAGTGTGGTTGTCTGGGGCACTCGAGGAGGGAGTGCGGGGCTGTTGGTGAAGGTGCTCTGGAGCAGCAGTGTGAGCAGTACGGTGATTGGATGAGA
GCGGTGGTAGGGTTTGGGGAGGTTGGGAGGCAGACTCAAGAGGGAGGGGTGGATAATGCAATGAATCGGAAGCCTGAGGCCCCAAAGCCTTCTACTGAAGGACATGTGAT
CCATGACTCTGTTCCCGAGGAAGCTGGGAAGGAGATTGCTGTGAGCAGGGGGATCAAGGGAAGGGGGAGTCAAGAGGGGGGCAGTGTGGGTGTGGTAGCGAGGGGGGAGG
GGGTGGCTGATAGGGGAAAGGGAAAGCAGAAGGTGGAGAGTATCACGGCTGAAAGTGGGGTCAGTGAGGGAGGGGATGATGTGATGCTGGTTGACTCTGTGGGCCAGGGG
AAAGAGGTGGAAGCGATAGGGAAGGGGACTAATAGTAGGAGCTGGAAGAGAATGGCGAGGGATATCTTGTCTGATGTTACCAATCGAGATGTTTCTGCGGGGTCTGTTAT
TGGTAAGAGGAAGGCAGGAAGTGTGGAGGGTGTGGAGTTAGGGGGAAAGAAATTGAGAGGGAGTTGTGTCAGGTGTATCGGATCCGACTGTGTTGGCGGCGGCTGGTTCC
CAACCCCGCTAGGGATTATGAGTTTGATATTCTGGAATGTTCGGGGATCGGGGTCACCCCGAACATTCAGGCGCCTGACCAAGCTGGTTAAGGAGAAACGACCTCAGGTG
CTCTTCTTATCAGAAACAAAGGTGTCTGCTAGTAGGATGTCTTCTGCGAAGCGTTTGTTGGGCTTTGATAACTGTTTTTGTGTTGATTGTCATGGACGGAGTGGTGGGTT
GGCCCTCATGTGGGTTTCCTCGGTATCCTTTAGTCTTCTTTCATTCTCCAAGAATCACATTGATGGATGGATCTTATGGGATGGCTGTAGGTGGCAGCTCACCGGTTTCT
ATGGTTTCCCTTTAGTGGACCTACATCCCCAGTCTTGGTCTCTCCTATCTAGACTGAAGGGTTGTGCTGATACACCGTGGCTGATTGACGGGGATTTTAACGCCATCTTA
TGTCAGGATGAGAAAGATGGAGGCAGGGATAAGCTGCTGTCTGAGCTAGCTGCTTTTCAGAGTGTGGTTGACTCGTGTGGTCTGCTGGATTTGGGGTTTGTGGGAGATCG
CTTCACTTGGTGTAACAGACGGCCGGGAGGTGAAACGATTTATGAGCGGTTGGATTGGGCTTTTAGCATCACACCTTGGACAGACCTCTATCCGAATTATGTGGTTAACC
ATCTTGACTATAGTCGGTCTGACCATAGGCCGATGGAACTAATTCTTAACCCCCCTCCGCAGTGTTGGTCTCAGAGTGACCAGCGGATTGCTCGTTTTGAGGAAACTTGG
CTTAGGTAA
Protein sequenceShow/hide protein sequence
MEDLVSRLNSWRLTDEEAEMVRLGEGTSVLSMGSLELCVVGKVFSSKSVNVEAFQSVMRVAWRVHGATRIESAGENLFAIQFWSRGEKVRVMSTGPWAFDRSLVILKEPR
SAGSVMDVEFNDCAFWVQIHNVPMRWQTREMARHLEGKIGTMKEVGGPREGGASVYRMRRARSFGVRFCMNDYQIFCDECGCLGHSRRECGAVGEGALEQQCEQYGDWMR
AVVGFGEVGRQTQEGGVDNAMNRKPEAPKPSTEGHVIHDSVPEEAGKEIAVSRGIKGRGSQEGGSVGVVARGEGVADRGKGKQKVESITAESGVSEGGDDVMLVDSVGQG
KEVEAIGKGTNSRSWKRMARDILSDVTNRDVSAGSVIGKRKAGSVEGVELGGKKLRGSCVRCIGSDCVGGGWFPTPLGIMSLIFWNVRGSGSPRTFRRLTKLVKEKRPQV
LFLSETKVSASRMSSAKRLLGFDNCFCVDCHGRSGGLALMWVSSVSFSLLSFSKNHIDGWILWDGCRWQLTGFYGFPLVDLHPQSWSLLSRLKGCADTPWLIDGDFNAIL
CQDEKDGGRDKLLSELAAFQSVVDSCGLLDLGFVGDRFTWCNRRPGGETIYERLDWAFSITPWTDLYPNYVVNHLDYSRSDHRPMELILNPPPQCWSQSDQRIARFEETW
LR