; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041280 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041280
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr13:14892520..14894296
RNA-Seq ExpressionLag0041280
SyntenyLag0041280
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7133372.1 hypothetical protein RHSIM_Rhsim09G0106200 [Rhododendron simsii]2.1e-4627.51Show/hide
Query:  MDEPIEEVLARFNIS-EKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSW-QCKSVKFVEVDENVLLFCFADAASMLYVQNQGPWL
        M + + +++  F++S E+E E + I  E     +E   + L+G+++T +K+N    ++++  +W   +++  VEV +N+  F F +  S+  V N GPW 
Subjt:  MDEPIEEVLARFNIS-EKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSW-QCKSVKFVEVDENVLLFCFADAASMLYVQNQGPWL

Query:  FEESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSV--QQRKFIRIKVELDVTKLLMK-GFIVMTGGSKK
        F   LL L  W P +K          FW+Q+ GLPF+    +  +++ +KIG        VD R+V   + +FIR++V + + K L + GFI +  GSK 
Subjt:  FEESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSV--QQRKFIRIKVELDVTKLLMK-GFIVMTGGSKK

Query:  WVWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSPA---PPVFGDWLRA---------------------GPV------PTMAKQGERSGHWRRKEQGQG
        WV +K+ERL  FC  CG + H    C  K             FG W++A                     GP       PT      + G+++  E  +G
Subjt:  WVWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSPA---PPVFGDWLRA---------------------GPV------PTMAKQGERSGHWRRKEQGQG

Query:  K-APESGVPSVGTESEKAPGINLGGD---------------------------------VSSEPPTGI--------VEPRVENEQN--------------
            +SG  S   E + +  I + G                                   S+ P  G+          P +E E +              
Subjt:  K-APESGVPSVGTESEKAPGINLGGD---------------------------------VSSEPPTGI--------VEPRVENEQN--------------

Query:  ----------ISENLELNEVLQVMGQV-MGLTWPT--GKHYGKAVDQGPGGTLHRTNADGA---------KVRGSSVSLIGFKRKCR----AGGVGTSEQ
                  I  NL+  EV Q  G+  +GL  P   G    K ++   G T  R     A         K    ++S++G +R+      A G  +   
Subjt:  ----------ISENLELNEVLQVMGQV-MGLTWPT--GKHYGKAVDQGPGGTLHRTNADGA---------KVRGSSVSLIGFKRKCR----AGGVGTSEQ

Query:  AEMGSSKRAKGSD-----KGNPLTVRSLKEQVKLHSPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVEIHQYADFFIEAVI
           G  KR   SD      GNPLTVR LKE    +SP+V+FL+ETKNK  +LE +++ +  E    VEP G+  G CLM K+  +VE+ +     I+A +
Subjt:  AEMGSSKRAKGSD-----KGNPLTVRSLKEQVKLHSPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVEIHQYADFFIEAVI

Query:  RPKTGNPKWHFFGVYASTDEKEREQQLSNLTSRIGLSQDNYVVGGDFNDIVCNGEKEG
            G       GVYAST++  R +   ++ S++G  Q+  V+ GDFNDI+ N EK G
Subjt:  RPKTGNPKWHFFGVYASTDEKEREQQLSNLTSRIGLSQDNYVVGGDFNDIVCNGEKEG

KAF7141361.1 hypothetical protein RHSIM_Rhsim06G0043400 [Rhododendron simsii]4.8e-4628Show/hide
Query:  SEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSW-QCKSVKFVEVDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNI
        SE+E E + +S    S  +    + L+G ++T++KFN+ AF+  +  +W    +++ VEV +N+  F FA+   M  V   GPW F++ +++L +W   +
Subjt:  SEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSW-QCKSVKFVEVDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNI

Query:  KTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSR--SVQQRKFIRIKVELDV-TKLLMKGFIVMTGGSKKWVWFKYERLPRFCSK
               +  + W+Q+HGLPF+  GQE  +V+  KIG    E  EVD R    +Q +FIR++V L V T L   G IV  GG K WV +KYER+P FC  
Subjt:  KTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSR--SVQQRKFIRIKVELDV-TKLLMKGFIVMTGGSKKWVWFKYERLPRFCSK

Query:  CGVMGHTAHWCSAKHLQTSPA---PPVFGDWLRAG-------------------------------------PVPTMAKQGER-------SGHWRRKEQG
        CG + H  + C  K             +G+W++A                                      P     K GER        G WR  +  
Subjt:  CGVMGHTAHWCSAKHLQTSPA---PPVFGDWLRAG-------------------------------------PVPTMAKQGER-------SGHWRRKEQG

Query:  Q----------GKAPESG--VPS---------------------VGTESEKAPGINLGG-DVSSEPPTGIVEPRV------------ENEQNISENLELN
        +          GK+   G   PS                     + +  +  PG  L G  V +   TG+ + R             +N +    NL L 
Subjt:  Q----------GKAPESG--VPS---------------------VGTESEKAPGINLGG-DVSSEPPTGIVEPRV------------ENEQNISENLELN

Query:  EVL--QVMGQVMGL-----------TWPTGKHYG------------KAVDQG-PGGTLH-----RTNADGAKVRGSSVSLIGFKRKCRAGGVGTSEQAEM
        E +  + MG   G+             P G   G             +++ G  GGT       +    G    G + SL G       GG G  ++  +
Subjt:  EVL--QVMGQVMGL-----------TWPTGKHYG------------KAVDQG-PGGTLH-----RTNADGAKVRGSSVSLIGFKRKCRAGGVGTSEQAEM

Query:  GSSKRAKGSDK---------------------GNPLTVRSLKEQVKLHSPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVE
          S    G  K                     GNPLTV +LK   KLHSP  +FL+ETKN+  ++E + R+LG+   FVV+P G+  GLC MWK    V 
Subjt:  GSSKRAKGSDK---------------------GNPLTVRSLKEQVKLHSPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVE

Query:  IHQYADFFIEAVIRPKTGNPKWHFFGVYASTDEKERE---QQLSNLTSRIGLSQD-NYVVGGDFNDIVCNGEKEG
        +   ++++++  +R + G   W + G+YASTD ++R    +++SNL    G+  D  +V+ GDFN IV N EK G
Subjt:  IHQYADFFIEAVIRPKTGNPKWHFFGVYASTDEKERE---QQLSNLTSRIGLSQD-NYVVGGDFNDIVCNGEKEG

KAF7150653.1 hypothetical protein RHSIM_Rhsim02G0038900 [Rhododendron simsii]4.0e-4526.42Show/hide
Query:  MDEPIEEVLARFNISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSW-QCKSVKFVEVDENVLLFCFADAASMLYVQNQGPWLF
        M + + E++   ++S++E + + I+ E     +E   + L+G+++T++KFN    + ++  +W   +++  VEV +N+  F F    ++  V N GPW F
Subjt:  MDEPIEEVLARFNISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSW-QCKSVKFVEVDENVLLFCFADAASMLYVQNQGPWLF

Query:  EESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSV--QQRKFIRIKVELDVTKLLMK-GFIVMTGGSKKW
        +  LLVL +W   +K +    ++  FWVQ+ GLPF+       +++ ++IG      + VD+R+V  ++ +FIR++V + V K L + GFI +  G+K W
Subjt:  EESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSV--QQRKFIRIKVELDVTKLLMK-GFIVMTGGSKKW

Query:  VWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSPA---PPVFGDWLRAGPVPTMAKQGERSGHWRRKEQGQGKAPESGVPSVGTESEKAPGINLGGDVS-
        V +K+ERL RFC  CG + H    C  +             FG W++AG     A + + +  W       G A +  + +VG+   K  G      +S 
Subjt:  VWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSPA---PPVFGDWLRAGPVPTMAKQGERSGHWRRKEQGQGKAPESGVPSVGTESEKAPGINLGGDVS-

Query:  SEPPTGIVEPRVENEQNIS-ENLELNEVL-----QVMGQVMGLTWPTGKHYGKAVDQGP------GGTLHRTNADGA----KVRGSSVSLIGFKRKCRAG
         E  +G++E +  +   I+ +   + ++L      V+G   G+  P+G         GP      G  L + N +GA     +  S +   G +   +  
Subjt:  SEPPTGIVEPRVENEQNIS-ENLELNEVL-----QVMGQVMGLTWPTGKHYGKAVDQGP------GGTLHRTNADGA----KVRGSSVSLIGFKRKCRAG

Query:  GVGTSEQ-----------------------------------AEMGSSKRAKGSDK-----------------------------------------GNP
        GVG  E                                    +  G +KR+KG  +                                         GNP
Subjt:  GVGTSEQ-----------------------------------AEMGSSKRAKGSDK-----------------------------------------GNP

Query:  LTVRSLKEQVKLHSPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVEIHQYADFFIEAVIRPKTGNPKWHFFGVYASTDEKE
        LT+R LK    L+SP+++FL+ETKNK  ++E I + +G      VEP GL  G CL+ K   ++++ +     I+  +    G       GVYASTD  E
Subjt:  LTVRSLKEQVKLHSPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVEIHQYADFFIEAVIRPKTGNPKWHFFGVYASTDEKE

Query:  REQQLSNLTSRIGLSQDNYVVGGDFNDIVCNGEKEG
        R      L  ++  SQ+  V+GGDFN I+ N EK G
Subjt:  REQQLSNLTSRIGLSQDNYVVGGDFNDIVCNGEKEG

XP_035540109.1 uncharacterized protein LOC118344190 [Juglans regia]1.8e-4527.07Show/hide
Query:  ISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQ-CKSVKFVEVDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPN
        ++E+E E + I  +A+ V  E     +IG++  +R    +    TM   W+  K   F EV  N+ +  F +    + V +  PWLF+  L  L  +   
Subjt:  ISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQ-CKSVKFVEVDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPN

Query:  IKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVTKLLMKGFIVMTGGSKKWVWFKYERLPRFCSKCG
         +          FWVQIH LP  C  +E  +++ + +G++   E +V +  +   KF+R++VE+ + K + +G ++   G + W+  +YE+LPR C KCG
Subjt:  IKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVTKLLMKGFIVMTGGSKKWVWFKYERLPRFCSKCG

Query:  VMGHTAHWCS---AKHLQTSPAPPVFGDWLRAGP---VPTMAKQGERSG-HWRRKEQG------QGKAPESGVPSVGTESEKAPGINLGGDVSSEPPTGI
         + H    C       ++       FG WLRA P         QG   G  W RK +        G    SG   + +E     G   G DV        
Subjt:  VMGHTAHWCS---AKHLQTSPAPPVFGDWLRAGP---VPTMAKQGERSG-HWRRKEQG------QGKAPESGVPSVGTESEKAPGINLGGDVSSEPPTGI

Query:  VEPRVENEQNISENLELN-----EVLQVMGQVMGLTWPTGKHYGKAVDQGPGGTLHRTNADGAKVRGSSVSLIGFKRKCRAGGVGTSE---QAEMGSSKR
        V  R+   +   E  ++      +++Q+     G       +  + + +G    +    ++G     ++V L G K K RA G+G SE    + +G  + 
Subjt:  VEPRVENEQNISENLELN-----EVLQVMGQVMGLTWPTGKHYGKAVDQGPGGTLHRTNADGAKVRGSSVSLIGFKRKCRAGGVGTSE---QAEMGSSKR

Query:  AKGSDKGNPLTVRSLKEQ------------------------VKLHSPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVEIH
          G D+ + +  R++K+                         VK   P+++FL ETK ++N++E I+R+ G++GC VVEP GL  GL +MWK VDEVE+ 
Subjt:  AKGSDKGNPLTVRSLKEQ------------------------VKLHSPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVEIH

Query:  QYADFFIEAVIRPKTGNPKWHFFGVYASTDEKEREQQLSNLTSRIGLSQDNYVVGGDFNDIVCNGEKEG
         Y+ + I   +R +  N +W   G Y   D  +R+     L+     ++  + V GDFN+IV   EK G
Subjt:  QYADFFIEAVIRPKTGNPKWHFFGVYASTDEKEREQQLSNLTSRIGLSQDNYVVGGDFNDIVCNGEKEG

XP_035544642.1 uncharacterized protein LOC109020982 [Juglans regia]6.4e-4327.34Show/hide
Query:  MDEPIEEVLARFNISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQCKS-VKFVEVDENVLLFCFADAASMLYVQNQGPWLF
        M E +      F ++E+E   + + S+   + +   K+CL+G ++  +  N+EAFR TM+  W+ +  V+F EV EN  L  F        V    PW F
Subjt:  MDEPIEEVLARFNISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQCKS-VKFVEVDENVLLFCFADAASMLYVQNQGPWLF

Query:  EESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVTKLLMKGFIVMTGGSKKWVWF
        +  LL L  +  N+          +FW+Q+H +P     +E  K + + +G+V   +   D R +   KF+RI+VE+ +TK LM+G  +   G K WV F
Subjt:  EESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVTKLLMKGFIVMTGGSKKWVWF

Query:  KYERLPRFCSKCGVMGHTAHWCSAKHLQTSP-APP--VFGDWLRAGPVPTMAKQGE----RSGHWRRKEQGQGKAPESGVPSVGTESEKAPGINLGGDVS
        KYERLP FC KCGV+ H    C     Q+SP A P   +G WLRA      AK+G+    R GH   ++             VG E      +NLG  VS
Subjt:  KYERLPRFCSKCGVMGHTAHWCSAKHLQTSP-APP--VFGDWLRAGPVPTMAKQGE----RSGHWRRKEQGQGKAPESGVPSVGTESEKAPGINLGGDVS

Query:  SE---PPTG----------IVEPRVENEQNISENLEL--NEVLQVMGQVMGLTWPTGKHYGKAVDQGPGGTLHRTN--ADGAKVRGSSVSLIGFKRKCRA
         E    P             ++P  E   ++ E+L     ++ +V   +M L   TG    K  D  P   + + +    G   R SS+         R 
Subjt:  SE---PPTG----------IVEPRVENEQNISENLEL--NEVLQVMGQVMGLTWPTGKHYGKAVDQGPGGTLHRTN--ADGAKVRGSSVSLIGFKRKCRA

Query:  GGVGTSEQAEMG------SSKRAKGSDKGNPLTVRSLKEQVKLHSPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVEIHQY
          +  +  + +         +   G ++     +      +K   P+ +FL ETK K +++E ++R + ++  FV++ +G   GL  +WK   E E+H Y
Subjt:  GGVGTSEQAEMG------SSKRAKGSDKGNPLTVRSLKEQVKLHSPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVEIHQY

Query:  ADFFIEAVIRPKTGNPKWHFFGVYASTDEKEREQQLSNLTSRIGLSQDNYVVGGDFNDIVCNGEKEG
        +   I  +++       W   G Y S     R+     L +   +    ++  GDFN+I+  G+K G
Subjt:  ADFFIEAVIRPKTGNPKWHFFGVYASTDEKEREQQLSNLTSRIGLSQDNYVVGGDFNDIVCNGEKEG

TrEMBL top hitse value%identityAlignment
A0A2N9FJM4 Reverse transcriptase domain-containing protein3.1e-4325.62Show/hide
Query:  MDEPIEEVLARFNISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQC-KSVKFVEVDENVLLFCFADAASMLYVQNQGPWLF
        M + + + L +  ++++E +++ I+   +++ LE+ +  L+G  +T+R  N  A + T+  +W+    V+ V+V   ++ F F +A  M +V    PW F
Subjt:  MDEPIEEVLARFNISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQC-KSVKFVEVDENVLLFCFADAASMLYVQNQGPWLF

Query:  EESLLVLAKWSPNIKTKAELPKV-CDFWVQIHGLPFDCKGQEAAKVVAQKIGR---VTDEEWEVDSRSVQQRKFIRIKVELDVTK-LLMKGFIVMTGGSK
        +  LL+L +W   + T A L      FW+Q+ G+PFD   +E  + + +KIGR   V    W  D     Q   +RI+VE+ + K LL  GF++   G +
Subjt:  EESLLVLAKWSPNIKTKAELPKV-CDFWVQIHGLPFDCKGQEAAKVVAQKIGR---VTDEEWEVDSRSVQQRKFIRIKVELDVTK-LLMKGFIVMTGGSK

Query:  KWVWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSPAPPVFGDWLRAG---------------------------PVPTMAKQGERSGHWRRKE------
         WV +KYERL  FC +CG+MGH    C+    Q   +   +G+WL+AG                           P P      E   H    +      
Subjt:  KWVWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSPAPPVFGDWLRAG---------------------------PVPTMAKQGERSGHWRRKE------

Query:  ----QGQGKAP-----------------ESGVPSVGTESEKAPGI-----NLGGDVSSEPPTGIVE-PRVENEQNISENLELNEVLQVMGQVMGL-TWPT
               G  P                 + G P + T +   P I      LGGD  ++ P  +   P++E     S N   N+  ++  +   L +W  
Subjt:  ----QGQGKAP-----------------ESGVPSVGTESEKAPGI-----NLGGDVSSEPPTGIVE-PRVENEQNISENLELNEVLQVMGQVMGL-TWPT

Query:  GKHYGKAVDQGPGGTLHRTNADGAKVRGSSVSLIGFKRKCRAGGVGTSEQAEMGSSKRAKGSDKGNPLTVRSLKEQVKLHSPNVIFLSETKNKANRLEGI
             K  ++G   TL  +N             +G KR   A     +E +      +   +  GNP  +R L+   K   P ++FL ETK    R+E +
Subjt:  GKHYGKAVDQGPGGTLHRTNADGAKVRGSSVSLIGFKRKCRAGGVGTSEQAEMGSSKRAKGSDKGNPLTVRSLKEQVKLHSPNVIFLSETKNKANRLEGI

Query:  RRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVEIHQYADFFIEAVIRPKTGNPKWHFFGVYASTDEKEREQQLSNLTSRIGLSQDNYVVGGDFNDIVCNGE
        R  +G++  F V  +G   GL LMWK   EV +  ++   ++  +   T N  W   G Y   +  +R +    L      +Q  ++  GDFN+I+   E
Subjt:  RRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVEIHQYADFFIEAVIRPKTGNPKWHFFGVYASTDEKEREQQLSNLTSRIGLSQDNYVVGGDFNDIVCNGE

Query:  KEGAY
        K G +
Subjt:  KEGAY

A0A2N9HE28 Reverse transcriptase domain-containing protein3.1e-4325.62Show/hide
Query:  MDEPIEEVLARFNISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQC-KSVKFVEVDENVLLFCFADAASMLYVQNQGPWLF
        M + + + L +  ++++E +++ I+   +++ LE+ +  L+G  +T+R  N  A + T+  +W+    V+ V+V   ++ F F +A  M +V    PW F
Subjt:  MDEPIEEVLARFNISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQC-KSVKFVEVDENVLLFCFADAASMLYVQNQGPWLF

Query:  EESLLVLAKWSPNIKTKAELPKV-CDFWVQIHGLPFDCKGQEAAKVVAQKIGR---VTDEEWEVDSRSVQQRKFIRIKVELDVTK-LLMKGFIVMTGGSK
        +  LL+L +W   + T A L      FW+Q+ G+PFD   +E  + + +KIGR   V    W  D     Q   +RI+VE+ + K LL  GF++   G +
Subjt:  EESLLVLAKWSPNIKTKAELPKV-CDFWVQIHGLPFDCKGQEAAKVVAQKIGR---VTDEEWEVDSRSVQQRKFIRIKVELDVTK-LLMKGFIVMTGGSK

Query:  KWVWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSPAPPVFGDWLRAG---------------------------PVPTMAKQGERSGHWRRKE------
         WV +KYERL  FC +CG+MGH    C+    Q   +   +G+WL+AG                           P P      E   H    +      
Subjt:  KWVWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSPAPPVFGDWLRAG---------------------------PVPTMAKQGERSGHWRRKE------

Query:  ----QGQGKAP-----------------ESGVPSVGTESEKAPGI-----NLGGDVSSEPPTGIVE-PRVENEQNISENLELNEVLQVMGQVMGL-TWPT
               G  P                 + G P + T +   P I      LGGD  ++ P  +   P++E     S N   N+  ++  +   L +W  
Subjt:  ----QGQGKAP-----------------ESGVPSVGTESEKAPGI-----NLGGDVSSEPPTGIVE-PRVENEQNISENLELNEVLQVMGQVMGL-TWPT

Query:  GKHYGKAVDQGPGGTLHRTNADGAKVRGSSVSLIGFKRKCRAGGVGTSEQAEMGSSKRAKGSDKGNPLTVRSLKEQVKLHSPNVIFLSETKNKANRLEGI
             K  ++G   TL  +N             +G KR   A     +E +      +   +  GNP  +R L+   K   P ++FL ETK    R+E +
Subjt:  GKHYGKAVDQGPGGTLHRTNADGAKVRGSSVSLIGFKRKCRAGGVGTSEQAEMGSSKRAKGSDKGNPLTVRSLKEQVKLHSPNVIFLSETKNKANRLEGI

Query:  RRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVEIHQYADFFIEAVIRPKTGNPKWHFFGVYASTDEKEREQQLSNLTSRIGLSQDNYVVGGDFNDIVCNGE
        R  +G++  F V  +G   GL LMWK   EV +  ++   ++  +   T N  W   G Y   +  +R +    L      +Q  ++  GDFN+I+   E
Subjt:  RRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVEIHQYADFFIEAVIRPKTGNPKWHFFGVYASTDEKEREQQLSNLTSRIGLSQDNYVVGGDFNDIVCNGE

Query:  KEGAY
        K G +
Subjt:  KEGAY

A0A6P9DXY5 uncharacterized protein LOC1183441908.8e-4627.07Show/hide
Query:  ISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQ-CKSVKFVEVDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPN
        ++E+E E + I  +A+ V  E     +IG++  +R    +    TM   W+  K   F EV  N+ +  F +    + V +  PWLF+  L  L  +   
Subjt:  ISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQ-CKSVKFVEVDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPN

Query:  IKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVTKLLMKGFIVMTGGSKKWVWFKYERLPRFCSKCG
         +          FWVQIH LP  C  +E  +++ + +G++   E +V +  +   KF+R++VE+ + K + +G ++   G + W+  +YE+LPR C KCG
Subjt:  IKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVTKLLMKGFIVMTGGSKKWVWFKYERLPRFCSKCG

Query:  VMGHTAHWCS---AKHLQTSPAPPVFGDWLRAGP---VPTMAKQGERSG-HWRRKEQG------QGKAPESGVPSVGTESEKAPGINLGGDVSSEPPTGI
         + H    C       ++       FG WLRA P         QG   G  W RK +        G    SG   + +E     G   G DV        
Subjt:  VMGHTAHWCS---AKHLQTSPAPPVFGDWLRAGP---VPTMAKQGERSG-HWRRKEQG------QGKAPESGVPSVGTESEKAPGINLGGDVSSEPPTGI

Query:  VEPRVENEQNISENLELN-----EVLQVMGQVMGLTWPTGKHYGKAVDQGPGGTLHRTNADGAKVRGSSVSLIGFKRKCRAGGVGTSE---QAEMGSSKR
        V  R+   +   E  ++      +++Q+     G       +  + + +G    +    ++G     ++V L G K K RA G+G SE    + +G  + 
Subjt:  VEPRVENEQNISENLELN-----EVLQVMGQVMGLTWPTGKHYGKAVDQGPGGTLHRTNADGAKVRGSSVSLIGFKRKCRAGGVGTSE---QAEMGSSKR

Query:  AKGSDKGNPLTVRSLKEQ------------------------VKLHSPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVEIH
          G D+ + +  R++K+                         VK   P+++FL ETK ++N++E I+R+ G++GC VVEP GL  GL +MWK VDEVE+ 
Subjt:  AKGSDKGNPLTVRSLKEQ------------------------VKLHSPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVEIH

Query:  QYADFFIEAVIRPKTGNPKWHFFGVYASTDEKEREQQLSNLTSRIGLSQDNYVVGGDFNDIVCNGEKEG
         Y+ + I   +R +  N +W   G Y   D  +R+     L+     ++  + V GDFN+IV   EK G
Subjt:  QYADFFIEAVIRPKTGNPKWHFFGVYASTDEKEREQQLSNLTSRIGLSQDNYVVGGDFNDIVCNGEKEG

A0A6P9EQ08 uncharacterized protein LOC1090209823.1e-4327.34Show/hide
Query:  MDEPIEEVLARFNISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQCKS-VKFVEVDENVLLFCFADAASMLYVQNQGPWLF
        M E +      F ++E+E   + + S+   + +   K+CL+G ++  +  N+EAFR TM+  W+ +  V+F EV EN  L  F        V    PW F
Subjt:  MDEPIEEVLARFNISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQCKS-VKFVEVDENVLLFCFADAASMLYVQNQGPWLF

Query:  EESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVTKLLMKGFIVMTGGSKKWVWF
        +  LL L  +  N+          +FW+Q+H +P     +E  K + + +G+V   +   D R +   KF+RI+VE+ +TK LM+G  +   G K WV F
Subjt:  EESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVTKLLMKGFIVMTGGSKKWVWF

Query:  KYERLPRFCSKCGVMGHTAHWCSAKHLQTSP-APP--VFGDWLRAGPVPTMAKQGE----RSGHWRRKEQGQGKAPESGVPSVGTESEKAPGINLGGDVS
        KYERLP FC KCGV+ H    C     Q+SP A P   +G WLRA      AK+G+    R GH   ++             VG E      +NLG  VS
Subjt:  KYERLPRFCSKCGVMGHTAHWCSAKHLQTSP-APP--VFGDWLRAGPVPTMAKQGE----RSGHWRRKEQGQGKAPESGVPSVGTESEKAPGINLGGDVS

Query:  SE---PPTG----------IVEPRVENEQNISENLEL--NEVLQVMGQVMGLTWPTGKHYGKAVDQGPGGTLHRTN--ADGAKVRGSSVSLIGFKRKCRA
         E    P             ++P  E   ++ E+L     ++ +V   +M L   TG    K  D  P   + + +    G   R SS+         R 
Subjt:  SE---PPTG----------IVEPRVENEQNISENLEL--NEVLQVMGQVMGLTWPTGKHYGKAVDQGPGGTLHRTN--ADGAKVRGSSVSLIGFKRKCRA

Query:  GGVGTSEQAEMG------SSKRAKGSDKGNPLTVRSLKEQVKLHSPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVEIHQY
          +  +  + +         +   G ++     +      +K   P+ +FL ETK K +++E ++R + ++  FV++ +G   GL  +WK   E E+H Y
Subjt:  GGVGTSEQAEMG------SSKRAKGSDKGNPLTVRSLKEQVKLHSPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVEIHQY

Query:  ADFFIEAVIRPKTGNPKWHFFGVYASTDEKEREQQLSNLTSRIGLSQDNYVVGGDFNDIVCNGEKEG
        +   I  +++       W   G Y S     R+     L +   +    ++  GDFN+I+  G+K G
Subjt:  ADFFIEAVIRPKTGNPKWHFFGVYASTDEKEREQQLSNLTSRIGLSQDNYVVGGDFNDIVCNGEKEG

A0A7N2M6Y1 CCHC-type domain-containing protein1.5e-5028.07Show/hide
Query:  MDEPIEEVLARFNISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQCKS-VKFVEVDENVLLFCFADAASMLYVQNQGPWLF
        MD+ +   L    ++ +E E + +++ + S +LE+    L G ++++R  N  A + T+  +W+  S ++ VEV  ++L F F     + +V+  GPW F
Subjt:  MDEPIEEVLARFNISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQCKS-VKFVEVDENVLLFCFADAASMLYVQNQGPWLF

Query:  EESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQ--QRKFIRIKVELDVTKLLMK-GFIVMTGGSKKW
        E +LL+L +W   + +K        FWVQI GLPF+   ++  + +  KIG+V     EVD R++Q  Q KF+R++VE+ + K L + GF+      + W
Subjt:  EESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQ--QRKFIRIKVELDVTKLLMK-GFIVMTGGSKKW

Query:  VWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSPAPPVFGDWLRAGPVPTMAKQGERSGHWRRKEQGQGKAPESGVPSVGTES----------EKAPGIN
        V F+YERLP FC KCG++GH    C    ++ +PA   +G+WL+AG      K G   G+++ K+  Q  A   G  S+G  S             P + 
Subjt:  VWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSPAPPVFGDWLRAGPVPTMAKQGERSGHWRRKEQGQGKAPESGVPSVGTES----------EKAPGIN

Query:  LGGDVSSEPPTGIVEPRVENEQNI-------------SENLELNEVLQVMGQVMGLTWPTGKHYGKAVDQGPGGTLHRTNADGAKV------RGSSVSL-
        +GG      P   V    E  +               S   E  E ++    V      +G+      D  PGG     N  G +       RG + ++ 
Subjt:  LGGDVSSEPPTGIVEPRVENEQNI-------------SENLELNEVLQVMGQVMGLTWPTGKHYGKAVDQGPGGTLHRTNADGAKV------RGSSVSL-

Query:  IGFKRKCRAGGVG----TSEQAEMGSSKRAKGSD---------------------KGNPLTVRSLKEQVKLHSPNVIFLSETKNKANRLEGIRRQLGYEG
          FKR  R  G       +E+A+  S+KR   +D                      GNP +VR L+E V+   P ++FLSETK K  ++  ++ ++G   
Subjt:  IGFKRKCRAGGVG----TSEQAEMGSSKRAKGSD---------------------KGNPLTVRSLKEQVKLHSPNVIFLSETKNKANRLEGIRRQLGYEG

Query:  CFVVEPRGLKAGLCLMWKIVDEVEIHQYADFFIEAVIRPKTGNPKWHFFGVYASTDEKEREQQLSNLTSRIGLSQDNYVVGGDFNDIVCNGEKEG
          +V   G   GL ++W     +E+  Y+ +FI+AV+       KW   G Y + +   R++    L S   +    ++  GDFN++V   EK G
Subjt:  CFVVEPRGLKAGLCLMWKIVDEVEIHQYADFFIEAVIRPKTGNPKWHFFGVYASTDEKEREQQLSNLTSRIGLSQDNYVVGGDFNDIVCNGEKEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02103.1 unknown protein6.7e-1428Show/hide
Query:  VDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRI
        +D   + F FA+   +L VQ + PWLF    +   +W   +     L    D WVQI G+P     +E    +A  +G +   ++     +  Q  FIR+
Subjt:  VDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRI

Query:  KVELDVT-KLLMKGFIVMTGGSKKWVWFKYERLPRFCSKCGVMGHTAHWC
        +V   +T +L     ++   G    + F+YERL R CS C    H   +C
Subjt:  KVELDVT-KLLMKGFIVMTGGSKKWVWFKYERLPRFCSKCGVMGHTAHWC

AT2G17920.1 nucleic acid binding;zinc ion binding7.9e-1527.05Show/hide
Query:  KECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQCKS-VKFVEVDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKT
        +E  ++ I +EA  +     +  +I   +  R  N +A    +  +W   + V    +D+  + F F     +L VQ + PWLF    +   +W P    
Subjt:  KECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQCKS-VKFVEVDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKT

Query:  KAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVT-KLLMKGFIVMTGGSKKWVWFKYERLPRFCSKCGVM
                D WVQ+ G+PF    +E A  +AQ+IG +   ++  D+ S  Q  +IR++V + +T  L     I    G    + F+YERL R CS C   
Subjt:  KAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVT-KLLMKGFIVMTGGSKKWVWFKYERLPRFCSKCGVM

Query:  GHTAHWC
         H  ++C
Subjt:  GHTAHWC

AT3G31430.1 unknown protein9.4e-1628.04Show/hide
Query:  DKKWCLIGEVVTNRKFNREAFRKTMMNSW-QCKSVKFVEVDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLP
        + ++ L G  V  R+ N  +   +M   W Q   V    ++     F F    S+  V  +GPW F + +++L +W P I     +P    FWVQI G+P
Subjt:  DKKWCLIGEVVTNRKFNREAFRKTMMNSW-QCKSVKFVEVDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLP

Query:  FDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVT-KLLMKGFIVMTGGSKKWVWFKYERLPRFCSKCGVMGHTAHWC
        F    +   + + + +G+V D ++ V+   V +  F R+ +  D+T  L  +     T G    + F+YERL  FC  CG++ H    C
Subjt:  FDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVT-KLLMKGFIVMTGGSKKWVWFKYERLPRFCSKCGVMGHTAHWC

AT5G18636.1 unknown protein1.4e-1428.12Show/hide
Query:  VDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRI
        +D   + F FA+   ++ VQ + PWLF    +   +W   +     L    D WVQI G+P     +E    +AQ +G +   ++     +  Q  FIR+
Subjt:  VDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRI

Query:  KVELDVT-KLLMKGFIVMTGGSKKWVWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSPA
        +V   +T +L     I+   G    + F+YERL R CS C    H   +C  +    S A
Subjt:  KVELDVT-KLLMKGFIVMTGGSKKWVWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSPA

AT5G25200.1 unknown protein3.0e-1428.67Show/hide
Query:  VDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRI
        +D   + F FA+   ++ VQ + PWLF    +   +W   +     L    D WVQI G+P     +E    +AQ +G +   ++     +  Q  FIR+
Subjt:  VDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRI

Query:  KVELDVT-KLLMKGFIVMTGGSKKWVWFKYERLPRFCSKCGVMGHTAHWC
        +V   +T +L     I+   G    + F+YERL R CS C    H   +C
Subjt:  KVELDVT-KLLMKGFIVMTGGSKKWVWFKYERLPRFCSKCGVMGHTAHWC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGATGAGCCTATTGAGGAAGTTCTGGCGAGGTTTAACATATCGGAAAAGGAATGCGAGACAGTAACAATTTCAAGCGAGGCCAAATCGGTTGATCTAGAGGACAA
GAAGTGGTGCCTCATTGGTGAGGTAGTGACAAACCGGAAGTTCAACAGAGAAGCCTTCCGGAAGACGATGATGAACAGTTGGCAATGTAAGTCGGTTAAATTCGTAGAAG
TTGATGAAAATGTGCTTTTGTTTTGCTTTGCGGATGCTGCGTCCATGTTATATGTACAAAACCAGGGGCCATGGCTCTTTGAGGAATCACTGTTGGTCCTTGCCAAGTGG
AGCCCAAATATAAAAACAAAAGCAGAGTTGCCAAAGGTGTGTGACTTTTGGGTACAGATACATGGGCTTCCTTTTGACTGTAAAGGGCAGGAGGCAGCAAAAGTAGTGGC
GCAAAAGATAGGTCGGGTTACAGATGAAGAGTGGGAGGTGGACTCTCGGAGCGTGCAACAACGAAAATTTATCAGAATTAAAGTGGAGCTAGATGTAACGAAATTGCTCA
TGAAAGGCTTTATAGTTATGACTGGGGGTTCCAAGAAGTGGGTATGGTTCAAGTATGAACGTCTGCCTAGGTTTTGCTCAAAGTGTGGAGTGATGGGGCATACAGCTCAC
TGGTGCAGTGCGAAACATCTCCAAACTTCCCCTGCGCCGCCAGTGTTTGGAGATTGGCTAAGGGCGGGTCCGGTACCGACAATGGCGAAACAAGGGGAAAGATCAGGTCA
TTGGCGTAGGAAGGAACAAGGTCAGGGGAAGGCACCGGAGAGCGGTGTGCCGTCGGTGGGAACAGAATCGGAGAAGGCCCCGGGCATAAATCTCGGAGGAGACGTATCTT
CGGAACCACCTACGGGAATCGTGGAACCGAGGGTGGAAAATGAGCAGAATATCTCGGAGAATTTGGAACTGAATGAGGTACTACAGGTAATGGGCCAAGTAATGGGGCTA
ACCTGGCCGACGGGCAAACATTATGGGAAAGCTGTTGATCAAGGCCCAGGGGGCACGCTACATAGGACCAATGCAGATGGGGCTAAGGTCAGAGGGTCATCAGTCTCACT
GATTGGGTTTAAAAGGAAGTGTAGAGCTGGCGGTGTAGGTACGAGTGAACAAGCTGAGATGGGTTCGTCGAAAAGAGCTAAAGGGAGTGATAAGGGGAACCCCCTGACAG
TTCGATCTCTTAAGGAGCAAGTGAAGCTCCATTCCCCAAATGTTATATTCTTGTCCGAAACAAAAAATAAGGCAAATAGGTTGGAGGGAATAAGGAGGCAATTGGGCTAC
GAGGGGTGTTTTGTGGTCGAGCCCCGTGGGCTCAAGGCAGGCCTTTGTCTGATGTGGAAGATTGTTGATGAGGTTGAGATCCATCAATATGCAGATTTCTTCATTGAGGC
TGTGATTCGGCCTAAGACTGGCAACCCAAAATGGCATTTCTTTGGAGTCTATGCAAGCACGGATGAGAAGGAAAGAGAGCAACAACTCAGTAACCTGACTTCTAGAATTG
GACTCTCACAGGATAATTACGTGGTAGGCGGGGACTTTAATGATATTGTTTGCAATGGGGAGAAGGAGGGGGCCTATACCGATCTCAAAGAAGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGGATGAGCCTATTGAGGAAGTTCTGGCGAGGTTTAACATATCGGAAAAGGAATGCGAGACAGTAACAATTTCAAGCGAGGCCAAATCGGTTGATCTAGAGGACAA
GAAGTGGTGCCTCATTGGTGAGGTAGTGACAAACCGGAAGTTCAACAGAGAAGCCTTCCGGAAGACGATGATGAACAGTTGGCAATGTAAGTCGGTTAAATTCGTAGAAG
TTGATGAAAATGTGCTTTTGTTTTGCTTTGCGGATGCTGCGTCCATGTTATATGTACAAAACCAGGGGCCATGGCTCTTTGAGGAATCACTGTTGGTCCTTGCCAAGTGG
AGCCCAAATATAAAAACAAAAGCAGAGTTGCCAAAGGTGTGTGACTTTTGGGTACAGATACATGGGCTTCCTTTTGACTGTAAAGGGCAGGAGGCAGCAAAAGTAGTGGC
GCAAAAGATAGGTCGGGTTACAGATGAAGAGTGGGAGGTGGACTCTCGGAGCGTGCAACAACGAAAATTTATCAGAATTAAAGTGGAGCTAGATGTAACGAAATTGCTCA
TGAAAGGCTTTATAGTTATGACTGGGGGTTCCAAGAAGTGGGTATGGTTCAAGTATGAACGTCTGCCTAGGTTTTGCTCAAAGTGTGGAGTGATGGGGCATACAGCTCAC
TGGTGCAGTGCGAAACATCTCCAAACTTCCCCTGCGCCGCCAGTGTTTGGAGATTGGCTAAGGGCGGGTCCGGTACCGACAATGGCGAAACAAGGGGAAAGATCAGGTCA
TTGGCGTAGGAAGGAACAAGGTCAGGGGAAGGCACCGGAGAGCGGTGTGCCGTCGGTGGGAACAGAATCGGAGAAGGCCCCGGGCATAAATCTCGGAGGAGACGTATCTT
CGGAACCACCTACGGGAATCGTGGAACCGAGGGTGGAAAATGAGCAGAATATCTCGGAGAATTTGGAACTGAATGAGGTACTACAGGTAATGGGCCAAGTAATGGGGCTA
ACCTGGCCGACGGGCAAACATTATGGGAAAGCTGTTGATCAAGGCCCAGGGGGCACGCTACATAGGACCAATGCAGATGGGGCTAAGGTCAGAGGGTCATCAGTCTCACT
GATTGGGTTTAAAAGGAAGTGTAGAGCTGGCGGTGTAGGTACGAGTGAACAAGCTGAGATGGGTTCGTCGAAAAGAGCTAAAGGGAGTGATAAGGGGAACCCCCTGACAG
TTCGATCTCTTAAGGAGCAAGTGAAGCTCCATTCCCCAAATGTTATATTCTTGTCCGAAACAAAAAATAAGGCAAATAGGTTGGAGGGAATAAGGAGGCAATTGGGCTAC
GAGGGGTGTTTTGTGGTCGAGCCCCGTGGGCTCAAGGCAGGCCTTTGTCTGATGTGGAAGATTGTTGATGAGGTTGAGATCCATCAATATGCAGATTTCTTCATTGAGGC
TGTGATTCGGCCTAAGACTGGCAACCCAAAATGGCATTTCTTTGGAGTCTATGCAAGCACGGATGAGAAGGAAAGAGAGCAACAACTCAGTAACCTGACTTCTAGAATTG
GACTCTCACAGGATAATTACGTGGTAGGCGGGGACTTTAATGATATTGTTTGCAATGGGGAGAAGGAGGGGGCCTATACCGATCTCAAAGAAGTTTAG
Protein sequenceShow/hide protein sequence
MMDEPIEEVLARFNISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQCKSVKFVEVDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKW
SPNIKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVTKLLMKGFIVMTGGSKKWVWFKYERLPRFCSKCGVMGHTAH
WCSAKHLQTSPAPPVFGDWLRAGPVPTMAKQGERSGHWRRKEQGQGKAPESGVPSVGTESEKAPGINLGGDVSSEPPTGIVEPRVENEQNISENLELNEVLQVMGQVMGL
TWPTGKHYGKAVDQGPGGTLHRTNADGAKVRGSSVSLIGFKRKCRAGGVGTSEQAEMGSSKRAKGSDKGNPLTVRSLKEQVKLHSPNVIFLSETKNKANRLEGIRRQLGY
EGCFVVEPRGLKAGLCLMWKIVDEVEIHQYADFFIEAVIRPKTGNPKWHFFGVYASTDEKEREQQLSNLTSRIGLSQDNYVVGGDFNDIVCNGEKEGAYTDLKEV