; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020254 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020254
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationscaffold1:28132307..28140648
RNA-Seq ExpressionSpg020254
SyntenySpg020254
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]4.6e-0926.15Show/hide
Query:  KKRGRPRQEKAMRDFREVIDWCAVRDPSFIGPEFTWCN-----NHV--------------NTSDHKPI--LAKWTEDIREDMI------PNFHRPKR---
        K  G  R    M +F+E I  C + D  F G +FTW N     N++              +T  + P   LA W  D    M          H  K    
Subjt:  KKRGRPRQEKAMRDFREVIDWCAVRDPSFIGPEFTWCN-----NHV--------------NTSDHKPI--LAKWTEDIREDMI------PNFHRPKR---

Query:  ---FEEAWARYEECHEIV-------------------------------------------QQSKL------------------ELRDKEKQLEILLEDD
           +E+ W+ YE C  IV                                           +Q++L                  E+R  E Q+  +L D+
Subjt:  ---FEEAWARYEECHEIV-------------------------------------------QQSKL------------------ELRDKEKQLEILLEDD

Query:  EIYWKQRAREEWLQWGDRNTKWFHLQANKRRKVNRIKGLFDENGQWTEGDLDLESIANQY
        E+YWKQR+R +WL+ GD+NTK+FH +A+ RR+ N+I G+ D+ G W +   D E I  ++
Subjt:  EIYWKQRAREEWLQWGDRNTKWFHLQANKRRKVNRIKGLFDENGQWTEGDLDLESIANQY

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]1.5e-4728.85Show/hide
Query:  PEEGTCKPIVPHPETQNLTVAQLINRNGTWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLGIQNQQNLEASVSSHKEEEMI
        P   + + ++   + +N+ V++L + +G WN  L+R  FL DDA+ IL++P      +DS+ W+ D +G + V+S Y++ +  ++ +    SS  +    
Subjt:  PEEGTCKPIVPHPETQNLTVAQLINRNGTWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLGIQNQQNLEASVSSHKEEEMI

Query:  WKVFWKAQLPSKIKICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDL--LFLNDRKEWTTPDYFDSIWRGTR
        W+  WK  +P+K KI  W+ +   LPT + L +R +D+   C +C D +E+ TH+ W C +   +W        +  + ++D      P    S+WR   
Subjt:  WKVFWKAQLPSKIKICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDL--LFLNDRKEWTTPDYFDSIWRGTR

Query:  DGIVDESKLTKSLIICWQIWTHKNYISHQNLKPDPDQLKTQIDKYIDEFHPRSRTYRDFSHSQKVERPLSTWIPPSTGLWKLNCDATWNDQHQRGGIGWI
           VD       +I  W++WT++N + H +       + T ID + +EF    R   + +H + +    S W  P  G +K+NCDA++  +  + G+G I
Subjt:  DGIVDESKLTKSLIICWQIWTHKNYISHQNLKPDPDQLKTQIDKYIDEFHPRSRTYRDFSHSQKVERPLSTWIPPSTGLWKLNCDATWNDQHQRGGIGWI

Query:  VRDWEGRPLIAGFRCIKRQWKINWMEAMAISEGIR-SILIHAPPIQIESNALQIIRLLNRQDQDETELLNFIQEVHALTSSRNINGFFHVNRKHNSLAHK
        +RD++G  + A    +     +  +EA A  EGI  +I I    + IES+A  +I+LL+ Q    TEL   I    AL +S N+  +  V R+ NS+AH 
Subjt:  VRDWEGRPLIAGFRCIKRQWKINWMEAMAISEGIR-SILIHAPPIQIESNALQIIRLLNRQDQDETELLNFIQEVHALTSSRNINGFFHVNRKHNSLAHK

Query:  LARKACLSGLFELWTE
        +A+ A       +W E
Subjt:  LARKACLSGLFELWTE

XP_023902041.1 uncharacterized protein LOC112013897 [Quercus suber]6.8e-4529.95Show/hide
Query:  KWKIGKGFNIEASNDPWIPEEGTCKPI-VPHPETQNLTVAQLINR-NGTWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLG
        +W++G G  I    D W+P   T K I  P P      V+ LI+R    W + +VR LFL  +A  IL+IPL+ N  ED IIW  + KG F VKSAY + 
Subjt:  KWKIGKGFNIEASNDPWIPEEGTCKPI-VPHPETQNLTVAQLINR-NGTWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLG

Query:  IQNQQNLEASVSSHKE-EEMIWKVFWKAQLPSKIKICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSS-TDLLFL
        +    N+E   SS  +   ++W+  W   +P K++I  W++  + LPT  NL ++G++I  +C  C  + E+  H+F +C+  + +W  +  +  DL+ +
Subjt:  IQNQQNLEASVSSHKE-EEMIWKVFWKAQLPSKIKICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSS-TDLLFL

Query:  NDRKEWTTPDYFDSIWRGTRDGIVDESKLTKSLIICWQIWTHKNYISHQNLKPDPDQLKTQIDKYIDEFHPRSRTYRDFSHSQKVERPLSTWIPPSTGLW
        N        D  D   +    G    S L    ++ W IW ++N I  ++L   P+ +     KYI EF   S  Y     SQ + +    W+ P  G++
Subjt:  NDRKEWTTPDYFDSIWRGTRDGIVDESKLTKSLIICWQIWTHKNYISHQNLKPDPDQLKTQIDKYIDEFHPRSRTYRDFSHSQKVERPLSTWIPPSTGLW

Query:  KLNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFRCIKRQWKINWMEAMAISEGIRSILIH-APPIQIESNALQIIRLLNRQDQDETELLNFIQEVHALTS
        K+N D   ++  +   +G I+RD  G    A     + Q+ +  +EA+A+  G+        P I +ES+AL ++  +N  +     L +  Q + +L S
Subjt:  KLNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFRCIKRQWKINWMEAMAISEGIRSILIH-APPIQIESNALQIIRLLNRQDQDETELLNFIQEVHALTS

Query:  SRNINGFFHVNRKHNSLAHKLARKACLSGLFELW
        S +     HV R++N  AH+LA+ A L    ++W
Subjt:  SRNINGFFHVNRKHNSLAHKLARKACLSGLFELW

XP_023902041.1 uncharacterized protein LOC112013897 [Quercus suber]6.0e-0944.16Show/hide
Query:  ELRDKEKQLEILLEDDEIYWKQRAREEWLQWGDRNTKWFHLQANKRRKVNRIKGLFDENGQWTEGDLDLESIANQYF
        E+    K++  LL+ +EI W+QR+R +WL  GDRNTK+FH +A+ RR+ N I G+ DENG W +    +  +A  YF
Subjt:  ELRDKEKQLEILLEDDEIYWKQRAREEWLQWGDRNTKWFHLQANKRRKVNRIKGLFDENGQWTEGDLDLESIANQYF

XP_023915006.1 uncharacterized protein LOC112026546 [Quercus suber]1.8e-4529.1Show/hide
Query:  KWKIGKGFNIEASNDPWIPEEGTCKPIVPHPETQNL-TVAQLINRNGT-WNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLG
        +W++G G  I+  +D W+P   T K I P  +  +   V+ LI+ +   W   +VR +FL  +A+ IL IPL+ N  ED ++W  + KG F VKSAY + 
Subjt:  KWKIGKGFNIEASNDPWIPEEGTCKPIVPHPETQNL-TVAQLINRNGT-WNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLG

Query:  IQ-NQQNLEASVSSHKEEEMIWKVFWKAQLPSKIKICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDLLFLN
              N +   SS  E   +WK  W  ++P K+KI  WR+  + LPT+ NL+ RG+  +  C LC    ET  H    C   +  W  +++   +L   
Subjt:  IQ-NQQNLEASVSSHKEEEMIWKVFWKAQLPSKIKICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDLLFLN

Query:  DRKEWTTPDYFDSIWRGTRDGIVDESKLTKSLIICWQIWTHKNYISHQNLKPDPDQLKTQIDKYIDEFHPRSRTYRDFSHSQKVERPLSTWIPPSTGLWK
         ++     D    I +         S L   + + W IW ++N   H+     P Q+     + + E+    +    FS+   V   L  W PP  G  K
Subjt:  DRKEWTTPDYFDSIWRGTRDGIVDESKLTKSLIICWQIWTHKNYISHQNLKPDPDQLKTQIDKYIDEFHPRSRTYRDFSHSQKVERPLSTWIPPSTGLWK

Query:  LNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFRCIKRQWKINWMEAMAISEGIRSIL-IHAPPIQIESNALQIIRLLNRQDQDETELLNFIQEVHALTSS
        +N D   +D  +    G I+RD +G  + A  R +   +  +  EA+A+ +G+   L +    +  ES+AL II+ +N +     E+ + +Q +  L SS
Subjt:  LNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFRCIKRQWKINWMEAMAISEGIRSIL-IHAPPIQIESNALQIIRLLNRQDQDETELLNFIQEVHALTSS

Query:  RNINGFFHVNRKHNSLAHKLARKACLSGLFELW
         +   F H  R+ N  AH+LAR A ++G+ ++W
Subjt:  RNINGFFHVNRKHNSLAHKLARKACLSGLFELW

XP_024190234.1 uncharacterized protein LOC112194221 [Rosa chinensis]3.0e-5328.93Show/hide
Query:  KWKIGKGFNIEASNDPWIPEEGTCKPIVPHPETQNLTVAQLINRNGTWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLGIQ
        +W++G G NI    D W+P   T +P+ PH    NL V++L+   G WNE L+R  F   + + IL+IP+   + +DSI+W+    G + VKS   L  +
Subjt:  KWKIGKGFNIEASNDPWIPEEGTCKPIVPHPETQNLTVAQLINRNGTWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLGIQ

Query:  NQQNLEASV----SSHKEEEMIWKVFWKAQLPSKIKICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDLLFL
         Q+  E SV    SS+KE   +W   WK ++ +K+K+  WR  K  LP  +NL +R +  + LC  C  + ET  H  W C  ++ +W          FL
Subjt:  NQQNLEASV----SSHKEEEMIWKVFWKAQLPSKIKICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDLLFL

Query:  ND-RKEWTTPDYFDSIWRGTRDGIVDESKLTKSLIICWQIWTHKNYISHQNLKPDPDQLKTQIDKYIDEFHPRSRTYRDFSHSQKVERPLSTWIPPSTGL
        N   K+W  P + D      +    +E +L    +ICW +W  +N   H+ +  +   +    +++++ F    R+  + +     +R    W PP+   
Subjt:  ND-RKEWTTPDYFDSIWRGTRDGIVDESKLTKSLIICWQIWTHKNYISHQNLKPDPDQLKTQIDKYIDEFHPRSRTYRDFSHSQKVERPLSTWIPPSTGL

Query:  WKLNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFRCIKRQWKINWMEAMAISEGIRSILIHAPPIQ---IESNALQIIRLLNRQDQDETELLNFIQEVHA
         KLN DA  + + ++  +G +VRD EG+   AG + +     I  +EA+A+  G+  +L      Q   +ES++  +I  LN+ + D +     + ++  
Subjt:  WKLNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFRCIKRQWKINWMEAMAISEGIRSILIHAPPIQ---IESNALQIIRLLNRQDQDETELLNFIQEVHA

Query:  LTSSRNINGFFHVNRKHNSLAHKLARKACLSGLFELWTE
        L S+     +  VNR+ N  AHK+A+ A ++    LW E
Subjt:  LTSSRNINGFFHVNRKHNSLAHKLARKACLSGLFELWTE

XP_024190234.1 uncharacterized protein LOC112194221 [Rosa chinensis]1.4e-0238.89Show/hide
Query:  EKQLEILLEDDEIYWKQRAREEWLQWGDRNTKWFHLQANKRRKVNRIKGLFDENGQWTEGDLDLESIANQYF
        +K LE  +  +E  W+QR+R EWL+ GD NT++FH  A  R   NR+ G+ D NG W +    ++     YF
Subjt:  EKQLEILLEDDEIYWKQRAREEWLQWGDRNTKWFHLQANKRRKVNRIKGLFDENGQWTEGDLDLESIANQYF

XP_024190234.1 uncharacterized protein LOC112194221 [Rosa chinensis]2.3e-4830.37Show/hide
Query:  KWKIGKGFNIEASNDPWIPEEGTCKPIVPHPETQNLTVAQLINRNGTWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLGIQ
        +W+IG G  +    D WIP   T +PI P        VA LI+    W    +   F+++D   IL I L   + ED ++W+ D KG + VKS Y+L + 
Subjt:  KWKIGKGFNIEASNDPWIPEEGTCKPIVPHPETQNLTVAQLINRNGTWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLGIQ

Query:  NQQNLEASVSSHKEEEMIWKVFWKAQLPSKIKICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDL--LFLND
          QN      S      +WK+ W   LP K+KI  WR  K+ILPT  NL KR      +C  C+ + ET +H+  ECKA R +W       DL  L +  
Subjt:  NQQNLEASVSSHKEEEMIWKVFWKAQLPSKIKICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDL--LFLND

Query:  RKEWTTPDYFDSIWRGTRDGIVDESKLTKSLIICWQIWTHKNYISHQNLKPDPDQLKTQIDKYIDEFHPRSRTYRDFSHSQKVERPL--STWIPPSTGLW
         K+    D+F +I          E++L   ++ CW IW+ +N    +  K D   L  + D  +  +   S+      H  K +R +    W PPS  + 
Subjt:  RKEWTTPDYFDSIWRGTRDGIVDESKLTKSLIICWQIWTHKNYISHQNLKPDPDQLKTQIDKYIDEFHPRSRTYRDFSHSQKVERPL--STWIPPSTGLW

Query:  KLNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFRCIKRQWKINWMEAMAISEGIR-SILIHAPPIQIESNALQIIRLLNRQDQDETELLNFIQEVHALTS
        KLN DA  + + Q+ G+G IVRD EG+ L  G +  + + +++  EA AI  G++ +  I +  + +ES+  +++ LLN      TE+   + +V   + 
Subjt:  KLNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFRCIKRQWKINWMEAMAISEGIR-SILIHAPPIQIESNALQIIRLLNRQDQDETELLNFIQEVHALTS

Query:  SRNINGFFHVNRKHNSLAHKLARKACLSGLFELWTEFF
              F  + R  N+ AH LA+ A  +   ++W   F
Subjt:  SRNINGFFHVNRKHNSLAHKLARKACLSGLFELWTEFF

TrEMBL top hitse value%identityAlignment
A0A5C7H0P0 Uncharacterized protein7.1e-4828.85Show/hide
Query:  PEEGTCKPIVPHPETQNLTVAQLINRNGTWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLGIQNQQNLEASVSSHKEEEMI
        P   + + ++   + +N+ V++L + +G WN  L+R  FL DDA+ IL++P      +DS+ W+ D +G + V+S Y++ +  ++ +    SS  +    
Subjt:  PEEGTCKPIVPHPETQNLTVAQLINRNGTWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLGIQNQQNLEASVSSHKEEEMI

Query:  WKVFWKAQLPSKIKICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDL--LFLNDRKEWTTPDYFDSIWRGTR
        W+  WK  +P+K KI  W+ +   LPT + L +R +D+   C +C D +E+ TH+ W C +   +W        +  + ++D      P    S+WR   
Subjt:  WKVFWKAQLPSKIKICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDL--LFLNDRKEWTTPDYFDSIWRGTR

Query:  DGIVDESKLTKSLIICWQIWTHKNYISHQNLKPDPDQLKTQIDKYIDEFHPRSRTYRDFSHSQKVERPLSTWIPPSTGLWKLNCDATWNDQHQRGGIGWI
           VD       +I  W++WT++N + H +       + T ID + +EF    R   + +H + +    S W  P  G +K+NCDA++  +  + G+G I
Subjt:  DGIVDESKLTKSLIICWQIWTHKNYISHQNLKPDPDQLKTQIDKYIDEFHPRSRTYRDFSHSQKVERPLSTWIPPSTGLWKLNCDATWNDQHQRGGIGWI

Query:  VRDWEGRPLIAGFRCIKRQWKINWMEAMAISEGIR-SILIHAPPIQIESNALQIIRLLNRQDQDETELLNFIQEVHALTSSRNINGFFHVNRKHNSLAHK
        +RD++G  + A    +     +  +EA A  EGI  +I I    + IES+A  +I+LL+ Q    TEL   I    AL +S N+  +  V R+ NS+AH 
Subjt:  VRDWEGRPLIAGFRCIKRQWKINWMEAMAISEGIR-SILIHAPPIQIESNALQIIRLLNRQDQDETELLNFIQEVHALTSSRNINGFFHVNRKHNSLAHK

Query:  LARKACLSGLFELWTE
        +A+ A       +W E
Subjt:  LARKACLSGLFELWTE

A0A803NML1 Uncharacterized protein2.8e-4428.6Show/hide
Query:  KWKIGKGFNIEASNDPWIPEEGTCKPIVPHPETQNLTVAQLINRNGTWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLGIQ
        ++KIG G +++   DPWIP     KP V +  + +L V+  I  N  WN  L+   F + D ++IL+IPL      D ++W+  P G++ VK+ + L   
Subjt:  KWKIGKGFNIEASNDPWIPEEGTCKPIVPHPETQNLTVAQLINRNGTWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLGIQ

Query:  NQQNLEASVSSHKEEEMIWKVFWKAQLPSKIKICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDLLFLNDRK
          ++  +S +S+K+ E  WK FW  +LP KI+I  W+++++ILPT   L KR +  +  C LC    E+  H  + CK  + +W       D    +   
Subjt:  NQQNLEASVSSHKEEEMIWKVFWKAQLPSKIKICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDLLFLNDRK

Query:  EWTTPDYFDSIWRGTRDGIVDESKLTKSLIICWQIWTHKNYISHQNLKPDPDQLKTQIDKYIDEFHPR-------SRTYRDFS--HSQKVERPLSTWIPP
             DY   +       I  +      L + W IWT +N + H      P  +     K+ ++F+         + T R  S   +   ++ +  W PP
Subjt:  EWTTPDYFDSIWRGTRDGIVDESKLTKSLIICWQIWTHKNYISHQNLKPDPDQLKTQIDKYIDEFHPR-------SRTYRDFS--HSQKVERPLSTWIPP

Query:  STGLWKLNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFRCIKRQWKINWMEAMAISEGIRSILIHAPPI-QIESNALQIIRLLNRQDQDETELLNFIQEV
            +KLN DA  N + ++ GIG I+RD +G  L A  + ++  +K + MEA A+   +  +     P+  IE++A ++   LNR + D +   + I ++
Subjt:  STGLWKLNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFRCIKRQWKINWMEAMAISEGIRSILIHAPPI-QIESNALQIIRLLNRQDQDETELLNFIQEV

Query:  HALTSSRNINGFFHVNRKHNSLAHKLARKA
          L SS       HV R  N  AH LA+ A
Subjt:  HALTSSRNINGFFHVNRKHNSLAHKLARKA

A0A803NML1 Uncharacterized protein7.1e-0831.11Show/hide
Query:  PFVPDRLESATAIGSLLGKVEHVDLEEEKDQSWGKSLRIKIQIEVESPLKCGIFLKSEKEGKHKWIAVTYEKLPDFCYGCGLLGHTIREC
        PF+      A A+G+++G+ + V  E+  ++ WG  LR+++ ++V  PLK G  +         W+   YE+LP++C  CG++GH   +C
Subjt:  PFVPDRLESATAIGSLLGKVEHVDLEEEKDQSWGKSLRIKIQIEVESPLKCGIFLKSEKEGKHKWIAVTYEKLPDFCYGCGLLGHTIREC

A0A803NML1 Uncharacterized protein6.7e-0632.17Show/hide
Query:  ELRDKEKQLEILLEDDEIYWKQRAREEWLQWGDRNTKWFHLQANKRRKVNRIKGLFDENGQWTEGDLDLESIANQYFYKWKIGKGFNIEASNDPWIPEEG
        EL+D E  L+ LL+ +E YW QR+R + LQ GD+NT +FH  A  R+  N IK L +  G       ++ ++ + Y+       G + ++ ND       
Subjt:  ELRDKEKQLEILLEDDEIYWKQRAREEWLQWGDRNTKWFHLQANKRRKVNRIKGLFDENGQWTEGDLDLESIANQYFYKWKIGKGFNIEASNDPWIPEEG

Query:  TCKPIVPHPETQNLT
        T    + H   Q+LT
Subjt:  TCKPIVPHPETQNLT

A0A803P5M6 Uncharacterized protein1.9e-4821.19Show/hide
Query:  PFVPDRLESATAIGSLLGKVEHVDLEEEKDQSWGKSLRIKIQIEVESPLKCGIFLKSEKEGKHKWIAVTYEKLPDFCYGCGLLGHTIRECEGNCGSIDE-
        PF+      A A+G+++G+ + V  E+  ++ WG  LR+++ ++V  PLK G  +         W+   YE+LP++C  CG++GH   +C      +D  
Subjt:  PFVPDRLESATAIGSLLGKVEHVDLEEEKDQSWGKSLRIKIQIEVESPLKCGIFLKSEKEGKHKWIAVTYEKLPDFCYGCGLLGHTIRECEGNCGSIDE-

Query:  ---ELPYGPSLRELT--------------------ILKMRETETMNNPYPSFFGRGMGRGRECARGSWRNLPTEEVENTHAGFQNHNHEEEDLAPPENST
            L Y P ++  T                    ++     +++ +  P    RG    R    G        E  NT+  F++ N+  +D+    +  
Subjt:  ---ELPYGPSLRELT--------------------ILKMRETETMNNPYPSFFGRGMGRGRECARGSWRNLPTEEVENTHAGFQNHNHEEEDLAPPENST

Query:  P---IRPDS---GKPTIPTTARRVPVVL---SKKESEVNLDDVRNGEISGETIKENPNGQNLSKSINDKDSDLLLMDIDQKWVGPQEGRK----------
        P   I+P S      T+ ++A+ +   +   + K+S  N  D+ +  +        PN  N   +   + S   L DI  K   P               
Subjt:  P---IRPDS---GKPTIPTTARRVPVVL---SKKESEVNLDDVRNGEISGETIKENPNGQNLSKSINDKDSDLLLMDIDQKWVGPQEGRK----------

Query:  -TSCKMETKADSNGPNFKPTEQ-KSMEESQSVNDC-----NSESNGSKNKNVDTYGTSKSQENRESGKFRTWRRLSRMQDSEEPEKKRGRP----RQEKA
         +   M    ++  PN +   Q + +   Q++  C     N  ++      VD +  S    +    K  + R LS    S       G      +    
Subjt:  -TSCKMETKADSNGPNFKPTEQ-KSMEESQSVNDC-----NSESNGSKNKNVDTYGTSKSQENRESGKFRTWRRLSRMQDSEEPEKKRGRP----RQEKA

Query:  MRDFREVIDWCAVRDP-SFIGPEFTWCNNHVNTSDHKPILAKWTEDIREDMIPNFHRPKRFEEAW--------------------------ARYEECHEI
            +E +DWC + D    I    T  +    +SDH+ I             P      RFE+ W                          +  + C + 
Subjt:  MRDFREVIDWCAVRDP-SFIGPEFTWCNNHVNTSDHKPILAKWTEDIREDMIPNFHRPKRFEEAW--------------------------ARYEECHEI

Query:  VQQ--------------------SKL------------ELRDKEKQLEILLEDDEIYWKQRAREEWLQWGDRNTKWFHLQANKRRKVNRIKGLFDENGQW
        +QQ                    S+L            EL+D E  L+ LLE +E YW QR+R +WLQ GD+NT +FH  A  R+  N IK L +  G  
Subjt:  VQQ--------------------SKL------------ELRDKEKQLEILLEDDEIYWKQRAREEWLQWGDRNTKWFHLQANKRRKVNRIKGLFDENGQW

Query:  TEGDLDLESIANQYFYK-----------------------------------------------------------------------------------
             ++ ++   Y+                                                                                     
Subjt:  TEGDLDLESIANQYFYK-----------------------------------------------------------------------------------

Query:  ----------------WKI------------GK--------------------------------GFNIEASNDPWIPEEGTCKPIVPHPETQNLTVAQL
                        WK+            GK                                G ++++  DPWIP     KP V +  + +L V+  
Subjt:  ----------------WKI------------GK--------------------------------GFNIEASNDPWIPEEGTCKPIVPHPETQNLTVAQL

Query:  INRNGTWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLGIQNQQNLEASVSSHKEEEMIWKVFWKAQLPSKIKICGWRIYKD
        I  N  WN  L+   F + D ++IL+IPL      D ++W+  P G++ VK+ + L     ++  +S +S+K+ E  WK FW  +LP KI+I  W+++++
Subjt:  INRNGTWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLGIQNQQNLEASVSSHKEEEMIWKVFWKAQLPSKIKICGWRIYKD

Query:  ILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDLLFLNDRKEWTTPDYFDSIWRGTRDGIVDESKLTKSLIICWQIWTHKNY
        ILPT   L KR +  +  C LC    E+  H  + CK  + +W       D    +        DY   +       I  +      L + W IWT +N 
Subjt:  ILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDLLFLNDRKEWTTPDYFDSIWRGTRDGIVDESKLTKSLIICWQIWTHKNY

Query:  ISHQNLKPDPDQLKTQIDKYIDEFH---------PRSRTYRDFSHSQKVERPLSTWIPPSTGLWKLNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFRCI
        + H      P  +     K+ ++F+           + ++     +   ++ +  W PP    +KLN DA  N + ++ GIG I+RD +G  L A  + +
Subjt:  ISHQNLKPDPDQLKTQIDKYIDEFH---------PRSRTYRDFSHSQKVERPLSTWIPPSTGLWKLNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFRCI

Query:  KRQWKINWMEAMAISEGIRSILIHAPPI-QIESNALQIIRLLNRQDQDETELLNFIQEVHALTSSRNINGFFHVNRKHNSLAHKLARKA
        +  +K + MEA A+   +  +     P+  IE++A ++   LNR + D +   + I ++  L SS       HV R  N  AH LA+ A
Subjt:  KRQWKINWMEAMAISEGIRSILIHAPPI-QIESNALQIIRLLNRQDQDETELLNFIQEVHALTSSRNINGFFHVNRKHNSLAHKLARKA

A0A803PJK4 Uncharacterized protein1.5e-4528.6Show/hide
Query:  QYFYKWKIGKGFNIEASNDPWIPEEGTCKPIVPHPETQNLTVAQLINRNGTWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYR
        Q+  +WK+G G  I+ + DPW+P   + KP++      NLTV++LI+ +  W+   ++  FL+ D +KIL+IPL+    +D IIW+ +  G++ VKS Y 
Subjt:  QYFYKWKIGKGFNIEASNDPWIPEEGTCKPIVPHPETQNLTVAQLINRNGTWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYR

Query:  LGIQNQQNLEASVSSHKEEEMIWKVFWKAQLPSKIKICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDLLFL
        L  +       SVSSH  +  +WK FWK  +PSK++I  W+  ++ LP  + L K  +  + +C LCR   E+  H  + CK  + +W     S D +  
Subjt:  LGIQNQQNLEASVSSHKEEEMIWKVFWKAQLPSKIKICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDLLFL

Query:  NDRKEWTTPDYF---DSIWRGTRDGIVDESKLTKSLIICWQIWTHKNYISHQNLKPDP-DQLKTQIDKYIDEFHPRSRT----YRDFSHSQKVERPLSTW
           +  T  D F      W         + +L +   I W IWT +N   H   KP P + L      Y+ E+    +      R+  H          W
Subjt:  NDRKEWTTPDYF---DSIWRGTRDGIVDESKLTKSLIICWQIWTHKNYISHQNLKPDP-DQLKTQIDKYIDEFHPRSRT----YRDFSHSQKVERPLSTW

Query:  IPPSTGLWKLNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFRCIKRQWKINWMEAMAISEGIRSILIHAPPIQ-IESNALQIIRLLNRQDQDETELLNFI
        + P +G  KLN DA  ++ ++  G G I+RD +G+ + A  +     +K   MEA+A+   ++ +     PI  IE+++L +++ L  +  + ++  + +
Subjt:  IPPSTGLWKLNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFRCIKRQWKINWMEAMAISEGIRSILIHAPPIQ-IESNALQIIRLLNRQDQDETELLNFI

Query:  QEVHALTSSRNINGFFHVNRKHNSLAHKLARKACLSGLFELWTE
         ++  L S+       HV R  N+ AH LA+ A       +W E
Subjt:  QEVHALTSSRNINGFFHVNRKHNSLAHKLARKACLSGLFELWTE

A0A803PJK4 Uncharacterized protein2.6e-1034.55Show/hide
Query:  PFVPDRLESATAIGSLLGKVEHVDLEEEK-DQSWGKSLRIKIQIEVESPLKCGIFLKSEKEGKHKWIAVTYEKLPDFCYGCGLLGHTIRECEGNCGSIDE
        PF+    +    IG L+G   ++D+ E+  ++ WG  LRI+++I+V  PL  G  +         W+   YE+LPDFCY CG++GH   +C+     ID+
Subjt:  PFVPDRLESATAIGSLLGKVEHVDLEEEK-DQSWGKSLRIKIQIEVESPLKCGIFLKSEKEGKHKWIAVTYEKLPDFCYGCGLLGHTIRECEGNCGSIDE

Query:  ----ELPYGP
            +L YGP
Subjt:  ----ELPYGP

A0A803PJK4 Uncharacterized protein1.1e-0541.56Show/hide
Query:  ELRDKEKQLEILLEDDEIYWKQRAREEWLQWGDRNTKWFHLQANKRRKVNRIKGLFDENGQWTEGDLDLESIANQYF
        +L + EK L+ LL  +E YW QR+R  WL+ GD NTK+FH +A+ R+  N+I  L D NG        + +I  +YF
Subjt:  ELRDKEKQLEILLEDDEIYWKQRAREEWLQWGDRNTKWFHLQANKRRKVNRIKGLFDENGQWTEGDLDLESIANQYF

A0A803PJK4 Uncharacterized protein1.2e-4429.86Show/hide
Query:  KWKIGKGFNIEASNDPWIPE-EGTCKPIVPHPETQNLTVAQLINRNGTWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLGI
        +W+IG G       D WIP   GT     P  E Q   + +LIN +G W    ++  F E+D   +  IP++    ED++ W   P G ++VKS YR+G 
Subjt:  KWKIGKGFNIEASNDPWIPE-EGTCKPIVPHPETQNLTVAQLINRNGTWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLGI

Query:  QNQQNLEASVSSHKEE-EMIWKVFWKAQLPSKIKICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDLLFLND
          + NL  + SS+ E+    WK+ W   LP ++K+ GWR+  + LP   NL  RGMD+NL C LC  + ET TH  W C   + +W        + + + 
Subjt:  QNQQNLEASVSSHKEE-EMIWKVFWKAQLPSKIKICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDLLFLND

Query:  RKEWTTPDYFDSIWRGTRDGIVDESKLTKSLIICWQIWTHKNYISHQNLKPDPDQLKTQIDKYIDEFHPRSRTYR------DFSHSQKVERPLSTWIPPS
           +     FD +   T    + +S+  +++ I W IW ++N   + N  P  + +  Q+  +I   +P SR  +      D  H Q     L  WI P 
Subjt:  RKEWTTPDYFDSIWRGTRDGIVDESKLTKSLIICWQIWTHKNYISHQNLKPDPDQLKTQIDKYIDEFHPRSRTYR------DFSHSQKVERPLSTWIPPS

Query:  TGLWKLNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFRCIKRQWKINWMEAMAISEGIRSILIHA-PPIQIESNALQIIRLLNRQDQDETELLNFIQEVH
        TG   +NCDA  N+     G G+I RD+EG  L+AG    +    +   EA AI E ++     A    +I+S+  +I+  +  +D   + +   + ++ 
Subjt:  TGLWKLNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFRCIKRQWKINWMEAMAISEGIRSILIHA-PPIQIESNALQIIRLLNRQDQDETELLNFIQEVH

Query:  ALTSSRNINGFFHVNRKHNSLAHKLARKACLSGLFELWTEFF
           +S N     HV+R +N  AH LARK   +    ++T  F
Subjt:  ALTSSRNINGFFHVNRKHNSLAHKLARKACLSGLFELWTEFF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G33160.1 glycoside hydrolase family 28 protein / polygalacturonase (pectinase) family protein3.6e-0424Show/hide
Query:  LIICWQIWTHKN--YISHQNLKPDPDQLKTQIDKYIDEFHPRSRTYRDFSHSQKVERPLSTWIPPSTGLWKLNCDATWNDQHQRGGIGWIVRDWEGRPLI
        L I W++W  +N  Y   +++  +      Q+D  + E++         SH   +   +  W  P+ G  K N D ++ + +  G  GWIVRD  GR   
Subjt:  LIICWQIWTHKN--YISHQNLKPDPDQLKTQIDKYIDEFHPRSRTYRDFSHSQKVERPLSTWIPPSTGLWKLNCDATWNDQHQRGGIGWIVRDWEGRPLI

Query:  AG---FRCIKRQWKINWMEAMAI-----SEGIRSILIHAPPIQIESNALQIIRLLNRQDQDETELLNFIQEVHALTSSRNINGFFHVNRKHNSLAHKLAR
        AG     CI    +  +   +       S+G R I         E +  ++  L+N          N+I+++    S      F   NR++N  A  LAR
Subjt:  AG---FRCIKRQWKINWMEAMAI-----SEGIRSILIHAPPIQIESNALQIIRLLNRQDQDETELLNFIQEVHALTSSRNINGFFHVNRKHNSLAHKLAR

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.7e-1425.18Show/hide
Query:  CFLCRDKTETATHLFWECKATRGLWHSYYSSTDLLFLNDRKEWTTPDYFDSIWRGTRDGIVDESKLTKSLI--ICWQIWTHKNYISHQNLKPDPDQLKTQ
        C  C D  ET  HL ++C   R +W     +   +      EWT   Y +  W    +  + +     +L+  + W++W  +N +  +  + D  ++  +
Subjt:  CFLCRDKTETATHLFWECKATRGLWHSYYSSTDLLFLNDRKEWTTPDYFDSIWRGTRDGIVDESKLTKSLI--ICWQIWTHKNYISHQNLKPDPDQLKTQ

Query:  IDKYIDEFHPRSRTYRDFSHSQKVERPLST-WIPPSTGLWKLNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFRCIKRQWKINWMEAMAISEGIRSI-LI
          +  +E+  R R     +   +VER LS  W  P     K N DATW  ++ R GIGWI+R+  G  L  G R + R   +   E  A+   + ++   
Subjt:  IDKYIDEFHPRSRTYRDFSHSQKVERPLST-WIPPSTGLWKLNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFRCIKRQWKINWMEAMAISEGIRSI-LI

Query:  HAPPIQIESNALQIIRLLNRQD---------QDETELLNFIQEVHALTSSRNINGFFHVNRKHNSLAHKLARKA
        +   I  ES+A  ++ LLN  D         +D  +LL+  +EV           F    R  N +A ++AR++
Subjt:  HAPPIQIESNALQIIRLLNRQD---------QDETELLNFIQEVHALTSSRNINGFFHVNRKHNSLAHKLARKA

AT4G29090.1 Ribonuclease H-like superfamily protein2.9e-2523.79Show/hide
Query:  LTVAQLINRNG-TWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLGIQ--NQQNLEASVSSHKEEEMIWKVFWKAQLPSKIK
        L V+ LI+ +G  W + ++  LF E +   I  +     +  DS  W+    G + VKS Y +  Q  N+++    VS      +  K+ WK+Q   KI+
Subjt:  LTVAQLINRNG-TWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLGIQ--NQQNLEASVSSHKEEEMIWKVFWKAQLPSKIK

Query:  ICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDLLFLNDRKEWTTPDYFDSIWRGTRDGIVDESKLTKSLI--
           W+   + LP    L  R +     C  C    ET  HL ++C   R  W     +   + +    EW    Y +  W         + +    L+  
Subjt:  ICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDLLFLNDRKEWTTPDYFDSIWRGTRDGIVDESKLTKSLI--

Query:  ICWQIWTHKNYISHQNLKPDPDQLKTQIDKYIDEFHPRSRTYRDFSHSQKVERPLSTWIPPSTGLWKLNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFR
        + W++W ++N +  +  + +  ++  + +  ++E+  R+      +  Q        W PP     K N DATWN  ++R GIGW++R+ +G     G R
Subjt:  ICWQIWTHKNYISHQNLKPDPDQLKTQIDKYIDEFHPRSRTYRDFSHSQKVERPLSTWIPPSTGLWKLNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFR

Query:  CIKRQWKINWMEAMAISEGIRSI-LIHAPPIQIESNALQIIRLLNRQDQDETELLNFIQEVHALTSSRNINGFFHVNRKHNSLAHKLARKA
         + +   +   E  A+   + S+       +  ES++  +I +LN  D+    L   IQ++  L S      F  + R+ N+LA ++AR++
Subjt:  CIKRQWKINWMEAMAISEGIRSI-LIHAPPIQIESNALQIIRLLNRQDQDETELLNFIQEVHALTSSRNINGFFHVNRKHNSLAHKLARKA

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.5e-0525.83Show/hide
Query:  NRNGTWNEHLVRG----LFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLGIQNQQNLEASVSSHKEEEMIWKVFWKAQLPSKIKICGWRI
        +RNG W     R     LFL   A  +  +P + ++ +DS +W  +  G +L   + R       +    +  H       KV W  +   +  +  W  
Subjt:  NRNGTWNEHLVRG----LFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLGIQNQQNLEASVSSHKEEEMIWKVFWKAQLPSKIKICGWRI

Query:  YKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYS
        + + LPT   L   GM+I     LC +  ET  HLF+EC  +  +W  + S
Subjt:  YKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTTTTCAACGATCCGTGGATTCCAAAAGAAACAACTTTCAAACCGATACCTATTCGACTAGGGGTTAATCGAGGAGCAATTAAAGTATCAGACTTTATCACTCC
ATCTATTGGATGGGATTTGGGTAAGCTCTGTAGTGAGGTGATTGTGGAAGATGCGAATTTAATTGCCACTATTCCTATTAGTCTTAGGAATGAGGAAAACAGATGGATTT
GGCATTATGCTTCAAATGGGGAGTACTCGGATAAAAGCAGGAATAGCTGGTCCAGGGACGTGAAGATTATGGAATGGGAGCAGCGATGCAAATGGATACACGGGTATTGG
ACGGAAACAAGGCCAAAAGTAAGATGTGAGCGGGATGATGGTATCAGTCAAGTGGGGGATCCAGTTCCTTCGAGAGATGGTTGCATATTGTTCACTGATGCAGCGATTAA
CCCGCAAAATAACGGGACGGGTTACGACGATCCTACATGGTCTACGTTTACTGCAACGTTTGCAGTATATGGAAGCTCAAGTATTTTCAGACTCTTCAAACGCGATCAGC
ATGATAACAGGGGATTTATATCCTTCTTCAGAAGTTTATCATTGGATCATTCAAATCTGAGATATGAGTACTCATGGGTTCGTCAACTCACTGCAAATGTAAGATGTATT
GTGACTGTTTGTGCTAATGAAATGGAAGCGGTGGACATTGGCAGCCTTGGTGCTGATCGGAGCAGTATATCTGCTGGTGGTGGGGCTTTCCCAAGTAGTTGTTGCAAACC
TTTTGTTCCTGATCGTCTGGAATCAGCTACGGCTATAGGCAGCCTCCTAGGCAAAGTCGAACATGTGGATTTAGAAGAAGAAAAAGATCAAAGCTGGGGAAAATCCTTAA
GGATCAAGATTCAGATTGAAGTAGAAAGTCCCTTGAAATGTGGAATCTTCTTGAAATCTGAAAAAGAAGGCAAGCATAAATGGATTGCTGTAACGTATGAGAAATTACCA
GATTTTTGCTATGGTTGTGGTTTATTGGGGCACACCATTAGAGAATGTGAAGGAAATTGTGGCTCAATTGATGAGGAACTACCATACGGACCGAGCTTGCGGGAACTTAC
AATACTAAAAATGCGTGAAACAGAGACAATGAATAATCCATATCCCAGTTTTTTCGGTAGAGGAATGGGTAGAGGAAGGGAATGTGCCAGAGGTAGTTGGAGAAACTTGC
CAACAGAGGAGGTGGAAAATACCCATGCTGGTTTTCAGAACCATAATCATGAGGAAGAAGACCTTGCACCGCCAGAAAACTCCACACCAATCCGACCGGATTCCGGCAAG
CCGACAATCCCCACGACGGCGAGAAGGGTACCGGTCGTTTTGTCCAAAAAGGAAAGTGAGGTAAATCTGGATGACGTTAGAAATGGAGAAATTAGCGGTGAGACAATAAA
AGAAAATCCCAACGGTCAAAATCTATCCAAATCAATAAATGATAAAGATTCTGATTTATTGCTAATGGATATTGACCAAAAGTGGGTTGGGCCACAAGAAGGGAGAAAGA
CAAGCTGCAAAATGGAAACAAAGGCTGACTCAAATGGGCCTAATTTCAAACCAACGGAACAGAAAAGTATGGAAGAATCTCAAAGCGTAAATGACTGCAATTCAGAATCT
AATGGGAGTAAGAACAAAAATGTGGATACATATGGAACTTCGAAATCTCAAGAGAACAGAGAGAGTGGGAAATTCAGAACTTGGCGTAGGTTATCACGTATGCAGGATTC
AGAGGAGCCAGAAAAAAAGAGAGGTAGACCTCGGCAGGAAAAGGCGATGCGAGATTTTCGTGAGGTTATTGATTGGTGCGCTGTTCGAGATCCGAGTTTCATTGGTCCTG
AGTTCACGTGGTGTAATAACCATGTGAACACGTCAGATCATAAACCGATATTAGCAAAATGGACCGAAGACATTAGGGAGGATATGATACCTAATTTTCATCGTCCGAAA
CGATTTGAAGAAGCTTGGGCTCGTTATGAAGAGTGCCATGAGATTGTGCAGCAAAGTAAGCTTGAGCTGAGGGATAAAGAAAAGCAGTTAGAGATTCTGTTAGAAGACGA
TGAAATTTACTGGAAACAAAGAGCTAGGGAAGAGTGGCTTCAATGGGGAGATCGAAACACGAAATGGTTCCACTTACAGGCGAATAAAAGGAGGAAAGTGAATAGAATCA
AAGGTTTATTTGATGAAAATGGTCAATGGACAGAGGGTGATTTAGATCTTGAATCTATTGCTAATCAGTATTTTTATAAGTGGAAAATAGGAAAAGGTTTCAACATTGAG
GCATCCAATGATCCATGGATCCCAGAGGAGGGTACTTGTAAACCAATTGTCCCTCATCCTGAAACACAAAACCTCACAGTGGCTCAGCTAATCAATAGAAATGGAACTTG
GAACGAACATCTGGTTCGAGGTCTATTCTTGGAAGATGACGCCAATAAGATTCTGAATATCCCTCTTAATCCGAATCAATCGGAGGATAGCATTATTTGGAATCCTGATC
CAAAAGGTTTATTCTTGGTAAAGAGTGCATATAGGCTAGGAATCCAGAATCAGCAAAATCTTGAAGCCTCGGTTTCAAGTCATAAGGAGGAGGAGATGATCTGGAAAGTC
TTTTGGAAAGCTCAATTGCCATCTAAAATCAAGATTTGTGGGTGGAGAATCTATAAAGATATTCTCCCAACACTATCCAATCTGAACAAACGAGGGATGGACATAAATCT
GTTATGCTTCCTGTGCAGAGACAAAACAGAGACGGCTACTCACCTCTTTTGGGAATGTAAAGCTACTAGAGGTTTGTGGCATTCTTATTACTCATCTACTGATCTTTTGT
TTTTGAATGACAGGAAGGAGTGGACTACACCAGATTATTTTGATAGCATTTGGAGGGGAACAAGAGATGGGATCGTGGACGAGAGCAAGCTGACCAAGAGCCTTATCATT
TGTTGGCAAATTTGGACGCACAAAAATTACATCTCACATCAAAATCTGAAGCCAGATCCAGATCAATTAAAGACACAAATTGACAAATATATCGATGAATTCCATCCCAG
AAGCAGAACTTACCGTGATTTTTCTCACTCCCAGAAAGTCGAAAGGCCGTTATCAACGTGGATTCCGCCATCGACGGGCCTATGGAAGCTTAACTGTGACGCGACGTGGA
ACGATCAACATCAACGAGGAGGCATTGGCTGGATTGTCAGAGACTGGGAAGGAAGACCTCTGATCGCTGGCTTTCGTTGCATTAAAAGGCAGTGGAAGATCAATTGGATG
GAGGCTATGGCGATCTCCGAAGGAATTCGTAGTATCCTGATACACGCTCCTCCAATTCAAATTGAAAGCAACGCCCTTCAGATTATCCGTCTTCTCAATCGGCAAGACCA
GGATGAAACTGAGCTGCTAAACTTCATTCAAGAGGTTCATGCCCTAACTTCTTCGAGAAACATTAATGGTTTTTTCCATGTGAACAGGAAGCACAATAGTTTGGCCCATA
AATTGGCACGAAAAGCTTGTTTAAGTGGACTGTTTGAGTTGTGGACTGAGTTTTTCCTACTTGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTTTTCAACGATCCGTGGATTCCAAAAGAAACAACTTTCAAACCGATACCTATTCGACTAGGGGTTAATCGAGGAGCAATTAAAGTATCAGACTTTATCACTCC
ATCTATTGGATGGGATTTGGGTAAGCTCTGTAGTGAGGTGATTGTGGAAGATGCGAATTTAATTGCCACTATTCCTATTAGTCTTAGGAATGAGGAAAACAGATGGATTT
GGCATTATGCTTCAAATGGGGAGTACTCGGATAAAAGCAGGAATAGCTGGTCCAGGGACGTGAAGATTATGGAATGGGAGCAGCGATGCAAATGGATACACGGGTATTGG
ACGGAAACAAGGCCAAAAGTAAGATGTGAGCGGGATGATGGTATCAGTCAAGTGGGGGATCCAGTTCCTTCGAGAGATGGTTGCATATTGTTCACTGATGCAGCGATTAA
CCCGCAAAATAACGGGACGGGTTACGACGATCCTACATGGTCTACGTTTACTGCAACGTTTGCAGTATATGGAAGCTCAAGTATTTTCAGACTCTTCAAACGCGATCAGC
ATGATAACAGGGGATTTATATCCTTCTTCAGAAGTTTATCATTGGATCATTCAAATCTGAGATATGAGTACTCATGGGTTCGTCAACTCACTGCAAATGTAAGATGTATT
GTGACTGTTTGTGCTAATGAAATGGAAGCGGTGGACATTGGCAGCCTTGGTGCTGATCGGAGCAGTATATCTGCTGGTGGTGGGGCTTTCCCAAGTAGTTGTTGCAAACC
TTTTGTTCCTGATCGTCTGGAATCAGCTACGGCTATAGGCAGCCTCCTAGGCAAAGTCGAACATGTGGATTTAGAAGAAGAAAAAGATCAAAGCTGGGGAAAATCCTTAA
GGATCAAGATTCAGATTGAAGTAGAAAGTCCCTTGAAATGTGGAATCTTCTTGAAATCTGAAAAAGAAGGCAAGCATAAATGGATTGCTGTAACGTATGAGAAATTACCA
GATTTTTGCTATGGTTGTGGTTTATTGGGGCACACCATTAGAGAATGTGAAGGAAATTGTGGCTCAATTGATGAGGAACTACCATACGGACCGAGCTTGCGGGAACTTAC
AATACTAAAAATGCGTGAAACAGAGACAATGAATAATCCATATCCCAGTTTTTTCGGTAGAGGAATGGGTAGAGGAAGGGAATGTGCCAGAGGTAGTTGGAGAAACTTGC
CAACAGAGGAGGTGGAAAATACCCATGCTGGTTTTCAGAACCATAATCATGAGGAAGAAGACCTTGCACCGCCAGAAAACTCCACACCAATCCGACCGGATTCCGGCAAG
CCGACAATCCCCACGACGGCGAGAAGGGTACCGGTCGTTTTGTCCAAAAAGGAAAGTGAGGTAAATCTGGATGACGTTAGAAATGGAGAAATTAGCGGTGAGACAATAAA
AGAAAATCCCAACGGTCAAAATCTATCCAAATCAATAAATGATAAAGATTCTGATTTATTGCTAATGGATATTGACCAAAAGTGGGTTGGGCCACAAGAAGGGAGAAAGA
CAAGCTGCAAAATGGAAACAAAGGCTGACTCAAATGGGCCTAATTTCAAACCAACGGAACAGAAAAGTATGGAAGAATCTCAAAGCGTAAATGACTGCAATTCAGAATCT
AATGGGAGTAAGAACAAAAATGTGGATACATATGGAACTTCGAAATCTCAAGAGAACAGAGAGAGTGGGAAATTCAGAACTTGGCGTAGGTTATCACGTATGCAGGATTC
AGAGGAGCCAGAAAAAAAGAGAGGTAGACCTCGGCAGGAAAAGGCGATGCGAGATTTTCGTGAGGTTATTGATTGGTGCGCTGTTCGAGATCCGAGTTTCATTGGTCCTG
AGTTCACGTGGTGTAATAACCATGTGAACACGTCAGATCATAAACCGATATTAGCAAAATGGACCGAAGACATTAGGGAGGATATGATACCTAATTTTCATCGTCCGAAA
CGATTTGAAGAAGCTTGGGCTCGTTATGAAGAGTGCCATGAGATTGTGCAGCAAAGTAAGCTTGAGCTGAGGGATAAAGAAAAGCAGTTAGAGATTCTGTTAGAAGACGA
TGAAATTTACTGGAAACAAAGAGCTAGGGAAGAGTGGCTTCAATGGGGAGATCGAAACACGAAATGGTTCCACTTACAGGCGAATAAAAGGAGGAAAGTGAATAGAATCA
AAGGTTTATTTGATGAAAATGGTCAATGGACAGAGGGTGATTTAGATCTTGAATCTATTGCTAATCAGTATTTTTATAAGTGGAAAATAGGAAAAGGTTTCAACATTGAG
GCATCCAATGATCCATGGATCCCAGAGGAGGGTACTTGTAAACCAATTGTCCCTCATCCTGAAACACAAAACCTCACAGTGGCTCAGCTAATCAATAGAAATGGAACTTG
GAACGAACATCTGGTTCGAGGTCTATTCTTGGAAGATGACGCCAATAAGATTCTGAATATCCCTCTTAATCCGAATCAATCGGAGGATAGCATTATTTGGAATCCTGATC
CAAAAGGTTTATTCTTGGTAAAGAGTGCATATAGGCTAGGAATCCAGAATCAGCAAAATCTTGAAGCCTCGGTTTCAAGTCATAAGGAGGAGGAGATGATCTGGAAAGTC
TTTTGGAAAGCTCAATTGCCATCTAAAATCAAGATTTGTGGGTGGAGAATCTATAAAGATATTCTCCCAACACTATCCAATCTGAACAAACGAGGGATGGACATAAATCT
GTTATGCTTCCTGTGCAGAGACAAAACAGAGACGGCTACTCACCTCTTTTGGGAATGTAAAGCTACTAGAGGTTTGTGGCATTCTTATTACTCATCTACTGATCTTTTGT
TTTTGAATGACAGGAAGGAGTGGACTACACCAGATTATTTTGATAGCATTTGGAGGGGAACAAGAGATGGGATCGTGGACGAGAGCAAGCTGACCAAGAGCCTTATCATT
TGTTGGCAAATTTGGACGCACAAAAATTACATCTCACATCAAAATCTGAAGCCAGATCCAGATCAATTAAAGACACAAATTGACAAATATATCGATGAATTCCATCCCAG
AAGCAGAACTTACCGTGATTTTTCTCACTCCCAGAAAGTCGAAAGGCCGTTATCAACGTGGATTCCGCCATCGACGGGCCTATGGAAGCTTAACTGTGACGCGACGTGGA
ACGATCAACATCAACGAGGAGGCATTGGCTGGATTGTCAGAGACTGGGAAGGAAGACCTCTGATCGCTGGCTTTCGTTGCATTAAAAGGCAGTGGAAGATCAATTGGATG
GAGGCTATGGCGATCTCCGAAGGAATTCGTAGTATCCTGATACACGCTCCTCCAATTCAAATTGAAAGCAACGCCCTTCAGATTATCCGTCTTCTCAATCGGCAAGACCA
GGATGAAACTGAGCTGCTAAACTTCATTCAAGAGGTTCATGCCCTAACTTCTTCGAGAAACATTAATGGTTTTTTCCATGTGAACAGGAAGCACAATAGTTTGGCCCATA
AATTGGCACGAAAAGCTTGTTTAAGTGGACTGTTTGAGTTGTGGACTGAGTTTTTCCTACTTGGTTAA
Protein sequenceShow/hide protein sequence
MDFFNDPWIPKETTFKPIPIRLGVNRGAIKVSDFITPSIGWDLGKLCSEVIVEDANLIATIPISLRNEENRWIWHYASNGEYSDKSRNSWSRDVKIMEWEQRCKWIHGYW
TETRPKVRCERDDGISQVGDPVPSRDGCILFTDAAINPQNNGTGYDDPTWSTFTATFAVYGSSSIFRLFKRDQHDNRGFISFFRSLSLDHSNLRYEYSWVRQLTANVRCI
VTVCANEMEAVDIGSLGADRSSISAGGGAFPSSCCKPFVPDRLESATAIGSLLGKVEHVDLEEEKDQSWGKSLRIKIQIEVESPLKCGIFLKSEKEGKHKWIAVTYEKLP
DFCYGCGLLGHTIRECEGNCGSIDEELPYGPSLRELTILKMRETETMNNPYPSFFGRGMGRGRECARGSWRNLPTEEVENTHAGFQNHNHEEEDLAPPENSTPIRPDSGK
PTIPTTARRVPVVLSKKESEVNLDDVRNGEISGETIKENPNGQNLSKSINDKDSDLLLMDIDQKWVGPQEGRKTSCKMETKADSNGPNFKPTEQKSMEESQSVNDCNSES
NGSKNKNVDTYGTSKSQENRESGKFRTWRRLSRMQDSEEPEKKRGRPRQEKAMRDFREVIDWCAVRDPSFIGPEFTWCNNHVNTSDHKPILAKWTEDIREDMIPNFHRPK
RFEEAWARYEECHEIVQQSKLELRDKEKQLEILLEDDEIYWKQRAREEWLQWGDRNTKWFHLQANKRRKVNRIKGLFDENGQWTEGDLDLESIANQYFYKWKIGKGFNIE
ASNDPWIPEEGTCKPIVPHPETQNLTVAQLINRNGTWNEHLVRGLFLEDDANKILNIPLNPNQSEDSIIWNPDPKGLFLVKSAYRLGIQNQQNLEASVSSHKEEEMIWKV
FWKAQLPSKIKICGWRIYKDILPTLSNLNKRGMDINLLCFLCRDKTETATHLFWECKATRGLWHSYYSSTDLLFLNDRKEWTTPDYFDSIWRGTRDGIVDESKLTKSLII
CWQIWTHKNYISHQNLKPDPDQLKTQIDKYIDEFHPRSRTYRDFSHSQKVERPLSTWIPPSTGLWKLNCDATWNDQHQRGGIGWIVRDWEGRPLIAGFRCIKRQWKINWM
EAMAISEGIRSILIHAPPIQIESNALQIIRLLNRQDQDETELLNFIQEVHALTSSRNINGFFHVNRKHNSLAHKLARKACLSGLFELWTEFFLLG