; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030079 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030079
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold6:9195107..9202067
RNA-Seq ExpressionSpg030079
SyntenySpg030079
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7824053.1 hypothetical protein G2W53_022197 [Senna tora]1.0e-4324.84Show/hide
Query:  TGFYGNPDQSKRSDSWKLLKRLNEATNLPWIIGGDFNEIMFSKEKKGGPSKPFNLMNDFCDAIGYCGLVDVGFSGYKFTWTRNKNKYDDTKERLDRFFAN
        TG +G P++  +  +W LL+ L   ++LPW+  GDFNEIMF+ EK+GG +K    M  F DA  +CG  D+GF GY FTW   +    + +ERLDR FA 
Subjt:  TGFYGNPDQSKRSDSWKLLKRLNEATNLPWIIGGDFNEIMFSKEKKGGPSKPFNLMNDFCDAIGYCGLVDVGFSGYKFTWTRNKNKYDDTKERLDRFFAN

Query:  SKMWDNVRGLKVERLQFHHSDHRAILMHIDWGGLNQQIRFHKNTIKFEQGWLKYEGSK------------------------------------------
         +        +V  +    SDH A+ +  D    +   R  +   +FE+ W   E  K                                          
Subjt:  SKMWDNVRGLKVERLQFHHSDHRAILMHIDWGGLNQQIRFHKNTIKFEQGWLKYEGSK------------------------------------------

Query:  ---------------------KAFEEGWKSEVT--------------------INNFNFT---KCIQNGIDQAA----------KDAKDEIVWHPDKKGS
                               +E+ W S +T                    IN+ N T     I +     A          ++ +D+ +W  +K  S
Subjt:  ---------------------KAFEEGWKSEVT--------------------INNFNFT---KCIQNGIDQAA----------KDAKDEIVWHPDKKGS

Query:  FSVKSAYRLAMKEDISHEATQSDNNRESSLWNKLWNSNVFPRAKVCIWKIINDILPTKTNILKKGIDLNPYCYFCRNKRESTSHLIWGCKLVREVWPILI
        +SVKSAY +  K   S   + S        W ++W   V P+ ++  W++ ++ LPT  N+ K+G+ +   C  CR   E T H   GC   + VW +  
Subjt:  FSVKSAYRLAMKEDISHEATQSDNNRESSLWNKLWNSNVFPRAKVCIWKIINDILPTKTNILKKGIDLNPYCYFCRNKRESTSHLIWGCKLVREVWPILI

Query:  PNSLTLLSVCRENWSAKDYWGWMVDNLNKEELTKGIILMWKIWEFRNKATVHNQQISADWILQSSEASIKEWDSSYLKTHLSERPRNLVSQAQWEKPPPN
         + L    +   N +  D+    +++ + E L     + W IW  RN    + + +S +  + +  + + E+ S + K  LS  P      ++W+KP   
Subjt:  PNSLTLLSVCRENWSAKDYWGWMVDNLNKEELTKGIILMWKIWEFRNKATVHNQQISADWILQSSEASIKEWDSSYLKTHLSERPRNLVSQAQWEKPPPN

Query:  HWKMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIKTLEAKAILEGISQIYHTCISRRIPLQIESDALEVIKVLNGEADDLFDLKIITDDI
          K+N DAA    ++   LG +V D  G L      +I+        EA AI  G+  +    ++    L++E+D L   K  +  ADD   +  I  D 
Subjt:  HWKMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIKTLEAKAILEGISQIYHTCISRRIPLQIESDALEVIKVLNGEADDLFDLKIITDDI

Query:  KALADRFVSIDFSHCSRFLNTESHCVAR
         +L++ F         R LN  +H VA+
Subjt:  KALADRFVSIDFSHCSRFLNTESHCVAR

KAF8408042.1 hypothetical protein HHK36_007182 [Tetracentron sinense]1.4e-4825.68Show/hide
Query:  LSERPRGGFTGFYGNPDQSKRSDSWKLLKRLNEATNLPWIIGGDFNEIMFSKEKKGGPSKPFNLMNDFCDAIGYCGLVDVGFSGYKFTWTRNKNKYDDTK
        + ER     TG YG+P+ +K+ ++W+L++ L+ + ++PW+  GDFNEI  ++EK G   K    M  F +AI  C L+ +GF G  FTW   ++   + +
Subjt:  LSERPRGGFTGFYGNPDQSKRSDSWKLLKRLNEATNLPWIIGGDFNEIMFSKEKKGGPSKPFNLMNDFCDAIGYCGLVDVGFSGYKFTWTRNKNKYDDTK

Query:  ERLDRFFANSKMWDNVRGLKVERLQFHHSDHRAILMHIDWGGLNQQIRFHKNTIKFEQGWLKYEGSKKAFEEGW----------------------KSEV
        ERLDR  A S          V+ L  H SDH  +L+  D      +I   K + +FE  W+      +  +  W                      K + 
Subjt:  ERLDRFFANSKMWDNVRGLKVERLQFHHSDHRAILMHIDWGGLNQQIRFHKNTIKFEQGWLKYEGSKKAFEEGW----------------------KSEV

Query:  TINNFNFTKCIQ-------NGIDQAAKDAKDEIVWHPDKKGSFSVKSAYRLAM----KEDISHEATQS---DNNRESSLWNKLWNSNVFPRAKVCIWKII
        T N+               + I  + +   D+ VWH   KG FSV+SAY L      +E  +  +T S   + +     W+++W   + P+ K+ IWK+ 
Subjt:  TINNFNFTKCIQ-------NGIDQAAKDAKDEIVWHPDKKGSFSVKSAYRLAM----KEDISHEATQS---DNNRESSLWNKLWNSNVFPRAKVCIWKII

Query:  NDILPTKTNILKKGIDLNPYCYFCRNKRESTSHLIWGCKLVREVWPILIPNSLTLLSVCRENWSAKDYWGWMVDNL---NKEELTKGIILMWKIWEFRNK
         +ILP + N+ K+ I +   C  C  + E+  H++  C   R+VW       L+ L +  +  SA     W+ + +    +E L+   ++ W IW+ RN+
Subjt:  NDILPTKTNILKKGIDLNPYCYFCRNKRESTSHLIWGCKLVREVWPILIPNSLTLLSVCRENWSAKDYWGWMVDNL---NKEELTKGIILMWKIWEFRNK

Query:  ATVHNQQISADWILQSSEASIKEWDSSYLKTHLSERPRNLVSQAQWEKPPPNHWKMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIKTLE
              +++    +Q +   + ++ ++  +      P ++ +   W  PP + +K+N D A + ++   G+G +V D NG LI    ++I    +   +E
Subjt:  ATVHNQQISADWILQSSEASIKEWDSSYLKTHLSERPRNLVSQAQWEKPPPNHWKMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIKTLE

Query:  AKAILEGISQIYHTCISRRIPLQIESDALEVIKVLNGEADDLFDLKIITDDIK
        A A  EG+       I   I   +ESD++  I+ L  + ++     ++ DD K
Subjt:  AKAILEGISQIYHTCISRRIPLQIESDALEVIKVLNGEADDLFDLKIITDDIK

KMS97072.1 hypothetical protein BVRB_7g179330 [Beta vulgaris subsp. vulgaris]1.6e-4424.69Show/hide
Query:  GREFARNTSGQPLGYRRRLKFSPAER--HESPLSERPRGG----FTGFYGNPDQSKRSDSWKLLKRLNEATNLPWIIGGDFNEIMFSKEKKGGPSKPFNL
        G E  R   G  L +   +  S      H   ++ +  GG    FTG YG  + S++  +W++++ L E  NL W++ GDFNE+++  EK       F  
Subjt:  GREFARNTSGQPLGYRRRLKFSPAER--HESPLSERPRGG----FTGFYGNPDQSKRSDSWKLLKRLNEATNLPWIIGGDFNEIMFSKEKKGGPSKPFNL

Query:  MNDFCDAIGYCGLVDVGFSGYKFTWTRNKNKYDDTKERLDRFFANSKMWDNVRGLKVERLQFHHSDHRAILMHIDWGGLNQQIRFHKNTIKFEQGWLKYE
        +  F   +    L D+G  GY FTW+  +      +ERLDR  A+    D      V  L +  SDH  I+M++     N      +   +FE  WL+  
Subjt:  MNDFCDAIGYCGLVDVGFSGYKFTWTRNKNKYDDTKERLDRFFANSKMWDNVRGLKVERLQFHHSDHRAILMHIDWGGLNQQIRFHKNTIKFEQGWLKYE

Query:  G----SKKAFEE-----------------------------------------------------------GWKSEVTINNFNFTKCIQ--NGIDQAAKD
              K+ +EE                                                            W+ EV    F F   ++    I  + + 
Subjt:  G----SKKAFEE-----------------------------------------------------------GWKSEVTINNFNFTKCIQ--NGIDQAAKD

Query:  AKDEIVWHPDKKGSFSVKSAYRLAMKEDISHEATQSDNNRESSLWNKLWNSNVFPRAKVCIWKIINDILPTKTNILKKGIDLNPYCYFCRNKRESTSHLI
         +DEI W     G F V+ AYRLA++ D     T S +     +W  +W   + P+    +W+   DILP   N+ KK       C  C    E+T H  
Subjt:  AKDEIVWHPDKKGSFSVKSAYRLAMKEDISHEATQSDNNRESSLWNKLWNSNVFPRAKVCIWKIINDILPTKTNILKKGIDLNPYCYFCRNKRESTSHLI

Query:  WGCKLVREVWPILIPNSLTLLSVCRENWSAKDYWGWMVDNLNKEELTKGIILMWKIWEFRNKATVHNQQISADWILQSSEASIKEWDSSYLKTHLSERPR
          C+ VRE W          +  C    + K++  W+ +   KE+    ++ +W++W+ RN+    N   SA    + +   ++     + K + S+R  
Subjt:  WGCKLVREVWPILIPNSLTLLSVCRENWSAKDYWGWMVDNLNKEELTKGIILMWKIWEFRNKATVHNQQISADWILQSSEASIKEWDSSYLKTHLSERPR

Query:  NLVSQAQWEKPPPNHWKMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIKTLEAKAIL----EGISQIYHTCISRRIPLQIESDALEVIKV
           ++ +W  P P   K+N DAA N +  R GLG +  D NG ++    + +   W  +  EA A+L    E I+Q +   +       +ESDA  +I  
Subjt:  NLVSQAQWEKPPPNHWKMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIKTLEAKAIL----EGISQIYHTCISRRIPLQIESDALEVIKV

Query:  LNGEADDLFDLKIITDDIKALADRFVSIDFSHCSRFLNTESHCVARRA
        +N       D++ + +D+  L   F +I FS C R  N  +H +A+ A
Subjt:  LNGEADDLFDLKIITDDIKALADRFVSIDFSHCSRFLNTESHCVARRA

RYR09735.1 hypothetical protein Ahy_B05g078136 [Arachis hypogaea]1.2e-4426.01Show/hide
Query:  RRRLKFSPAERHESPLSERPRGGFTGFYGNPDQSKRSDSWKLLKRLNEATNLPWIIGGDFNEIMFSKEKKGGPSKPFNLMNDFCDAIGYCGLVDVGFSGY
        +  L        E  L     G F   YGNP+ +KR D W+ +   N    +P +  GDFN+I+  +EK G   KP N + +F   +    L+D+   G 
Subjt:  RRRLKFSPAERHESPLSERPRGGFTGFYGNPDQSKRSDSWKLLKRLNEATNLPWIIGGDFNEIMFSKEKKGGPSKPFNLMNDFCDAIGYCGLVDVGFSGY

Query:  KFTWTRNKNKYDDTKERLDRFFANSKMWDNVRGLKVERLQFHHSDHRAILMHIDWGGLNQQIRFH--KNTIKFEQGWLKYEGSKKAFEEGWKSEVTINNF
        KFTW  N      T+E++DR  AN +     +   +  +    SDH  +++ +        I  H  +   KFE  W  +E      + GW  E   N  
Subjt:  KFTWTRNKNKYDDTKERLDRFFANSKMWDNVRGLKVERLQFHHSDHRAILMHIDWGGLNQQIRFH--KNTIKFEQGWLKYEGSKKAFEEGWKSEVTINNF

Query:  NFTKCIQNGIDQAAKDAKDEIVWHPDKKGSFSVKSAYRLAMKEDISHEATQSDNNRESSLWNKLWNSNVFPRAKVCIWKIINDILPTKTNILKKGIDLNP
         + K     I +   + K+E+     K G ++ K  Y    +E+ S    + D      LW ++W   V P+ K+ +WK   +ILP  T + KK I  +P
Subjt:  NFTKCIQNGIDQAAKDAKDEIVWHPDKKGSFSVKSAYRLAMKEDISHEATQSDNNRESSLWNKLWNSNVFPRAKVCIWKIINDILPTKTNILKKGIDLNP

Query:  YCYFCRNKRESTSHLIWGCKLVREVW----PILIPNSLTLLSVCRENWSAKDYWGWMVDNLNK------EELTKGI----ILMWKIWEFRNKATVHNQQI
         C  C  + ES  H +  C+ VR VW       IP + T+ S+  EN        W++DN+ K      E+  K I     L+W++W+ RN      ++I
Subjt:  YCYFCRNKRESTSHLIWGCKLVREVW----PILIPNSLTLLSVCRENWSAKDYWGWMVDNLNK------EELTKGI----ILMWKIWEFRNKATVHNQQI

Query:  SADWILQSSE-ASIKEWDSSYLK--THLSERPRNLVSQAQWEKPPPNHW-KMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIKTLEAKAI
        +  W ++ ++      W ++  K  T   E  R   +  +W +PPPN W K+N DA + ++   G +  +V D  G ++     +I+   +I   EA AI
Subjt:  SADWILQSSE-ASIKEWDSSYLK--THLSERPRNLVSQAQWEKPPPNHW-KMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIKTLEAKAI

Query:  LEGISQIYHTCISRRIPLQIESDALEVIKVLNGEADDLFDLKIITDDIKALADRFVSIDFSHCSRFLNTESHCVARRAAFAPILPSESSSSSTSFC
         + +  + +  + + +   IESD   +++ +  ++  +     I  DI+ L +       +   R  N  +H VA+ A    + P+ S    T  C
Subjt:  LEGISQIYHTCISRRIPLQIESDALEVIKVLNGEADDLFDLKIITDDIKALADRFVSIDFSHCSRFLNTESHCVARRAAFAPILPSESSSSSTSFC

VFR00443.1 unnamed protein product [Cuscuta campestris]1.8e-4324.21Show/hide
Query:  LVCKILTEKHISPDIFKTMIPKIWGLVGRVNIKKAGENLYECTFDTAQGKRKVMDGRPWIYDNMLLMFEEIKGTERYASMQFTQAPFWIHFLNLPRICFN
        +V + LT+  I  DI + ++   W     V I +  +NLY   F   +   ++++  PW +DN  L+ + +   E   +        W+   +LP     
Subjt:  LVCKILTEKHISPDIFKTMIPKIWGLVGRVNIKKAGENLYECTFDTAQGKRKVMDGRPWIYDNMLLMFEEIKGTERYASMQFTQAPFWIHFLNLPRICFN

Query:  RKWAEALGNAVGAFERVDFDKDEYE-DIIRWGKGRKKSWKMSEPKKKAKCSGSPTRQGQTTKEISTQTPGPRK---------RNPGN---VEQEKEPYKA
                 ++   E V+ + +EY  D      GRKK    S   +  +       +G          PGP +         R  GN   V++  E   A
Subjt:  RKWAEALGNAVGAFERVDFDKDEYE-DIIRWGKGRKKSWKMSEPKKKAKCSGSPTRQGQTTKEISTQTPGPRK---------RNPGN---VEQEKEPYKA

Query:  KEKQKRRLQKKTGSMKKSRKRKLEKGRE--FARNTSGQPLGY------RRRLKFSPAERHESPLSERPRGG----FTGFYGNPDQSKRSDSWKLLKRLNE
        K+     L +  G+  K+   +++ G E  FA ++ G   G        R +      R    +    +G      TGFYGNP +  R  SW LL+ L  
Subjt:  KEKQKRRLQKKTGSMKKSRKRKLEKGRE--FARNTSGQPLGY------RRRLKFSPAERHESPLSERPRGG----FTGFYGNPDQSKRSDSWKLLKRLNE

Query:  ATNLPWIIGGDFNEIMFSKEKKGGPSKPFNLMNDFCDAIGYCGLVDVGFSGYKFTWTRNKNKYDDTKERLDRFFANSKMWDNVRGLKVERLQFHHSDHRA
         ++LPW++ GDFN +  S EK+GG  +P +L++ F   +  C L D+G  GY FTW   +   +  +ERLDR  +  +         V       SDH A
Subjt:  ATNLPWIIGGDFNEIMFSKEKKGGPSKPFNLMNDFCDAIGYCGLVDVGFSGYKFTWTRNKNKYDDTKERLDRFFANSKMWDNVRGLKVERLQFHHSDHRA

Query:  ILMHIDWGGLNQQIRFHKNTIKFEQGWLKYEGSKKAFEEGWKSEVTINNFN-------------FTKCIQ-------------------NGIDQAA----
        + + +      ++++      +FE  WL+ +G K   +E W S  T++  +               +CI                    NG++  A    
Subjt:  ILMHIDWGGLNQQIRFHKNTIKFEQGWLKYEGSKKAFEEGWKSEVTINNFN-------------FTKCIQ-------------------NGIDQAA----

Query:  -----------------------------KDAKDEIVWHPDKKGSFSVKSAYRLAMKEDISHEATQSDNNRESSLWNKLWNSNVFPRAKVCIWKIINDIL
                                        KD   W   + G +SVKS YR  M      EATQ+      S W++LW   V P+ KVC+W+ +  IL
Subjt:  -----------------------------KDAKDEIVWHPDKKGSFSVKSAYRLAMKEDISHEATQSDNNRESSLWNKLWNSNVFPRAKVCIWKIINDIL

Query:  PTKTNILKKGIDLNPYCYFCRNKRESTSHLIWGCKLVREVWPILIPNSLTLLSVCRENWSAKDYWGWMV-DNLNKEELTKGIILMWKIWEFRNKAT----
        PT + +  KG++++  C  C    E+  HL   C     VW        +     R   S+   W  +V       EL   +  +W IW+ RN A     
Subjt:  PTKTNILKKGIDLNPYCYFCRNKRESTSHLIWGCKLVREVWPILIPNSLTLLSVCRENWSAKDYWGWMV-DNLNKEELTKGIILMWKIWEFRNKAT----

Query:  VHNQQISADWILQSSEASIKEWDSSYLKTHLSERPRNLVSQAQWEKPPPNHWKMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIK---TL
        V   +++   +L  S    K W++++      E P  L   A+   PP         A ++     G  G I   S      F      + W  +     
Subjt:  VHNQQISADWILQSSEASIKEWDSSYLKTHLSERPRNLVSQAQWEKPPPNHWKMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIK---TL

Query:  EAKAILEGISQIYHTCISRRIPLQIESDALEVIKVLN--GEADDLFDLKIITDDIKALADRFVSIDFSHCSRFLNTESHCVARR
        EA A  E +S +    +   +   I SD   V++ +N  G +D       + DD KAL   F S+  SH  R LN  +H +ARR
Subjt:  EAKAILEGISQIYHTCISRRIPLQIESDALEVIKVLN--GEADDLFDLKIITDDIKALADRFVSIDFSHCSRFLNTESHCVARR

TrEMBL top hitse value%identityAlignment
A0A0J8BAU9 Uncharacterized protein7.8e-4524.69Show/hide
Query:  GREFARNTSGQPLGYRRRLKFSPAER--HESPLSERPRGG----FTGFYGNPDQSKRSDSWKLLKRLNEATNLPWIIGGDFNEIMFSKEKKGGPSKPFNL
        G E  R   G  L +   +  S      H   ++ +  GG    FTG YG  + S++  +W++++ L E  NL W++ GDFNE+++  EK       F  
Subjt:  GREFARNTSGQPLGYRRRLKFSPAER--HESPLSERPRGG----FTGFYGNPDQSKRSDSWKLLKRLNEATNLPWIIGGDFNEIMFSKEKKGGPSKPFNL

Query:  MNDFCDAIGYCGLVDVGFSGYKFTWTRNKNKYDDTKERLDRFFANSKMWDNVRGLKVERLQFHHSDHRAILMHIDWGGLNQQIRFHKNTIKFEQGWLKYE
        +  F   +    L D+G  GY FTW+  +      +ERLDR  A+    D      V  L +  SDH  I+M++     N      +   +FE  WL+  
Subjt:  MNDFCDAIGYCGLVDVGFSGYKFTWTRNKNKYDDTKERLDRFFANSKMWDNVRGLKVERLQFHHSDHRAILMHIDWGGLNQQIRFHKNTIKFEQGWLKYE

Query:  G----SKKAFEE-----------------------------------------------------------GWKSEVTINNFNFTKCIQ--NGIDQAAKD
              K+ +EE                                                            W+ EV    F F   ++    I  + + 
Subjt:  G----SKKAFEE-----------------------------------------------------------GWKSEVTINNFNFTKCIQ--NGIDQAAKD

Query:  AKDEIVWHPDKKGSFSVKSAYRLAMKEDISHEATQSDNNRESSLWNKLWNSNVFPRAKVCIWKIINDILPTKTNILKKGIDLNPYCYFCRNKRESTSHLI
         +DEI W     G F V+ AYRLA++ D     T S +     +W  +W   + P+    +W+   DILP   N+ KK       C  C    E+T H  
Subjt:  AKDEIVWHPDKKGSFSVKSAYRLAMKEDISHEATQSDNNRESSLWNKLWNSNVFPRAKVCIWKIINDILPTKTNILKKGIDLNPYCYFCRNKRESTSHLI

Query:  WGCKLVREVWPILIPNSLTLLSVCRENWSAKDYWGWMVDNLNKEELTKGIILMWKIWEFRNKATVHNQQISADWILQSSEASIKEWDSSYLKTHLSERPR
          C+ VRE W          +  C    + K++  W+ +   KE+    ++ +W++W+ RN+    N   SA    + +   ++     + K + S+R  
Subjt:  WGCKLVREVWPILIPNSLTLLSVCRENWSAKDYWGWMVDNLNKEELTKGIILMWKIWEFRNKATVHNQQISADWILQSSEASIKEWDSSYLKTHLSERPR

Query:  NLVSQAQWEKPPPNHWKMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIKTLEAKAIL----EGISQIYHTCISRRIPLQIESDALEVIKV
           ++ +W  P P   K+N DAA N +  R GLG +  D NG ++    + +   W  +  EA A+L    E I+Q +   +       +ESDA  +I  
Subjt:  NLVSQAQWEKPPPNHWKMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIKTLEAKAIL----EGISQIYHTCISRRIPLQIESDALEVIKV

Query:  LNGEADDLFDLKIITDDIKALADRFVSIDFSHCSRFLNTESHCVARRA
        +N       D++ + +D+  L   F +I FS C R  N  +H +A+ A
Subjt:  LNGEADDLFDLKIITDDIKALADRFVSIDFSHCSRFLNTESHCVARRA

A0A2Z6N4T0 Uncharacterized protein3.3e-4322.81Show/hide
Query:  ERPRGGFTGFYGNPDQSKRSDSWKLLKRLNEATNLPWIIGGDFNEIMFSKEKKGGPSKPFNLMNDFCDAIGYCGLVDVGFSGYKFTWTRNKNKYDDTKER
        +R +   TGFYG P+ S+R DSW  L++L+ A+ LPW I GDFN+I+ S EK+G   +P  L+N F +A+   GLVD+ + GY FTW ++       +E+
Subjt:  ERPRGGFTGFYGNPDQSKRSDSWKLLKRLNEATNLPWIIGGDFNEIMFSKEKKGGPSKPFNLMNDFCDAIGYCGLVDVGFSGYKFTWTRNKNKYDDTKER

Query:  LDRFFANSKMWDNVRGLKVERLQFHHSDHRAILMHIDWGGLNQQIRFHKNTIKFEQGWL---------------------------------KYEGSK--
        LDR  AN    +  +   VE L    SDH  +L+  D   +  +   H    KFE  W                                   + G K  
Subjt:  LDRFFANSKMWDNVRGLKVERLQFHHSDHRAILMHIDWGGLNQQIRFHKNTIKFEQGWL---------------------------------KYEGSK--

Query:  -------KAFEEGWKSEVT-----------INNFNFTKCIQNGIDQ-----------------------AAKDAKDEIVWHPDKKGSFSVKSAYRLAMKE
                 +++ W S+ T           +NN      + +   +                             D+I W  +K G ++VKSAYR  +  
Subjt:  -------KAFEEGWKSEVT-----------INNFNFTKCIQNGIDQ-----------------------AAKDAKDEIVWHPDKKGSFSVKSAYRLAMKE

Query:  DISHEATQSDNNRESSLWNKLWNSNVFPRAKVCIWKIINDILPTKTNILKKGIDLNPYCYFCRNKRESTSHLIWGCKLVREVW-------PILIPNSLTL
                 D +R    W+ +W + + P+ K  +W+I  + LPT+  +  +G+     C  C    E ++HL + C+     W        I+   +LT+
Subjt:  DISHEATQSDNNRESSLWNKLWNSNVFPRAKVCIWKIINDILPTKTNILKKGIDLNPYCYFCRNKRESTSHLIWGCKLVREVW-------PILIPNSLTL

Query:  LSVCRENWSAKDYWGWMVDNLNKEELTKGIILMWKIWEFRNKATVHNQQISADWILQSSEASIKEWDSSYLKTHLSERPRNLVSQAQWEKPPPNHWKMNT
                S K+    ++  LN++       +MW IW+ RN     N+++    + + + + +  W ++          ++   + +W +P    WK N 
Subjt:  LSVCRENWSAKDYWGWMVDNLNKEELTKGIILMWKIWEFRNKATVHNQQISADWILQSSEASIKEWDSSYLKTHLSERPRNLVSQAQWEKPPPNHWKMNT

Query:  DAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIKTLEAKAILEGISQIYHTCISRRIPLQIESDALEVIKVLNGEADD
        DA+++   ++ G+G  + D  G  +    +       + T EA  +L  +  +    ++  +    E D+  V+   N    D
Subjt:  DAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIKTLEAKAILEGISQIYHTCISRRIPLQIESDALEVIKVLNGEADD

A0A444Z6E6 Uncharacterized protein6.0e-4526.01Show/hide
Query:  RRRLKFSPAERHESPLSERPRGGFTGFYGNPDQSKRSDSWKLLKRLNEATNLPWIIGGDFNEIMFSKEKKGGPSKPFNLMNDFCDAIGYCGLVDVGFSGY
        +  L        E  L     G F   YGNP+ +KR D W+ +   N    +P +  GDFN+I+  +EK G   KP N + +F   +    L+D+   G 
Subjt:  RRRLKFSPAERHESPLSERPRGGFTGFYGNPDQSKRSDSWKLLKRLNEATNLPWIIGGDFNEIMFSKEKKGGPSKPFNLMNDFCDAIGYCGLVDVGFSGY

Query:  KFTWTRNKNKYDDTKERLDRFFANSKMWDNVRGLKVERLQFHHSDHRAILMHIDWGGLNQQIRFH--KNTIKFEQGWLKYEGSKKAFEEGWKSEVTINNF
        KFTW  N      T+E++DR  AN +     +   +  +    SDH  +++ +        I  H  +   KFE  W  +E      + GW  E   N  
Subjt:  KFTWTRNKNKYDDTKERLDRFFANSKMWDNVRGLKVERLQFHHSDHRAILMHIDWGGLNQQIRFH--KNTIKFEQGWLKYEGSKKAFEEGWKSEVTINNF

Query:  NFTKCIQNGIDQAAKDAKDEIVWHPDKKGSFSVKSAYRLAMKEDISHEATQSDNNRESSLWNKLWNSNVFPRAKVCIWKIINDILPTKTNILKKGIDLNP
         + K     I +   + K+E+     K G ++ K  Y    +E+ S    + D      LW ++W   V P+ K+ +WK   +ILP  T + KK I  +P
Subjt:  NFTKCIQNGIDQAAKDAKDEIVWHPDKKGSFSVKSAYRLAMKEDISHEATQSDNNRESSLWNKLWNSNVFPRAKVCIWKIINDILPTKTNILKKGIDLNP

Query:  YCYFCRNKRESTSHLIWGCKLVREVW----PILIPNSLTLLSVCRENWSAKDYWGWMVDNLNK------EELTKGI----ILMWKIWEFRNKATVHNQQI
         C  C  + ES  H +  C+ VR VW       IP + T+ S+  EN        W++DN+ K      E+  K I     L+W++W+ RN      ++I
Subjt:  YCYFCRNKRESTSHLIWGCKLVREVW----PILIPNSLTLLSVCRENWSAKDYWGWMVDNLNK------EELTKGI----ILMWKIWEFRNKATVHNQQI

Query:  SADWILQSSE-ASIKEWDSSYLK--THLSERPRNLVSQAQWEKPPPNHW-KMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIKTLEAKAI
        +  W ++ ++      W ++  K  T   E  R   +  +W +PPPN W K+N DA + ++   G +  +V D  G ++     +I+   +I   EA AI
Subjt:  SADWILQSSE-ASIKEWDSSYLK--THLSERPRNLVSQAQWEKPPPNHW-KMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIKTLEAKAI

Query:  LEGISQIYHTCISRRIPLQIESDALEVIKVLNGEADDLFDLKIITDDIKALADRFVSIDFSHCSRFLNTESHCVARRAAFAPILPSESSSSSTSFC
         + +  + +  + + +   IESD   +++ +  ++  +     I  DI+ L +       +   R  N  +H VA+ A    + P+ S    T  C
Subjt:  LEGISQIYHTCISRRIPLQIESDALEVIKVLNGEADDLFDLKIITDDIKALADRFVSIDFSHCSRFLNTESHCVARRAAFAPILPSESSSSSTSFC

A0A484NGL5 Uncharacterized protein8.6e-4424.21Show/hide
Query:  LVCKILTEKHISPDIFKTMIPKIWGLVGRVNIKKAGENLYECTFDTAQGKRKVMDGRPWIYDNMLLMFEEIKGTERYASMQFTQAPFWIHFLNLPRICFN
        +V + LT+  I  DI + ++   W     V I +  +NLY   F   +   ++++  PW +DN  L+ + +   E   +        W+   +LP     
Subjt:  LVCKILTEKHISPDIFKTMIPKIWGLVGRVNIKKAGENLYECTFDTAQGKRKVMDGRPWIYDNMLLMFEEIKGTERYASMQFTQAPFWIHFLNLPRICFN

Query:  RKWAEALGNAVGAFERVDFDKDEYE-DIIRWGKGRKKSWKMSEPKKKAKCSGSPTRQGQTTKEISTQTPGPRK---------RNPGN---VEQEKEPYKA
                 ++   E V+ + +EY  D      GRKK    S   +  +       +G          PGP +         R  GN   V++  E   A
Subjt:  RKWAEALGNAVGAFERVDFDKDEYE-DIIRWGKGRKKSWKMSEPKKKAKCSGSPTRQGQTTKEISTQTPGPRK---------RNPGN---VEQEKEPYKA

Query:  KEKQKRRLQKKTGSMKKSRKRKLEKGRE--FARNTSGQPLGY------RRRLKFSPAERHESPLSERPRGG----FTGFYGNPDQSKRSDSWKLLKRLNE
        K+     L +  G+  K+   +++ G E  FA ++ G   G        R +      R    +    +G      TGFYGNP +  R  SW LL+ L  
Subjt:  KEKQKRRLQKKTGSMKKSRKRKLEKGRE--FARNTSGQPLGY------RRRLKFSPAERHESPLSERPRGG----FTGFYGNPDQSKRSDSWKLLKRLNE

Query:  ATNLPWIIGGDFNEIMFSKEKKGGPSKPFNLMNDFCDAIGYCGLVDVGFSGYKFTWTRNKNKYDDTKERLDRFFANSKMWDNVRGLKVERLQFHHSDHRA
         ++LPW++ GDFN +  S EK+GG  +P +L++ F   +  C L D+G  GY FTW   +   +  +ERLDR  +  +         V       SDH A
Subjt:  ATNLPWIIGGDFNEIMFSKEKKGGPSKPFNLMNDFCDAIGYCGLVDVGFSGYKFTWTRNKNKYDDTKERLDRFFANSKMWDNVRGLKVERLQFHHSDHRA

Query:  ILMHIDWGGLNQQIRFHKNTIKFEQGWLKYEGSKKAFEEGWKSEVTINNFN-------------FTKCIQ-------------------NGIDQAA----
        + + +      ++++      +FE  WL+ +G K   +E W S  T++  +               +CI                    NG++  A    
Subjt:  ILMHIDWGGLNQQIRFHKNTIKFEQGWLKYEGSKKAFEEGWKSEVTINNFN-------------FTKCIQ-------------------NGIDQAA----

Query:  -----------------------------KDAKDEIVWHPDKKGSFSVKSAYRLAMKEDISHEATQSDNNRESSLWNKLWNSNVFPRAKVCIWKIINDIL
                                        KD   W   + G +SVKS YR  M      EATQ+      S W++LW   V P+ KVC+W+ +  IL
Subjt:  -----------------------------KDAKDEIVWHPDKKGSFSVKSAYRLAMKEDISHEATQSDNNRESSLWNKLWNSNVFPRAKVCIWKIINDIL

Query:  PTKTNILKKGIDLNPYCYFCRNKRESTSHLIWGCKLVREVWPILIPNSLTLLSVCRENWSAKDYWGWMV-DNLNKEELTKGIILMWKIWEFRNKAT----
        PT + +  KG++++  C  C    E+  HL   C     VW        +     R   S+   W  +V       EL   +  +W IW+ RN A     
Subjt:  PTKTNILKKGIDLNPYCYFCRNKRESTSHLIWGCKLVREVWPILIPNSLTLLSVCRENWSAKDYWGWMV-DNLNKEELTKGIILMWKIWEFRNKAT----

Query:  VHNQQISADWILQSSEASIKEWDSSYLKTHLSERPRNLVSQAQWEKPPPNHWKMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIK---TL
        V   +++   +L  S    K W++++      E P  L   A+   PP         A ++     G  G I   S      F      + W  +     
Subjt:  VHNQQISADWILQSSEASIKEWDSSYLKTHLSERPRNLVSQAQWEKPPPNHWKMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIK---TL

Query:  EAKAILEGISQIYHTCISRRIPLQIESDALEVIKVLN--GEADDLFDLKIITDDIKALADRFVSIDFSHCSRFLNTESHCVARR
        EA A  E +S +    +   +   I SD   V++ +N  G +D       + DD KAL   F S+  SH  R LN  +H +ARR
Subjt:  EAKAILEGISQIYHTCISRRIPLQIESDALEVIKVLN--GEADDLFDLKIITDDIKALADRFVSIDFSHCSRFLNTESHCVARR

A0A803PQM1 Uncharacterized protein9.9e-4020.85Show/hide
Query:  FTGFYGNPDQSKRSDSWKLLKRLNE-ATNLPWIIGGDFNEIMFSKEKKGGPSKPFNLMNDFCDAIGYCGLVDVGFSGYKFTWTRNKNKYDDTKERLDRFF
        FT FYG P    R  SW LLKRL + A  LPW+I GDFNEI+++  K+GG  +  + M +F   + +C L ++ F+G  FTWT+++++ +  +ERLD  F
Subjt:  FTGFYGNPDQSKRSDSWKLLKRLNE-ATNLPWIIGGDFNEIMFSKEKKGGPSKPFNLMNDFCDAIGYCGLVDVGFSGYKFTWTRNKNKYDDTKERLDRFF

Query:  ANSKMWDNVRGLKVERLQFHHSDHRAILMHIDWGGLNQQIRFHKNTIKFEQGWLKYEGSKKAFEEGWK--------------------------------
         N     + + L    L F+ SDHRAI ++I    L+QQ    K   +FE+ WLK E +    ++ WK                                
Subjt:  ANSKMWDNVRGLKVERLQFHHSDHRAILMHIDWGGLNQQIRFHKNTIKFEQGWLKYEGSKKAFEEGWK--------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------SEVT----------------------------------------------------INN----FNF-
                                         S VT                                                    +NN     +F 
Subjt:  ---------------------------------SEVT----------------------------------------------------INN----FNF-

Query:  --TKCIQNGIDQAAKDA---------------------------KDEIVWHPDKKGSFSVKSAYRLAMKEDISHEATQSDNNRESSLWNKLWNSNVFPRA
          T C QN      +DA                           KD ++WH    GS++VKS + LA   +   +++ SD NR+   W   WN ++ P+ 
Subjt:  --TKCIQNGIDQAAKDA---------------------------KDEIVWHPDKKGSFSVKSAYRLAMKEDISHEATQSDNNRESSLWNKLWNSNVFPRA

Query:  KVCIWKIINDILPTKTNILKKGIDLNPYCYFCRNKRESTSHLIWGCKLVREVWPILIPNSLTLLSVCR-ENWSAKDYWGWMVDNLNKEELTKGIILMWKI
        ++  WK+I  ILP    + K+ +  +  C  C +  ES  H ++GC   + +W      S  ++   + ++    DY  ++     +E+    I L+W I
Subjt:  KVCIWKIINDILPTKTNILKKGIDLNPYCYFCRNKRESTSHLIWGCKLVREVWPILIPNSLTLLSVCR-ENWSAKDYWGWMVDNLNKEELTKGIILMWKI

Query:  WEFRNKATVHNQQISADWILQSSEASIKEWD-------------SSYLKTHLSERPRNLVSQAQWEKPPPNHWKMNTDAAWNEKESRGGLGWIVHDSNGS
        W  RN+     +   +  I+  +    +++              + +  TH +   + L     W  P  N +K+N DAA N  + + G+G I+   +G 
Subjt:  WEFRNKATVHNQQISADWILQSSEASIKEWD-------------SSYLKTHLSERPRNLVSQAQWEKPPPNHWKMNTDAAWNEKESRGGLGWIVHDSNGS

Query:  LICFGMQQIEKKWAIKTLEAKAILEGISQIYHTCISRRIPLQIESDALEVIKVLNGEADDLFDLKIITDDIKALADRFVSIDFSHCSRFLNTESHCVARR
        ++    + ++  +    +EAKA+   ++ +  + +S      IE+DAL V   LN    DL     +  DI+ L   F S+  SH  R  N  +H +A+ 
Subjt:  LICFGMQQIEKKWAIKTLEAKAILEGISQIYHTCISRRIPLQIESDALEVIKVLNGEADDLFDLKIITDDIKALADRFVSIDFSHCSRFLNTESHCVARR

Query:  A
        A
Subjt:  A

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52990.1 thioredoxin family protein5.1e-0425.78Show/hide
Query:  KMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIKTLEAKAILEGISQIYHTCISRRIPLQIESDALEVIKVLNGEADDLFDLKIITDDIKA
        K N DA+ +E +   GLGW++ +S G+++  GM + + +   +  E  A++  I     T       +  E D   V +++N ++D+   LK   D IK+
Subjt:  KMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIKTLEAKAILEGISQIYHTCISRRIPLQIESDALEVIKVLNGEADDLFDLKIITDDIKA

Query:  LADRFVSIDFSHCSRFLNTESHCVARRA
            F S +F    R  N  +  + ++A
Subjt:  LADRFVSIDFSHCSRFLNTESHCVARRA

AT3G09510.1 Ribonuclease H-like superfamily protein2.3e-2523.44Show/hide
Query:  AKDAK-DEIVWHPDKKGSFSVKSAYRLAMKEDISHEATQSDNNRESSLWNKLWNSNVFPRAKVCIWKIINDILPTKTNILKKGIDLNPYCYFCRNKREST
        AK  K D+I+W+ +  G ++V+S Y L   +  ++    +  +    L  ++WN  + P+ K  +W+ ++  L T   +  +G+ ++P C  C  + ES 
Subjt:  AKDAK-DEIVWHPDKKGSFSVKSAYRLAMKEDISHEATQSDNNRESSLWNKLWNSNVFPRAKVCIWKIINDILPTKTNILKKGIDLNPYCYFCRNKREST

Query:  SHLIWGCKLVREVWPILIPNSLTLLSVCRENWSAKDYWGWMVDNLNKEELTK--------GIILMWKIWEFRNKATVHNQQISADWILQSSEASIKEWDS
        +H ++ C      W       L+  S+ R    + D+   + + LN  + T          + L+W+IW+ RN    +  + S    + S++A   +W +
Subjt:  SHLIWGCKLVREVWPILIPNSLTLLSVCRENWSAKDYWGWMVDNLNKEELTK--------GIILMWKIWEFRNKATVHNQQISADWILQSSEASIKEWDS

Query:  SYLKTHLSERPRNLVSQ--AQWEKPPPNHWKMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIKTLEAKAILEGISQIYHTCISRRIPLQI
        +      +  P   +++   +W  PP  + K N DA ++ ++     GWI+ +  G+ I +G  ++         E KA+L  + Q   T I     + +
Subjt:  SYLKTHLSERPRNLVSQ--AQWEKPPPNHWKMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIKTLEAKAILEGISQIYHTCISRRIPLQI

Query:  ESDALEVIKVLNGEADDLFDLKIITDDIKALADRFVSIDFSHCSRFLNTESHCVARRAAFAPILPSESSS----SSTSFCRNSN
        E D   +I ++NG +     L    +DI   A++F SI F    R  N  +H +A+         S S S        FC +SN
Subjt:  ESDALEVIKVLNGEADDLFDLKIITDDIKALADRFVSIDFSHCSRFLNTESHCVARRAAFAPILPSESSS----SSTSFCRNSN

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.4e-0633.33Show/hide
Query:  LWNSNVFPRAKVCIWKIINDILPTKTNILKKGIDLNPYCYFCRNKRESTSHLIWGCKLV-REV
        +W+  + P+ K+ IWK +N+ LP    +L + I + P+C  CR+  E+ +H+++ C    REV
Subjt:  LWNSNVFPRAKVCIWKIINDILPTKTNILKKGIDLNPYCYFCRNKRESTSHLIWGCKLV-REV

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.8e-0924.1Show/hide
Query:  LMWKIWEFRNKATVHNQQISADWILQSSEASIKEWDSSYLKTHLSERPRNL--VSQAQWEKPPPNHWKMNTDAAWNEKESRGGLGWIVHDSNGSLICFGM
        LMW+IW+  N    ++ +      ++ +    KEW  + +        RN       +W  P  +  K N DA+ +E+ +  GLGWI+ +S G++I  GM
Subjt:  LMWKIWEFRNKATVHNQQISADWILQSSEASIKEWDSSYLKTHLSERPRNL--VSQAQWEKPPPNHWKMNTDAAWNEKESRGGLGWIVHDSNGSLICFGM

Query:  QQIEKKWAIKTLEAKAILEGISQIYHTCISRRIPLQIESDALEVIKVLNGEADDLFDLKIITDDIKALADRFVSIDFSHCSRFLNTESHCVARRA
         + + +   +  E   ++  I   Y     + I    E D   + +++N ++ +   L+   D I++    F SI+FS   R  N  +  +A++A
Subjt:  QQIEKKWAIKTLEAKAILEGISQIYHTCISRRIPLQIESDALEVIKVLNGEADDLFDLKIITDDIKALADRFVSIDFSHCSRFLNTESHCVARRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGATTCTACAGAAATAGAAGACAAAAGGGAAAAGAACGGCCATACGAAACAAGGGTCATCCCAGGGGGAGGAATCGTCCATTAGTAGTCAAATGGAAAAACTTAA
TATTACAGTGCAAGAGAGAGGGAACGTAGTTAGAATGGAAGATGACGAACTGAAGCAAGAAGTAAAAGATCTGGAGAACATTCTGGTATGCAAAATATTAACAGAGAAAC
ATATAAGTCCAGACATTTTCAAAACCATGATTCCCAAAATATGGGGATTAGTGGGAAGAGTTAACATTAAAAAGGCAGGGGAGAATCTCTACGAGTGCACCTTCGACACG
GCTCAGGGGAAGAGAAAGGTAATGGATGGGAGGCCATGGATTTACGATAACATGCTTCTCATGTTTGAAGAAATTAAAGGAACAGAGAGATATGCAAGTATGCAGTTTAC
TCAAGCTCCATTTTGGATACATTTCCTGAATTTGCCAAGAATTTGTTTCAATAGAAAATGGGCTGAGGCTCTGGGTAACGCGGTCGGAGCATTTGAACGAGTAGACTTCG
ACAAAGATGAATATGAGGATATTATCCGGTGGGGAAAGGGAAGGAAAAAGTCATGGAAAATGTCAGAACCCAAGAAGAAAGCCAAATGTTCTGGGAGTCCAACACGCCAA
GGTCAAACAACTAAGGAAATTAGCACCCAAACACCTGGGCCCAGGAAAAGAAATCCTGGAAACGTAGAGCAAGAGAAGGAACCTTACAAAGCCAAGGAGAAGCAGAAAAG
AAGACTACAAAAGAAAACAGGAAGCATGAAAAAGAGCCGCAAGAGGAAGTTAGAGAAGGGAAGAGAGTTTGCAAGGAACACTTCGGGTCAACCGTTGGGATATCGGCGGA
GGCTGAAATTCAGCCCCGCCGAACGCCATGAATCTCCTCTGTCGGAACGTCCGAGGGGGGGATTTACCGGCTTCTATGGAAATCCCGATCAATCCAAGAGGTCAGATTCG
TGGAAGCTGCTGAAACGACTGAACGAGGCTACAAACCTCCCATGGATCATAGGAGGAGATTTCAACGAGATCATGTTCAGCAAGGAAAAAAAAGGGGGCCCCTCTAAACC
TTTTAATCTCATGAATGATTTTTGTGATGCTATTGGTTATTGCGGTCTTGTTGATGTTGGTTTTTCTGGCTACAAGTTCACTTGGACTCGGAACAAAAATAAATATGATG
ATACCAAAGAGAGACTTGACCGCTTCTTTGCCAACTCGAAGATGTGGGACAATGTTAGAGGGCTGAAAGTGGAGCGCCTCCAATTCCATCATTCGGATCACAGAGCCATT
TTGATGCATATTGATTGGGGGGGTCTCAATCAACAGATTCGGTTTCACAAGAACACAATCAAATTCGAGCAGGGATGGTTGAAATACGAAGGAAGCAAAAAGGCCTTTGA
GGAAGGATGGAAGTCTGAAGTCACGATCAACAATTTCAACTTTACCAAATGCATTCAAAATGGAATTGATCAAGCTGCAAAAGACGCTAAGGATGAAATAGTGTGGCACC
CCGACAAGAAAGGATCATTCTCTGTCAAAAGTGCTTATAGACTGGCTATGAAAGAGGATATATCCCATGAAGCCACCCAATCCGACAACAACAGAGAATCATCCCTATGG
AACAAGCTATGGAATTCCAACGTTTTCCCTCGGGCTAAAGTGTGTATTTGGAAGATCATAAATGACATTCTTCCTACTAAAACAAACATTCTCAAAAAGGGAATTGACCT
CAATCCTTACTGCTATTTTTGTAGGAATAAGAGAGAGTCTACATCCCATTTGATTTGGGGATGCAAGCTGGTGAGAGAAGTGTGGCCTATATTAATTCCTAATTCTCTCA
CTTTACTGTCTGTGTGCAGGGAAAATTGGTCAGCTAAGGACTACTGGGGATGGATGGTCGATAACTTAAACAAAGAGGAATTAACAAAAGGAATCATCCTAATGTGGAAA
ATCTGGGAATTCAGAAACAAAGCAACAGTTCACAATCAACAAATCTCAGCAGATTGGATTCTTCAAAGCTCCGAAGCCAGCATTAAGGAATGGGATTCTTCTTACCTGAA
GACTCATCTGTCGGAAAGACCAAGGAACCTCGTGAGTCAAGCACAGTGGGAGAAGCCTCCGCCTAATCACTGGAAAATGAACACCGACGCGGCCTGGAACGAGAAAGAAA
GCAGAGGTGGGCTCGGCTGGATCGTGCATGACTCAAACGGATCCCTGATCTGTTTCGGAATGCAACAAATCGAGAAAAAATGGGCTATTAAGACGCTAGAGGCGAAGGCG
ATTTTAGAAGGAATTAGTCAAATTTATCATACCTGTATTAGTCGTCGTATCCCTCTCCAGATTGAATCGGACGCTCTTGAGGTGATTAAAGTTCTTAATGGTGAGGCTGA
TGATCTCTTCGACCTGAAGATCATCACAGACGACATCAAAGCCTTGGCAGATCGCTTCGTCTCTATCGATTTTAGCCATTGCAGTCGTTTTTTGAACACAGAATCGCACT
GTGTTGCGAGGAGAGCTGCGTTCGCGCCAATCCTTCCGTCGGAGTCTTCGTCTTCTTCGACCAGTTTCTGTAGAAACAGCAACTCTCTTTCGCGGGTTAATGGGCGTGTA
TTTTGGGCCCCTTCTATTCCTTATCTAAAACGAAGATGTGAGATGAGAGGACAGAGATCTTCGATTTCGGCTCGAAACCAAGTTTCAGATCGAGATTTGCAAAGAGTATT
CGAACACGGTTGCAGAGGTGTTTCAAACCAAGTTTCAAATCGAGGCTTGAAACCAAGTTTCCGAACGAAACTTGCAGGAGAGATCCGGAAGCGTTTGAGGAAAGGGCGGT
TGAAGGAATCGTTTGAACACTTCGTTTTTCTTCTTCGATTTGAGGCTTGTCTGCGACGATGGATCGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGATTCTACAGAAATAGAAGACAAAAGGGAAAAGAACGGCCATACGAAACAAGGGTCATCCCAGGGGGAGGAATCGTCCATTAGTAGTCAAATGGAAAAACTTAA
TATTACAGTGCAAGAGAGAGGGAACGTAGTTAGAATGGAAGATGACGAACTGAAGCAAGAAGTAAAAGATCTGGAGAACATTCTGGTATGCAAAATATTAACAGAGAAAC
ATATAAGTCCAGACATTTTCAAAACCATGATTCCCAAAATATGGGGATTAGTGGGAAGAGTTAACATTAAAAAGGCAGGGGAGAATCTCTACGAGTGCACCTTCGACACG
GCTCAGGGGAAGAGAAAGGTAATGGATGGGAGGCCATGGATTTACGATAACATGCTTCTCATGTTTGAAGAAATTAAAGGAACAGAGAGATATGCAAGTATGCAGTTTAC
TCAAGCTCCATTTTGGATACATTTCCTGAATTTGCCAAGAATTTGTTTCAATAGAAAATGGGCTGAGGCTCTGGGTAACGCGGTCGGAGCATTTGAACGAGTAGACTTCG
ACAAAGATGAATATGAGGATATTATCCGGTGGGGAAAGGGAAGGAAAAAGTCATGGAAAATGTCAGAACCCAAGAAGAAAGCCAAATGTTCTGGGAGTCCAACACGCCAA
GGTCAAACAACTAAGGAAATTAGCACCCAAACACCTGGGCCCAGGAAAAGAAATCCTGGAAACGTAGAGCAAGAGAAGGAACCTTACAAAGCCAAGGAGAAGCAGAAAAG
AAGACTACAAAAGAAAACAGGAAGCATGAAAAAGAGCCGCAAGAGGAAGTTAGAGAAGGGAAGAGAGTTTGCAAGGAACACTTCGGGTCAACCGTTGGGATATCGGCGGA
GGCTGAAATTCAGCCCCGCCGAACGCCATGAATCTCCTCTGTCGGAACGTCCGAGGGGGGGATTTACCGGCTTCTATGGAAATCCCGATCAATCCAAGAGGTCAGATTCG
TGGAAGCTGCTGAAACGACTGAACGAGGCTACAAACCTCCCATGGATCATAGGAGGAGATTTCAACGAGATCATGTTCAGCAAGGAAAAAAAAGGGGGCCCCTCTAAACC
TTTTAATCTCATGAATGATTTTTGTGATGCTATTGGTTATTGCGGTCTTGTTGATGTTGGTTTTTCTGGCTACAAGTTCACTTGGACTCGGAACAAAAATAAATATGATG
ATACCAAAGAGAGACTTGACCGCTTCTTTGCCAACTCGAAGATGTGGGACAATGTTAGAGGGCTGAAAGTGGAGCGCCTCCAATTCCATCATTCGGATCACAGAGCCATT
TTGATGCATATTGATTGGGGGGGTCTCAATCAACAGATTCGGTTTCACAAGAACACAATCAAATTCGAGCAGGGATGGTTGAAATACGAAGGAAGCAAAAAGGCCTTTGA
GGAAGGATGGAAGTCTGAAGTCACGATCAACAATTTCAACTTTACCAAATGCATTCAAAATGGAATTGATCAAGCTGCAAAAGACGCTAAGGATGAAATAGTGTGGCACC
CCGACAAGAAAGGATCATTCTCTGTCAAAAGTGCTTATAGACTGGCTATGAAAGAGGATATATCCCATGAAGCCACCCAATCCGACAACAACAGAGAATCATCCCTATGG
AACAAGCTATGGAATTCCAACGTTTTCCCTCGGGCTAAAGTGTGTATTTGGAAGATCATAAATGACATTCTTCCTACTAAAACAAACATTCTCAAAAAGGGAATTGACCT
CAATCCTTACTGCTATTTTTGTAGGAATAAGAGAGAGTCTACATCCCATTTGATTTGGGGATGCAAGCTGGTGAGAGAAGTGTGGCCTATATTAATTCCTAATTCTCTCA
CTTTACTGTCTGTGTGCAGGGAAAATTGGTCAGCTAAGGACTACTGGGGATGGATGGTCGATAACTTAAACAAAGAGGAATTAACAAAAGGAATCATCCTAATGTGGAAA
ATCTGGGAATTCAGAAACAAAGCAACAGTTCACAATCAACAAATCTCAGCAGATTGGATTCTTCAAAGCTCCGAAGCCAGCATTAAGGAATGGGATTCTTCTTACCTGAA
GACTCATCTGTCGGAAAGACCAAGGAACCTCGTGAGTCAAGCACAGTGGGAGAAGCCTCCGCCTAATCACTGGAAAATGAACACCGACGCGGCCTGGAACGAGAAAGAAA
GCAGAGGTGGGCTCGGCTGGATCGTGCATGACTCAAACGGATCCCTGATCTGTTTCGGAATGCAACAAATCGAGAAAAAATGGGCTATTAAGACGCTAGAGGCGAAGGCG
ATTTTAGAAGGAATTAGTCAAATTTATCATACCTGTATTAGTCGTCGTATCCCTCTCCAGATTGAATCGGACGCTCTTGAGGTGATTAAAGTTCTTAATGGTGAGGCTGA
TGATCTCTTCGACCTGAAGATCATCACAGACGACATCAAAGCCTTGGCAGATCGCTTCGTCTCTATCGATTTTAGCCATTGCAGTCGTTTTTTGAACACAGAATCGCACT
GTGTTGCGAGGAGAGCTGCGTTCGCGCCAATCCTTCCGTCGGAGTCTTCGTCTTCTTCGACCAGTTTCTGTAGAAACAGCAACTCTCTTTCGCGGGTTAATGGGCGTGTA
TTTTGGGCCCCTTCTATTCCTTATCTAAAACGAAGATGTGAGATGAGAGGACAGAGATCTTCGATTTCGGCTCGAAACCAAGTTTCAGATCGAGATTTGCAAAGAGTATT
CGAACACGGTTGCAGAGGTGTTTCAAACCAAGTTTCAAATCGAGGCTTGAAACCAAGTTTCCGAACGAAACTTGCAGGAGAGATCCGGAAGCGTTTGAGGAAAGGGCGGT
TGAAGGAATCGTTTGAACACTTCGTTTTTCTTCTTCGATTTGAGGCTTGTCTGCGACGATGGATCGATTGA
Protein sequenceShow/hide protein sequence
MGDSTEIEDKREKNGHTKQGSSQGEESSISSQMEKLNITVQERGNVVRMEDDELKQEVKDLENILVCKILTEKHISPDIFKTMIPKIWGLVGRVNIKKAGENLYECTFDT
AQGKRKVMDGRPWIYDNMLLMFEEIKGTERYASMQFTQAPFWIHFLNLPRICFNRKWAEALGNAVGAFERVDFDKDEYEDIIRWGKGRKKSWKMSEPKKKAKCSGSPTRQ
GQTTKEISTQTPGPRKRNPGNVEQEKEPYKAKEKQKRRLQKKTGSMKKSRKRKLEKGREFARNTSGQPLGYRRRLKFSPAERHESPLSERPRGGFTGFYGNPDQSKRSDS
WKLLKRLNEATNLPWIIGGDFNEIMFSKEKKGGPSKPFNLMNDFCDAIGYCGLVDVGFSGYKFTWTRNKNKYDDTKERLDRFFANSKMWDNVRGLKVERLQFHHSDHRAI
LMHIDWGGLNQQIRFHKNTIKFEQGWLKYEGSKKAFEEGWKSEVTINNFNFTKCIQNGIDQAAKDAKDEIVWHPDKKGSFSVKSAYRLAMKEDISHEATQSDNNRESSLW
NKLWNSNVFPRAKVCIWKIINDILPTKTNILKKGIDLNPYCYFCRNKRESTSHLIWGCKLVREVWPILIPNSLTLLSVCRENWSAKDYWGWMVDNLNKEELTKGIILMWK
IWEFRNKATVHNQQISADWILQSSEASIKEWDSSYLKTHLSERPRNLVSQAQWEKPPPNHWKMNTDAAWNEKESRGGLGWIVHDSNGSLICFGMQQIEKKWAIKTLEAKA
ILEGISQIYHTCISRRIPLQIESDALEVIKVLNGEADDLFDLKIITDDIKALADRFVSIDFSHCSRFLNTESHCVARRAAFAPILPSESSSSSTSFCRNSNSLSRVNGRV
FWAPSIPYLKRRCEMRGQRSSISARNQVSDRDLQRVFEHGCRGVSNQVSNRGLKPSFRTKLAGEIRKRLRKGRLKESFEHFVFLLRFEACLRRWID