; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006760 (gene) of Snake gourd v1 genome

Gene IDTan0006760
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSmr domain-containing protein
Genome locationLG06:10845343..10851602
RNA-Seq ExpressionTan0006760
SyntenyTan0006760
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576051.1 hypothetical protein SDJN03_26690, partial [Cucurbita argyrosperma subsp. sororia]1.2e-25278.33Show/hide
Query:  MSWVRGKSPGWAAFNLKQQSSGLQD--DPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSPENFGAKKTVLGDSSIQSGKKAVEETTD
        MSW RGKSPGWAA NLKQ +SGLQD  DP+PFPPMSTALS LPPREN+HRVNGRSGRSFSS PLPS DSL SPENFGAKKT+ G+SSIQS KK VEE+TD
Subjt:  MSWVRGKSPGWAAFNLKQQSSGLQD--DPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSPENFGAKKTVLGDSSIQSGKKAVEETTD

Query:  V-TFWKLKELHSWADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIGPEPLPPISTA
        V  FWKLKELHSWAD SLI+DIMEAVNNNF+EAS LLKTMVSSDNFE+NN+ STLGL+SSND+S VRGKSPGW  FN  QQNRGLQD I P+P PP+ +A
Subjt:  V-TFWKLKELHSWADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIGPEPLPPISTA

Query:  LPFLPPCEKLHRVHGRSGRSFSSASL--ADSLTLAENYGARKTILGDSGSKSGKKVVEETTDI-TFWKLKELYSWADFSLIMDIMEAVNNNFNEASTLLK
        L  LPP E LH V+G  GRS SS+ L  ADSLT  ENY A+K ILGDS  ++G+KVVEETTD+  FWKLKEL++WADFSLI+DIMEAV+NNFNEAST L 
Subjt:  LPFLPPCEKLHRVHGRSGRSFSSASL--ADSLTLAENYGARKTILGDSGSKSGKKVVEETTDI-TFWKLKELYSWADFSLIMDIMEAVNNNFNEASTLLK

Query:  IMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLKDV---HQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLALGCPKSVPIE
         MVSSD+ EI NEMSTLGLHS++   CNGKNDV+ISL + V  PI SS+LKDV   HQN N       KL ENNY+ERNF H+VGNPK+AL C KS PIE
Subjt:  IMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLKDV---HQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLALGCPKSVPIE

Query:  PEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEAVQALQEHLLK
        PEWEEDDIYLSHRKDAIA+MRSASQHSRAATNAYLR DHASAKYHSSRAQEQWLAAKMLN KAANEIL+TRN +NGLWKLDLHGLHAAEAVQALQ+HLLK
Subjt:  PEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEAVQALQEHLLK

Query:  IETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGIGKHSRGEAALPKAVTNFLSENGYRFEQLRPGTINVRPKFHR
        IET+NASNRSLSPKKAERKGF R SSLE LSC+  KLDKE QSP  RHRPTSLEVITG+GKHSRGEAALPKAVT+FLSENGYRFEQLRPGTI+VRPKF R
Subjt:  IETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGIGKHSRGEAALPKAVTNFLSENGYRFEQLRPGTINVRPKFHR

XP_008451240.1 PREDICTED: uncharacterized protein LOC103492590 [Cucumis melo]1.5e-25076.23Show/hide
Query:  MSWVRGKSPGWAAFNLKQQSSGLQD--DPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSP----------ENFGAKKTVLGDSSIQS
        MSWVRGKS GWAAFNLKQQ++G+QD  D +PFPPMST LSSLPPRENL  VNGRSGRSFS AP+PS DS T P           NF AKKT+LG S+IQS
Subjt:  MSWVRGKSPGWAAFNLKQQSSGLQD--DPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSP----------ENFGAKKTVLGDSSIQS

Query:  GKKAVEETTDV-TFWKLKELHSWADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIG
        GKK VEET DV +FWKLKELH WAD SLIMDIMEAVNN+F+EASTLL TMVSSDN E+NN+ S LGL+SSNDLSW+ GKSPGW  FN  Q NRGLQ E  
Subjt:  GKKAVEETTDV-TFWKLKELHSWADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIG

Query:  PEPLPPISTALPFLPPCEKLHRVHGRSGRSFSSASL--ADSLTLAENYGARKTILGDSGSKSGKKVVEETTDI-TFWKLKELYSWADFSLIMDIMEAVNN
        PE  PP+ T  P LPP E LH V+GR GRSF+S  L  ADSLT   NYGA+ TI  DSG +SGKKVVEE TD+  FWKLKE++SWADFSLI+DIM+AVNN
Subjt:  PEPLPPISTALPFLPPCEKLHRVHGRSGRSFSSASL--ADSLTLAENYGARKTILGDSGSKSGKKVVEETTDI-TFWKLKELYSWADFSLIMDIMEAVNN

Query:  NFNEASTLLKIMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLK---DVHQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLA
        NF+EASTLLK MVSSD+FEINNE+STLGLHS+ND LCNG NDVSIS E+ +  PILS +LK    +HQN+N+  ED TKL  N+Y+ERNF  + GN K+A
Subjt:  NFNEASTLLKIMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLK---DVHQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLA

Query:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEA
        LGC KSVPIEPEWEEDDIYLSHRKDAIA+MRSASQHSRAATNAY R DHASAKYHSSRAQEQWLAAKMLN+KAANEIL+TRN KNGLWKLDLHGLHAAEA
Subjt:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEA

Query:  VQALQEHLLKIETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGIGKHSRGEAALPKAVTNFLSENGYRFEQLRPG
        VQALQ+HLLKIETQNASNRSLSPKKAERKGF RASSLE LSC+D+KLDKE  SPSSRHRPTSLEVITGIGKHS+GEAALPKAVT+FL+ENGYRFEQ RPG
Subjt:  VQALQEHLLKIETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGIGKHSRGEAALPKAVTNFLSENGYRFEQLRPG

Query:  TINVRPKFHR
        TI+VRPKF R
Subjt:  TINVRPKFHR

XP_022953352.1 uncharacterized protein LOC111455928 isoform X1 [Cucurbita moschata]4.4e-25578.67Show/hide
Query:  MSWVRGKSPGWAAFNLKQQSSGLQD--DPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSPENFGAKKTVLGDSSIQSGKKAVEETTD
        MSW RGKSPGWAA NLKQQ+SGLQD  DP+PFPPMSTALS LPPREN+HRVNGRSGRSFSS PLPS DSL SPENFG KKT+ G+SSI+SGKK VEE+TD
Subjt:  MSWVRGKSPGWAAFNLKQQSSGLQD--DPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSPENFGAKKTVLGDSSIQSGKKAVEETTD

Query:  V-TFWKLKELHSWADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIGPEPLPPISTA
        V  FWKLKELHSWAD SLI+DIMEAVNNNF+EAS LLKTMVSSDNFE+NN+ STLGL+SSND+S VRGKSPGW  FN  QQNRGLQD I P+P PP+ +A
Subjt:  V-TFWKLKELHSWADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIGPEPLPPISTA

Query:  LPFLPPCEKLHRVHGRSGRSFSSASL--ADSLTLAENYGARKTILGDSGSKSGKKVVEETTDI-TFWKLKELYSWADFSLIMDIMEAVNNNFNEASTLLK
        L  LPP E LH V+GR GRS SS+ L  ADSLTL ENY A+K ILGDS  ++G+KVVEETTD+  FWKLKEL++WADFSLI+DIMEAV+NNFNEAST L 
Subjt:  LPFLPPCEKLHRVHGRSGRSFSSASL--ADSLTLAENYGARKTILGDSGSKSGKKVVEETTDI-TFWKLKELYSWADFSLIMDIMEAVNNNFNEASTLLK

Query:  IMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLKDV---HQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLALGCPKSVPIE
         MVSSD+ EI NEMSTLGLHS++   CNGKNDV+ISL + V  PI SS+LKDV   HQN N       KL ENNY+ERNF H+VGNPK+AL C KS PIE
Subjt:  IMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLKDV---HQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLALGCPKSVPIE

Query:  PEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEAVQALQEHLLK
        PEWEEDDIYLSHRKDAIA+MRSASQHSRAATNAYLR DHASAKYHSSRAQEQWLAAKMLN KAANEIL+TRN +NGLWKLDLHGLHAAEAVQALQ+HLLK
Subjt:  PEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEAVQALQEHLLK

Query:  IETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGIGKHSRGEAALPKAVTNFLSENGYRFEQLRPGTINVRPKFHR
        IET+NASNRSLSPKKAERKGF R SSLE LSC+  KLDKE QSP  RHRPTSLEVITG+GKHSRGEAALPKAVT+FLSENGYRFEQLRPGTI+VRPKF R
Subjt:  IETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGIGKHSRGEAALPKAVTNFLSENGYRFEQLRPGTINVRPKFHR

XP_023548349.1 uncharacterized protein LOC111807017 [Cucurbita pepo subsp. pepo]5.4e-25378.33Show/hide
Query:  MSWVRGKSPGWAAFNLKQQSSGLQD--DPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSPENFGAKKTVLGDSSIQSGKKAVEETTD
        MSW RGKSPGWAA NLKQQ+SGLQD  DP+PFPPMSTALS LPPREN+HRVNGRSGRSFSS PLPS DSL SP+NFGAKKT+ G+SSI+SGKK VEE+TD
Subjt:  MSWVRGKSPGWAAFNLKQQSSGLQD--DPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSPENFGAKKTVLGDSSIQSGKKAVEETTD

Query:  V-TFWKLKELHSWADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIGPEPLPPISTA
        V  FWKLKELHSWAD SLI+DIMEAVNNNF+EAS LLKTMVSSDNFE+NN+ STLGL+SSND+S VRGKSPGW  +N  QQNRGLQD I P+P PP+ +A
Subjt:  V-TFWKLKELHSWADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIGPEPLPPISTA

Query:  LPFLPPCEKLHRVHGRSGRSFSSASL--ADSLTLAENYGARKTILGDSGSKSGKKVVEETTDI-TFWKLKELYSWADFSLIMDIMEAVNNNFNEASTLLK
        L  LPP E LH V GR GRS SS+ L  ADSLT  ENY A+K ILGDS  ++G+KVVEETTD+  FWKLKEL++WADFSLI+DIMEAV+NNFNEAST L 
Subjt:  LPFLPPCEKLHRVHGRSGRSFSSASL--ADSLTLAENYGARKTILGDSGSKSGKKVVEETTDI-TFWKLKELYSWADFSLIMDIMEAVNNNFNEASTLLK

Query:  IMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLKDV---HQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLALGCPKSVPIE
         MVSSD+ EI NEMSTLGLHS++   C GKNDV+ISL + V  PI SS+LKDV   HQN N       KL ENNY+ERNF H+VGNPK+AL C KS PIE
Subjt:  IMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLKDV---HQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLALGCPKSVPIE

Query:  PEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEAVQALQEHLLK
        PEWEEDDIYLSHRKDAIA+MRSASQHSRAATNAYLR DHASAKYHSSRAQEQWLAAKMLN KAANEIL+TRN +NGLWKLDLHGLHAAEAVQALQ+HLLK
Subjt:  PEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEAVQALQEHLLK

Query:  IETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGIGKHSRGEAALPKAVTNFLSENGYRFEQLRPGTINVRPKFHR
        IET+NASNRSLSPKKAERKGF R SSLE LSC+  KLDKE QSP  RHRPTSLEVITGIGKHSRGEAALPKAVT+FLSENGYRFEQLRPGTI+VRPKF R
Subjt:  IETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGIGKHSRGEAALPKAVTNFLSENGYRFEQLRPGTINVRPKFHR

XP_038898473.1 uncharacterized protein LOC120086100 [Benincasa hispida]1.0e-25677.54Show/hide
Query:  MSWVRGKSPGWAAFNLKQQSSGLQD--DPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSP----------ENFGAKKTVLGDSSIQS
        MSWV+GKSPGWAAFNLKQQ++GLQD  D +PFPP+ST LSSLPP EN H VNGRSGRSFS AP PS +SLTSP          EN GAKKT+L  S++Q+
Subjt:  MSWVRGKSPGWAAFNLKQQSSGLQD--DPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSP----------ENFGAKKTVLGDSSIQS

Query:  GKKAVEETTDV-TFWKLKELHSWADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIG
        GKK VEET DV +FWKLKELHSWAD SLIMD+MEAVNNNFDEASTLLKTMV+SDNFE+NN+ STLGL  SNDLSWV G  PGW  FN  Q NRGLQDE  
Subjt:  GKKAVEETTDV-TFWKLKELHSWADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIG

Query:  PEPLPPISTALPFLPPCEKLHRVHGRSGRSFSSA--SLADSLTLAENYGARKTILGDSGSKSGKKVVEETTD-ITFWKLKELYSWADFSLIMDIMEAVNN
         EPLPP+ T    LPPCE LHRV+G SG+SFSS   + ADSLT  ENYGA+KTI  DS  +SGKKVVEE+ D + FWKLKEL+SWADFSLI+DIMEAVNN
Subjt:  PEPLPPISTALPFLPPCEKLHRVHGRSGRSFSSA--SLADSLTLAENYGARKTILGDSGSKSGKKVVEETTD-ITFWKLKELYSWADFSLIMDIMEAVNN

Query:  NFNEASTLLKIMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLKD---VHQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLA
        NFNEASTLLK MVSSD+F+IN+EMSTL L S+ND LCNGKNDVS SLE+   IPI SS+LKD   VHQNNN+C E+ TKL ENNY+ERNF H+ G PK+ 
Subjt:  NFNEASTLLKIMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLKD---VHQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLA

Query:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEA
        LG  KSVPIEPEWEEDDIYLSHRKDAIA+MRSASQHSRAATNAYLR DHASAKYHSSRAQEQWLAAKMLN+KAANEIL+TRN KNGLWKLDLHGLHAAEA
Subjt:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEA

Query:  VQALQEHLLKIETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGIGKHSRGEAALPKAVTNFLSENGYRFEQLRPG
        VQALQEHLLKIET+NASNRSLSPKK+ERKGF  ASSLE LSC+DSK+DKE  SPSSRHRPTSLEVITGIGKHSRGEA LPKAVT+FLSENGYRFEQLRPG
Subjt:  VQALQEHLLKIETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGIGKHSRGEAALPKAVTNFLSENGYRFEQLRPG

Query:  TINVRPKFHR
        TI++RPKF R
Subjt:  TINVRPKFHR

TrEMBL top hitse value%identityAlignment
A0A0A0KA90 Smr domain-containing protein6.0e-25075.25Show/hide
Query:  MSWVRGKSPGWAAFNLKQQSSGLQD--DPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSP----------ENFGAKKTVLGDSSIQS
        MSWVRGKS GWAAFNLKQQ++GLQD  D +PFPPMST LSSLPPRENL  VNG SG+SFS AP+PS DS T P           NFGAKKT+LG ++IQS
Subjt:  MSWVRGKSPGWAAFNLKQQSSGLQD--DPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSP----------ENFGAKKTVLGDSSIQS

Query:  GKKAVEETTDV-TFWKLKELHSWADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIG
        GKK VEET DV +FWKLKELH WAD SLIMDIMEAVNN+F+EASTLL TMVSSDN E+NN  STLGL+SSNDL W+ GKSPGW  FN  Q N+GLQDE+ 
Subjt:  GKKAVEETTDV-TFWKLKELHSWADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIG

Query:  PEPLPPISTALPFLPPCEKLHRVHGRSGRSFSSASL--ADSLTLAENYGARKTILGDSGSKSGKKVVEETTDI-TFWKLKELYSWADFSLIMDIMEAVNN
         E  PP+ T    LPP E LH V+GRSGRSF+S  L   DSLT  ENYGA+ TI  DS  +SGKKVVEE TD+  FWKLKE++SWADFSLI+DIM+AVNN
Subjt:  PEPLPPISTALPFLPPCEKLHRVHGRSGRSFSSASL--ADSLTLAENYGARKTILGDSGSKSGKKVVEETTDI-TFWKLKELYSWADFSLIMDIMEAVNN

Query:  NFNEASTLLKIMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLK---DVHQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLA
        NF+EASTLLK MVSSD+FEINNE+STLGLHS+ND LCNG NDVSI+ E+ +  PILSS++K    +HQNNN+  ED TKL  N+Y+ERN  H+ GN K+A
Subjt:  NFNEASTLLKIMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLK---DVHQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLA

Query:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEA
        LGC KSVPIEPEWEEDDIYLSHRKDAIA+MRSASQHSRAATNAY R DHASAKYHSSRA+EQWLAAKMLN+KAANEIL+TRN KNGLWKLDLHGLHAAEA
Subjt:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEA

Query:  VQALQEHLLKIETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGIGKHSRGEAALPKAVTNFLSENGYRFEQLRPG
        VQAL +HLLKIETQNASNRSLSPKKAERKGF RASSLE LSC++SKLDKE  SPSSRHRPTSLEVITGIGKHS+GEAALPKAV +FL+ENGYRFEQ RPG
Subjt:  VQALQEHLLKIETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGIGKHSRGEAALPKAVTNFLSENGYRFEQLRPG

Query:  TINVRPKFHR
        TI+VRPKF R
Subjt:  TINVRPKFHR

A0A1S3BRS7 uncharacterized protein LOC1034925907.1e-25176.23Show/hide
Query:  MSWVRGKSPGWAAFNLKQQSSGLQD--DPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSP----------ENFGAKKTVLGDSSIQS
        MSWVRGKS GWAAFNLKQQ++G+QD  D +PFPPMST LSSLPPRENL  VNGRSGRSFS AP+PS DS T P           NF AKKT+LG S+IQS
Subjt:  MSWVRGKSPGWAAFNLKQQSSGLQD--DPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSP----------ENFGAKKTVLGDSSIQS

Query:  GKKAVEETTDV-TFWKLKELHSWADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIG
        GKK VEET DV +FWKLKELH WAD SLIMDIMEAVNN+F+EASTLL TMVSSDN E+NN+ S LGL+SSNDLSW+ GKSPGW  FN  Q NRGLQ E  
Subjt:  GKKAVEETTDV-TFWKLKELHSWADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIG

Query:  PEPLPPISTALPFLPPCEKLHRVHGRSGRSFSSASL--ADSLTLAENYGARKTILGDSGSKSGKKVVEETTDI-TFWKLKELYSWADFSLIMDIMEAVNN
        PE  PP+ T  P LPP E LH V+GR GRSF+S  L  ADSLT   NYGA+ TI  DSG +SGKKVVEE TD+  FWKLKE++SWADFSLI+DIM+AVNN
Subjt:  PEPLPPISTALPFLPPCEKLHRVHGRSGRSFSSASL--ADSLTLAENYGARKTILGDSGSKSGKKVVEETTDI-TFWKLKELYSWADFSLIMDIMEAVNN

Query:  NFNEASTLLKIMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLK---DVHQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLA
        NF+EASTLLK MVSSD+FEINNE+STLGLHS+ND LCNG NDVSIS E+ +  PILS +LK    +HQN+N+  ED TKL  N+Y+ERNF  + GN K+A
Subjt:  NFNEASTLLKIMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLK---DVHQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLA

Query:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEA
        LGC KSVPIEPEWEEDDIYLSHRKDAIA+MRSASQHSRAATNAY R DHASAKYHSSRAQEQWLAAKMLN+KAANEIL+TRN KNGLWKLDLHGLHAAEA
Subjt:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEA

Query:  VQALQEHLLKIETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGIGKHSRGEAALPKAVTNFLSENGYRFEQLRPG
        VQALQ+HLLKIETQNASNRSLSPKKAERKGF RASSLE LSC+D+KLDKE  SPSSRHRPTSLEVITGIGKHS+GEAALPKAVT+FL+ENGYRFEQ RPG
Subjt:  VQALQEHLLKIETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGIGKHSRGEAALPKAVTNFLSENGYRFEQLRPG

Query:  TINVRPKFHR
        TI+VRPKF R
Subjt:  TINVRPKFHR

A0A5D3CAF0 Smr (Small MutS Related) domain-containing protein, putative isoform 17.8e-25075.9Show/hide
Query:  MSWVRGKSPGWAAFNLKQQSSGLQD--DPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSP----------ENFGAKKTVLGDSSIQS
        MSWVRGKS GWAAFNLKQQ++G+QD  D +PFPPMST LSSLPPRENL  VNGRSGRSFS AP+PS DS T P           NF AKKT+LG S+IQS
Subjt:  MSWVRGKSPGWAAFNLKQQSSGLQD--DPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSP----------ENFGAKKTVLGDSSIQS

Query:  GKKAVEETTDV-TFWKLKELHSWADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIG
        GKK VEET DV +FWKLKELH WAD SLIMDIMEAVNN+F+EASTLL TMVSSDN E+NN+ S LGL+SSNDLSW+ GKSPGW  FN  Q NRGLQ E  
Subjt:  GKKAVEETTDV-TFWKLKELHSWADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIG

Query:  PEPLPPISTALPFLPPCEKLHRVHGRSGRSFSSASL--ADSLTLAENYGARKTILGDSGSKSGKKVVEETTDI-TFWKLKELYSWADFSLIMDIMEAVNN
        PE  PP+ T  P LPP E LH V+G  GRSF+S  L  ADSLT   NYGA+ TI  DSG +SGKKVVEE TD+  FWKLKE++SWADFSLI+DIM+AVNN
Subjt:  PEPLPPISTALPFLPPCEKLHRVHGRSGRSFSSASL--ADSLTLAENYGARKTILGDSGSKSGKKVVEETTDI-TFWKLKELYSWADFSLIMDIMEAVNN

Query:  NFNEASTLLKIMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLK---DVHQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLA
        NF+EASTLLK MVSSD+FEINNE+STLGLH +ND LCNG NDVSIS E+ +  PILS +LK    +HQN+N+  ED TKL  N+Y+ERNF  + GN K+A
Subjt:  NFNEASTLLKIMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLK---DVHQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLA

Query:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEA
        LGC KSVPIEPEWEEDD+YLSHRKDAIA+MRSASQHSRAATNAY R DHASAKYHSSRAQEQWLAAKMLN+KAANEIL+TRN KNGLWKLDLHGLHAAEA
Subjt:  LGCPKSVPIEPEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEA

Query:  VQALQEHLLKIETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGIGKHSRGEAALPKAVTNFLSENGYRFEQLRPG
        VQALQ+HLLKIETQNASNRSLSPKKAERKGF RASSLE LSC+DSKLDKE  SPSSRHRPTSLEVITGIGKHS+GEAALPKAVT+FL+ENGYRFEQ RPG
Subjt:  VQALQEHLLKIETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGIGKHSRGEAALPKAVTNFLSENGYRFEQLRPG

Query:  TINVRPKFHR
        TI+VRPKF R
Subjt:  TINVRPKFHR

A0A6J1GN51 uncharacterized protein LOC111455928 isoform X12.1e-25578.67Show/hide
Query:  MSWVRGKSPGWAAFNLKQQSSGLQD--DPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSPENFGAKKTVLGDSSIQSGKKAVEETTD
        MSW RGKSPGWAA NLKQQ+SGLQD  DP+PFPPMSTALS LPPREN+HRVNGRSGRSFSS PLPS DSL SPENFG KKT+ G+SSI+SGKK VEE+TD
Subjt:  MSWVRGKSPGWAAFNLKQQSSGLQD--DPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSPENFGAKKTVLGDSSIQSGKKAVEETTD

Query:  V-TFWKLKELHSWADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIGPEPLPPISTA
        V  FWKLKELHSWAD SLI+DIMEAVNNNF+EAS LLKTMVSSDNFE+NN+ STLGL+SSND+S VRGKSPGW  FN  QQNRGLQD I P+P PP+ +A
Subjt:  V-TFWKLKELHSWADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIGPEPLPPISTA

Query:  LPFLPPCEKLHRVHGRSGRSFSSASL--ADSLTLAENYGARKTILGDSGSKSGKKVVEETTDI-TFWKLKELYSWADFSLIMDIMEAVNNNFNEASTLLK
        L  LPP E LH V+GR GRS SS+ L  ADSLTL ENY A+K ILGDS  ++G+KVVEETTD+  FWKLKEL++WADFSLI+DIMEAV+NNFNEAST L 
Subjt:  LPFLPPCEKLHRVHGRSGRSFSSASL--ADSLTLAENYGARKTILGDSGSKSGKKVVEETTDI-TFWKLKELYSWADFSLIMDIMEAVNNNFNEASTLLK

Query:  IMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLKDV---HQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLALGCPKSVPIE
         MVSSD+ EI NEMSTLGLHS++   CNGKNDV+ISL + V  PI SS+LKDV   HQN N       KL ENNY+ERNF H+VGNPK+AL C KS PIE
Subjt:  IMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLKDV---HQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLALGCPKSVPIE

Query:  PEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEAVQALQEHLLK
        PEWEEDDIYLSHRKDAIA+MRSASQHSRAATNAYLR DHASAKYHSSRAQEQWLAAKMLN KAANEIL+TRN +NGLWKLDLHGLHAAEAVQALQ+HLLK
Subjt:  PEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEAVQALQEHLLK

Query:  IETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGIGKHSRGEAALPKAVTNFLSENGYRFEQLRPGTINVRPKFHR
        IET+NASNRSLSPKKAERKGF R SSLE LSC+  KLDKE QSP  RHRPTSLEVITG+GKHSRGEAALPKAVT+FLSENGYRFEQLRPGTI+VRPKF R
Subjt:  IETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGIGKHSRGEAALPKAVTNFLSENGYRFEQLRPGTINVRPKFHR

A0A6J1GPE7 uncharacterized protein LOC111455928 isoform X23.6e-24777Show/hide
Query:  MSWVRGKSPGWAAFNLKQQSSGLQD--DPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSPENFGAKKTVLGDSSIQSGKKAVEETTD
        MSW RGKSPGWAA NLKQQ+SGLQD  DP+PFPPMSTALS LPPREN+HRVNGRSGRS             SPENFG KKT+ G+SSI+SGKK VEE+TD
Subjt:  MSWVRGKSPGWAAFNLKQQSSGLQD--DPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSPENFGAKKTVLGDSSIQSGKKAVEETTD

Query:  V-TFWKLKELHSWADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIGPEPLPPISTA
        V  FWKLKELHSWAD SLI+DIMEAVNNNF+EAS LLKTMVSSDNFE+NN+ STLGL+SSND+S VRGKSPGW  FN  QQNRGLQD I P+P PP+ +A
Subjt:  V-TFWKLKELHSWADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIGPEPLPPISTA

Query:  LPFLPPCEKLHRVHGRSGRSFSSASL--ADSLTLAENYGARKTILGDSGSKSGKKVVEETTDI-TFWKLKELYSWADFSLIMDIMEAVNNNFNEASTLLK
        L  LPP E LH V+GR GRS SS+ L  ADSLTL ENY A+K ILGDS  ++G+KVVEETTD+  FWKLKEL++WADFSLI+DIMEAV+NNFNEAST L 
Subjt:  LPFLPPCEKLHRVHGRSGRSFSSASL--ADSLTLAENYGARKTILGDSGSKSGKKVVEETTDI-TFWKLKELYSWADFSLIMDIMEAVNNNFNEASTLLK

Query:  IMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLKDV---HQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLALGCPKSVPIE
         MVSSD+ EI NEMSTLGLHS++   CNGKNDV+ISL + V  PI SS+LKDV   HQN N       KL ENNY+ERNF H+VGNPK+AL C KS PIE
Subjt:  IMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLKDV---HQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLALGCPKSVPIE

Query:  PEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEAVQALQEHLLK
        PEWEEDDIYLSHRKDAIA+MRSASQHSRAATNAYLR DHASAKYHSSRAQEQWLAAKMLN KAANEIL+TRN +NGLWKLDLHGLHAAEAVQALQ+HLLK
Subjt:  PEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEAVQALQEHLLK

Query:  IETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGIGKHSRGEAALPKAVTNFLSENGYRFEQLRPGTINVRPKFHR
        IET+NASNRSLSPKKAERKGF R SSLE LSC+  KLDKE QSP  RHRPTSLEVITG+GKHSRGEAALPKAVT+FLSENGYRFEQLRPGTI+VRPKF R
Subjt:  IETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGIGKHSRGEAALPKAVTNFLSENGYRFEQLRPGTINVRPKFHR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G23520.1 smr (Small MutS Related) domain-containing protein9.3e-7841.69Show/hide
Query:  LSWVRGKSPGWATFNPVQ-QNRGLQDEIGPEPLPPISTAL-PFLPPCEKLHRVHGRSGRSFSSASLADSL--TLAENYGARKTILGDSGSKSGK--KVVE
        +SW++GKS GW  F+  Q Q +GL+ E+  +P PP+ST++        +L R H  S +SFSS  L  S    L EN        G    +      +  
Subjt:  LSWVRGKSPGWATFNPVQ-QNRGLQDEIGPEPLPPISTAL-PFLPPCEKLHRVHGRSGRSFSSASLADSL--TLAENYGARKTILGDSGSKSGK--KVVE

Query:  ETTDITFWKLKELYSWADFSLIMDIMEAVNNNFNEASTLLKIMVSS--DDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLKDVHQNN
         + D+ F KLKE+ SWAD +LI D++ +  ++F  A   LK MVSS  +D E  +++      +   +    +  V+ S++   +     +   D+  + 
Subjt:  ETTDITFWKLKELYSWADFSLIMDIMEAVNNNFNEASTLLKIMVSS--DDFEINNEMSTLGLHSSNDQLCNGKNDVSISLEKPVKIPILSSSLKDVHQNN

Query:  NSCLEDDTKLIENNYYERNFLHDVGNPKLALGCPKSVPIEPEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLN
             D +  + N      F  D+      +   +S+PIEPEWEEDD+YLSHRKDA+ +MRSAS HSRAA NA+ R DHASAK HS +A+E WLAA+ LN
Subjt:  NSCLEDDTKLIENNYYERNFLHDVGNPKLALGCPKSVPIEPEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRAQEQWLAAKMLN

Query:  NKAANEILRTRNGKNGLWKLDLHGLHAAEAVQALQEHLLKIETQNASNRSLSPKKAERK-GFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGI
         +AA +I+   N  N +WKLDLHGLHA EAVQALQE L  IE     NRS+SP +   K   LR++S E    +D +     Q  SSR    SL+VITGI
Subjt:  NKAANEILRTRNGKNGLWKLDLHGLHAAEAVQALQEHLLKIETQNASNRSLSPKKAERK-GFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGI

Query:  GKHSRGEAALPKAVTNFLSENGYRFEQLRPGTINVRPKF
        GKHSRG+A+LP AV  F  +N YRF++ RPG I VRPKF
Subjt:  GKHSRGEAALPKAVTNFLSENGYRFEQLRPGTINVRPKF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTGGGTGAGGGGTAAGTCTCCTGGCTGGGCAGCATTTAACCTTAAGCAACAGAGTAGTGGCCTTCAAGATGACCCGGAACCATTCCCACCAATGTCAACTGCCCT
TTCCTCTCTGCCACCCCGTGAAAATTTGCACAGAGTTAATGGTCGTTCAGGGAGATCTTTCTCATCTGCACCCCTTCCATCTACTGATTCCTTAACTTCGCCAGAAAATT
TTGGTGCAAAAAAGACCGTACTTGGTGATTCTAGCATTCAAAGTGGCAAGAAGGCAGTCGAAGAAACCACTGATGTTACCTTTTGGAAGCTTAAAGAGCTTCATTCTTGG
GCTGATTTTAGCTTGATTATGGATATAATGGAAGCTGTAAATAATAACTTCGATGAGGCGTCTACTTTATTAAAAACAATGGTTTCTAGTGATAATTTTGAGGTCAATAA
TGATACGAGCACCTTAGGACTTTATTCCTCTAATGATCTATCGTGGGTGAGGGGTAAATCTCCTGGCTGGGCAACATTTAACCCTGTGCAACAGAATAGAGGCCTTCAAG
ATGAAATTGGCCCAGAACCACTTCCACCAATATCAACCGCCCTTCCCTTTCTGCCACCTTGTGAAAAGTTGCACAGAGTTCATGGTCGTTCAGGGAGATCTTTCTCATCT
GCATCCCTTGCCGATTCTCTAACTTTGGCAGAAAATTATGGTGCAAGAAAGACAATACTTGGTGATTCTGGCAGTAAAAGTGGCAAGAAGGTGGTTGAAGAAACCACTGA
TATTACCTTTTGGAAGCTTAAAGAGCTTTATTCTTGGGCTGATTTTAGCTTGATTATGGATATAATGGAAGCTGTAAATAATAACTTCAATGAGGCGTCTACTTTATTAA
AAATAATGGTTTCTAGTGATGATTTTGAGATCAATAATGAGATGAGTACCTTAGGACTGCATTCCTCTAATGATCAATTGTGCAATGGGAAGAATGATGTAAGTATATCA
TTAGAAAAACCGGTTAAAATTCCCATTCTTAGTTCTTCACTAAAGGACGTGCATCAAAACAATAATTCATGTCTAGAAGATGACACAAAATTGATTGAAAATAATTATTA
TGAAAGGAACTTCCTTCATGATGTTGGGAATCCTAAACTAGCTCTTGGTTGCCCAAAGTCTGTTCCTATTGAGCCTGAGTGGGAAGAAGATGATATTTACCTGAGCCATC
GGAAAGATGCTATAGCAATAATGAGGTCGGCATCTCAACATTCAAGGGCAGCCACTAATGCCTATCTTAGGAATGATCATGCTTCTGCCAAGTATCATTCATCAAGGGCT
CAAGAACAATGGTTAGCTGCAAAAATGTTAAATAATAAGGCGGCTAATGAAATTTTACGAACGAGGAATGGTAAAAATGGGCTCTGGAAGTTGGACTTACATGGACTTCA
TGCAGCAGAGGCTGTTCAAGCCTTGCAAGAACACTTGCTGAAAATTGAAACTCAGAATGCCTCCAACCGGTCGTTGTCACCAAAGAAAGCAGAAAGGAAAGGGTTCCTAC
GTGCTTCATCCCTTGAGTCTCTTAGTTGTATAGACTCAAAGTTGGACAAAGAATCACAATCACCATCATCAAGGCATAGGCCGACATCATTGGAAGTCATAACAGGCATA
GGTAAACATAGCAGGGGGGAAGCTGCTCTACCCAAGGCTGTGACAAATTTTCTTAGTGAAAACGGGTACCGTTTCGAACAGTTGAGGCCTGGGACGATCAACGTTCGACC
TAAGTTTCATAGGTAA
mRNA sequenceShow/hide mRNA sequence
CCGGAAATAGATTAAAGTAAGAAATTCGACAGAAAAGTAATGAATTTGAGTTCATCGCTTTGAAGGAATTCAGAAATCGAAGTCAACAATCTGAGTATGTGGGGTAGCGT
TGAACTGTGCCCTCAGAAGATTTTCCGATTTCTCTGTTTCCTTTTCCGATCTGGAATTCCATCTATGTTGCAAGCTCTCACTTTCAATCTCCCCATCGCATCGATTCCGA
AGACGCACGAACGACGGACTCGGGGATTTGACATAGGAAGAAGACGAACTTAGCGGATTTGGGATTCGAGGCTTTTCAACTTTTTCATCCTCTCCTGCTGCTCCTCCAGG
TTTGAACCCTAATTTTCTCCTCGAAGCTGGAAGGTTATGAACGGGTCTGTTTCTTCAAGTTGTGCCTACCTCTTCGAGTTGCAGTTTCATTGGATTGTTGGAACTCCAAG
ACATTCATAGTAGGGGGAGTTTGAGTAGACCTGTGAAATTGTAATTTACTGGTGCTCAACTCCATATGCTTTATTTTCAGTTCTTTACATTGGAAAATGTCGTGGGTGAG
GGGTAAGTCTCCTGGCTGGGCAGCATTTAACCTTAAGCAACAGAGTAGTGGCCTTCAAGATGACCCGGAACCATTCCCACCAATGTCAACTGCCCTTTCCTCTCTGCCAC
CCCGTGAAAATTTGCACAGAGTTAATGGTCGTTCAGGGAGATCTTTCTCATCTGCACCCCTTCCATCTACTGATTCCTTAACTTCGCCAGAAAATTTTGGTGCAAAAAAG
ACCGTACTTGGTGATTCTAGCATTCAAAGTGGCAAGAAGGCAGTCGAAGAAACCACTGATGTTACCTTTTGGAAGCTTAAAGAGCTTCATTCTTGGGCTGATTTTAGCTT
GATTATGGATATAATGGAAGCTGTAAATAATAACTTCGATGAGGCGTCTACTTTATTAAAAACAATGGTTTCTAGTGATAATTTTGAGGTCAATAATGATACGAGCACCT
TAGGACTTTATTCCTCTAATGATCTATCGTGGGTGAGGGGTAAATCTCCTGGCTGGGCAACATTTAACCCTGTGCAACAGAATAGAGGCCTTCAAGATGAAATTGGCCCA
GAACCACTTCCACCAATATCAACCGCCCTTCCCTTTCTGCCACCTTGTGAAAAGTTGCACAGAGTTCATGGTCGTTCAGGGAGATCTTTCTCATCTGCATCCCTTGCCGA
TTCTCTAACTTTGGCAGAAAATTATGGTGCAAGAAAGACAATACTTGGTGATTCTGGCAGTAAAAGTGGCAAGAAGGTGGTTGAAGAAACCACTGATATTACCTTTTGGA
AGCTTAAAGAGCTTTATTCTTGGGCTGATTTTAGCTTGATTATGGATATAATGGAAGCTGTAAATAATAACTTCAATGAGGCGTCTACTTTATTAAAAATAATGGTTTCT
AGTGATGATTTTGAGATCAATAATGAGATGAGTACCTTAGGACTGCATTCCTCTAATGATCAATTGTGCAATGGGAAGAATGATGTAAGTATATCATTAGAAAAACCGGT
TAAAATTCCCATTCTTAGTTCTTCACTAAAGGACGTGCATCAAAACAATAATTCATGTCTAGAAGATGACACAAAATTGATTGAAAATAATTATTATGAAAGGAACTTCC
TTCATGATGTTGGGAATCCTAAACTAGCTCTTGGTTGCCCAAAGTCTGTTCCTATTGAGCCTGAGTGGGAAGAAGATGATATTTACCTGAGCCATCGGAAAGATGCTATA
GCAATAATGAGGTCGGCATCTCAACATTCAAGGGCAGCCACTAATGCCTATCTTAGGAATGATCATGCTTCTGCCAAGTATCATTCATCAAGGGCTCAAGAACAATGGTT
AGCTGCAAAAATGTTAAATAATAAGGCGGCTAATGAAATTTTACGAACGAGGAATGGTAAAAATGGGCTCTGGAAGTTGGACTTACATGGACTTCATGCAGCAGAGGCTG
TTCAAGCCTTGCAAGAACACTTGCTGAAAATTGAAACTCAGAATGCCTCCAACCGGTCGTTGTCACCAAAGAAAGCAGAAAGGAAAGGGTTCCTACGTGCTTCATCCCTT
GAGTCTCTTAGTTGTATAGACTCAAAGTTGGACAAAGAATCACAATCACCATCATCAAGGCATAGGCCGACATCATTGGAAGTCATAACAGGCATAGGTAAACATAGCAG
GGGGGAAGCTGCTCTACCCAAGGCTGTGACAAATTTTCTTAGTGAAAACGGGTACCGTTTCGAACAGTTGAGGCCTGGGACGATCAACGTTCGACCTAAGTTTCATAGGT
AAATGGCTAAAACAGTAATGGTTTTAAGAGAATGTAAGTTATTTAGTTAGATTAGAGTAGTTGGAGATTGTGCTAAGTGATACAGTGTAAAATAACTGAAAATGTTATTT
AGGAATTTGTCTATTGAATGATATCTTGAACCTTGGAAGGTTGTACAGGAGAGCAGAATTAACAGTGTGGTGGATTGATTGTTAACCACATTACTACTATATTCTTATGA
TGTTCTTTTTCTTGAAAAAAAAGATAGCAGTTTAGCATTTGGGAAAAA
Protein sequenceShow/hide protein sequence
MSWVRGKSPGWAAFNLKQQSSGLQDDPEPFPPMSTALSSLPPRENLHRVNGRSGRSFSSAPLPSTDSLTSPENFGAKKTVLGDSSIQSGKKAVEETTDVTFWKLKELHSW
ADFSLIMDIMEAVNNNFDEASTLLKTMVSSDNFEVNNDTSTLGLYSSNDLSWVRGKSPGWATFNPVQQNRGLQDEIGPEPLPPISTALPFLPPCEKLHRVHGRSGRSFSS
ASLADSLTLAENYGARKTILGDSGSKSGKKVVEETTDITFWKLKELYSWADFSLIMDIMEAVNNNFNEASTLLKIMVSSDDFEINNEMSTLGLHSSNDQLCNGKNDVSIS
LEKPVKIPILSSSLKDVHQNNNSCLEDDTKLIENNYYERNFLHDVGNPKLALGCPKSVPIEPEWEEDDIYLSHRKDAIAIMRSASQHSRAATNAYLRNDHASAKYHSSRA
QEQWLAAKMLNNKAANEILRTRNGKNGLWKLDLHGLHAAEAVQALQEHLLKIETQNASNRSLSPKKAERKGFLRASSLESLSCIDSKLDKESQSPSSRHRPTSLEVITGI
GKHSRGEAALPKAVTNFLSENGYRFEQLRPGTINVRPKFHR