; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018213 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018213
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionLate embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family
Genome locationtig00153145:238986..255928
RNA-Seq ExpressionSgr018213
SyntenySgr018213
Gene Ontology termsGO:0098542 - defense response to other organism (biological process)
GO:0009506 - plasmodesma (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0046658 - anchored component of plasma membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup
IPR044839 - Protein NDR1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043819.1 NDR1/HIN1-like protein 12 [Cucumis melo var. makuwa]3.8e-9078.95Show/hide
Query:  LPQNSPAGSAMVVDCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDE
        L  +SP G+AM  DC+KHCKKKRK+L+K+IG  + +F+F+VLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLT+NFL+TI SRNPNRRIGIYYDE
Subjt:  LPQNSPAGSAMVVDCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDE

Query:  LDVYAVYRNQQITLRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFG-NPSPG
        L VYA+YRNQQITLRTIIP FYQGHKDVNVWSPF+SGTSVPVAPFIS+ALNQDR+AGAL++LVKIDG+VRWKVG FI+G YQFHANCP  INFG  P+ G
Subjt:  LDVYAVYRNQQITLRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFG-NPSPG

Query:  DGSITKLHV
        DGS+ + +V
Subjt:  DGSITKLHV

KAB5552942.1 hypothetical protein DKX38_010253 [Salix brachista]1.4e-10039.79Show/hide
Query:  DCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQIT
        DC  H + +R+  ++ + GA++ FL IVL+ ILIVWA+LRP+KP F LQD TVYAFN + P+ LT+NF VTI+SR+P+ R+GIYYD+LD+YA YRNQQIT
Subjt:  DCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQIT

Query:  LRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPGDGSITKLHVDVVDS
        LRT IPP YQGH+D +VWSPFI G +VPV+P+ S AL+QD+ AG ++L++KIDGRVR+KVG FIS  Y+ HA CPA I FG+ +               +
Subjt:  LRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPGDGSITKLHVDVVDS

Query:  GEILGVE-LSKLLAAEQHQLLAVELSGEDGGGLVALVERRQPLGVEPLVFINGGGFDGVVVDSDSLVGVSDDQHSKTNDDDEGDDDLENGAEEAAGATAA
        G I+G   + ++        L  + S E+       +ER    G     F+    F G    +  L  V+ + ++  N+          G +  +  + +
Subjt:  GEILGVE-LSKLLAAEQHQLLAVELSGEDGGGLVALVERRQPLGVEPLVFINGGGFDGVVVDSDSLVGVSDDQHSKTNDDDEGDDDLENGAEEAAGATAA

Query:  VAVAVVGVCRRRDGGAVEGAVQVGLFTGHGGERTAINRYICRIENAKGNDMGSGSKYPIDHRSFSAMADKQPHLNAAFYGPAVPPPHHGR--------GC
        + +++                            T +      + N   + MG                 KQ  L  A  G ++PPP HGR        GC
Subjt:  VAVAVVGVCRRRDGGAVEGAVQVGLFTGHGGERTAINRYICRIENAKGNDMGSGSKYPIDHRSFSAMADKQPHLNAAFYGPAVPPPHHGR--------GC

Query:  ACCLLSTILKIVVALVVAAGILVLISWLVFRP-HKIKFHVTDAHLTQFDFTGDQLHYNLALNITIRNPNKRYGVYYDSIEATAVYKDRRIETQWLMPFYQ
         CCLL  +L ++  + +  G+ +LI WL+FRP +++K HVTD  LTQF+ T + L +NLA NI+IRNPNK+ G++YD IEA A Y D+R   Q L PFYQ
Subjt:  ACCLLSTILKIVVALVVAAGILVLISWLVFRP-HKIKFHVTDAHLTQFDFTGDQLHYNLALNITIRNPNKRYGVYYDSIEATAVYKDRRIETQWLMPFYQ

Query:  GHKTTTVINSTFDGQQLVLLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEV-IGKFKPKVSC-ELKVPLS
        GHK T+V+N  F GQQ + L G++L +F+ EK +G+Y I ++  +R+R K G+V   +F+PK+ C +LK+PL+
Subjt:  GHKTTTVINSTFDGQQLVLLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEV-IGKFKPKVSC-ELKVPLS

KAF2323816.1 hypothetical protein GH714_042401 [Hevea brasiliensis]9.5e-11040.41Show/hide
Query:  DCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQIT
        DC  H  K+RK L++ I   +L+FLF+VL+TIL++WA+LRP+KP+F LQDVTVYAFNA+VP +LT+NF VT  SRNPN +IGIYYD L+VYA Y  QQIT
Subjt:  DCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQIT

Query:  LRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPGDGSITKLHVDVVDS
        L + IPP YQGHK++NVWSP + GT++PVAP+   +L QD+  GA+LL++K+DGRVR+KVG FI+G Y  +  CPA I FG+ +                
Subjt:  LRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPGDGSITKLHVDVVDS

Query:  GEILGVELSKLLAAEQHQLLAVELSGEDGGGLVALVERRQPLGVEPLVFINGGGFDGVVVDSDSLVGVSDDQHSKTNDDDEGDDDLENGAEEAAGATAAV
                                                                G++V  +S                                   V
Subjt:  GEILGVELSKLLAAEQHQLLAVELSGEDGGGLVALVERRQPLGVEPLVFINGGGFDGVVVDSDSLVGVSDDQHSKTNDDDEGDDDLENGAEEAAGATAAV

Query:  AVAVVGVCRRRDGGAVEGAVQVGLFTGHGGERTAINRYICRIENAKGNDMGSGSKYPIDHRSFSAMADKQPHLNAAFYGPAVPPPH----HGR--GCACC
          ++VG        +  G +  GLF+ H      +  ++  +   K                      KQ +LN A+YGPA+PPP+     GR  GC CC
Subjt:  AVAVVGVCRRRDGGAVEGAVQVGLFTGHGGERTAINRYICRIENAKGNDMGSGSKYPIDHRSFSAMADKQPHLNAAFYGPAVPPPH----HGR--GCACC

Query:  LLSTILKIVVALVVAAGILVLISWLVFRPHKIKFHVTDAHLTQFDF--TGDQLHYNLALNITIRNPNKRYGVYYDSIEATAVYKDRRIETQWLMPFYQGH
        LLS +LKI++A+VV AG+ V I WLV RP+K+KFHVT+A L++FD+    + L+YNL++ I++RNPNK+ G+YYD IEA A Y+D+R     L PFYQGH
Subjt:  LLSTILKIVVALVVAAGILVLISWLVFRPHKIKFHVTDAHLTQFDF--TGDQLHYNLALNITIRNPNKRYGVYYDSIEATAVYKDRRIETQWLMPFYQGH

Query:  KTTTVINSTFDGQQLV-LLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEV-IGKFKPKVSCELKVPLSPNAYSFTWFQTTACDFDF
        K T+++   F+G+Q++  L G+EL +F+ EK +G+Y IDVK  L++RLK G++ +GKFKPK+ C+LKVPLS N  +    +TT CD+D+
Subjt:  KTTTVINSTFDGQQLV-LLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEV-IGKFKPKVSCELKVPLSPNAYSFTWFQTTACDFDF

KAG5398828.1 hypothetical protein IGI04_020642 [Brassica rapa subsp. trilocularis]1.1e-9236.01Show/hide
Query:  DCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNAT--VPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQ
        DC  H   +RK + +II  ++++ LFI+ LTIL++WA+L+PTKP F LQD TVYAFN +   P+ LT+NF VT+ SRNPN +IG+YYD L+VYA Y+NQQ
Subjt:  DCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNAT--VPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQ

Query:  ITLRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPGDGSITKLHVDVV
        IT RT IPP YQGHK+VN+WSPF+ GTSVP+APF   +L+ D++ G + L ++ DGRVRWKVG FI+G Y  +  C A INFGN +              
Subjt:  ITLRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPGDGSITKLHVDVV

Query:  DSGEILGVELSKLLAAEQHQLLAVELSGEDGGGLVALVERRQPLGVEPLVFINGGGFDGVVVDSDSLVGVSDDQHSKTNDDDEGDDDLENGAEEAAGATA
         +G I+G    K   A+  +   +   G D                EP      GG+DG                            L  G         
Subjt:  DSGEILGVELSKLLAAEQHQLLAVELSGEDGGGLVALVERRQPLGVEPLVFINGGGFDGVVVDSDSLVGVSDDQHSKTNDDDEGDDDLENGAEEAAGATA

Query:  AVAVAVVGVCRRRDGGAVEGAVQVGLFTGHGGERTAINRYICRIENAKGNDMGSGSKYPIDHRSFSAMADKQPHLNAAFYGPAVPPPHHG----------
                          +G  Q  L  G+ G                          P  H +      +QPHLN  +YGP +PPP             
Subjt:  AVAVAVVGVCRRRDGGAVEGAVQVGLFTGHGGERTAINRYICRIENAKGNDMGSGSKYPIDHRSFSAMADKQPHLNAAFYGPAVPPPHHG----------

Query:  --------------RGCACCLLSTILKIVVALVVAAGILVLISWLVFRPHKIKFHVTDAHLTQFDF---TGDQLHYNLALNITIRNPNKRYGVYYDSIEA
                      R C CC+LS +  I++A+ V  G+  LI WL+FRP+ +KF+V DA+L +F         L Y+L LN TIRNPN+R G+YYD ++ 
Subjt:  --------------RGCACCLLSTILKIVVALVVAAGILVLISWLVFRPHKIKFHVTDAHLTQFDF---TGDQLHYNLALNITIRNPNKRYGVYYDSIEA

Query:  TAVYKDRRIETQWLMPFYQGHKTTTVINSTFDGQQLVLLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEVIG-KFKPKVSC-ELKVPLSPNAYSFTW
        +  Y D+R  +  + PFYQGHK TTV+ +  +GQ LV+LG    A+   +   G+Y ID++  + +R +   +   K KPK+ C +LK+PL  ++ +  +
Subjt:  TAVYKDRRIETQWLMPFYQGHKTTTVINSTFDGQQLVLLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEVIG-KFKPKVSC-ELKVPLSPNAYSFTW

Query:  -FQTTACDFDF
         FQT  C+FDF
Subjt:  -FQTTACDFDF

TXG58153.1 hypothetical protein EZV62_015982 [Acer yangbiense]1.3e-9838.64Show/hide
Query:  DCQKH--------CKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYA
        DC  H        CK  R RL + I   +L+FL   L+TIL+VWA+LRPTKP F LQD TVYAFN + P+FLT+NF VTI SRNPN ++GIYYD LD+YA
Subjt:  DCQKH--------CKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYA

Query:  VYRNQQITLRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPGDGSITK
        +YRNQQIT RT IP  YQGHK VNVWSP++ GT+VPVAP+ + +L +D++ G + L+ KIDGRVRWKVG  I+  Y  +  C A IN GN          
Subjt:  VYRNQQITLRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPGDGSITK

Query:  LHVDVVDSGEILGVELSKLLAAEQHQLLAVELSGEDGGGLVALVERRQPLGVEPLVFINGGGFDGVVVDSDSLVGVSDDQHSKTNDDDEGDDDLENGAEE
                                                                        G+VV                                
Subjt:  LHVDVVDSGEILGVELSKLLAAEQHQLLAVELSGEDGGGLVALVERRQPLGVEPLVFINGGGFDGVVVDSDSLVGVSDDQHSKTNDDDEGDDDLENGAEE

Query:  AAGATAAVAVAVVGVCRRRDGGAVEGAVQVGLFTGHGGERTAINRYICRIENAKGNDMGSGSKYPIDHRSFSAMADKQPHLNAAFYGPAVPP--PHH---
                                                                                AMADKQ HLN A+YG  +PP   +H   
Subjt:  AAGATAAVAVAVVGVCRRRDGGAVEGAVQVGLFTGHGGERTAINRYICRIENAKGNDMGSGSKYPIDHRSFSAMADKQPHLNAAFYGPAVPP--PHH---

Query:  ------GRGCACCLLSTILKIVVALVVAAGILVLISWLVFRP-HKIKFHVTDAHLTQFDFTGDQ-LHYNLALNITIRNPNKRYGVYYDSIEATAVYKDRR
              G GC CC    ILKIVV +VV  G+  LI WL+FRP ++IKFH  D  LTQF+ T +Q L+Y LALNI IRNPNK+ GVYYD IEA A Y+D+R
Subjt:  ------GRGCACCLLSTILKIVVALVVAAGILVLISWLVFRP-HKIKFHVTDAHLTQFDFTGDQ-LHYNLALNITIRNPNKRYGVYYDSIEATAVYKDRR

Query:  IETQWLMPFYQGHKTTTVINSTFDGQQLVLLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEVIG-KFKPKVSCELKVPL--SPNAYSFTWFQTTACD
        +++  + PFYQGHK TTV+N  F+GQQL++  G+ + +F  EK +GIY IDVK  LR+R K G+V   K +P++ CELKVP+  S    +     TT C 
Subjt:  IETQWLMPFYQGHKTTTVINSTFDGQQLVLLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEVIG-KFKPKVSCELKVPL--SPNAYSFTWFQTTACD

Query:  FDF
        FDF
Subjt:  FDF

TrEMBL top hitse value%identityAlignment
A0A5A7TRJ4 NDR1/HIN1-like protein 121.8e-9078.95Show/hide
Query:  LPQNSPAGSAMVVDCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDE
        L  +SP G+AM  DC+KHCKKKRK+L+K+IG  + +F+F+VLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLT+NFL+TI SRNPNRRIGIYYDE
Subjt:  LPQNSPAGSAMVVDCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDE

Query:  LDVYAVYRNQQITLRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFG-NPSPG
        L VYA+YRNQQITLRTIIP FYQGHKDVNVWSPF+SGTSVPVAPFIS+ALNQDR+AGAL++LVKIDG+VRWKVG FI+G YQFHANCP  INFG  P+ G
Subjt:  LDVYAVYRNQQITLRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFG-NPSPG

Query:  DGSITKLHV
        DGS+ + +V
Subjt:  DGSITKLHV

A0A5C7HME2 Uncharacterized protein6.2e-9938.64Show/hide
Query:  DCQKH--------CKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYA
        DC  H        CK  R RL + I   +L+FL   L+TIL+VWA+LRPTKP F LQD TVYAFN + P+FLT+NF VTI SRNPN ++GIYYD LD+YA
Subjt:  DCQKH--------CKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYA

Query:  VYRNQQITLRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPGDGSITK
        +YRNQQIT RT IP  YQGHK VNVWSP++ GT+VPVAP+ + +L +D++ G + L+ KIDGRVRWKVG  I+  Y  +  C A IN GN          
Subjt:  VYRNQQITLRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPGDGSITK

Query:  LHVDVVDSGEILGVELSKLLAAEQHQLLAVELSGEDGGGLVALVERRQPLGVEPLVFINGGGFDGVVVDSDSLVGVSDDQHSKTNDDDEGDDDLENGAEE
                                                                        G+VV                                
Subjt:  LHVDVVDSGEILGVELSKLLAAEQHQLLAVELSGEDGGGLVALVERRQPLGVEPLVFINGGGFDGVVVDSDSLVGVSDDQHSKTNDDDEGDDDLENGAEE

Query:  AAGATAAVAVAVVGVCRRRDGGAVEGAVQVGLFTGHGGERTAINRYICRIENAKGNDMGSGSKYPIDHRSFSAMADKQPHLNAAFYGPAVPP--PHH---
                                                                                AMADKQ HLN A+YG  +PP   +H   
Subjt:  AAGATAAVAVAVVGVCRRRDGGAVEGAVQVGLFTGHGGERTAINRYICRIENAKGNDMGSGSKYPIDHRSFSAMADKQPHLNAAFYGPAVPP--PHH---

Query:  ------GRGCACCLLSTILKIVVALVVAAGILVLISWLVFRP-HKIKFHVTDAHLTQFDFTGDQ-LHYNLALNITIRNPNKRYGVYYDSIEATAVYKDRR
              G GC CC    ILKIVV +VV  G+  LI WL+FRP ++IKFH  D  LTQF+ T +Q L+Y LALNI IRNPNK+ GVYYD IEA A Y+D+R
Subjt:  ------GRGCACCLLSTILKIVVALVVAAGILVLISWLVFRP-HKIKFHVTDAHLTQFDFTGDQ-LHYNLALNITIRNPNKRYGVYYDSIEATAVYKDRR

Query:  IETQWLMPFYQGHKTTTVINSTFDGQQLVLLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEVIG-KFKPKVSCELKVPL--SPNAYSFTWFQTTACD
        +++  + PFYQGHK TTV+N  F+GQQL++  G+ + +F  EK +GIY IDVK  LR+R K G+V   K +P++ CELKVP+  S    +     TT C 
Subjt:  IETQWLMPFYQGHKTTTVINSTFDGQQLVLLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEVIG-KFKPKVSCELKVPL--SPNAYSFTWFQTTACD

Query:  FDF
        FDF
Subjt:  FDF

A0A5D3DNV1 NDR1/HIN1-like protein 121.9e-8780.4Show/hide
Query:  MVVDCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQ
        M  DC+KHCKKKRK+L+K+IG  + +F+F+VLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLT+NFL+TI SRNPNRRIGIYYDEL VYA+YRNQ
Subjt:  MVVDCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQ

Query:  QITLRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFG-NPSPGDGSITKLHV
        QITLRTIIP FYQGHKDVNVWSPF+SGTSVPVAPFIS+ALNQDR+AGAL++LVKIDG+VRWKVG FI+G YQFHANCP  INFG  P+ GDGS+ + +V
Subjt:  QITLRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFG-NPSPGDGSITKLHV

A0A5N5MD96 Uncharacterized protein6.7e-10139.79Show/hide
Query:  DCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQIT
        DC  H + +R+  ++ + GA++ FL IVL+ ILIVWA+LRP+KP F LQD TVYAFN + P+ LT+NF VTI+SR+P+ R+GIYYD+LD+YA YRNQQIT
Subjt:  DCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQIT

Query:  LRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPGDGSITKLHVDVVDS
        LRT IPP YQGH+D +VWSPFI G +VPV+P+ S AL+QD+ AG ++L++KIDGRVR+KVG FIS  Y+ HA CPA I FG+ +               +
Subjt:  LRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPGDGSITKLHVDVVDS

Query:  GEILGVE-LSKLLAAEQHQLLAVELSGEDGGGLVALVERRQPLGVEPLVFINGGGFDGVVVDSDSLVGVSDDQHSKTNDDDEGDDDLENGAEEAAGATAA
        G I+G   + ++        L  + S E+       +ER    G     F+    F G    +  L  V+ + ++  N+          G +  +  + +
Subjt:  GEILGVE-LSKLLAAEQHQLLAVELSGEDGGGLVALVERRQPLGVEPLVFINGGGFDGVVVDSDSLVGVSDDQHSKTNDDDEGDDDLENGAEEAAGATAA

Query:  VAVAVVGVCRRRDGGAVEGAVQVGLFTGHGGERTAINRYICRIENAKGNDMGSGSKYPIDHRSFSAMADKQPHLNAAFYGPAVPPPHHGR--------GC
        + +++                            T +      + N   + MG                 KQ  L  A  G ++PPP HGR        GC
Subjt:  VAVAVVGVCRRRDGGAVEGAVQVGLFTGHGGERTAINRYICRIENAKGNDMGSGSKYPIDHRSFSAMADKQPHLNAAFYGPAVPPPHHGR--------GC

Query:  ACCLLSTILKIVVALVVAAGILVLISWLVFRP-HKIKFHVTDAHLTQFDFTGDQLHYNLALNITIRNPNKRYGVYYDSIEATAVYKDRRIETQWLMPFYQ
         CCLL  +L ++  + +  G+ +LI WL+FRP +++K HVTD  LTQF+ T + L +NLA NI+IRNPNK+ G++YD IEA A Y D+R   Q L PFYQ
Subjt:  ACCLLSTILKIVVALVVAAGILVLISWLVFRP-HKIKFHVTDAHLTQFDFTGDQLHYNLALNITIRNPNKRYGVYYDSIEATAVYKDRRIETQWLMPFYQ

Query:  GHKTTTVINSTFDGQQLVLLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEV-IGKFKPKVSC-ELKVPLS
        GHK T+V+N  F GQQ + L G++L +F+ EK +G+Y I ++  +R+R K G+V   +F+PK+ C +LK+PL+
Subjt:  GHKTTTVINSTFDGQQLVLLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEV-IGKFKPKVSC-ELKVPLS

A0A6A6NF99 Uncharacterized protein4.6e-11040.41Show/hide
Query:  DCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQIT
        DC  H  K+RK L++ I   +L+FLF+VL+TIL++WA+LRP+KP+F LQDVTVYAFNA+VP +LT+NF VT  SRNPN +IGIYYD L+VYA Y  QQIT
Subjt:  DCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQIT

Query:  LRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPGDGSITKLHVDVVDS
        L + IPP YQGHK++NVWSP + GT++PVAP+   +L QD+  GA+LL++K+DGRVR+KVG FI+G Y  +  CPA I FG+ +                
Subjt:  LRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPGDGSITKLHVDVVDS

Query:  GEILGVELSKLLAAEQHQLLAVELSGEDGGGLVALVERRQPLGVEPLVFINGGGFDGVVVDSDSLVGVSDDQHSKTNDDDEGDDDLENGAEEAAGATAAV
                                                                G++V  +S                                   V
Subjt:  GEILGVELSKLLAAEQHQLLAVELSGEDGGGLVALVERRQPLGVEPLVFINGGGFDGVVVDSDSLVGVSDDQHSKTNDDDEGDDDLENGAEEAAGATAAV

Query:  AVAVVGVCRRRDGGAVEGAVQVGLFTGHGGERTAINRYICRIENAKGNDMGSGSKYPIDHRSFSAMADKQPHLNAAFYGPAVPPPH----HGR--GCACC
          ++VG        +  G +  GLF+ H      +  ++  +   K                      KQ +LN A+YGPA+PPP+     GR  GC CC
Subjt:  AVAVVGVCRRRDGGAVEGAVQVGLFTGHGGERTAINRYICRIENAKGNDMGSGSKYPIDHRSFSAMADKQPHLNAAFYGPAVPPPH----HGR--GCACC

Query:  LLSTILKIVVALVVAAGILVLISWLVFRPHKIKFHVTDAHLTQFDF--TGDQLHYNLALNITIRNPNKRYGVYYDSIEATAVYKDRRIETQWLMPFYQGH
        LLS +LKI++A+VV AG+ V I WLV RP+K+KFHVT+A L++FD+    + L+YNL++ I++RNPNK+ G+YYD IEA A Y+D+R     L PFYQGH
Subjt:  LLSTILKIVVALVVAAGILVLISWLVFRPHKIKFHVTDAHLTQFDF--TGDQLHYNLALNITIRNPNKRYGVYYDSIEATAVYKDRRIETQWLMPFYQGH

Query:  KTTTVINSTFDGQQLV-LLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEV-IGKFKPKVSCELKVPLSPNAYSFTWFQTTACDFDF
        K T+++   F+G+Q++  L G+EL +F+ EK +G+Y IDVK  L++RLK G++ +GKFKPK+ C+LKVPLS N  +    +TT CD+D+
Subjt:  KTTTVINSTFDGQQLV-LLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEV-IGKFKPKVSCELKVPLSPNAYSFTWFQTTACDFDF

SwissProt top hitse value%identityAlignment
Q9FNH6 NDR1/HIN1-like protein 34.7e-5148.25Show/hide
Query:  LNAAFYGPAVPPP-----HHGRG-------------CACCLLSTILKIVVALVVAAGILVLISWLVFRPHKIKFHVTDAHLTQFDF-TGDQLHYNLALNI
        LN A+YGP++PPP      HGR              C CC+LS I  I++ + V  GI  LI WL+FRP+ IKFHVTDA LT+F     + L YNL LN 
Subjt:  LNAAFYGPAVPPP-----HHGRG-------------CACCLLSTILKIVVALVVAAGILVLISWLVFRPHKIKFHVTDAHLTQFDF-TGDQLHYNLALNI

Query:  TIRNPNKRYGVYYDSIEATAVYKDRRI-ETQWLMPFYQGHKTTTVINSTFDGQQLVLLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEVIG-KFKPK
        TIRNPN+R GVYYD IE    Y D+R   +  +  FYQGHK TTV+ +   GQQLVLL G E  + + +  + IY ID K RL++R K G +   +FKPK
Subjt:  TIRNPNKRYGVYYDSIEATAVYKDRRI-ETQWLMPFYQGHKTTTVINSTFDGQQLVLLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEVIG-KFKPK

Query:  VSCELKVPLSPNAYSFTWFQTTACDFDF
        + C+LKVPL+ N+ S   FQ T CD DF
Subjt:  VSCELKVPLSPNAYSFTWFQTTACDFDF

Q9SJ52 NDR1/HIN1-like protein 101.3e-5650.88Show/hide
Query:  MADKQPHLNAAFYGPAVPPP--------HHGRGCACCLLSTILKIVVALVVAAGILVLISWLVFRPHKIKFHVTDAHLTQFDFTGDQ--LHYNLALNITI
        MA +QP LN AFYGP+VPPP         HGRGC CCLLS  +K++++L+V  G+  LI WL+ RP  IKFHVTDA LT+FD T     L YNLAL + +
Subjt:  MADKQPHLNAAFYGPAVPPP--------HHGRGCACCLLSTILKIVVALVVAAGILVLISWLVFRPHKIKFHVTDAHLTQFDFTGDQ--LHYNLALNITI

Query:  RNPNKRYGVYYDSIEATAVYKDRRIETQWLMPFYQGHKTTTVINSTFDGQQLVLLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEV-IGKFKPKVSC
        RNPNKR G+YYD IEA A Y+ +R  T  L PFYQGHK TTV+  TF GQ LV+    +    +AE+ +G+Y I++KFRLR+R K G++   + KPKV C
Subjt:  RNPNKRYGVYYDSIEATAVYKDRRIETQWLMPFYQGHKTTTVINSTFDGQQLVLLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEV-IGKFKPKVSC

Query:  -ELKVPLSPNAYSFTWFQT--TACDFDF
         +L++PLS +  + T        CDFDF
Subjt:  -ELKVPLSPNAYSFTWFQT--TACDFDF

Q9SJ54 NDR1/HIN1-like protein 121.0e-5355.38Show/hide
Query:  DCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQIT
        DC  H           I G ++ F+ IVL+TI +VW +L+PTKP F LQD TVYAFN + P+ LT+NF +TI SRN N RIGIYYD L VYA YRNQQIT
Subjt:  DCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQIT

Query:  LRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPG
        LRT IPP YQGHK+ NVWSPF+ G SVP+APF + AL  +++ G + L+++ DGRVRWKVG  I+G Y  H  C A IN  + + G
Subjt:  LRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPG

Q9SRN0 NDR1/HIN1-like protein 11.9e-6058.51Show/hide
Query:  DCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNAT--VPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQ
        DC+ H   +RK L++ I  +++  LFI+ LTIL++WA+L+P+KP F LQD TVYAFN +   P+ LT+NF +T+ SRNPN +IGIYYD LDVYA YR+QQ
Subjt:  DCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNAT--VPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQ

Query:  ITLRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPG
        IT  T IPP YQGHKDV++WSPF+ GTSVP+APF   +L+ D+D G +LL+++ DGRVRWKVG FI+G Y  H  CPA INFGN + G
Subjt:  ITLRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPG

Q9SRN1 NDR1/HIN1-like protein 29.1e-4743.28Show/hide
Query:  MADKQPHLNAAFYGPAVPPP---HHG------------------RGCACCLLSTILKIVVALVVAAGILVLISWLVFRPHKIKFHVTDAHLTQFDF-TGD
        M  KQP+LN A+YGP++PPP   H                    R C CC+LS I  I++A+ V  G+  LI WL+FRP+ +KF+V DA+L +F F   +
Subjt:  MADKQPHLNAAFYGPAVPPP---HHG------------------RGCACCLLSTILKIVVALVVAAGILVLISWLVFRPHKIKFHVTDAHLTQFDF-TGD

Query:  QLHYNLALNITIRNPNKRYGVYYDSIEATAVYKDRRIETQWLMPFYQGHKTTTVINSTFDGQQLVLLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGE
         LHY+L LN TIRNPN+R GVYYD    +  Y D+R  +  +  FYQGHK TTVI +  +GQ LV+LG     +   ++++GIY I+ K RL +R K   
Subjt:  QLHYNLALNITIRNPNKRYGVYYDSIEATAVYKDRRIETQWLMPFYQGHKTTTVINSTFDGQQLVLLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGE

Query:  VIG-KFKPKVSC-ELKVPL-SPNAYSFTWFQTTACDFD
        +   K KPK+ C +LK+PL S N+     FQ   CDFD
Subjt:  VIG-KFKPKVSC-ELKVPL-SPNAYSFTWFQTTACDFD

Arabidopsis top hitse value%identityAlignment
AT2G35960.1 NDR1/HIN1-like 127.1e-5555.38Show/hide
Query:  DCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQIT
        DC  H           I G ++ F+ IVL+TI +VW +L+PTKP F LQD TVYAFN + P+ LT+NF +TI SRN N RIGIYYD L VYA YRNQQIT
Subjt:  DCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQIT

Query:  LRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPG
        LRT IPP YQGHK+ NVWSPF+ G SVP+APF + AL  +++ G + L+++ DGRVRWKVG  I+G Y  H  C A IN  + + G
Subjt:  LRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPG

AT2G35980.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family9.0e-5850.88Show/hide
Query:  MADKQPHLNAAFYGPAVPPP--------HHGRGCACCLLSTILKIVVALVVAAGILVLISWLVFRPHKIKFHVTDAHLTQFDFTGDQ--LHYNLALNITI
        MA +QP LN AFYGP+VPPP         HGRGC CCLLS  +K++++L+V  G+  LI WL+ RP  IKFHVTDA LT+FD T     L YNLAL + +
Subjt:  MADKQPHLNAAFYGPAVPPP--------HHGRGCACCLLSTILKIVVALVVAAGILVLISWLVFRPHKIKFHVTDAHLTQFDFTGDQ--LHYNLALNITI

Query:  RNPNKRYGVYYDSIEATAVYKDRRIETQWLMPFYQGHKTTTVINSTFDGQQLVLLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEV-IGKFKPKVSC
        RNPNKR G+YYD IEA A Y+ +R  T  L PFYQGHK TTV+  TF GQ LV+    +    +AE+ +G+Y I++KFRLR+R K G++   + KPKV C
Subjt:  RNPNKRYGVYYDSIEATAVYKDRRIETQWLMPFYQGHKTTTVINSTFDGQQLVLLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEV-IGKFKPKVSC

Query:  -ELKVPLSPNAYSFTWFQT--TACDFDF
         +L++PLS +  + T        CDFDF
Subjt:  -ELKVPLSPNAYSFTWFQT--TACDFDF

AT3G11660.1 NDR1/HIN1-like 11.3e-6158.51Show/hide
Query:  DCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNAT--VPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQ
        DC+ H   +RK L++ I  +++  LFI+ LTIL++WA+L+P+KP F LQD TVYAFN +   P+ LT+NF +T+ SRNPN +IGIYYD LDVYA YR+QQ
Subjt:  DCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNAT--VPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQ

Query:  ITLRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPG
        IT  T IPP YQGHKDV++WSPF+ GTSVP+APF   +L+ D+D G +LL+++ DGRVRWKVG FI+G Y  H  CPA INFGN + G
Subjt:  ITLRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPG

AT3G44220.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.2e-5654.69Show/hide
Query:  DCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQIT
        +C+ H  +  K + K IG  VL FL  VL  + +VWA+L P  P F LQD T+YAFN + P++LT+N  VT+ SRNPN +IGI+YD LD+YA YRNQQ+T
Subjt:  DCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQIT

Query:  LRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPGDGSITK
        L T++P  YQGH DV +WSPF+ GT+VPVAP+ S AL+QD  AG +LL +KIDG VRWKVG ++SG Y+ H NCPA I       GDG   K
Subjt:  LRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPGDGSITK

AT3G52470.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.6e-5757.14Show/hide
Query:  MVVDCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQ
        M  DC  H   K   + K+   A++ F+ IVL+TI +VW +LRPTKP F LQD TVYAFN + P+ LT+NF VTI SRNPN +IGIYYD L VYA Y NQ
Subjt:  MVVDCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLRPTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQ

Query:  QITLRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPG
        QITLRT IPP YQGHK+VNVWSPF+ GT+VP+AP+ S AL +++D G + L+++ DG VRWKV   I+G Y  H  C A IN GN + G
Subjt:  QITLRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLVKIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAAGGATGGCAAGTAACAAGACAACAGACGGCGTCACGGCCAACTTAAGAGGCTGCTACGATACGGTGTCGTATTATGTGGACGCCGACAACTATTGCGTCGGTGC
ATTGCATCGCATGGTGAAGAGCTCCATATCCAATCCTTCTCCTTTCCTTCTTCTTCTTCTTCTACCGCAGAACTCGCCGGCCGGCTCCGCCATGGTAGTCGACTGCCAGA
AACACTGCAAGAAGAAGCGAAAGAGGCTCTTAAAGATAATCGGCGGCGCCGTCCTGCTCTTCCTCTTCATCGTCCTCCTCACAATCCTCATTGTCTGGGCCGTCCTCCGC
CCCACCAAGCCCACTTTCTTCCTCCAAGACGTCACCGTCTACGCCTTCAACGCCACCGTCCCCAGCTTCCTCACCACCAACTTCCTCGTCACCATCATCTCCCGCAACCC
GAATCGCCGGATCGGAATCTACTACGACGAGCTCGACGTCTACGCCGTCTACCGCAACCAGCAGATCACTCTCCGGACCATCATCCCCCCCTTCTACCAGGGCCACAAGG
ACGTCAACGTCTGGTCCCCCTTCATCTCCGGCACCTCCGTACCCGTCGCGCCCTTCATCAGTACGGCGCTGAACCAGGACCGAGACGCTGGAGCCTTGCTGCTGCTGGTC
AAGATTGACGGGCGAGTCCGGTGGAAGGTCGGAAAGTTCATCAGTGGCCACTACCAGTTCCACGCAAACTGCCCGGCGGCCATAAACTTCGGGAATCCGTCGCCGGGAGA
TGGATCCATCACGAAACTTCACGTCGATGTCGTAGACTCCGGCGAGATTCTCGGCGTTGAACTCAGCAAGCTCCTCGCCGCTGAGCAACACCAGCTGCTGGCCGTCGAAC
TTAGTGGTGAGGACGGTGGTGGTCTTGTGGCCTTGGTAGAACGGCGGCAGCCACTGGGTGTCGAACCTCTGGTCTTTATAAACGGCGGTGGCTTCGACGGTGTCGTAGTA
GACTCCGATTCGCTTGTTGGGGTTTCTGATGATCAACACAGCAAGACCAACGACGACGACGAGGGAGACGACGATCTTGAGAATGGTGCTGAGGAGGCAGCAGGCGCAAC
CGCGGCCGTGGCCGTGGCGGTGGTAGGTGTTTGCAGGAGGAGGGACGGCGGGGCCGTAGAAGGCGCCGTTCAGGTGGGGTTGTTTACCGGCCATGGAGGAGAAAGAACGG
CGATCAACAGATATATTTGCAGAATAGAGAATGCGAAGGGAAATGATATGGGAAGTGGAAGCAAATATCCGATCGATCACCGTTCTTTCTCTGCCATGGCCGACAAACAA
CCCCACCTGAACGCCGCCTTCTACGGCCCCGCCGTCCCTCCTCCCCACCACGGCCGCGGTTGCGCCTGCTGCCTCCTCAGCACCATTCTCAAGATCGTCGTTGCCCTCGT
GGTTGCCGCTGGTATTCTTGTGTTGATCTCATGGCTGGTTTTCCGTCCCCATAAGATCAAGTTCCACGTCACCGACGCCCATCTCACCCAGTTCGATTTCACCGGCGACC
AGCTCCATTACAACTTGGCTTTGAACATTACCATCAGAAACCCCAACAAGAGATACGGAGTCTACTACGACTCCATCGAAGCCACCGCCGTTTATAAGGACCGGAGGATC
GAGACGCAGTGGCTGATGCCGTTCTACCAAGGCCACAAGACCACCACCGTCATCAACTCCACCTTCGACGGCCAGCAGCTGGTGTTGCTCGGCGGCGACGAGCTTGCTGA
ATTCGACGCCGAGAAACGTGCCGGCATCTACGGCATCGACGTGAAGTTCCGTCTCCGGTTAAGGTTGAAGTCCGGAGAAGTGATCGGAAAATTCAAGCCCAAGGTCAGCT
GTGAATTGAAGGTTCCATTGAGTCCCAATGCATACTCTTTTACTTGGTTTCAGACCACTGCATGCGACTTCGATTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGATAAGGATGGCAAGTAACAAGACAACAGACGGCGTCACGGCCAACTTAAGAGGCTGCTACGATACGGTGTCGTATTATGTGGACGCCGACAACTATTGCGTCGGTGC
ATTGCATCGCATGGTGAAGAGCTCCATATCCAATCCTTCTCCTTTCCTTCTTCTTCTTCTTCTACCGCAGAACTCGCCGGCCGGCTCCGCCATGGTAGTCGACTGCCAGA
AACACTGCAAGAAGAAGCGAAAGAGGCTCTTAAAGATAATCGGCGGCGCCGTCCTGCTCTTCCTCTTCATCGTCCTCCTCACAATCCTCATTGTCTGGGCCGTCCTCCGC
CCCACCAAGCCCACTTTCTTCCTCCAAGACGTCACCGTCTACGCCTTCAACGCCACCGTCCCCAGCTTCCTCACCACCAACTTCCTCGTCACCATCATCTCCCGCAACCC
GAATCGCCGGATCGGAATCTACTACGACGAGCTCGACGTCTACGCCGTCTACCGCAACCAGCAGATCACTCTCCGGACCATCATCCCCCCCTTCTACCAGGGCCACAAGG
ACGTCAACGTCTGGTCCCCCTTCATCTCCGGCACCTCCGTACCCGTCGCGCCCTTCATCAGTACGGCGCTGAACCAGGACCGAGACGCTGGAGCCTTGCTGCTGCTGGTC
AAGATTGACGGGCGAGTCCGGTGGAAGGTCGGAAAGTTCATCAGTGGCCACTACCAGTTCCACGCAAACTGCCCGGCGGCCATAAACTTCGGGAATCCGTCGCCGGGAGA
TGGATCCATCACGAAACTTCACGTCGATGTCGTAGACTCCGGCGAGATTCTCGGCGTTGAACTCAGCAAGCTCCTCGCCGCTGAGCAACACCAGCTGCTGGCCGTCGAAC
TTAGTGGTGAGGACGGTGGTGGTCTTGTGGCCTTGGTAGAACGGCGGCAGCCACTGGGTGTCGAACCTCTGGTCTTTATAAACGGCGGTGGCTTCGACGGTGTCGTAGTA
GACTCCGATTCGCTTGTTGGGGTTTCTGATGATCAACACAGCAAGACCAACGACGACGACGAGGGAGACGACGATCTTGAGAATGGTGCTGAGGAGGCAGCAGGCGCAAC
CGCGGCCGTGGCCGTGGCGGTGGTAGGTGTTTGCAGGAGGAGGGACGGCGGGGCCGTAGAAGGCGCCGTTCAGGTGGGGTTGTTTACCGGCCATGGAGGAGAAAGAACGG
CGATCAACAGATATATTTGCAGAATAGAGAATGCGAAGGGAAATGATATGGGAAGTGGAAGCAAATATCCGATCGATCACCGTTCTTTCTCTGCCATGGCCGACAAACAA
CCCCACCTGAACGCCGCCTTCTACGGCCCCGCCGTCCCTCCTCCCCACCACGGCCGCGGTTGCGCCTGCTGCCTCCTCAGCACCATTCTCAAGATCGTCGTTGCCCTCGT
GGTTGCCGCTGGTATTCTTGTGTTGATCTCATGGCTGGTTTTCCGTCCCCATAAGATCAAGTTCCACGTCACCGACGCCCATCTCACCCAGTTCGATTTCACCGGCGACC
AGCTCCATTACAACTTGGCTTTGAACATTACCATCAGAAACCCCAACAAGAGATACGGAGTCTACTACGACTCCATCGAAGCCACCGCCGTTTATAAGGACCGGAGGATC
GAGACGCAGTGGCTGATGCCGTTCTACCAAGGCCACAAGACCACCACCGTCATCAACTCCACCTTCGACGGCCAGCAGCTGGTGTTGCTCGGCGGCGACGAGCTTGCTGA
ATTCGACGCCGAGAAACGTGCCGGCATCTACGGCATCGACGTGAAGTTCCGTCTCCGGTTAAGGTTGAAGTCCGGAGAAGTGATCGGAAAATTCAAGCCCAAGGTCAGCT
GTGAATTGAAGGTTCCATTGAGTCCCAATGCATACTCTTTTACTTGGTTTCAGACCACTGCATGCGACTTCGATTTCTGA
Protein sequenceShow/hide protein sequence
MIRMASNKTTDGVTANLRGCYDTVSYYVDADNYCVGALHRMVKSSISNPSPFLLLLLLPQNSPAGSAMVVDCQKHCKKKRKRLLKIIGGAVLLFLFIVLLTILIVWAVLR
PTKPTFFLQDVTVYAFNATVPSFLTTNFLVTIISRNPNRRIGIYYDELDVYAVYRNQQITLRTIIPPFYQGHKDVNVWSPFISGTSVPVAPFISTALNQDRDAGALLLLV
KIDGRVRWKVGKFISGHYQFHANCPAAINFGNPSPGDGSITKLHVDVVDSGEILGVELSKLLAAEQHQLLAVELSGEDGGGLVALVERRQPLGVEPLVFINGGGFDGVVV
DSDSLVGVSDDQHSKTNDDDEGDDDLENGAEEAAGATAAVAVAVVGVCRRRDGGAVEGAVQVGLFTGHGGERTAINRYICRIENAKGNDMGSGSKYPIDHRSFSAMADKQ
PHLNAAFYGPAVPPPHHGRGCACCLLSTILKIVVALVVAAGILVLISWLVFRPHKIKFHVTDAHLTQFDFTGDQLHYNLALNITIRNPNKRYGVYYDSIEATAVYKDRRI
ETQWLMPFYQGHKTTTVINSTFDGQQLVLLGGDELAEFDAEKRAGIYGIDVKFRLRLRLKSGEVIGKFKPKVSCELKVPLSPNAYSFTWFQTTACDFDF