; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022087 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022087
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionMuDRA-like transposase
Genome locationchr7:17840352..17844864
RNA-Seq ExpressionLag0022087
SyntenyLag0022087
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004332 - Transposase, MuDR, plant
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type
IPR018289 - MULE transposase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050209.1 MuDRA-like transposase [Cucumis melo var. makuwa]2.6e-12732.38Show/hide
Query:  SGQYSNYKDIEVGDIFLSKKDLQMRLSVLGMKENFEFRVKKSNKKIFKVWCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKHLMLTHKHRQTKSWVVG
        S + S  ++++VG IF  K+DL MRLSVL MK+NF+F VKKS K++  V C+ ++C WR+RA  L+  N+  + KY  +H+C    L   HRQ KSWVVG
Subjt:  SGQYSNYKDIEVGDIFLSKKDLQMRLSVLGMKENFEFRVKKSNKKIFKVWCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKHLMLTHKHRQTKSWVVG

Query:  NIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLN
         ++K+ F   GR Y+P+DII D+RQ +G+N+SY+KAWRARE A    RGSPEESY  L R+GEALK  N G+ F +E+E+  +FK++FMA+G+ +RGFLN
Subjt:  NIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLN

Query:  SIHPILVVDGTHMRGKFSGKLLTAVGIDGNSQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVHILGTAQNVDTPAMAVNMAPGPSTTTRASNSAV
         I P++V+DGT ++ K+ G+L+ AV +DGN+QIY + F +   ETD+S  +F  +++ AI  V      LG   +  T    V          R     V
Subjt:  SIHPILVVDGTHMRGKFSGKLLTAVGIDGNSQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVHILGTAQNVDTPAMAVNMAPGPSTTTRASNSAV

Query:  MSGQYSNYKDIEVGDIFLSKKDLQMRLSVLGMKENFEFRVKKSNKKIFKTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHAL
        + G     K+I       ++KDLQ  L  L   +  +F +K                 V+  +   G    P   +S+ R          K +   E+ L
Subjt:  MSGQYSNYKDIEVGDIFLSKKDLQMRLSVLGMKENFEFRVKKSNKKIFKTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHAL

Query:  VLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNSQIYPVGFDIGRGE
         +      E     P    ++K+ N               K      GS     LN  HP++ +D         G++   VG+  N     +G +    E
Subjt:  VLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNSQIYPVGFDIGRGE

Query:  TDESW-----TYFFRQIERAIGHVNPLVHILGTAQNVDTPAMAVNMAPGPSTTTRASNSAVMSGQYSNYKDIEVGDIFLSKKNLQMRLSVLGMKENFEFR
        + ES+     T+ +  +E           +     ++     A     G      +S S  +  + S + +                       E  +F 
Subjt:  TDESW-----TYFFRQIERAIGHVNPLVHILGTAQNVDTPAMAVNMAPGPSTTTRASNSAVMSGQYSNYKDIEVGDIFLSKKNLQMRLSVLGMKENFEFR

Query:  VKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKHLMLTHKHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWR
        VKKS K++  VRC+ ++C WR+R   L+  N+  + KY  +H+C    L   +RQ KSWVVG ++K+ F   GR Y+P+DII D+RQ +G+N+SY+KAWR
Subjt:  VKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKHLMLTHKHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWR

Query:  AREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGF
        ARE+A    RGSPEESY  L R+GEALK  NP                          GFLN I  ++V+DGT ++ K+ G+L+ AV +DGNNQIYP+ F
Subjt:  AREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGF

Query:  DIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENLKTEGWLQATYYDHRTDVVKWDAPISKYGEEIVLAAVNN
         +   ETD+S  +F  +++ AIG VN                                                                          
Subjt:  DIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENLKTEGWLQATYYDHRTDVVKWDAPISKYGEEIVLAAVNN

Query:  ARKHVVRPIDRYEFQVDDGHLGGRVNIHTRTCTCRKFDVFQLPCSHAITTCIYLNVPYLDLCSPSYHVEAVLAAYA
               PID Y+F V D      VN+ T+ CTC++F   QLPCSHAI      N+    LC+  Y  E +LAAYA
Subjt:  ARKHVVRPIDRYEFQVDDGHLGGRVNIHTRTCTCRKFDVFQLPCSHAITTCIYLNVPYLDLCSPSYHVEAVLAAYA

XP_022131652.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]2.2e-10237.19Show/hide
Query:  IGRGETDESW--TYFFRQIERAIGHVNPLVHILGTAQNVDT-PAMAVNMAPGPSTTTRASNSAVMSGQYSNYKDIEVGDIFLSKKNLQMRLSVLGMKENF
        +G G+ +E +   Y   +++    H    +H       VDT   MA N   G S   R         Q +   D++  D+F SKK L M++ +L +++NF
Subjt:  IGRGETDESW--TYFFRQIERAIGHVNPLVHILGTAQNVDT-PAMAVNMAPGPSTTTRASNSAVMSGQYSNYKDIEVGDIFLSKKNLQMRLSVLGMKENF

Query:  EFRVKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHT-CKHLMLTHKHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYD
        +F+VKKS  K++ +RC+   C WR+R T +R CN+  + KY  +H+ C   ++   HRQ KSWVVG++V++ F +V R+YRPKDI+ DIR+++GVN+SYD
Subjt:  EFRVKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHT-CKHLMLTHKHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYD

Query:  KAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIY
        KAWR+ E AL L RG P  SY  LP +GEA+KI NPG+ F+LE+++  YFK+VFMA+G +IRGF+  I P+LV+DG H++GK+ G LLTA  +D NNQIY
Subjt:  KAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIY

Query:  PVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENL----KT--------------------EGW-----
        PV F +   E+  SW +F  Q++  +  V  LV +SDRH +I KA+  VF  A H +CIHHL+ NL    KT                    E W     
Subjt:  PVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENL----KT--------------------EGW-----

Query:  -----------------------------------------------------------LQATYYDHRTDVVKWDAPISKYGEEIVLAAVNNARKHVVRP
                                                                   LQ  +Y+HRT      + +S Y EE++  A++NAR+H+V  
Subjt:  -----------------------------------------------------------LQATYYDHRTDVVKWDAPISKYGEEIVLAAVNNARKHVVRP

Query:  IDRYEFQVDDGHLGGRVNIHTRTCTCRKFDVFQLPCSHAITTCIYLNVPYLDLCSPSYHVEA
        ID++ F+V DG+L G V++ ++TCTCR+FD F++PCSHAI      ++    LC  +Y V +
Subjt:  IDRYEFQVDDGHLGGRVNIHTRTCTCRKFDVFQLPCSHAITTCIYLNVPYLDLCSPSYHVEA

XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]4.9e-10241.43Show/hide
Query:  QYSNYKDIEVGDIFLSKKNLQMRLSVLGMKENFEFRVKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKHLMLTHKHRQTKSWVVGNI
        Q S   D++ G++F +KK L +R+ ++ M+ NF+F+VKKS  +++ + CV   C WR+RAT LR CN+  + KY +IHTC   +L   HRQ KSWVVG++
Subjt:  QYSNYKDIEVGDIFLSKKNLQMRLSVLGMKENFEFRVKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKHLMLTHKHRQTKSWVVGNI

Query:  VKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSI
        V+  F +V R+YRPKDII D+R+++GVNLSYDKAWR+ E AL L RG P  SY  LP +GEALKI NPG+ F+LE++ G YFK+VFMA+G +IRGFL  I
Subjt:  VKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSI

Query:  HPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENL--
         P+LVVDG H++GKF G LL A G D NNQIYPV F I  GET  SW +F  Q++   G VN LV VS+RH +I KA+  VF  A H +CI H++ NL  
Subjt:  HPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENL--

Query:  ------------------------------------------------------KTE----------------------------------GWLQATYYD
                                                              +TE                                  G LQ  +YD
Subjt:  ------------------------------------------------------KTE----------------------------------GWLQATYYD

Query:  HRTDVVKWDAPISKYGEEIVLAAVNNARKHVVRPIDRYEFQVDDGHLGGRVNIHTRTCTCRKFDVFQLPCSHAITTCIYLNVPYLDLCSPSYHVEAVLAA
         RT        +S Y E  +    +NAR+HVV  ID++  QV DG+L G V+ ++RTC CR+FD F++PCSHAI   +  N+    LC  +Y   + + A
Subjt:  HRTDVVKWDAPISKYGEEIVLAAVNNARKHVVRPIDRYEFQVDDGHLGGRVNIHTRTCTCRKFDVFQLPCSHAITTCIYLNVPYLDLCSPSYHVEAVLAA

Query:  YA
        YA
Subjt:  YA

XP_022154930.1 uncharacterized protein LOC111022077 [Momordica charantia]1.7e-9939.68Show/hide
Query:  DIEVGDIFLSKKNLQMRLSVLGMKENFEFRVKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKHLMLTHKHRQTKSWVVGNIVKTTFL
        DI VG IF SK  L+ +L VL MK NFEFRVKKS K ++ V C+   CKW + A  +RG +  T++KY+++H C   +L H HRQ +SWVVG ++K    
Subjt:  DIEVGDIFLSKKNLQMRLSVLGMKENFEFRVKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKHLMLTHKHRQTKSWVVGNIVKTTFL

Query:  EVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVV
        +V R YRPKDI++D+R+++GVN+ Y+KAWRA+E AL +  GSP+ESY  L R+ EALK+ N G+ F++E+EE  YFK+ FMA+G  IRGF + I P+LV+
Subjt:  EVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVV

Query:  DGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENLKT------
        DG H++GK+ G +LTA  +DGNNQIYP+ F I   E++ SW++F  +++  IG V+ L+ VSDRH SI K+V+ VF +A H  C+HHL  NLK       
Subjt:  DGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENLKT------

Query:  ---------------------------------------EGW-------------------------------------------LQATYYDHRTDVVKW
                                               E W                                           +Q  +Y+ RT     
Subjt:  ---------------------------------------EGW-------------------------------------------LQATYYDHRTDVVKW

Query:  DAPISKYGEEIVLAAVNNARKHVVRPIDRYEFQVDDGHLGGRVNIHTRTCTCRKFDVFQLPCSHAITTCIYLNVPYLDLCSPSYHVEAVLAAYA
        +  ++ YGE  + AA N +R + + PID +E +V DG    RVN++ RTC C++FD ++LPCSHAI    + NV    LCSP+Y ++ ++ AYA
Subjt:  DAPISKYGEEIVLAAVNNARKHVVRPIDRYEFQVDDGHLGGRVNIHTRTCTCRKFDVFQLPCSHAITTCIYLNVPYLDLCSPSYHVEAVLAAYA

XP_038887209.1 uncharacterized protein LOC120077397 [Benincasa hispida]5.2e-11246.59Show/hide
Query:  APGPSTTTRASNSAVMSGQYSNYKDIEVGDIFLSKKNLQMRLSVLGMKENFEFRVKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKH
        A G   +  +S++  M GQ  + +D++VGDIF+SKK+L+M LS+L ++ NF+FRVKKS   ++KV  +V+ECK ++RA  L+ C +  ++KY N HTC++
Subjt:  APGPSTTTRASNSAVMSGQYSNYKDIEVGDIFLSKKNLQMRLSVLGMKENFEFRVKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKH

Query:  LMLTHKHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYF
         +LT+ HRQ  SWV+  ++ + F +V  SYRPKDI+ DI+Q++ V+LSYDKAWRARE  LVL  GS EESY++LP+FGEAL+IENPGS F  +++E  YF
Subjt:  LMLTHKHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYF

Query:  KHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVF
        K+VFMA+G++IR FLN I P+L+VDGT +R K+SGKLL A+ +D NNQIY V F I   ETDESW +F +Q+   IG V  LVIVSDRH SI K +  VF
Subjt:  KHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVF

Query:  LDATHVYCIHHLQENL-----------------------------------------------------KTEGWLQATYYDHRTDVVKWDAPISKYGEEI
          A +  CIHHL+ NL                                                        GWLQ  Y+  R +     + +S Y   I
Subjt:  LDATHVYCIHHLQENL-----------------------------------------------------KTEGWLQATYYDHRTDVVKWDAPISKYGEEI

Query:  VLAAVNNARKHVVRPIDRYEFQVDDGHLGGRVNIHTRTCTCRKFDVFQLPCSHAI
        +  A + A ++ VRPID++EF+V DG LGGRVNIH R CTCRKFD +++PCSHAI
Subjt:  VLAAVNNARKHVVRPIDRYEFQVDDGHLGGRVNIHTRTCTCRKFDVFQLPCSHAI

TrEMBL top hitse value%identityAlignment
A0A5A7U9N8 MuDRA-like transposase1.3e-12732.38Show/hide
Query:  SGQYSNYKDIEVGDIFLSKKDLQMRLSVLGMKENFEFRVKKSNKKIFKVWCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKHLMLTHKHRQTKSWVVG
        S + S  ++++VG IF  K+DL MRLSVL MK+NF+F VKKS K++  V C+ ++C WR+RA  L+  N+  + KY  +H+C    L   HRQ KSWVVG
Subjt:  SGQYSNYKDIEVGDIFLSKKDLQMRLSVLGMKENFEFRVKKSNKKIFKVWCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKHLMLTHKHRQTKSWVVG

Query:  NIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLN
         ++K+ F   GR Y+P+DII D+RQ +G+N+SY+KAWRARE A    RGSPEESY  L R+GEALK  N G+ F +E+E+  +FK++FMA+G+ +RGFLN
Subjt:  NIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLN

Query:  SIHPILVVDGTHMRGKFSGKLLTAVGIDGNSQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVHILGTAQNVDTPAMAVNMAPGPSTTTRASNSAV
         I P++V+DGT ++ K+ G+L+ AV +DGN+QIY + F +   ETD+S  +F  +++ AI  V      LG   +  T    V          R     V
Subjt:  SIHPILVVDGTHMRGKFSGKLLTAVGIDGNSQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVHILGTAQNVDTPAMAVNMAPGPSTTTRASNSAV

Query:  MSGQYSNYKDIEVGDIFLSKKDLQMRLSVLGMKENFEFRVKKSNKKIFKTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHAL
        + G     K+I       ++KDLQ  L  L   +  +F +K                 V+  +   G    P   +S+ R          K +   E+ L
Subjt:  MSGQYSNYKDIEVGDIFLSKKDLQMRLSVLGMKENFEFRVKKSNKKIFKTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHAL

Query:  VLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNSQIYPVGFDIGRGE
         +      E     P    ++K+ N               K      GS     LN  HP++ +D         G++   VG+  N     +G +    E
Subjt:  VLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNSQIYPVGFDIGRGE

Query:  TDESW-----TYFFRQIERAIGHVNPLVHILGTAQNVDTPAMAVNMAPGPSTTTRASNSAVMSGQYSNYKDIEVGDIFLSKKNLQMRLSVLGMKENFEFR
        + ES+     T+ +  +E           +     ++     A     G      +S S  +  + S + +                       E  +F 
Subjt:  TDESW-----TYFFRQIERAIGHVNPLVHILGTAQNVDTPAMAVNMAPGPSTTTRASNSAVMSGQYSNYKDIEVGDIFLSKKNLQMRLSVLGMKENFEFR

Query:  VKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKHLMLTHKHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWR
        VKKS K++  VRC+ ++C WR+R   L+  N+  + KY  +H+C    L   +RQ KSWVVG ++K+ F   GR Y+P+DII D+RQ +G+N+SY+KAWR
Subjt:  VKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKHLMLTHKHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWR

Query:  AREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGF
        ARE+A    RGSPEESY  L R+GEALK  NP                          GFLN I  ++V+DGT ++ K+ G+L+ AV +DGNNQIYP+ F
Subjt:  AREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGF

Query:  DIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENLKTEGWLQATYYDHRTDVVKWDAPISKYGEEIVLAAVNN
         +   ETD+S  +F  +++ AIG VN                                                                          
Subjt:  DIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENLKTEGWLQATYYDHRTDVVKWDAPISKYGEEIVLAAVNN

Query:  ARKHVVRPIDRYEFQVDDGHLGGRVNIHTRTCTCRKFDVFQLPCSHAITTCIYLNVPYLDLCSPSYHVEAVLAAYA
               PID Y+F V D      VN+ T+ CTC++F   QLPCSHAI      N+    LC+  Y  E +LAAYA
Subjt:  ARKHVVRPIDRYEFQVDDGHLGGRVNIHTRTCTCRKFDVFQLPCSHAITTCIYLNVPYLDLCSPSYHVEAVLAAYA

A0A6J1BRM2 protein FAR1-RELATED SEQUENCE 4-like1.1e-10237.19Show/hide
Query:  IGRGETDESW--TYFFRQIERAIGHVNPLVHILGTAQNVDT-PAMAVNMAPGPSTTTRASNSAVMSGQYSNYKDIEVGDIFLSKKNLQMRLSVLGMKENF
        +G G+ +E +   Y   +++    H    +H       VDT   MA N   G S   R         Q +   D++  D+F SKK L M++ +L +++NF
Subjt:  IGRGETDESW--TYFFRQIERAIGHVNPLVHILGTAQNVDT-PAMAVNMAPGPSTTTRASNSAVMSGQYSNYKDIEVGDIFLSKKNLQMRLSVLGMKENF

Query:  EFRVKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHT-CKHLMLTHKHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYD
        +F+VKKS  K++ +RC+   C WR+R T +R CN+  + KY  +H+ C   ++   HRQ KSWVVG++V++ F +V R+YRPKDI+ DIR+++GVN+SYD
Subjt:  EFRVKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHT-CKHLMLTHKHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYD

Query:  KAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIY
        KAWR+ E AL L RG P  SY  LP +GEA+KI NPG+ F+LE+++  YFK+VFMA+G +IRGF+  I P+LV+DG H++GK+ G LLTA  +D NNQIY
Subjt:  KAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIY

Query:  PVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENL----KT--------------------EGW-----
        PV F +   E+  SW +F  Q++  +  V  LV +SDRH +I KA+  VF  A H +CIHHL+ NL    KT                    E W     
Subjt:  PVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENL----KT--------------------EGW-----

Query:  -----------------------------------------------------------LQATYYDHRTDVVKWDAPISKYGEEIVLAAVNNARKHVVRP
                                                                   LQ  +Y+HRT      + +S Y EE++  A++NAR+H+V  
Subjt:  -----------------------------------------------------------LQATYYDHRTDVVKWDAPISKYGEEIVLAAVNNARKHVVRP

Query:  IDRYEFQVDDGHLGGRVNIHTRTCTCRKFDVFQLPCSHAITTCIYLNVPYLDLCSPSYHVEA
        ID++ F+V DG+L G V++ ++TCTCR+FD F++PCSHAI      ++    LC  +Y V +
Subjt:  IDRYEFQVDDGHLGGRVNIHTRTCTCRKFDVFQLPCSHAITTCIYLNVPYLDLCSPSYHVEA

A0A6J1DJT1 uncharacterized protein LOC1110207152.4e-10241.43Show/hide
Query:  QYSNYKDIEVGDIFLSKKNLQMRLSVLGMKENFEFRVKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKHLMLTHKHRQTKSWVVGNI
        Q S   D++ G++F +KK L +R+ ++ M+ NF+F+VKKS  +++ + CV   C WR+RAT LR CN+  + KY +IHTC   +L   HRQ KSWVVG++
Subjt:  QYSNYKDIEVGDIFLSKKNLQMRLSVLGMKENFEFRVKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKHLMLTHKHRQTKSWVVGNI

Query:  VKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSI
        V+  F +V R+YRPKDII D+R+++GVNLSYDKAWR+ E AL L RG P  SY  LP +GEALKI NPG+ F+LE++ G YFK+VFMA+G +IRGFL  I
Subjt:  VKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSI

Query:  HPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENL--
         P+LVVDG H++GKF G LL A G D NNQIYPV F I  GET  SW +F  Q++   G VN LV VS+RH +I KA+  VF  A H +CI H++ NL  
Subjt:  HPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENL--

Query:  ------------------------------------------------------KTE----------------------------------GWLQATYYD
                                                              +TE                                  G LQ  +YD
Subjt:  ------------------------------------------------------KTE----------------------------------GWLQATYYD

Query:  HRTDVVKWDAPISKYGEEIVLAAVNNARKHVVRPIDRYEFQVDDGHLGGRVNIHTRTCTCRKFDVFQLPCSHAITTCIYLNVPYLDLCSPSYHVEAVLAA
         RT        +S Y E  +    +NAR+HVV  ID++  QV DG+L G V+ ++RTC CR+FD F++PCSHAI   +  N+    LC  +Y   + + A
Subjt:  HRTDVVKWDAPISKYGEEIVLAAVNNARKHVVRPIDRYEFQVDDGHLGGRVNIHTRTCTCRKFDVFQLPCSHAITTCIYLNVPYLDLCSPSYHVEAVLAA

Query:  YA
        YA
Subjt:  YA

A0A6J1DL12 uncharacterized protein LOC1110220775.0e-10039.88Show/hide
Query:  DIEVGDIFLSKKNLQMRLSVLGMKENFEFRVKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKHLMLTHKHRQTKSWVVGNIVKTTFL
        DI VG IF SK  L+ +L VL MK NFEFRVKKS K ++ V C+   CKW + A  +RG +  T++KY+++H C   +L H HRQ +SWVVG ++K    
Subjt:  DIEVGDIFLSKKNLQMRLSVLGMKENFEFRVKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKHLMLTHKHRQTKSWVVGNIVKTTFL

Query:  EVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVV
        +V R YRPKDI++D+R+++GVN+ Y+KAWRA+E AL +  GSP+ESY  L R+ EALK+ N G+ F++E+EE  YFK+ FMA+G  IRGF + I P+LV+
Subjt:  EVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVV

Query:  DGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENLKT------
        DG H++GK+ G +LTA  +DGNNQIYP+ F I   E+D SW++F  +++  IG V+ L+ VSDRH SI K+V+ VF +A H  C+HHL  NLK       
Subjt:  DGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENLKT------

Query:  ---------------------------------------EGW-------------------------------------------LQATYYDHRTDVVKW
                                               E W                                           +Q  +Y+ RT     
Subjt:  ---------------------------------------EGW-------------------------------------------LQATYYDHRTDVVKW

Query:  DAPISKYGEEIVLAAVNNARKHVVRPIDRYEFQVDDGHLGGRVNIHTRTCTCRKFDVFQLPCSHAITTCIYLNVPYLDLCSPSYHVEAVLAAYA
        +  ++ YGE  + AA N +R + + PID +E +V DG    RVN++ RTC C++FD ++LPCSHAI    + NV    LCSP+Y ++ ++ AYA
Subjt:  DAPISKYGEEIVLAAVNNARKHVVRPIDRYEFQVDDGHLGGRVNIHTRTCTCRKFDVFQLPCSHAITTCIYLNVPYLDLCSPSYHVEAVLAAYA

A0A6J1DYC4 uncharacterized protein LOC1110256787.9e-9843.21Show/hide
Query:  MKENFEFRVKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKHLMLTHKHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVN
        M++NF+F+VKKS  +++ +RCV  +C WR+RAT L+ C +  + KY   HTC    L H HRQ KSWVVG++V+  F +V R+YRPKDII D+R+ +GVN
Subjt:  MKENFEFRVKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKHLMLTHKHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVN

Query:  LSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGN
        LSYD+AWR+ E AL L RG P  SY  LP +GEALKI NPG+ F+L++E G YFK+VFM +G +IRGFL    P+LVVDG H++GKF G LL+A G+D N
Subjt:  LSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGN

Query:  NQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENL------------------------------
        NQIY V F I  GET  SW +F  QI+RA+  V+ LV VSDRH++I KA+  VF  A H +C  H++ NL                              
Subjt:  NQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENL------------------------------

Query:  ----------------------------KTEGWLQATYYDHRTDVVKWDAPISKYGEEIVLAAVNNARKHVVRPIDRYEFQVDDGHLGGRVNIHTRTCTC
                                       G LQ  +YD RT        +S Y E +     ++AR+HVV  ID++ FQV DG+L G V+++   C+C
Subjt:  ----------------------------KTEGWLQATYYDHRTDVVKWDAPISKYGEEIVLAAVNNARKHVVRPIDRYEFQVDDGHLGGRVNIHTRTCTC

Query:  RKFDVFQLPCSHAITTCIYLNVPYLDLCSPSYHVEAVLAAYA
        R+FD F++PCSHAI      N+    LC  +Y   + + AYA
Subjt:  RKFDVFQLPCSHAITTCIYLNVPYLDLCSPSYHVEAVLAAYA

SwissProt top hitse value%identityAlignment
P19775 Transposase for insertion sequence element IS256 in transposon Tn40019.9e-0528.97Show/hide
Query:  HPILVVDGTHMRGKFSGKLLT-----AVGI--DGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHH
        +P L+ D  +++ +   ++L+     A+GI  DG+ +I  +GF I  GE++E+WT FF  ++   G     +++SD HK ++ A++  F + +   C  H
Subjt:  HPILVVDGTHMRGKFSGKLLT-----AVGI--DGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHH

Query:  LQENLKT
           N+ T
Subjt:  LQENLKT

P59787 Transposase for insertion sequence element IS256 in transposon Tn40019.9e-0528.97Show/hide
Query:  HPILVVDGTHMRGKFSGKLLT-----AVGI--DGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHH
        +P L+ D  +++ +   ++L+     A+GI  DG+ +I  +GF I  GE++E+WT FF  ++   G     +++SD HK ++ A++  F + +   C  H
Subjt:  HPILVVDGTHMRGKFSGKLLT-----AVGI--DGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHH

Query:  LQENLKT
           N+ T
Subjt:  LQENLKT

Q3EBQ3 Protein FAR1-RELATED SEQUENCE 22.0e-0520.59Show/hide
Query:  KHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFM
        K ++ + WV+ N VK    E      P D    +R K       +K   A      L     EE  + L      ++ + PG F+ ++ +     ++VF 
Subjt:  KHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFM

Query:  AIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATH
                +  S   +++ D  ++R  +       +G+  + Q   +G  +    ++ ++++ FR   +A+G   P V+++D+ K +   V  VF D  H
Subjt:  AIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATH

Query:  VYCI
        ++C+
Subjt:  VYCI

Q9SWG3 Protein FAR-RED IMPAIRED RESPONSE 11.2e-0524.83Show/hide
Query:  LPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTA--VGIDGNNQIYPVGFDIGRGETDESWTYFFRQ
        L  + + +K ENP  F+ +++ E    +++F A   +   +L S + ++  D T++  KF+ KL  A  +G++ ++Q   +G  +   E+ E++ +  + 
Subjt:  LPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTA--VGIDGNNQIYPVGFDIGRGETDESWTYFFRQ

Query:  IERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENL
          RA+G   P VI++D+ K ++ AV  +  +  H + + H+ E +
Subjt:  IERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENL

Arabidopsis top hitse value%identityAlignment
AT2G32250.1 FAR1-related sequence 21.4e-0620.59Show/hide
Query:  KHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFM
        K ++ + WV+ N VK    E      P D    +R K       +K   A      L     EE  + L      ++ + PG F+ ++ +     ++VF 
Subjt:  KHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFM

Query:  AIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATH
                +  S   +++ D  ++R  +       +G+  + Q   +G  +    ++ ++++ FR   +A+G   P V+++D+ K +   V  VF D  H
Subjt:  AIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATH

Query:  VYCI
        ++C+
Subjt:  VYCI

AT2G32250.2 FAR1-related sequence 21.4e-0620.59Show/hide
Query:  KHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFM
        K ++ + WV+ N VK    E      P D    +R K       +K   A      L     EE  + L      ++ + PG F+ ++ +     ++VF 
Subjt:  KHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFM

Query:  AIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATH
                +  S   +++ D  ++R  +       +G+  + Q   +G  +    ++ ++++ FR   +A+G   P V+++D+ K +   V  VF D  H
Subjt:  AIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATH

Query:  VYCI
        ++C+
Subjt:  VYCI

AT2G32250.3 FAR1-related sequence 21.4e-0620.59Show/hide
Query:  KHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFM
        K ++ + WV+ N VK    E      P D    +R K       +K   A      L     EE  + L      ++ + PG F+ ++ +     ++VF 
Subjt:  KHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFM

Query:  AIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATH
                +  S   +++ D  ++R  +       +G+  + Q   +G  +    ++ ++++ FR   +A+G   P V+++D+ K +   V  VF D  H
Subjt:  AIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATH

Query:  VYCI
        ++C+
Subjt:  VYCI

AT2G32250.4 FAR1-related sequence 21.4e-0620.59Show/hide
Query:  KHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFM
        K ++ + WV+ N VK    E      P D    +R K       +K   A      L     EE  + L      ++ + PG F+ ++ +     ++VF 
Subjt:  KHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFM

Query:  AIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATH
                +  S   +++ D  ++R  +       +G+  + Q   +G  +    ++ ++++ FR   +A+G   P V+++D+ K +   V  VF D  H
Subjt:  AIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATH

Query:  VYCI
        ++C+
Subjt:  VYCI

AT4G15090.1 FRS (FAR1 Related Sequences) transcription factor family8.3e-0724.83Show/hide
Query:  LPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTA--VGIDGNNQIYPVGFDIGRGETDESWTYFFRQ
        L  + + +K ENP  F+ +++ E    +++F A   +   +L S + ++  D T++  KF+ KL  A  +G++ ++Q   +G  +   E+ E++ +  + 
Subjt:  LPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTA--VGIDGNNQIYPVGFDIGRGETDESWTYFFRQ

Query:  IERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENL
          RA+G   P VI++D+ K ++ AV  +  +  H + + H+ E +
Subjt:  IERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCGTGTTTGGGTATGTTTTGGTGGTGTGTGGAAGGATAGTGAAAAGGAGTACGATGGTGGAGAGGTGAGGGCCTTAGACGTGGATGTGGGAATTTCTTATAATGA
GTTCTTAGGTCAGGTGTACGAAATAAGTAGTACAAGTCCGAATGATAACGACATTGTTTTGAGGTGTCTGCTTCAACTGAGGTCTAAATCTCCCGCTTTTGTGATTAGGA
ACGATGTAGATTTGCATAGTTTTTTGACTTGGGAGTTAGTGTTTGCAAGTCCCCTTCATGTATCCACTGTGCCGAAACCATCCTCCCTGTCTAGTAACCAAGGGTTTCGT
CCATATCTCATTCCATATACTGTACCAAATACTCCATATATGGGAGTGCAATACTTTTCATATATGGATCCAAATCATCAACCATACCATTACACGTCTGTCCCACCATA
TATTCCACATTACATGAGTCAAGAAATACCATCCCGAGCCACACCCTTTCCACACAATGAAGATCATCACAATCCCTCGACCTCCACATCCTTCCCATACAAAGAAGAGC
AAGAGCAACACATACCCTCCGTTTCCACACACATGCCATTTACACCCATTTATAATGAGCCAATGTCACTGAGTCCAATACGTGACGACGAGGGGGCATATAACCTTGGG
GATGATGATTGGGGAGAGTACGGAGAAGAGGAGCCAGGGACATATACAGATGGAGCACACGAGAGTGGCGAGGAAGATCTGTACAAAGTTGATGTAACTGCCCAGGAGAT
GGAAGAGAACATCGGGGTTCATCAGGATGTACTTGGAACTGCACAAAATGTCGATACTCCTACTATGGCCGTTAATATGGCACCTGGCCCTTCTACGACAACTCGTGCAT
CTAATTCAGCAGTAATGTCAGGTCAGTACTCTAATTACAAGGATATAGAAGTGGGAGATATATTCTTGTCTAAGAAGGATTTGCAGATGAGACTGTCGGTTTTAGGGATG
AAAGAGAACTTTGAATTTAGGGTTAAGAAGTCTAACAAGAAAATATTCAAGGTTTGGTGTGTTGTTGATGAATGTAAATGGAGGGTCCGAGCTACGGTCTTGAGAGGCTG
CAATGTCCTTACAGTTACCAAGTATTCTAACATTCACACGTGCAAACACTTGATGCTAACCCACAAACATAGGCAGACCAAAAGTTGGGTTGTTGGCAATATTGTGAAGA
CAACATTTTTGGAGGTTGGTCGTTCCTACAGACCAAAGGACATCATAAGTGATATCCGACAAAAGTTCGGGGTGAATCTGAGTTATGACAAGGCATGGAGGGCTAGAGAA
CATGCCTTGGTCCTTGCCAGAGGCTCGCCTGAAGAGTCCTACAGACAGTTGCCACGGTTTGGAGAAGCGTTGAAAATTGAAAATCCTGGTTCATTTTTCGACTTGGAAGT
GGAAGAGGGTGGTTACTTCAAGCATGTGTTCATGGCAATTGGATCCACGATTAGAGGGTTCTTGAACTCTATTCATCCGATTTTGGTTGTAGATGGGACCCACATGAGGG
GAAAGTTCAGTGGAAAGCTCCTAACTGCAGTAGGCATCGATGGAAACAGCCAAATATACCCCGTAGGGTTTGACATTGGGAGGGGGGAAACCGATGAATCATGGACCTAC
TTTTTCCGGCAAATCGAGCGTGCTATTGGTCACGTTAATCCCCTGGTTCATATACTTGGAACTGCACAAAATGTCGATACTCCTGCTATGGCCGTTAATATGGCACCTGG
CCCTTCTACGACAACTCGTGCATCTAATTCAGCAGTAATGTCAGGTCAGTACTCTAATTACAAGGATATAGAAGTGGGAGATATATTCTTGTCTAAGAAGGATTTGCAGA
TGAGACTGTCGGTTTTAGGGATGAAAGAGAACTTTGAATTTAGGGTTAAGAAGTCTAACAAGAAAATATTCAAGACCAAAAGTTGGGTTGTTGGCAATATTGTGAAGACA
ACATTTTTGGAGGTTGGTCGTTCCTACAGACCAAAGGACATCATAAGTGATATCCGACAAAAGTTCGGGGTGAATCTGAGTTATGACAAGGCATGGAGGGCTAGAGAACA
TGCCTTGGTCCTTGCCAGAGGCTCGCCTGAAGAGTCCTACAGACAGTTACCACGGTTTGGAGAAGCGTTGAAAATTGAAAATCCTGGTTCATTTTTCGACTTGGAAGTGG
AAGAGGGTGGTTACTTCAAGCATGTGTTCATGGCAATTGGATCCACGATTAGAGGGTTCTTGAACTCTATTCATCCGATTTTGGTTGTAGATGGGACCCACATGAGGGGA
AAGTTCAGTGGAAAGCTCCTAACTGCAGTAGGCATCGATGGAAACAGCCAAATATACCCCGTAGGGTTTGACATTGGGAGGGGGGAAACCGATGAATCATGGACCTACTT
TTTCCGGCAAATCGAGCGTGCTATTGGTCACGTTAATCCCCTGGTTCATATACTTGGAACTGCACAAAATGTCGATACTCCTGCTATGGCCGTTAATATGGCACCTGGCC
CTTCTACGACAACTCGTGCATCTAATTCAGCAGTAATGTCAGGTCAGTACTCTAATTACAAGGATATAGAAGTGGGAGATATATTCTTGTCTAAGAAGAATTTGCAGATG
AGACTGTCGGTTTTAGGGATGAAAGAGAACTTTGAATTTAGGGTTAAGAAGTCTAACAAGAAAATATTCAAGGTTCGGTGTGTTGTTGATGAATGTAAATGGAGGGTCCG
AGCTACGGTCTTGAGAGGCTGCAATGTCCTTACAGTTACCAAGTATTCTAACATTCACACGTGCAAACACTTGATGCTAACCCACAAACATAGGCAGACCAAAAGTTGGG
TTGTTGGCAATATTGTGAAGACAACATTTTTGGAGGTTGGTCGTTCCTACAGACCAAAGGACATCATAAGTGATATCCGACAAAAGTTCGGGGTGAATCTGAGTTATGAC
AAGGCATGGAGGGCTAGAGAACATGCATTGGTCCTTGCCAGAGGCTCGCCTGAAGAGTCCTACAGACAGTTACCACGGTTTGGAGAAGCGTTGAAAATTGAAAATCCTGG
TTCATTTTTCGACTTGGAAGTGGAAGAGGGTGGTTACTTCAAGCATGTGTTCATGGCAATTGGATCCACGATTAGAGGGTTCTTGAACTCTATTCATCCGATTTTGGTTG
TAGATGGGACCCACATGAGGGGAAAGTTCAGTGGAAAGCTCCTAACTGCAGTAGGCATCGATGGAAACAACCAAATATACCCCGTAGGGTTTGACATTGGGAGGGGGGAA
ACCGATGAATCATGGACCTACTTTTTCCGGCAAATCGAGCGTGCTATTGGTCACGTTAATCCCCTGGTGATTGTATCTGACAGACACAAGAGCATATTGAAGGCAGTACA
GATTGTGTTTCTTGATGCAACCCACGTTTACTGTATCCACCATCTACAAGAAAACTTGAAAACAGAGGGATGGTTGCAAGCCACTTACTACGACCATCGTACCGATGTAG
TGAAATGGGATGCACCAATATCAAAATATGGTGAAGAAATTGTTCTAGCAGCGGTGAATAATGCAAGAAAGCACGTGGTTAGGCCAATTGACAGGTACGAATTCCAAGTC
GACGATGGACACTTGGGTGGTCGCGTCAACATCCACACGAGGACATGTACTTGTCGTAAATTTGACGTCTTCCAACTTCCGTGCTCCCATGCAATCACAACCTGCATATA
CCTGAATGTGCCGTACTTGGATCTTTGTTCTCCATCCTACCACGTTGAAGCAGTCCTAGCTGCGTACGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCGTGTTTGGGTATGTTTTGGTGGTGTGTGGAAGGATAGTGAAAAGGAGTACGATGGTGGAGAGGTGAGGGCCTTAGACGTGGATGTGGGAATTTCTTATAATGA
GTTCTTAGGTCAGGTGTACGAAATAAGTAGTACAAGTCCGAATGATAACGACATTGTTTTGAGGTGTCTGCTTCAACTGAGGTCTAAATCTCCCGCTTTTGTGATTAGGA
ACGATGTAGATTTGCATAGTTTTTTGACTTGGGAGTTAGTGTTTGCAAGTCCCCTTCATGTATCCACTGTGCCGAAACCATCCTCCCTGTCTAGTAACCAAGGGTTTCGT
CCATATCTCATTCCATATACTGTACCAAATACTCCATATATGGGAGTGCAATACTTTTCATATATGGATCCAAATCATCAACCATACCATTACACGTCTGTCCCACCATA
TATTCCACATTACATGAGTCAAGAAATACCATCCCGAGCCACACCCTTTCCACACAATGAAGATCATCACAATCCCTCGACCTCCACATCCTTCCCATACAAAGAAGAGC
AAGAGCAACACATACCCTCCGTTTCCACACACATGCCATTTACACCCATTTATAATGAGCCAATGTCACTGAGTCCAATACGTGACGACGAGGGGGCATATAACCTTGGG
GATGATGATTGGGGAGAGTACGGAGAAGAGGAGCCAGGGACATATACAGATGGAGCACACGAGAGTGGCGAGGAAGATCTGTACAAAGTTGATGTAACTGCCCAGGAGAT
GGAAGAGAACATCGGGGTTCATCAGGATGTACTTGGAACTGCACAAAATGTCGATACTCCTACTATGGCCGTTAATATGGCACCTGGCCCTTCTACGACAACTCGTGCAT
CTAATTCAGCAGTAATGTCAGGTCAGTACTCTAATTACAAGGATATAGAAGTGGGAGATATATTCTTGTCTAAGAAGGATTTGCAGATGAGACTGTCGGTTTTAGGGATG
AAAGAGAACTTTGAATTTAGGGTTAAGAAGTCTAACAAGAAAATATTCAAGGTTTGGTGTGTTGTTGATGAATGTAAATGGAGGGTCCGAGCTACGGTCTTGAGAGGCTG
CAATGTCCTTACAGTTACCAAGTATTCTAACATTCACACGTGCAAACACTTGATGCTAACCCACAAACATAGGCAGACCAAAAGTTGGGTTGTTGGCAATATTGTGAAGA
CAACATTTTTGGAGGTTGGTCGTTCCTACAGACCAAAGGACATCATAAGTGATATCCGACAAAAGTTCGGGGTGAATCTGAGTTATGACAAGGCATGGAGGGCTAGAGAA
CATGCCTTGGTCCTTGCCAGAGGCTCGCCTGAAGAGTCCTACAGACAGTTGCCACGGTTTGGAGAAGCGTTGAAAATTGAAAATCCTGGTTCATTTTTCGACTTGGAAGT
GGAAGAGGGTGGTTACTTCAAGCATGTGTTCATGGCAATTGGATCCACGATTAGAGGGTTCTTGAACTCTATTCATCCGATTTTGGTTGTAGATGGGACCCACATGAGGG
GAAAGTTCAGTGGAAAGCTCCTAACTGCAGTAGGCATCGATGGAAACAGCCAAATATACCCCGTAGGGTTTGACATTGGGAGGGGGGAAACCGATGAATCATGGACCTAC
TTTTTCCGGCAAATCGAGCGTGCTATTGGTCACGTTAATCCCCTGGTTCATATACTTGGAACTGCACAAAATGTCGATACTCCTGCTATGGCCGTTAATATGGCACCTGG
CCCTTCTACGACAACTCGTGCATCTAATTCAGCAGTAATGTCAGGTCAGTACTCTAATTACAAGGATATAGAAGTGGGAGATATATTCTTGTCTAAGAAGGATTTGCAGA
TGAGACTGTCGGTTTTAGGGATGAAAGAGAACTTTGAATTTAGGGTTAAGAAGTCTAACAAGAAAATATTCAAGACCAAAAGTTGGGTTGTTGGCAATATTGTGAAGACA
ACATTTTTGGAGGTTGGTCGTTCCTACAGACCAAAGGACATCATAAGTGATATCCGACAAAAGTTCGGGGTGAATCTGAGTTATGACAAGGCATGGAGGGCTAGAGAACA
TGCCTTGGTCCTTGCCAGAGGCTCGCCTGAAGAGTCCTACAGACAGTTACCACGGTTTGGAGAAGCGTTGAAAATTGAAAATCCTGGTTCATTTTTCGACTTGGAAGTGG
AAGAGGGTGGTTACTTCAAGCATGTGTTCATGGCAATTGGATCCACGATTAGAGGGTTCTTGAACTCTATTCATCCGATTTTGGTTGTAGATGGGACCCACATGAGGGGA
AAGTTCAGTGGAAAGCTCCTAACTGCAGTAGGCATCGATGGAAACAGCCAAATATACCCCGTAGGGTTTGACATTGGGAGGGGGGAAACCGATGAATCATGGACCTACTT
TTTCCGGCAAATCGAGCGTGCTATTGGTCACGTTAATCCCCTGGTTCATATACTTGGAACTGCACAAAATGTCGATACTCCTGCTATGGCCGTTAATATGGCACCTGGCC
CTTCTACGACAACTCGTGCATCTAATTCAGCAGTAATGTCAGGTCAGTACTCTAATTACAAGGATATAGAAGTGGGAGATATATTCTTGTCTAAGAAGAATTTGCAGATG
AGACTGTCGGTTTTAGGGATGAAAGAGAACTTTGAATTTAGGGTTAAGAAGTCTAACAAGAAAATATTCAAGGTTCGGTGTGTTGTTGATGAATGTAAATGGAGGGTCCG
AGCTACGGTCTTGAGAGGCTGCAATGTCCTTACAGTTACCAAGTATTCTAACATTCACACGTGCAAACACTTGATGCTAACCCACAAACATAGGCAGACCAAAAGTTGGG
TTGTTGGCAATATTGTGAAGACAACATTTTTGGAGGTTGGTCGTTCCTACAGACCAAAGGACATCATAAGTGATATCCGACAAAAGTTCGGGGTGAATCTGAGTTATGAC
AAGGCATGGAGGGCTAGAGAACATGCATTGGTCCTTGCCAGAGGCTCGCCTGAAGAGTCCTACAGACAGTTACCACGGTTTGGAGAAGCGTTGAAAATTGAAAATCCTGG
TTCATTTTTCGACTTGGAAGTGGAAGAGGGTGGTTACTTCAAGCATGTGTTCATGGCAATTGGATCCACGATTAGAGGGTTCTTGAACTCTATTCATCCGATTTTGGTTG
TAGATGGGACCCACATGAGGGGAAAGTTCAGTGGAAAGCTCCTAACTGCAGTAGGCATCGATGGAAACAACCAAATATACCCCGTAGGGTTTGACATTGGGAGGGGGGAA
ACCGATGAATCATGGACCTACTTTTTCCGGCAAATCGAGCGTGCTATTGGTCACGTTAATCCCCTGGTGATTGTATCTGACAGACACAAGAGCATATTGAAGGCAGTACA
GATTGTGTTTCTTGATGCAACCCACGTTTACTGTATCCACCATCTACAAGAAAACTTGAAAACAGAGGGATGGTTGCAAGCCACTTACTACGACCATCGTACCGATGTAG
TGAAATGGGATGCACCAATATCAAAATATGGTGAAGAAATTGTTCTAGCAGCGGTGAATAATGCAAGAAAGCACGTGGTTAGGCCAATTGACAGGTACGAATTCCAAGTC
GACGATGGACACTTGGGTGGTCGCGTCAACATCCACACGAGGACATGTACTTGTCGTAAATTTGACGTCTTCCAACTTCCGTGCTCCCATGCAATCACAACCTGCATATA
CCTGAATGTGCCGTACTTGGATCTTTGTTCTCCATCCTACCACGTTGAAGCAGTCCTAGCTGCGTACGCATAA
Protein sequenceShow/hide protein sequence
MPRVWVCFGGVWKDSEKEYDGGEVRALDVDVGISYNEFLGQVYEISSTSPNDNDIVLRCLLQLRSKSPAFVIRNDVDLHSFLTWELVFASPLHVSTVPKPSSLSSNQGFR
PYLIPYTVPNTPYMGVQYFSYMDPNHQPYHYTSVPPYIPHYMSQEIPSRATPFPHNEDHHNPSTSTSFPYKEEQEQHIPSVSTHMPFTPIYNEPMSLSPIRDDEGAYNLG
DDDWGEYGEEEPGTYTDGAHESGEEDLYKVDVTAQEMEENIGVHQDVLGTAQNVDTPTMAVNMAPGPSTTTRASNSAVMSGQYSNYKDIEVGDIFLSKKDLQMRLSVLGM
KENFEFRVKKSNKKIFKVWCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKHLMLTHKHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRARE
HALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNSQIYPVGFDIGRGETDESWTY
FFRQIERAIGHVNPLVHILGTAQNVDTPAMAVNMAPGPSTTTRASNSAVMSGQYSNYKDIEVGDIFLSKKDLQMRLSVLGMKENFEFRVKKSNKKIFKTKSWVVGNIVKT
TFLEVGRSYRPKDIISDIRQKFGVNLSYDKAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRG
KFSGKLLTAVGIDGNSQIYPVGFDIGRGETDESWTYFFRQIERAIGHVNPLVHILGTAQNVDTPAMAVNMAPGPSTTTRASNSAVMSGQYSNYKDIEVGDIFLSKKNLQM
RLSVLGMKENFEFRVKKSNKKIFKVRCVVDECKWRVRATVLRGCNVLTVTKYSNIHTCKHLMLTHKHRQTKSWVVGNIVKTTFLEVGRSYRPKDIISDIRQKFGVNLSYD
KAWRAREHALVLARGSPEESYRQLPRFGEALKIENPGSFFDLEVEEGGYFKHVFMAIGSTIRGFLNSIHPILVVDGTHMRGKFSGKLLTAVGIDGNNQIYPVGFDIGRGE
TDESWTYFFRQIERAIGHVNPLVIVSDRHKSILKAVQIVFLDATHVYCIHHLQENLKTEGWLQATYYDHRTDVVKWDAPISKYGEEIVLAAVNNARKHVVRPIDRYEFQV
DDGHLGGRVNIHTRTCTCRKFDVFQLPCSHAITTCIYLNVPYLDLCSPSYHVEAVLAAYA