; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039075 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039075
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr2:35322346..35328073
RNA-Seq ExpressionLag0039075
SyntenyLag0039075
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]6.1e-11936.7Show/hide
Query:  GQFGGSASEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYSLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFTQLDDETLSE
        GQFGG   EDP SHLKSF+ + N F +PG++ + L++ LFP+S+   A  WLN  P+++I TW+ + +KFL KYFPPT N+ +R+EI++F Q ++E ++ 
Subjt:  GQFGGSASEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYSLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFTQLDDETLSE

Query:  AWEQFKSLQQRCPHHGLPLCIQLETFYVDLNKNSQILVDLSANGALLRKTYDEAHEILDRIARNNYEWGFAEDRRRRPIKANSGSFEVEPMTSVNAKIEA
        AWE+FK L   CP+ G+P C+Q+E F+   +  ++++++ +ANG    K+++E  EILD+++ +NY+W   + R +      +G   ++ MTS+  +I+ 
Subjt:  AWEQFKSLQQRCPHHGLPLCIQLETFYVDLNKNSQILVDLSANGALLRKTYDEAHEILDRIARNNYEWGFAEDRRRRPIKANSGSFEVEPMTSVNAKIEA

Query:  LTLKMD----------ALTASTAPL-IVQLNTFGCRICGTF-PAQH----------------------------GWRK-PKSIILKRIRKHDEGIHDQSK
        +T  +           +  A+T P  + Q+    C  CG   P+++                            GW++ P      +   +  G + Q K
Subjt:  LTLKMD----------ALTASTAPL-IVQLNTFGCRICGTF-PAQH----------------------------GWRK-PKSIILKRIRKHDEGIHDQSK

Query:  RIYG-----------------QERRSSSEPSR-----------------------------------------------IPSHLEIQVGQLASELKNRLA
          Y                   ++++  +P++                                                   LE+Q+GQL +E++ R  
Subjt:  RIYG-----------------QERRSSSEPSR-----------------------------------------------IPSHLEIQVGQLASELKNRLA

Query:  GSLPSNTEDPK--GKQHCHAVTLRSGRQLHEPTPMQDDPCKNASTEGHSQPQEEKATQEEESEKKEANADKEPDAYKPTLPFPQRLRKKNDEVQFKKFLN
        GSLPS+TE+P+  GK+HC+++  RSG +   P  M D+   + S E  +Q   +K  +   S              +P  PFPQRL +KN +  F+KFL+
Subjt:  GSLPSNTEDPK--GKQHCHAVTLRSGRQLHEPTPMQDDPCKNASTEGHSQPQEEKATQEEESEKKEANADKEPDAYKPTLPFPQRLRKKNDEVQFKKFLN

Query:  VLRELHINVPFVEALEKMSSYAKFLKDILMNKRPWKEFETVALTRI----------------------------DVGRALCDLGASINLMSLSVYKRLGI
        +L++LHIN+PFVEALE+M +YAKF+KDI+  K+   E+ETVALT                              DVGRALCDLGASINLM LS++K+  I
Subjt:  VLRELHINVPFVEALEKMSSYAKFLKDILMNKRPWKEFETVALTRI----------------------------DVGRALCDLGASINLMSLSVYKRLGI

Query:  GEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFLADFIILDYEADKEIPIILGRPFLATGQTRIDVQKGELTMRMNDQKVVFNILSSLKYPDDPKEC
        G+A PTT+TLQLADRS+  PEGKIEDVLVKVDKFIF  DFIILD EADK++PIILGRPFLATG+T IDV+KGELTMR++DQKV FN+L ++KY DD +EC
Subjt:  GEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFLADFIILDYEADKEIPIILGRPFLATGQTRIDVQKGELTMRMNDQKVVFNILSSLKYPDDPKEC

Query:  NMVARIDICSPQELLSNAASSTEGGTQEDQERG
         +++     +  EL     +  E   +E ++ G
Subjt:  NMVARIDICSPQELLSNAASSTEGGTQEDQERG

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]3.8e-12142.27Show/hide
Query:  TAGQFGGSASEDPHSHLKSFLDI-------CNKFIIPGVTPEQLQVILFPYSLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFT
        T GQF G   EDPH HLKSFL +        + F   GV  + +++ LFPY LRD AK WLN     +I +WN LAE FL KYFPPT N+R + EI+TF 
Subjt:  TAGQFGGSASEDPHSHLKSFLDI-------CNKFIIPGVTPEQLQVILFPYSLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFT

Query:  QLDDETLSEAWEQFKSLQQRCPHHGLPLCIQLETFYVDLNKNSQILVDLSANGALLRKTYDEAHEILDRIARNNYEWGFAEDRRRRPIKANSGSFEVEPM
        Q +DETLSEA E+FK + ++CPHHGLP CIQ+ETFY  LN  ++ +VD SANGA+L KTY+EA+EIL+RIA NN +W    D R  P +   G  EV+ +
Subjt:  QLDDETLSEAWEQFKSLQQRCPHHGLPLCIQLETFYVDLNKNSQILVDLSANGALLRKTYDEAHEILDRIARNNYEWGFAEDRRRRPIKANSGSFEVEPM

Query:  TSVNAKIEALT-------LKMDALTAS---TAPLIVQLNTFGCRICG---TF---------------PAQHGWRK--PKSIILKRIRKHDEGIHDQSKRI
        +S+NA++ ++T       L  D++  +   TA  I Q     C  CG   TF                A  G  K  P S       ++      + + +
Subjt:  TSVNAKIEALT-------LKMDALTAS---TAPLIVQLNTFGCRICG---TF---------------PAQHGWRK--PKSIILKRIRKHDEGIHDQSKRI

Query:  YGQER-RSSSEPS--RIPSHLEIQVGQLASELKNRLAGSLPSNTEDP---KGKQHCHAVTLRSGRQLHEPTPMQDDPCKNAST-EGHSQP-------QEE
        Y Q+    ++ PS  R+ + L     Q+ ++ K        S T      K     +   ++S +       +Q    KNA   + HSQ         EE
Subjt:  YGQER-RSSSEPS--RIPSHLEIQVGQLASELKNRLAGSLPSNTEDP---KGKQHCHAVTLRSGRQLHEPTPMQDDPCKNAST-EGHSQP-------QEE

Query:  KATQEEESE-----------KKEANADKEPDAYKPTLPFPQRLRKKNDEVQFKKFLNVLRELHINVPFVEALEKMSSYAKFLKDILMNKRPWKEFETVAL
         A Q+E S+           +  A++++E   Y P+ PFPQR+++K +E  F+KF+++L+E+HIN+P VEAL++M +Y KFLKD+L+N+R ++EF+ V+L
Subjt:  KATQEEESE-----------KKEANADKEPDAYKPTLPFPQRLRKKNDEVQFKKFLNVLRELHINVPFVEALEKMSSYAKFLKDILMNKRPWKEFETVAL

Query:  TR----------------------------IDVGRALCDLGASINLMSLSVYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFLADFIIL
                                       ++GRALCDLGA+INLM LS+YK+LGIGEARPTT+TLQLADRS+ +PEGKIED+L++VDKFIFLADFIIL
Subjt:  TR----------------------------IDVGRALCDLGASINLMSLSVYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFLADFIIL

Query:  DYEADKEIPIILGRPFLATGQTRIDVQKGELTMRMNDQKVVFNILSSLKYPDDPKECNMV
        DYE D ++PIILGRPFL  G+T +DV KG +T+RM+ QKV FNI  S+KYP   +EC+ V
Subjt:  DYEADKEIPIILGRPFLATGQTRIDVQKGELTMRMNDQKVVFNILSSLKYPDDPKECNMV

XP_024949903.1 uncharacterized protein LOC112496715 [Citrus sinensis]8.7e-12642.47Show/hide
Query:  TAGQFGGSASEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYSLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFTQLDDETL
        T GQF G  S+D H HLK FL++ + F I G + E L++ LF +SLRD A+ WLN  P +SI TW+ LA+KFL KYFPPT N++LR EI +F QL+DE+L
Subjt:  TAGQFGGSASEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYSLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFTQLDDETL

Query:  SEAWEQFKSLQQRCPHHGLPLCIQLETFYVDLNKNSQILVDLSANGALLRKTYDEAHEILDRIARNNYEWGFAEDRRRRPIKANSGSFEVEPMTSVNAKI
         +AWE+FK L +RCPHHG+P CIQLET Y  LN++++++VD SANGALL K+Y+EA+EIL+RIA NNY+W      R+   +  +G   V+ +T+++A++
Subjt:  SEAWEQFKSLQQRCPHHGLPLCIQLETFYVDLNKNSQILVDLSANGALLRKTYDEAHEILDRIARNNYEWGFAEDRRRRPIKANSGSFEVEPMTSVNAKI

Query:  EALTLKMDALTASTAPLIV-QLNTFGCRIC-------------------GTFPAQH-----------GWRKPKSIILKRIRKHDEGIHDQSK-----RIY
         +LT  + A+T  TAP  V Q++   C  C                   G F  Q+           GWR+  +       +    +  Q++       Y
Subjt:  EALTLKMDALTASTAPLIV-QLNTFGCRIC-------------------GTFPAQH-----------GWRKPKSIILKRIRKHDEGIHDQSK-----RIY

Query:  GQERRSSS-------------------EPSRIPSH------LEIQVGQLASELKNRLAGSLPSNTEDPK--GKQHCHAVTLRSGRQL--------HEPTP
         Q +   S                     + + SH      LE Q+GQLA+ L NR  GSLPSNTE+P+  GK+HC  + LRSGR+L         E +P
Subjt:  GQERRSSS-------------------EPSRIPSH------LEIQVGQLASELKNRLAGSLPSNTEDPK--GKQHCHAVTLRSGRQL--------HEPTP

Query:  MQDDPCKN---ASTEGHS-QPQEEKATQEEESEKKEANADKEP------DAYKPTLPFPQRLRKKNDEVQFKKFLNVLRELHINVPFVEALEKMSSYAKF
        ++     N   AS E +   P  E+AT    ++   A  +K+P        ++   PF QR +K+  + QF K L VL++LHIN+ F+E LE+M +YAKF
Subjt:  MQDDPCKN---ASTEGHS-QPQEEKATQEEESEKKEANADKEP------DAYKPTLPFPQRLRKKNDEVQFKKFLNVLRELHINVPFVEALEKMSSYAKF

Query:  LKDILMNKRPWKEFETVALTRIDVGRALCDLGASINLMSLSVYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFLADFIILDYEADKEIP
        LKDIL  KR  +EFETVA+T+            S  ++   + +++   +  PTT++LQLADRS  +PEG+I+DVLVKVDKFIF  DFI+LD+EADKE+P
Subjt:  LKDILMNKRPWKEFETVALTRIDVGRALCDLGASINLMSLSVYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFLADFIILDYEADKEIP

Query:  IILGRPFLATGQTRIDVQKGELTMRMNDQKVVFNILSSLKYPDDPKECNMVARIDICSPQELLS
        IILGRPFLATG+T IDV+KGELTMR+NDQ+V FN+L  +K PD+ ++CN +  +D    + L S
Subjt:  IILGRPFLATGQTRIDVQKGELTMRMNDQKVVFNILSSLKYPDDPKECNMVARIDICSPQELLS

XP_024965798.1 uncharacterized protein LOC112506000 [Cynara cardunculus var. scolymus]7.9e-11943.03Show/hide
Query:  FIIPGVTPEQLQVILFPYSLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFTQLDDETLSEAWEQFKSLQQRCPHHGLPLCIQLE
        F IPGVT E L++ LF Y+L+D A+ WLN Q  NSI TWN LAEKFL+KYFPPT N ++R EIM F Q +DE +SEAWE+FK L ++C HHG+P C+QLE
Subjt:  FIIPGVTPEQLQVILFPYSLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFTQLDDETLSEAWEQFKSLQQRCPHHGLPLCIQLE

Query:  TFYVDLNKNSQILVDLSANGALLRKTYDEAHEILDRIARNNYEWGFAEDRRRRPIKANSGSFEVEPMTSVNAKIEALT-LKMDALTASTAPLIVQ-----
        TFY  L+  ++I++D +A GA   KTY+E ++IL+RI+ NN EW    + R    K+ SG  E++ ++S+NA+I ALT L  + L  +     VQ     
Subjt:  TFYVDLNKNSQILVDLSANGALLRKTYDEAHEILDRIARNNYEWGFAEDRRRRPIKANSGSFEVEPMTSVNAKIEALT-LKMDALTASTAPLIVQ-----

Query:  -LNTFGCRICGTFPAQHGWR----KPKSIILKRIRKHDEGIHDQSKRIYGQERRSSSEPSRIPSHLEIQVGQLASELKNRLAGSLPSNTEDPKGKQHCHA
           T  C  CG     H +      P S+                  +    R S   P+      E Q GQ A++LKNR  G+LP +TE P  K+H  A
Subjt:  -LNTFGCRICGTFPAQHGWR----KPKSIILKRIRKHDEGIHDQSKRIYGQERRSSSEPSRIPSHLEIQVGQLASELKNRLAGSLPSNTEDPKGKQHCHA

Query:  VTLRSGRQLHEPTPMQDDPCKNASTEGHSQ-----------PQEEKATQEEESEKKEANADKEPDAYKPT--------------------------LPFP
        VTLRS + L +    ++ P +    E  S            P  E+A Q   S  + A+         PT                          LPFP
Subjt:  VTLRSGRQLHEPTPMQDDPCKNASTEGHSQ-----------PQEEKATQEEESEKKEANADKEPDAYKPT--------------------------LPFP

Query:  QRLRKKNDEVQFKKFLNVLRELHINVPFVEALEKMSSYAKFLKDILMNKRPWKEFETVALTR----------------------------IDVGRALCDL
         R++ KN +VQFKKFL++ ++L+IN+P VEALE+MSSY KFLKDIL  KR   E ETV+LT+                             +VG ALCDL
Subjt:  QRLRKKNDEVQFKKFLNVLRELHINVPFVEALEKMSSYAKFLKDILMNKRPWKEFETVALTR----------------------------IDVGRALCDL

Query:  GASINLMSLSVYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFLADFIILDYEADKEIPIILGRPFLATGQTRIDVQKGELTMRMNDQKV
        GASINLM LS++ +LGIG+ R T +TLQLADRS+ +P+ KIED+LVKVDKFIF ADF++LDYEA+K +PIILGRPFLATG+T IDVQKGELTMR+NDQ+V
Subjt:  GASINLMSLSVYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFLADFIILDYEADKEIPIILGRPFLATGQTRIDVQKGELTMRMNDQKV

Query:  VFNILSSLKYPDDPKECNMVARIDICSPQELLSNAASSTEGGTQED
         FN+  +LK+  + ++C+ +  +       L+    S    GT ED
Subjt:  VFNILSSLKYPDDPKECNMVARIDICSPQELLSNAASSTEGGTQED

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]6.3e-12441.7Show/hide
Query:  TAGQFGGSASEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYSLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFTQLDDETL
        T GQF    +EDPH HL+SFL++ + F I GV+ E  ++ LFP+SLRD A+ WLN    +S+  WN  AEKFL KYFPPT N++ R EIM+F QL+DE+ 
Subjt:  TAGQFGGSASEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYSLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFTQLDDETL

Query:  SEAWEQFKSLQQRCPHHGLPLCIQLETFYVDLNKNSQILVDLSANGALLRKTYDEAHEILDRIARNNYEWGFAEDRRRRPIKANSGSFEVEPMTSVNAKI
        S+AWE+FK L ++CPHHG+P CIQ+ETFY  LN  SQ+++D SANGA+L K+Y+EA EIL+ IA NNY+W        R +   +G  EV+ +T++  ++
Subjt:  SEAWEQFKSLQQRCPHHGLPLCIQLETFYVDLNKNSQILVDLSANGALLRKTYDEAHEILDRIARNNYEWGFAEDRRRRPIKANSGSFEVEPMTSVNAKI

Query:  EALTLKMDALTASTAPLI-----VQLNTFGCRIC----------------------------GTFPAQH--GWRKPKSI---ILKRIRKHDEGIHDQ---
         ++T  +  L+   +  I     +Q +   C  C                            G F   +   W+   ++      +++  D+G       
Subjt:  EALTLKMDALTASTAPLI-----VQLNTFGCRIC----------------------------GTFPAQH--GWRKPKSI---ILKRIRKHDEGIHDQ---

Query:  -SKRIYGQERRSSSEPSRIPS---------------------HLEIQVGQLASELKNRLAGSLPSNTEDPK--GKQHCHAVTLRSGRQLH----------
         S+++   +   +S+PS + S                     +LE+Q+G LA+ELK R  GSLPS+TE+P+  GK+ C ++ LRSG+ L           
Subjt:  -SKRIYGQERRSSSEPSRIPS---------------------HLEIQVGQLASELKNRLAGSLPSNTEDPK--GKQHCHAVTLRSGRQLH----------

Query:  EPTPMQDDP--CKNASTEGHSQPQEEKATQEEESEKKEANADKEPDAYKPTLPFPQRLRKKNDEVQFKKFLNVLRELHINVPFVEALEKMSSYAKFLKDI
        EPT +Q D    K  + E     + + AT ++ + ++ A       + KP LPFPQR +K+  + QFKKFL+VL++LHIN+P VEALE+M +Y KFLKDI
Subjt:  EPTPMQDDP--CKNASTEGHSQPQEEKATQEEESEKKEANADKEPDAYKPTLPFPQRLRKKNDEVQFKKFLNVLRELHINVPFVEALEKMSSYAKFLKDI

Query:  LMNKRPWKEFET------VALTRIDVGRALCDLGASINLMSLSVYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFLADFIILDYEADKE
        L  KR   EFE+       A+ +  +   L D G+    +S+   + LGIGEARPTT+TLQLADRS+ HP+GKIEDVLV+VDKFIF ADFIILDYE D+E
Subjt:  LMNKRPWKEFET------VALTRIDVGRALCDLGASINLMSLSVYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFLADFIILDYEADKE

Query:  IPIILGRPFLATGQTRIDVQKGELTMRMNDQKVVFNILSSLKYPDDPKECNMVARID
        +PIIL RPFLATG+T IDV+KGELTMR  D++  F +   ++ PD   EC  +  +D
Subjt:  IPIILGRPFLATGQTRIDVQKGELTMRMNDQKVVFNILSSLKYPDDPKECNMVARID

TrEMBL top hitse value%identityAlignment
A0A6A2WLX1 Reverse transcriptase4.4e-10737.32Show/hide
Query:  GQFGGSASEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYSLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFTQLDDETLSE
        GQFGG  +ED   H+++FL++C+ F   GV  + L++ LFPYSLRD A+ WL+  P  S+ +W  L + FL +Y PP  N++LR EI +F Q DDE++ E
Subjt:  GQFGGSASEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYSLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFTQLDDETLSE

Query:  AWEQFKSLQQRCPHHGLPLCIQLETFYVDLNKNSQILVDLSANGALLRKTYDEAHEILDRIARNNYEWGFAE--DRRRRPIKANSGSFEVEPMTSVNAKI
         W+++KSL Q+C +HG     Q+  FY  +N  +++L+D SANG LL K+  EA  ILDRIA N+Y++  +     RR P     G+FE+E   SV+A++
Subjt:  AWEQFKSLQQRCPHHGLPLCIQLETFYVDLNKNSQILVDLSANGALLRKTYDEAHEILDRIARNNYEWGFAE--DRRRRPIKANSGSFEVEPMTSVNAKI

Query:  EALTLKMDALTASTAPLIVQLNTFGCRIC------GTFPAQH-----------------------GWRKPKSIILK-----------RIRKHDEG-----
          +T  +  L  ST    V+  +  C +C         P  H                       GWR+  +   +           R + H+E      
Subjt:  EALTLKMDALTASTAPLIVQLNTFGCRIC------GTFPAQH-----------------------GWRKPKSIILK-----------RIRKHDEG-----

Query:  ----------------------------------IHDQSKRIYGQERRSSSEPSRIPSH------LEIQVGQLASELKNRLAGSLPSNTEDPK--GKQHC
                                          + D S  I  Q     S+ + I SH      LE QVGQ+A+ L+ R  G LPS+TE  K  GK+HC
Subjt:  ----------------------------------IHDQSKRIYGQERRSSSEPSRIPSH------LEIQVGQLASELKNRLAGSLPSNTEDPK--GKQHC

Query:  HAVTLRSGRQLHEPTPMQDDPCKNASTEGHSQPQEEKATQEEESEKKEANADKEPDAYKPTLPFPQRLRKKNDEVQFKKFLNVLRELHINVPFVEALEKM
        + +TLRSG Q++                     +E+ A   +   K + N        +P  PFPQRL+K N+EVQFKKF+++L +LHIN+P +EA+E+M
Subjt:  HAVTLRSGRQLHEPTPMQDDPCKNASTEGHSQPQEEKATQEEESEKKEANADKEPDAYKPTLPFPQRLRKKNDEVQFKKFLNVLRELHINVPFVEALEKM

Query:  SSYAKFLKDILMNKRPWKEFETVALT-------------RID--------------VGRALCDLGASINLMSLSVYKRLGIGEARPTTITLQLADRSVVH
          YAKF+KDI   KR   + ETVA               R D              VG+ALCDLG+S+NL+  S++ +LGIG+ARPT++ LQLAD+S V 
Subjt:  SSYAKFLKDILMNKRPWKEFETVALT-------------RID--------------VGRALCDLGASINLMSLSVYKRLGIGEARPTTITLQLADRSVVH

Query:  PEGKIEDVLVKVDKFIFLADFIILDYEADKEIPIILGRPFLATGQTRIDVQKGELTMRMNDQKVVFNILSSLKYPDDPKECNMVARIDICSPQE
         EG++EDV+V+VDKF+F  DF+ILD E D + PIILGRPFLATG+  ID +KGELTMR+ DQ V  N+  +LKY DD +EC  ++ ++    +E
Subjt:  PEGKIEDVLVKVDKFIFLADFIILDYEADKEIPIILGRPFLATGQTRIDVQKGELTMRMNDQKVVFNILSSLKYPDDPKECNMVARIDICSPQE

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129451.4e-11639.42Show/hide
Query:  QFGGSASEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYSLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFTQLDDETLSEA
        QF G  S+DP+SHL +FL+IC+ F   GVT + +++ LFP+SLRD AK WLN  P  SI TW  LA+KFL K+FPP   +++R +I +F Q D E+L EA
Subjt:  QFGGSASEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYSLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFTQLDDETLSEA

Query:  WEQFKSLQQRCPHHGLPLCIQLETFYVDLNKNSQILVDLSANGALLRKTYDEAHEILDRIARNNYEWGFAEDRRRRPIKANSGSFEVEPMTSVNAKIEAL
        WE+FK L +RCPHHG+P  +Q++TFY  L  + + ++D +A GAL+ K   +A+ +L+ +A NNY+W       R+ +    G++E++ + ++  ++ AL
Subjt:  WEQFKSLQQRCPHHGLPLCIQLETFYVDLNKNSQILVDLSANGALLRKTYDEAHEILDRIARNNYEWGFAEDRRRRPIKANSGSFEVEPMTSVNAKIEAL

Query:  TLKMDALTASTAPLIVQLNTFGCRIC-------------------GTFPAQH----------GWR--------------KPKSII--------LKRIRKH
        + K+D L        VQ +   C +C                   G F  Q           GWR               PK I+          +I + 
Subjt:  TLKMDALTASTAPLIVQLNTFGCRIC-------------------GTFPAQH----------GWR--------------KPKSII--------LKRIRKH

Query:  DEGIHDQSKRIYGQERRSSSEPSRIPSHLEIQVGQLASELKNRLAGSLPSNTE-DPKGKQHCHAVTLRSGRQLHEPTPMQ-DDPCKNASTEGHSQPQEEK
           + +   +   +             +LE QVGQLA+ + NR  GSLPS+T+ +PKGK+ C A+TLRSG+++        +   ++   EG  + + E 
Subjt:  DEGIHDQSKRIYGQERRSSSEPSRIPSHLEIQVGQLASELKNRLAGSLPSNTE-DPKGKQHCHAVTLRSGRQLHEPTPMQ-DDPCKNASTEGHSQPQEEK

Query:  ATQEEESEKKEANADKEPDAYKPTLPFPQRLRKKNDEVQFKKFLNVLRELHINVPFVEALEKMSSYAKFLKDILMNKRPWKEFETVALT-----------
          Q+++ +K E     +     P  PFPQRL+K+  E QF+KFLNV ++LHIN+PF EALE+M SY KFLKDIL  KR   EFETV LT           
Subjt:  ATQEEESEKKEANADKEPDAYKPTLPFPQRLRKKNDEVQFKKFLNVLRELHINVPFVEALEKMSSYAKFLKDILMNKRPWKEFETVALT-----------

Query:  -----------------RIDVGRALCDLGASINLMSLSVYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFLADFIILDYEADKEIPIIL
                          +   +AL DLGASINLM  S++++LG+GE +PT++TLQLADRS V+P G IEDVLVKVDKFIF  DF+ILD E D++IPIIL
Subjt:  -----------------RIDVGRALCDLGASINLMSLSVYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFLADFIILDYEADKEIPIIL

Query:  GRPFLATGQTRIDVQKGELTMRMNDQKVVFNILSSLKYPDDPKECNMVARID
        GRPFLAT    IDV++G+++ ++ ++ V FNI ++ K+P     C+ V  ID
Subjt:  GRPFLATGQTRIDVQKGELTMRMNDQKVVFNILSSLKYPDDPKECNMVARID

A0A6J1CPJ3 uncharacterized protein LOC1110129471.5e-11037.56Show/hide
Query:  GQFGGSASEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYSLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFTQLDDETLSE
        GQF G   EDP SHLKSF+ + N F +PG++ + L++ LFP+SL   A  WLN  P+++I T + + +KFL KYFPPT N+ +R+EI++F Q ++E ++ 
Subjt:  GQFGGSASEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYSLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFTQLDDETLSE

Query:  AWEQFKSLQQRCPHHGLPLCIQLETFYVDLNKNSQILVDLSANGALLRKTYDEAHEILDRIARNNYEWGFAEDRRRRPIKANSGSFEVEPMTSVNAKIEA
        AWE+FK L + CP+ G+P C+Q+E F+   +  + ++++ +ANG    K+++E  EILD+++ +N +W   + R +      +    ++ MTS+  +I+ 
Subjt:  AWEQFKSLQQRCPHHGLPLCIQLETFYVDLNKNSQILVDLSANGALLRKTYDEAHEILDRIARNNYEWGFAEDRRRRPIKANSGSFEVEPMTSVNAKIEA

Query:  LT-----LKMDALTASTAPL------IVQLNTFGCRI------------------------------CGT------------------------------
        +T     ++ +   A+ AP       + Q+    C++                               GT                              
Subjt:  LT-----LKMDALTASTAPL------IVQLNTFGCRI------------------------------CGT------------------------------

Query:  -----FPAQHGWRKPKSIILKRIRKHDEGIH------DQSKRIYGQERRSSSEPSRIPSHLEIQVGQLASELKNRLAGSLPSNTEDPKGKQHCHAVTLRS
              PAQ      + ++ + I K+D  +       D + +   + +           +LE+Q+GQLA+E++ R  GSLPS+TE+P             
Subjt:  -----FPAQHGWRKPKSIILKRIRKHDEGIH------DQSKRIYGQERRSSSEPSRIPSHLEIQVGQLASELKNRLAGSLPSNTEDPKGKQHCHAVTLRS

Query:  GRQLHEPTPMQDDPCKNASTEGHSQPQEEKATQEEESEKKEANADKEPDAYKPTLPFPQRLRKKNDEVQFKKFLNVLRELHINVPFVEALEKMSSYAKFL
         R++  P+P         S E  +Q   +K  + E S     +   +    +   PFPQRL +KN +  F+KFL++L++LHIN+PFVEALE+M +YAKFL
Subjt:  GRQLHEPTPMQDDPCKNASTEGHSQPQEEKATQEEESEKKEANADKEPDAYKPTLPFPQRLRKKNDEVQFKKFLNVLRELHINVPFVEALEKMSSYAKFL

Query:  KDILMNKRPWKEFETVALTRI----------------------------DVGRALCDLGASINLMSLSVYKRLGIGEARPTTITLQLADRSVVHPEGKIE
        KDI+  K+   E+ETVALT                              DVGRALCDLGA INLM LS++K+L IG+A PTT+TL LADRS+  PEGKIE
Subjt:  KDILMNKRPWKEFETVALTRI----------------------------DVGRALCDLGASINLMSLSVYKRLGIGEARPTTITLQLADRSVVHPEGKIE

Query:  DVLVKVDKFIFLADFIILDYEADKEIPIILGRPFLATGQTRIDVQKGELTMRMNDQKVVFNILSSLKYPDDPKECNMVA
        DVLVKVDKFIF ADFIILD EADK++PIILGRPFLATG+T IDV+KGELTMR++DQKV FN+L ++KYPDD +EC +++
Subjt:  DVLVKVDKFIFLADFIILDYEADKEIPIILGRPFLATGQTRIDVQKGELTMRMNDQKVVFNILSSLKYPDDPKECNMVA

A0A6J1DY39 uncharacterized protein LOC1110256532.9e-11936.7Show/hide
Query:  GQFGGSASEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYSLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFTQLDDETLSE
        GQFGG   EDP SHLKSF+ + N F +PG++ + L++ LFP+S+   A  WLN  P+++I TW+ + +KFL KYFPPT N+ +R+EI++F Q ++E ++ 
Subjt:  GQFGGSASEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYSLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFTQLDDETLSE

Query:  AWEQFKSLQQRCPHHGLPLCIQLETFYVDLNKNSQILVDLSANGALLRKTYDEAHEILDRIARNNYEWGFAEDRRRRPIKANSGSFEVEPMTSVNAKIEA
        AWE+FK L   CP+ G+P C+Q+E F+   +  ++++++ +ANG    K+++E  EILD+++ +NY+W   + R +      +G   ++ MTS+  +I+ 
Subjt:  AWEQFKSLQQRCPHHGLPLCIQLETFYVDLNKNSQILVDLSANGALLRKTYDEAHEILDRIARNNYEWGFAEDRRRRPIKANSGSFEVEPMTSVNAKIEA

Query:  LTLKMD----------ALTASTAPL-IVQLNTFGCRICGTF-PAQH----------------------------GWRK-PKSIILKRIRKHDEGIHDQSK
        +T  +           +  A+T P  + Q+    C  CG   P+++                            GW++ P      +   +  G + Q K
Subjt:  LTLKMD----------ALTASTAPL-IVQLNTFGCRICGTF-PAQH----------------------------GWRK-PKSIILKRIRKHDEGIHDQSK

Query:  RIYG-----------------QERRSSSEPSR-----------------------------------------------IPSHLEIQVGQLASELKNRLA
          Y                   ++++  +P++                                                   LE+Q+GQL +E++ R  
Subjt:  RIYG-----------------QERRSSSEPSR-----------------------------------------------IPSHLEIQVGQLASELKNRLA

Query:  GSLPSNTEDPK--GKQHCHAVTLRSGRQLHEPTPMQDDPCKNASTEGHSQPQEEKATQEEESEKKEANADKEPDAYKPTLPFPQRLRKKNDEVQFKKFLN
        GSLPS+TE+P+  GK+HC+++  RSG +   P  M D+   + S E  +Q   +K  +   S              +P  PFPQRL +KN +  F+KFL+
Subjt:  GSLPSNTEDPK--GKQHCHAVTLRSGRQLHEPTPMQDDPCKNASTEGHSQPQEEKATQEEESEKKEANADKEPDAYKPTLPFPQRLRKKNDEVQFKKFLN

Query:  VLRELHINVPFVEALEKMSSYAKFLKDILMNKRPWKEFETVALTRI----------------------------DVGRALCDLGASINLMSLSVYKRLGI
        +L++LHIN+PFVEALE+M +YAKF+KDI+  K+   E+ETVALT                              DVGRALCDLGASINLM LS++K+  I
Subjt:  VLRELHINVPFVEALEKMSSYAKFLKDILMNKRPWKEFETVALTRI----------------------------DVGRALCDLGASINLMSLSVYKRLGI

Query:  GEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFLADFIILDYEADKEIPIILGRPFLATGQTRIDVQKGELTMRMNDQKVVFNILSSLKYPDDPKEC
        G+A PTT+TLQLADRS+  PEGKIEDVLVKVDKFIF  DFIILD EADK++PIILGRPFLATG+T IDV+KGELTMR++DQKV FN+L ++KY DD +EC
Subjt:  GEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFLADFIILDYEADKEIPIILGRPFLATGQTRIDVQKGELTMRMNDQKVVFNILSSLKYPDDPKEC

Query:  NMVARIDICSPQELLSNAASSTEGGTQEDQERG
         +++     +  EL     +  E   +E ++ G
Subjt:  NMVARIDICSPQELLSNAASSTEGGTQEDQERG

A0A6J1EQ90 uncharacterized protein LOC1114364111.8e-12142.27Show/hide
Query:  TAGQFGGSASEDPHSHLKSFLDI-------CNKFIIPGVTPEQLQVILFPYSLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFT
        T GQF G   EDPH HLKSFL +        + F   GV  + +++ LFPY LRD AK WLN     +I +WN LAE FL KYFPPT N+R + EI+TF 
Subjt:  TAGQFGGSASEDPHSHLKSFLDI-------CNKFIIPGVTPEQLQVILFPYSLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFT

Query:  QLDDETLSEAWEQFKSLQQRCPHHGLPLCIQLETFYVDLNKNSQILVDLSANGALLRKTYDEAHEILDRIARNNYEWGFAEDRRRRPIKANSGSFEVEPM
        Q +DETLSEA E+FK + ++CPHHGLP CIQ+ETFY  LN  ++ +VD SANGA+L KTY+EA+EIL+RIA NN +W    D R  P +   G  EV+ +
Subjt:  QLDDETLSEAWEQFKSLQQRCPHHGLPLCIQLETFYVDLNKNSQILVDLSANGALLRKTYDEAHEILDRIARNNYEWGFAEDRRRRPIKANSGSFEVEPM

Query:  TSVNAKIEALT-------LKMDALTAS---TAPLIVQLNTFGCRICG---TF---------------PAQHGWRK--PKSIILKRIRKHDEGIHDQSKRI
        +S+NA++ ++T       L  D++  +   TA  I Q     C  CG   TF                A  G  K  P S       ++      + + +
Subjt:  TSVNAKIEALT-------LKMDALTAS---TAPLIVQLNTFGCRICG---TF---------------PAQHGWRK--PKSIILKRIRKHDEGIHDQSKRI

Query:  YGQER-RSSSEPS--RIPSHLEIQVGQLASELKNRLAGSLPSNTEDP---KGKQHCHAVTLRSGRQLHEPTPMQDDPCKNAST-EGHSQP-------QEE
        Y Q+    ++ PS  R+ + L     Q+ ++ K        S T      K     +   ++S +       +Q    KNA   + HSQ         EE
Subjt:  YGQER-RSSSEPS--RIPSHLEIQVGQLASELKNRLAGSLPSNTEDP---KGKQHCHAVTLRSGRQLHEPTPMQDDPCKNAST-EGHSQP-------QEE

Query:  KATQEEESE-----------KKEANADKEPDAYKPTLPFPQRLRKKNDEVQFKKFLNVLRELHINVPFVEALEKMSSYAKFLKDILMNKRPWKEFETVAL
         A Q+E S+           +  A++++E   Y P+ PFPQR+++K +E  F+KF+++L+E+HIN+P VEAL++M +Y KFLKD+L+N+R ++EF+ V+L
Subjt:  KATQEEESE-----------KKEANADKEPDAYKPTLPFPQRLRKKNDEVQFKKFLNVLRELHINVPFVEALEKMSSYAKFLKDILMNKRPWKEFETVAL

Query:  TR----------------------------IDVGRALCDLGASINLMSLSVYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFLADFIIL
                                       ++GRALCDLGA+INLM LS+YK+LGIGEARPTT+TLQLADRS+ +PEGKIED+L++VDKFIFLADFIIL
Subjt:  TR----------------------------IDVGRALCDLGASINLMSLSVYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFLADFIIL

Query:  DYEADKEIPIILGRPFLATGQTRIDVQKGELTMRMNDQKVVFNILSSLKYPDDPKECNMV
        DYE D ++PIILGRPFL  G+T +DV KG +T+RM+ QKV FNI  S+KYP   +EC+ V
Subjt:  DYEADKEIPIILGRPFLATGQTRIDVQKGELTMRMNDQKVVFNILSSLKYPDDPKECNMV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTCAAATCACTCCCAAATTGTTGTCAATTTTTGGTGCAATTCTTGTGTATGATTTTCCTCTATATATAGAGAATGAGCTTATGAAAATGGTGGAATTCTTCAACCT
AAAATGTTGCCAAAATTCTCCCAAGAACTTCCATATTCATACTTTCAAGTTGCTTATACCCATAAGCTTTCCTTTCCAATTTCTATCAATTGAGGATCCCACAATTCGTT
CCAAGGCCTGGGATAGTAGAGAAGATCCAAGTGGTTGTCCAAAAGTTGTTCGTGTGAATTATAAAGCATCGGATTATACGAATTGGGAGCTTCAAGAGGTAGTCATTAGA
GTCCATTTTGTAATTGCTCGAAAGAAGAAGAAGAACTCAAGCTGGCCAAAGAACAGCAGGCTGACCATTGGGCTGACAGATGATTGGCGGCTGACCATTCGGGCTGTCAG
ATGCACAAGCACACTTTCCAAGATGATCAAGAGACAGAAAAAGATGGATGAAAAGACAAGCGCCGGCGCAGCCAGAGCAACTGCAGGACAATTTGGAGGAAGTGCATCAG
AAGATCCTCACTCTCACCTCAAGTCATTCCTTGACATCTGCAACAAGTTTATCATCCCAGGAGTTACCCCAGAACAACTACAGGTCATATTATTTCCTTACTCTCTGAGA
GATGCGGCGAAAGTATGGTTAAATCAGCAGCCAGCAAACTCCATCGCAACATGGAATGTTTTGGCAGAAAAATTCCTAGAAAAATACTTTCCACCCACCGCAAACTCTAG
ATTGAGAAAGGAAATAATGACATTCACGCAACTGGATGATGAAACGTTGAGCGAAGCTTGGGAACAATTCAAGAGCTTGCAACAAAGATGCCCGCATCATGGTTTACCAC
TGTGTATTCAGCTAGAGACCTTCTATGTAGATCTTAACAAGAATTCGCAAATTTTGGTAGACTTATCCGCAAATGGTGCACTACTCAGAAAGACTTATGACGAAGCCCAC
GAAATTCTAGATCGAATTGCACGCAATAACTATGAGTGGGGCTTTGCAGAAGATAGGCGACGAAGACCCATTAAGGCCAATTCAGGAAGTTTTGAAGTGGAACCTATGAC
TTCAGTAAACGCAAAAATTGAGGCGTTGACCTTAAAAATGGATGCGCTCACAGCCAGCACCGCACCCCTGATCGTGCAACTCAACACATTTGGTTGCAGAATTTGTGGGA
CATTTCCAGCGCAACATGGATGGAGGAAGCCAAAGAGCATCATCCTCAAACGCATTAGAAAACATGATGAGGGAATTCATGACCAGTCAAAAAGAATATATGGCCAAGAA
CGACGCAGTAGTTCAGAGCCAAGCCGCATCCCTTCGCATTTAGAGATTCAAGTTGGACAGCTTGCGTCAGAGTTAAAAAACAGACTTGCGGGATCACTGCCCAGCAATAC
AGAGGATCCAAAAGGGAAACAACACTGCCATGCGGTGACTCTACGAAGTGGAAGACAATTACATGAACCTACACCAATGCAAGATGACCCATGTAAAAATGCATCCACAG
AAGGGCATAGCCAGCCGCAAGAAGAAAAAGCAACTCAGGAAGAAGAATCTGAAAAGAAGGAAGCTAACGCAGATAAAGAGCCTGACGCATATAAGCCCACACTACCTTTT
CCGCAGAGGTTGCGAAAGAAGAATGATGAGGTTCAATTTAAGAAGTTTTTGAATGTTTTGCGAGAATTGCATATCAATGTCCCATTTGTGGAGGCATTAGAAAAGATGTC
GAGCTATGCCAAATTCTTGAAAGATATTTTGATGAACAAACGACCATGGAAGGAGTTTGAAACTGTTGCGTTAACTAGAATTGATGTGGGGAGAGCCCTATGCGACCTAG
GCGCAAGCATAAATTTGATGTCACTGTCTGTCTACAAGCGCTTGGGGATTGGAGAAGCGAGACCAACAACTATTACACTTCAGTTAGCAGATAGATCTGTAGTCCACCCT
GAGGGAAAGATCGAAGACGTATTAGTCAAAGTTGACAAATTCATTTTTCTTGCAGATTTCATCATTTTGGACTATGAAGCTGACAAGGAGATTCCCATTATTTTGGGAAG
ACCATTCCTTGCCACAGGACAAACCCGGATTGACGTACAAAAGGGAGAACTGACAATGCGCATGAACGATCAGAAGGTTGTCTTTAACATTCTCTCTTCGCTAAAATATC
CTGACGATCCCAAGGAATGCAACATGGTAGCCCGCATAGATATATGCTCTCCGCAGGAATTACTATCAAATGCGGCCAGTTCCACAGAAGGAGGAACCCAGGAAGATCAG
GAAAGAGGAAACCGCATCTTATCGCCGCATGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGACTCAAATCACTCCCAAATTGTTGTCAATTTTTGGTGCAATTCTTGTGTATGATTTTCCTCTATATATAGAGAATGAGCTTATGAAAATGGTGGAATTCTTCAACCT
AAAATGTTGCCAAAATTCTCCCAAGAACTTCCATATTCATACTTTCAAGTTGCTTATACCCATAAGCTTTCCTTTCCAATTTCTATCAATTGAGGATCCCACAATTCGTT
CCAAGGCCTGGGATAGTAGAGAAGATCCAAGTGGTTGTCCAAAAGTTGTTCGTGTGAATTATAAAGCATCGGATTATACGAATTGGGAGCTTCAAGAGGTAGTCATTAGA
GTCCATTTTGTAATTGCTCGAAAGAAGAAGAAGAACTCAAGCTGGCCAAAGAACAGCAGGCTGACCATTGGGCTGACAGATGATTGGCGGCTGACCATTCGGGCTGTCAG
ATGCACAAGCACACTTTCCAAGATGATCAAGAGACAGAAAAAGATGGATGAAAAGACAAGCGCCGGCGCAGCCAGAGCAACTGCAGGACAATTTGGAGGAAGTGCATCAG
AAGATCCTCACTCTCACCTCAAGTCATTCCTTGACATCTGCAACAAGTTTATCATCCCAGGAGTTACCCCAGAACAACTACAGGTCATATTATTTCCTTACTCTCTGAGA
GATGCGGCGAAAGTATGGTTAAATCAGCAGCCAGCAAACTCCATCGCAACATGGAATGTTTTGGCAGAAAAATTCCTAGAAAAATACTTTCCACCCACCGCAAACTCTAG
ATTGAGAAAGGAAATAATGACATTCACGCAACTGGATGATGAAACGTTGAGCGAAGCTTGGGAACAATTCAAGAGCTTGCAACAAAGATGCCCGCATCATGGTTTACCAC
TGTGTATTCAGCTAGAGACCTTCTATGTAGATCTTAACAAGAATTCGCAAATTTTGGTAGACTTATCCGCAAATGGTGCACTACTCAGAAAGACTTATGACGAAGCCCAC
GAAATTCTAGATCGAATTGCACGCAATAACTATGAGTGGGGCTTTGCAGAAGATAGGCGACGAAGACCCATTAAGGCCAATTCAGGAAGTTTTGAAGTGGAACCTATGAC
TTCAGTAAACGCAAAAATTGAGGCGTTGACCTTAAAAATGGATGCGCTCACAGCCAGCACCGCACCCCTGATCGTGCAACTCAACACATTTGGTTGCAGAATTTGTGGGA
CATTTCCAGCGCAACATGGATGGAGGAAGCCAAAGAGCATCATCCTCAAACGCATTAGAAAACATGATGAGGGAATTCATGACCAGTCAAAAAGAATATATGGCCAAGAA
CGACGCAGTAGTTCAGAGCCAAGCCGCATCCCTTCGCATTTAGAGATTCAAGTTGGACAGCTTGCGTCAGAGTTAAAAAACAGACTTGCGGGATCACTGCCCAGCAATAC
AGAGGATCCAAAAGGGAAACAACACTGCCATGCGGTGACTCTACGAAGTGGAAGACAATTACATGAACCTACACCAATGCAAGATGACCCATGTAAAAATGCATCCACAG
AAGGGCATAGCCAGCCGCAAGAAGAAAAAGCAACTCAGGAAGAAGAATCTGAAAAGAAGGAAGCTAACGCAGATAAAGAGCCTGACGCATATAAGCCCACACTACCTTTT
CCGCAGAGGTTGCGAAAGAAGAATGATGAGGTTCAATTTAAGAAGTTTTTGAATGTTTTGCGAGAATTGCATATCAATGTCCCATTTGTGGAGGCATTAGAAAAGATGTC
GAGCTATGCCAAATTCTTGAAAGATATTTTGATGAACAAACGACCATGGAAGGAGTTTGAAACTGTTGCGTTAACTAGAATTGATGTGGGGAGAGCCCTATGCGACCTAG
GCGCAAGCATAAATTTGATGTCACTGTCTGTCTACAAGCGCTTGGGGATTGGAGAAGCGAGACCAACAACTATTACACTTCAGTTAGCAGATAGATCTGTAGTCCACCCT
GAGGGAAAGATCGAAGACGTATTAGTCAAAGTTGACAAATTCATTTTTCTTGCAGATTTCATCATTTTGGACTATGAAGCTGACAAGGAGATTCCCATTATTTTGGGAAG
ACCATTCCTTGCCACAGGACAAACCCGGATTGACGTACAAAAGGGAGAACTGACAATGCGCATGAACGATCAGAAGGTTGTCTTTAACATTCTCTCTTCGCTAAAATATC
CTGACGATCCCAAGGAATGCAACATGGTAGCCCGCATAGATATATGCTCTCCGCAGGAATTACTATCAAATGCGGCCAGTTCCACAGAAGGAGGAACCCAGGAAGATCAG
GAAAGAGGAAACCGCATCTTATCGCCGCATGAATAA
Protein sequenceShow/hide protein sequence
MTQITPKLLSIFGAILVYDFPLYIENELMKMVEFFNLKCCQNSPKNFHIHTFKLLIPISFPFQFLSIEDPTIRSKAWDSREDPSGCPKVVRVNYKASDYTNWELQEVVIR
VHFVIARKKKKNSSWPKNSRLTIGLTDDWRLTIRAVRCTSTLSKMIKRQKKMDEKTSAGAARATAGQFGGSASEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYSLR
DAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTANSRLRKEIMTFTQLDDETLSEAWEQFKSLQQRCPHHGLPLCIQLETFYVDLNKNSQILVDLSANGALLRKTYDEAH
EILDRIARNNYEWGFAEDRRRRPIKANSGSFEVEPMTSVNAKIEALTLKMDALTASTAPLIVQLNTFGCRICGTFPAQHGWRKPKSIILKRIRKHDEGIHDQSKRIYGQE
RRSSSEPSRIPSHLEIQVGQLASELKNRLAGSLPSNTEDPKGKQHCHAVTLRSGRQLHEPTPMQDDPCKNASTEGHSQPQEEKATQEEESEKKEANADKEPDAYKPTLPF
PQRLRKKNDEVQFKKFLNVLRELHINVPFVEALEKMSSYAKFLKDILMNKRPWKEFETVALTRIDVGRALCDLGASINLMSLSVYKRLGIGEARPTTITLQLADRSVVHP
EGKIEDVLVKVDKFIFLADFIILDYEADKEIPIILGRPFLATGQTRIDVQKGELTMRMNDQKVVFNILSSLKYPDDPKECNMVARIDICSPQELLSNAASSTEGGTQEDQ
ERGNRILSPHE