; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002036 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002036
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold10:200231..209986
RNA-Seq ExpressionSpg002036
SyntenySpg002036
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056838.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.5e-6424.57Show/hide
Query:  FVEDTCNKRLIPLSIS--FLQWFEKVLVEILQNPVSS-FFHEKIKEEFGV-IRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDF
        ++ + C  +   + I+   L W      ++L    +  FF E+  E+  + +R  K  S      E     + G +  I VP G +  GW  F  +    
Subjt:  FVEDTCNKRLIPLSIS--FLQWFEKVLVEILQNPVSS-FFHEKIKEEFGV-IRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDF

Query:  ILKIHSNENQPIRSLLSKEESLPVFDKVSAGHASS-NSYAEVVKRGG-----SLKSSVSLNDSIRNAKGINEEAYWVRKNCDVLKLDLERSIVVSRLMAQ
        I    S   + IRS + KE      D  S+   SS  SYA+V+             + S + S R +  I  + + +  N        E++++++R    
Subjt:  ILKIHSNENQPIRSLLSKEESLPVFDKVSAGHASS-NSYAEVVKRGG-----SLKSSVSLNDSIRNAKGINEEAYWVRKNCDVLKLDLERSIVVSRLMAQ

Query:  YSWKDVKIALENFFKTFVLVNPFMDDKALI-----HAADGGLEFSANGKWKKFGNLHLKLDFWSSEIHSQPKSIKSYGGWLAIRNIPLNLWHRDSFEAIG
          W  +  +L    +      PF  DKA++     HA        ANG W   GN  +K + W S +HS    I SYGGWL  R IPL+LW+ ++F+ IG
Subjt:  YSWKDVKIALENFFKTFVLVNPFMDDKALI-----HAADGGLEFSANGKWKKFGNLHLKLDFWSSEIHSQPKSIKSYGGWLAIRNIPLNLWHRDSFEAIG

Query:  KNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKYEF---SLRYGDINSLENRNL------------NFDSRKQLDANDFSNSLDLIRV
           GG + ++  T+ +    +A I+V  N+ GF+PA I +       F   +++  +   L  RN+             FD    L      N    I  
Subjt:  KNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKYEF---SLRYGDINSLENRNL------------NFDSRKQLDANDFSNSLDLIRV

Query:  RQVILDEESDIVNKEDRMNELPAFSRHEEAFNEDLDISKDVSAQDKYLNGELVLSMDTSVQDQNLKERVQVNEMLGSPKGASLHDRCINNAGCKGFNAR-
               +  I N +       + S H +A        K+ S++ +Y              DQ L +R          KG ++    IN+     ++ R 
Subjt:  RQVILDEESDIVNKEDRMNELPAFSRHEEAFNEDLDISKDVSAQDKYLNGELVLSMDTSVQDQNLKERVQVNEMLGSPKGASLHDRCINNAGCKGFNAR-

Query:  ---INEPTLALSPSLNDNEFNESGPQEAQQFQVFELSYKNDNAVNGILNHDVQQVALKTYSRKKCSLSSAVMTNFKTNFNADHLESDCTHLIAGNKASGS
            N     LSP     + N S  +   + +  E+S  ND                     K+ S      T        D  ES   H +        
Subjt:  ---INEPTLALSPSLNDNEFNESGPQEAQQFQVFELSYKNDNAVNGILNHDVQQVALKTYSRKKCSLSSAVMTNFKTNFNADHLESDCTHLIAGNKASGS

Query:  AIINAGNGLSQAKVFKESSIQIPGGSNVFVRGIGSSFNHSIHSPVDSDDESMVSVSSEDSDQLLDKEDNVEQFSDDQIGESLESLFCEKVDG-LGSQIIH
        ++   G G  Q  +    S+ + G  +     I S  NH + +  +   +   +  S DS +  +   +V++ +D     S  +      D   GS++  
Subjt:  AIINAGNGLSQAKVFKESSIQIPGGSNVFVRGIGSSFNHSIHSPVDSDDESMVSVSSEDSDQLLDKEDNVEQFSDDQIGESLESLFCEKVDG-LGSQIIH

Query:  ESLLSPSQIPNQF-SSIVDTCGFQLCKISPQSSKVAETKQVAIDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLC
               +I   F   +V        K+SP+      T  V     F   + S + +  +     G  GG+L++WD++K  V +   G YS+S+  L   
Subjt:  ESLLSPSQIPNQF-SSIVDTCGFQLCKISPQSSKVAETKQVAIDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLC

Query:  KKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRF
            W+++VYGP  Y +R  LW EL  L   C   W I GDFNI RW  E        + M  FN FI  + L++ PL N  FTWS      ++S +DRF
Subjt:  KKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRF

Query:  LVTKEWDVLFDNSRVSRKARIFSDHFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSK
        L++K W+  F         R  SDHFP+LLE+    WGP PFR  NS L   +  +  ++  +  +  G+ G+       +L   IK+W     +   + 
Subjt:  LVTKEWDVLFDNSRVSRKARIFSDHFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSK

Query:  EKNLLFELEFFDAKAEESLLSDEELDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISSNGVSLVSFREIEQEIL
        +K LL E++  D    +  +S       +++K +++ +  +  +   ++ +  W  LGDEN S+FHR     +RKNLI  +    G SL S  +I +  +
Subjt:  EKNLLFELEFFDAKAEESLLSDEELDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISSNGVSLVSFREIEQEIL

Query:  DFF
          F
Subjt:  DFF

TYJ98683.1 hypothetical protein E5676_scaffold429G00120 [Cucumis melo var. makuwa]3.8e-8460.94Show/hide
Query:  IDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGGDF
        ID+  IKSLWSSK+IGW  VE++G+ GG+L MWD SK+ V+E LKGGYSLS+  +T CKK CW++NVYGP DY+ERRF+W  L SLS YCT  WCIGG  
Subjt:  IDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGGDF

Query:  NITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRFLVTKEWDVLFDNSRVSRKARIFSDHFPLLLEAGSFMWGPSPF
        NITRW HE FP+ +QT+GMR+FN  I+   + E+PL NG+ TWSR+G++ S SL+D F + KEWD + +NSRV RKA   SDHFPLLLEAGS  WGPSPF
Subjt:  NITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRFLVTKEWDVLFDNSRVSRKARIFSDHFPLLLEAGSFMWGPSPF

Query:  RFYNSWLSQAECDRIILDSLSIDRSQGWAGFVI
        RF NSWL  +EC+RII +  +I     WAGFV+
Subjt:  RFYNSWLSQAECDRIILDSLSIDRSQGWAGFVI

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.8e-6824.48Show/hide
Query:  IPLSISFLQWFEKVLVEILQNPVSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDFI-LKIHSNENQPIR
        I +S   L W    L  ++  P ++ F  + ++    I + K  +      E         +  I VP G +K GW  F  MI   + +K  +      R
Subjt:  IPLSISFLQWFEKVLVEILQNPVSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDFI-LKIHSNENQPIR

Query:  SLLSKEESLPVFDKVSAGHASSNSYAEVVKRGGSLKSSVSLNDSIRNAKGINEEAYWVRKNCDVLKLD-LERSIVVSRLMAQYSWKDVKIALENFFKTFV
        +      S P+            SYA+ V  G    +S S +DS  ++   +  +      CD    D LE ++V+ R      W  +   L    +   
Subjt:  SLLSKEESLPVFDKVSAGHASSNSYAEVVKRGGSLKSSVSLNDSIRNAKGINEEAYWVRKNCDVLKLD-LERSIVVSRLMAQYSWKDVKIALENFFKTFV

Query:  LVNPFMDDKALIHAADG--GLEFSANGKWKKFGNLHLKLDFWSSEIHSQPKSIKSYGGWLAIRNIPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSE
          N F  +KAL+H +          N  W   G   ++ + WS   H+ PK I SYGGW   R IPL+LW+  +F+ IGK   GL+ ++  T +  +  E
Subjt:  LVNPFMDDKALIHAADG--GLEFSANGKWKKFGNLHLKLDFWSSEIHSQPKSIKSYGGWLAIRNIPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSE

Query:  AFIEVEKNFCGFIPADINV--KIGNKY--------------------------EFSLRYGDINSLENRNLNFDSRKQLDANDFSNSLDLIRVRQVILDEE
        A I+V  N+ GF+PA++ +    GNK+                          + +  + D N  E+    F+  + +  +  S S D    +    D+ 
Subjt:  AFIEVEKNFCGFIPADINV--KIGNKY--------------------------EFSLRYGDINSLENRNLNFDSRKQLDANDFSNSLDLIRVRQVILDEE

Query:  S---DIVNKEDRMNELPAFSRHEEAFNEDLDISKDVSAQDKYLNGELVLSMDTSVQDQNLKERVQVNEMLGSPKGASLHDRCINNAGCKGFNARINEPTL
        S    ++ K DR   LP+F       NE+L    ++ A       E++  +         K++V +     S        R ++      FN        
Subjt:  S---DIVNKEDRMNELPAFSRHEEAFNEDLDISKDVSAQDKYLNGELVLSMDTSVQDQNLKERVQVNEMLGSPKGASLHDRCINNAGCKGFNARINEPTL

Query:  ALSPSLNDNEFN-ESGPQEAQQFQVFELSYKNDNAVNGILNHDVQQVALKTYSRKKCSLSSAVMTNFKTNFNADHLESDCTHLIAGNKASGSAIINAGNG
          SPS   N FN +S P                N    + + + +Q   +  S KK   SS+   N K N N     +    ++A ++ +      A  G
Subjt:  ALSPSLNDNEFN-ESGPQEAQQFQVFELSYKNDNAVNGILNHDVQQVALKTYSRKKCSLSSAVMTNFKTNFNADHLESDCTHLIAGNKASGSAIINAGNG

Query:  LSQAKVFKESSIQIPGGSNVFVRGIGSSFNHSIHSPVDSDDESMVSVSSEDSDQLLDKEDNVEQFSDDQIGESLESLFCEKVDGLGSQIIHESLLSPSQI
        LS      +     P              N S+    +SD+  +V ++   + +++ +   ++   ++    S E+ + +       +  +       + 
Subjt:  LSQAKVFKESSIQIPGGSNVFVRGIGSSFNHSIHSPVDSDDESMVSVSSEDSDQLLDKEDNVEQFSDDQIGESLESLFCEKVDGLGSQIIHESLLSPSQI

Query:  PN------QFSSIVDTCGFQLCKISPQSSKVAETKQVAIDL---------KFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVK
        P+      Q  S +   G +L   +  S     T  +   +         + IKSLW S  I W    A G SGG+LI+WD    S+L   +G +SLS  
Subjt:  PN------QFSSIVDTCGFQLCKISPQSSKVAETKQVAIDL---------KFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVK

Query:  CLTLCKKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHS
         L       W++ +YGP   +ER   W EL +L +  + PW +GGD N+ R   E   V   +   R  N FI ++ L++ PL+N +FTWS   N  + S
Subjt:  CLTLCKKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHS

Query:  LIDRFLVTKEWDVLFDNSRVSRKARIFSDHFPLLLEAGS--FMWGPSPFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEF
         IDRFL    W+ LF         R  SDHFPL+ E  +    WGP PFR  +  LS  E  R +          G+ GF    + ++L   IK W  E 
Subjt:  LIDRFLVTKEWDVLFDNSRVSRKARIFSDHFPLLLEAGS--FMWGPSPFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEF

Query:  EDSRKSKEKNLLFELEFFDAKAEESLLSDEELDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISSNGVSLVSFR
          S    ++ ++ E++  D K  ++ L+ EE +  LA+K ++  L + + +   ++ K  WL+ GDEN+SFFHR  +++++++ I ++    G    +  
Subjt:  EDSRKSKEKNLLFELEFFDAKAEESLLSDEELDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISSNGVSLVSFR

Query:  EIEQEILDFFS
         I    + FFS
Subjt:  EIEQEILDFFS

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]5.1e-6535.98Show/hide
Query:  LCKISPQSSKVAETKQVAIDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKKVCWVSNVYGPNDYKERRFLWFE
        + +++P    + ETK   +D+  +KSLWS+  I WS ++A G + G+LI+W++  L   E ++G +SL++        + WVS +YGP+  +     W E
Subjt:  LCKISPQSSKVAETKQVAIDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKKVCWVSNVYGPNDYKERRFLWFE

Query:  LRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRFLVTKEWDVLFDNSRVSRKARIFSD
        L  LS  C + W + GDFN+TRW  E+      TK M  FN FIEDS L+++PL+NG+ TWSR+    S SLID FL+T             R  R  SD
Subjt:  LRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRFLVTKEWDVLFDNSRVSRKARIFSD

Query:  HFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSKEKNLLFELEFFDAKAEESLLSDEE
        HFP+LL+ G   WG +PFRF N WLS       +          GW G  +  K ++LK AIK W  E      S++++L   +   D       ++ ++
Subjt:  HFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSKEKNLLFELEFFDAKAEESLLSDEE

Query:  LDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISSNGVSLVSFREIEQEILDF
            +  K +++ +   +E    ++CK  WL  GDENT FFHRFLA K+R+++IT+++S  G+ L   ++IE+E +DF
Subjt:  LDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISSNGVSLVSFREIEQEILDF

XP_038904301.1 uncharacterized protein LOC120090656 [Benincasa hispida]1.0e-7350.88Show/hide
Query:  LWFELRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRFLVTKEWDVLFDNSRVSRKAR
        +W EL SL+    DPWCIG +FN  R  HERFPVGR T+ M  FNKFI  + L+E PLSNG+FTWSR+G+  S SL+D FLV+  W+ +FDNSRV+R+AR
Subjt:  LWFELRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRFLVTKEWDVLFDNSRVSRKAR

Query:  IFSDHFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSKEKNLLFELEFFDAKAEESLL
          SDHFPL LEAG+F WGPS FRF NSWL+  E  ++I  SL    +  WA   +S+  R  K A+KKWF EF    K KE++LL EL+  D+   +   
Subjt:  IFSDHFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSKEKNLLFELEFFDAKAEESLL

Query:  SDEELDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISSNGVSLVSFREIEQEILDFFSL
             D   ++K +++ LY  +E++LI+KCKL WLK GDENTSFFHRFL+ +KRKNL   L++   +     R+IE  IL F+SL
Subjt:  SDEELDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISSNGVSLVSFREIEQEILDFFSL

TrEMBL top hitse value%identityAlignment
A0A5D3BHE3 Uncharacterized protein1.8e-8460.94Show/hide
Query:  IDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGGDF
        ID+  IKSLWSSK+IGW  VE++G+ GG+L MWD SK+ V+E LKGGYSLS+  +T CKK CW++NVYGP DY+ERRF+W  L SLS YCT  WCIGG  
Subjt:  IDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGGDF

Query:  NITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRFLVTKEWDVLFDNSRVSRKARIFSDHFPLLLEAGSFMWGPSPF
        NITRW HE FP+ +QT+GMR+FN  I+   + E+PL NG+ TWSR+G++ S SL+D F + KEWD + +NSRV RKA   SDHFPLLLEAGS  WGPSPF
Subjt:  NITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRFLVTKEWDVLFDNSRVSRKARIFSDHFPLLLEAGSFMWGPSPF

Query:  RFYNSWLSQAECDRIILDSLSIDRSQGWAGFVI
        RF NSWL  +EC+RII +  +I     WAGFV+
Subjt:  RFYNSWLSQAECDRIILDSLSIDRSQGWAGFVI

A0A5D3BKT8 LINE-1 retrotransposable element ORF2 protein1.2e-6424.57Show/hide
Query:  FVEDTCNKRLIPLSIS--FLQWFEKVLVEILQNPVSS-FFHEKIKEEFGV-IRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDF
        ++ + C  +   + I+   L W      ++L    +  FF E+  E+  + +R  K  S      E     + G +  I VP G +  GW  F  +    
Subjt:  FVEDTCNKRLIPLSIS--FLQWFEKVLVEILQNPVSS-FFHEKIKEEFGV-IRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDF

Query:  ILKIHSNENQPIRSLLSKEESLPVFDKVSAGHASS-NSYAEVVKRGG-----SLKSSVSLNDSIRNAKGINEEAYWVRKNCDVLKLDLERSIVVSRLMAQ
        I    S   + IRS + KE      D  S+   SS  SYA+V+             + S + S R +  I  + + +  N        E++++++R    
Subjt:  ILKIHSNENQPIRSLLSKEESLPVFDKVSAGHASS-NSYAEVVKRGG-----SLKSSVSLNDSIRNAKGINEEAYWVRKNCDVLKLDLERSIVVSRLMAQ

Query:  YSWKDVKIALENFFKTFVLVNPFMDDKALI-----HAADGGLEFSANGKWKKFGNLHLKLDFWSSEIHSQPKSIKSYGGWLAIRNIPLNLWHRDSFEAIG
          W  +  +L    +      PF  DKA++     HA        ANG W   GN  +K + W S +HS    I SYGGWL  R IPL+LW+ ++F+ IG
Subjt:  YSWKDVKIALENFFKTFVLVNPFMDDKALI-----HAADGGLEFSANGKWKKFGNLHLKLDFWSSEIHSQPKSIKSYGGWLAIRNIPLNLWHRDSFEAIG

Query:  KNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKYEF---SLRYGDINSLENRNL------------NFDSRKQLDANDFSNSLDLIRV
           GG + ++  T+ +    +A I+V  N+ GF+PA I +       F   +++  +   L  RN+             FD    L      N    I  
Subjt:  KNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKYEF---SLRYGDINSLENRNL------------NFDSRKQLDANDFSNSLDLIRV

Query:  RQVILDEESDIVNKEDRMNELPAFSRHEEAFNEDLDISKDVSAQDKYLNGELVLSMDTSVQDQNLKERVQVNEMLGSPKGASLHDRCINNAGCKGFNAR-
               +  I N +       + S H +A        K+ S++ +Y              DQ L +R          KG ++    IN+     ++ R 
Subjt:  RQVILDEESDIVNKEDRMNELPAFSRHEEAFNEDLDISKDVSAQDKYLNGELVLSMDTSVQDQNLKERVQVNEMLGSPKGASLHDRCINNAGCKGFNAR-

Query:  ---INEPTLALSPSLNDNEFNESGPQEAQQFQVFELSYKNDNAVNGILNHDVQQVALKTYSRKKCSLSSAVMTNFKTNFNADHLESDCTHLIAGNKASGS
            N     LSP     + N S  +   + +  E+S  ND                     K+ S      T        D  ES   H +        
Subjt:  ---INEPTLALSPSLNDNEFNESGPQEAQQFQVFELSYKNDNAVNGILNHDVQQVALKTYSRKKCSLSSAVMTNFKTNFNADHLESDCTHLIAGNKASGS

Query:  AIINAGNGLSQAKVFKESSIQIPGGSNVFVRGIGSSFNHSIHSPVDSDDESMVSVSSEDSDQLLDKEDNVEQFSDDQIGESLESLFCEKVDG-LGSQIIH
        ++   G G  Q  +    S+ + G  +     I S  NH + +  +   +   +  S DS +  +   +V++ +D     S  +      D   GS++  
Subjt:  AIINAGNGLSQAKVFKESSIQIPGGSNVFVRGIGSSFNHSIHSPVDSDDESMVSVSSEDSDQLLDKEDNVEQFSDDQIGESLESLFCEKVDG-LGSQIIH

Query:  ESLLSPSQIPNQF-SSIVDTCGFQLCKISPQSSKVAETKQVAIDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLC
               +I   F   +V        K+SP+      T  V     F   + S + +  +     G  GG+L++WD++K  V +   G YS+S+  L   
Subjt:  ESLLSPSQIPNQF-SSIVDTCGFQLCKISPQSSKVAETKQVAIDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLC

Query:  KKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRF
            W+++VYGP  Y +R  LW EL  L   C   W I GDFNI RW  E        + M  FN FI  + L++ PL N  FTWS      ++S +DRF
Subjt:  KKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRF

Query:  LVTKEWDVLFDNSRVSRKARIFSDHFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSK
        L++K W+  F         R  SDHFP+LLE+    WGP PFR  NS L   +  +  ++  +  +  G+ G+       +L   IK+W     +   + 
Subjt:  LVTKEWDVLFDNSRVSRKARIFSDHFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSK

Query:  EKNLLFELEFFDAKAEESLLSDEELDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISSNGVSLVSFREIEQEIL
        +K LL E++  D    +  +S       +++K +++ +  +  +   ++ +  W  LGDEN S+FHR     +RKNLI  +    G SL S  +I +  +
Subjt:  EKNLLFELEFFDAKAEESLLSDEELDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISSNGVSLVSFREIEQEIL

Query:  DFF
          F
Subjt:  DFF

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein1.8e-6824.48Show/hide
Query:  IPLSISFLQWFEKVLVEILQNPVSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDFI-LKIHSNENQPIR
        I +S   L W    L  ++  P ++ F  + ++    I + K  +      E         +  I VP G +K GW  F  MI   + +K  +      R
Subjt:  IPLSISFLQWFEKVLVEILQNPVSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDFI-LKIHSNENQPIR

Query:  SLLSKEESLPVFDKVSAGHASSNSYAEVVKRGGSLKSSVSLNDSIRNAKGINEEAYWVRKNCDVLKLD-LERSIVVSRLMAQYSWKDVKIALENFFKTFV
        +      S P+            SYA+ V  G    +S S +DS  ++   +  +      CD    D LE ++V+ R      W  +   L    +   
Subjt:  SLLSKEESLPVFDKVSAGHASSNSYAEVVKRGGSLKSSVSLNDSIRNAKGINEEAYWVRKNCDVLKLD-LERSIVVSRLMAQYSWKDVKIALENFFKTFV

Query:  LVNPFMDDKALIHAADG--GLEFSANGKWKKFGNLHLKLDFWSSEIHSQPKSIKSYGGWLAIRNIPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSE
          N F  +KAL+H +          N  W   G   ++ + WS   H+ PK I SYGGW   R IPL+LW+  +F+ IGK   GL+ ++  T +  +  E
Subjt:  LVNPFMDDKALIHAADG--GLEFSANGKWKKFGNLHLKLDFWSSEIHSQPKSIKSYGGWLAIRNIPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSE

Query:  AFIEVEKNFCGFIPADINV--KIGNKY--------------------------EFSLRYGDINSLENRNLNFDSRKQLDANDFSNSLDLIRVRQVILDEE
        A I+V  N+ GF+PA++ +    GNK+                          + +  + D N  E+    F+  + +  +  S S D    +    D+ 
Subjt:  AFIEVEKNFCGFIPADINV--KIGNKY--------------------------EFSLRYGDINSLENRNLNFDSRKQLDANDFSNSLDLIRVRQVILDEE

Query:  S---DIVNKEDRMNELPAFSRHEEAFNEDLDISKDVSAQDKYLNGELVLSMDTSVQDQNLKERVQVNEMLGSPKGASLHDRCINNAGCKGFNARINEPTL
        S    ++ K DR   LP+F       NE+L    ++ A       E++  +         K++V +     S        R ++      FN        
Subjt:  S---DIVNKEDRMNELPAFSRHEEAFNEDLDISKDVSAQDKYLNGELVLSMDTSVQDQNLKERVQVNEMLGSPKGASLHDRCINNAGCKGFNARINEPTL

Query:  ALSPSLNDNEFN-ESGPQEAQQFQVFELSYKNDNAVNGILNHDVQQVALKTYSRKKCSLSSAVMTNFKTNFNADHLESDCTHLIAGNKASGSAIINAGNG
          SPS   N FN +S P                N    + + + +Q   +  S KK   SS+   N K N N     +    ++A ++ +      A  G
Subjt:  ALSPSLNDNEFN-ESGPQEAQQFQVFELSYKNDNAVNGILNHDVQQVALKTYSRKKCSLSSAVMTNFKTNFNADHLESDCTHLIAGNKASGSAIINAGNG

Query:  LSQAKVFKESSIQIPGGSNVFVRGIGSSFNHSIHSPVDSDDESMVSVSSEDSDQLLDKEDNVEQFSDDQIGESLESLFCEKVDGLGSQIIHESLLSPSQI
        LS      +     P              N S+    +SD+  +V ++   + +++ +   ++   ++    S E+ + +       +  +       + 
Subjt:  LSQAKVFKESSIQIPGGSNVFVRGIGSSFNHSIHSPVDSDDESMVSVSSEDSDQLLDKEDNVEQFSDDQIGESLESLFCEKVDGLGSQIIHESLLSPSQI

Query:  PN------QFSSIVDTCGFQLCKISPQSSKVAETKQVAIDL---------KFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVK
        P+      Q  S +   G +L   +  S     T  +   +         + IKSLW S  I W    A G SGG+LI+WD    S+L   +G +SLS  
Subjt:  PN------QFSSIVDTCGFQLCKISPQSSKVAETKQVAIDL---------KFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVK

Query:  CLTLCKKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHS
         L       W++ +YGP   +ER   W EL +L +  + PW +GGD N+ R   E   V   +   R  N FI ++ L++ PL+N +FTWS   N  + S
Subjt:  CLTLCKKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHS

Query:  LIDRFLVTKEWDVLFDNSRVSRKARIFSDHFPLLLEAGS--FMWGPSPFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEF
         IDRFL    W+ LF         R  SDHFPL+ E  +    WGP PFR  +  LS  E  R +          G+ GF    + ++L   IK W  E 
Subjt:  LIDRFLVTKEWDVLFDNSRVSRKARIFSDHFPLLLEAGS--FMWGPSPFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEF

Query:  EDSRKSKEKNLLFELEFFDAKAEESLLSDEELDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISSNGVSLVSFR
          S    ++ ++ E++  D K  ++ L+ EE +  LA+K ++  L + + +   ++ K  WL+ GDEN+SFFHR  +++++++ I ++    G    +  
Subjt:  EDSRKSKEKNLLFELEFFDAKAEESLLSDEELDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISSNGVSLVSFR

Query:  EIEQEILDFFS
         I    + FFS
Subjt:  EIEQEILDFFS

A0A6J1E2G6 uncharacterized protein LOC1110254052.5e-6535.98Show/hide
Query:  LCKISPQSSKVAETKQVAIDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKKVCWVSNVYGPNDYKERRFLWFE
        + +++P    + ETK   +D+  +KSLWS+  I WS ++A G + G+LI+W++  L   E ++G +SL++        + WVS +YGP+  +     W E
Subjt:  LCKISPQSSKVAETKQVAIDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKKVCWVSNVYGPNDYKERRFLWFE

Query:  LRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRFLVTKEWDVLFDNSRVSRKARIFSD
        L  LS  C + W + GDFN+TRW  E+      TK M  FN FIEDS L+++PL+NG+ TWSR+    S SLID FL+T             R  R  SD
Subjt:  LRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRFLVTKEWDVLFDNSRVSRKARIFSD

Query:  HFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSKEKNLLFELEFFDAKAEESLLSDEE
        HFP+LL+ G   WG +PFRF N WLS       +          GW G  +  K ++LK AIK W  E      S++++L   +   D       ++ ++
Subjt:  HFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSKEKNLLFELEFFDAKAEESLLSDEE

Query:  LDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISSNGVSLVSFREIEQEILDF
            +  K +++ +   +E    ++CK  WL  GDENT FFHRFLA K+R+++IT+++S  G+ L   ++IE+E +DF
Subjt:  LDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISSNGVSLVSFREIEQEILDF

A0A803QQM3 Uncharacterized protein1.1e-6034.47Show/hide
Query:  LCKISPQSSKVAETKQVAIDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKKVCWVSNVYGPNDYKERRFLWFE
        +CK +P    + E K+  +D +FI S+W S+   W  + A G+SGG L++WD   +SVL+ L G +S+SV      K+  W S VYGP  YK R   W E
Subjt:  LCKISPQSSKVAETKQVAIDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKKVCWVSNVYGPNDYKERRFLWFE

Query:  LRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRFLVTKEWDVLFDNSRVSRKARIFSD
        L  LS  C + WC+GGDFN+TR V E+      T+ M+ F+  I +  L++  L NG FTWS    +   S +DRFL +  W+V++   R     R+ SD
Subjt:  LRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRFLVTKEWDVLFDNSRVSRKARIFSD

Query:  HFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSKEKNLLFELEFFDAKAEESLLSDEE
        H P+++++    WGP PFRF N WL      +        + + GW G     K + L+  +K+W        K+ +  L   L   D     S  +   
Subjt:  HFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSKEKNLLFELEFFDAKAEESLLSDEE

Query:  LDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISSNGVSLVSFREIEQEILDFFS
        LD    +K E   L+  +ER +  K K  W + GD N+  FH  L A+K KN I+ +   NG  + + +EI +E++ FFS
Subjt:  LDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISSNGVSLVSFREIEQEILDFFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTTATCAGCTGTTGTATTCAGAATAGGTATTTTTGTACTTGGAGGGAAGGAAATATCCATTTTGTTGAAGATACTTGCAACAAGCGTTTGATTCCATTGTCCAT
TTCCTTCTTACAGTGGTTTGAAAAAGTGTTAGTTGAGATTTTGCAAAATCCCGTTTCTTCATTCTTTCATGAGAAAATCAAGGAAGAATTTGGAGTCATTAGGTTGATTA
AGTTCTTCTCAGATAATGAATGGTTCTTTGAATGTGCTGTTTGGCCTTCCACGGGTGGAAGAAGGATTATTCAAGTTCCTGCTGGCTTGAATAAGAAAGGATGGTATGTT
TTTTGGGAAATGATTAGGGATTTCATCCTTAAAATTCATTCTAATGAGAATCAACCTATTCGGTCATTGTTAAGCAAAGAGGAGAGTCTTCCGGTTTTTGATAAAGTTTC
AGCAGGTCATGCCTCTTCCAATTCATATGCTGAGGTGGTAAAGCGAGGTGGTTCTTTAAAAAGTTCAGTTTCTTTGAATGATTCAATAAGAAATGCCAAGGGTATTAACG
AAGAAGCTTACTGGGTTCGCAAGAATTGTGATGTGCTGAAATTAGATTTGGAAAGATCAATTGTTGTTTCTAGATTGATGGCCCAATATTCTTGGAAGGATGTTAAGATT
GCCCTTGAGAATTTCTTTAAAACTTTTGTCTTAGTTAACCCCTTCATGGATGATAAAGCTCTGATTCATGCAGCAGATGGTGGATTGGAATTTTCTGCAAATGGCAAGTG
GAAGAAATTTGGAAACTTACATTTGAAATTGGATTTTTGGTCCTCTGAAATTCATTCACAGCCGAAGTCTATAAAAAGTTATGGAGGCTGGCTTGCAATTAGAAATATTC
CATTGAATCTATGGCATCGTGATTCCTTTGAAGCTATCGGAAAGAACCTTGGAGGGTTGGTTAGTATTTCTTCCAATACGCTTAATTTGTTAGATTGTTCTGAAGCCTTC
ATTGAAGTAGAAAAGAATTTTTGTGGATTTATTCCTGCTGATATTAATGTTAAGATTGGTAATAAGTATGAATTTTCATTAAGATATGGTGATATTAATTCTTTGGAGAA
CAGAAATTTGAATTTTGATTCAAGAAAACAGCTAGATGCCAATGACTTTTCAAATTCCCTGGATTTAATTAGGGTAAGGCAGGTGATTTTGGATGAAGAATCTGATATTG
TTAATAAAGAGGATAGGATGAATGAGTTGCCTGCTTTCTCTAGGCATGAGGAGGCATTTAATGAGGATTTGGATATTTCAAAGGATGTCTCGGCACAAGATAAATATTTG
AATGGGGAATTGGTTCTTTCAATGGATACCTCGGTGCAAGATCAGAATTTAAAAGAGAGAGTCCAAGTTAATGAGATGTTGGGTTCTCCAAAAGGTGCTTCACTGCATGA
CAGGTGTATTAATAATGCTGGTTGTAAAGGTTTTAATGCCAGAATTAATGAGCCGACATTAGCTCTCTCTCCTTCATTAAATGACAATGAATTTAATGAGTCCGGTCCTC
AGGAAGCCCAACAGTTTCAGGTTTTTGAACTTTCTTATAAGAATGATAATGCCGTTAATGGTATCTTAAATCATGATGTCCAGCAAGTAGCATTAAAGACCTATTCTCGG
AAAAAATGTTCTCTCTCATCGGCTGTTATGACCAACTTTAAGACCAACTTTAATGCTGATCATTTAGAGTCTGACTGTACTCATTTAATTGCTGGAAATAAGGCTTCGGG
ATCTGCTATAATCAATGCTGGAAACGGGTTGAGTCAGGCCAAGGTATTTAAGGAATCTTCTATTCAAATTCCAGGGGGAAGTAATGTTTTTGTCAGAGGTATTGGTAGTT
CCTTCAATCATAGTATTCATTCCCCGGTGGATTCAGATGATGAGTCTATGGTTAGTGTTAGCAGTGAAGATTCTGATCAATTGTTAGATAAAGAGGATAATGTGGAACAA
TTTTCAGATGATCAAATTGGTGAGTCTTTAGAATCTCTTTTTTGTGAGAAGGTTGATGGTTTAGGTTCTCAAATTATTCATGAGTCTTTATTATCACCTTCTCAAATTCC
TAACCAATTCTCTTCGATAGTTGATACTTGTGGATTTCAGTTGTGTAAAATTTCGCCTCAGTCTTCTAAAGTGGCTGAGACTAAACAAGTAGCAATTGATTTGAAATTCA
TTAAATCCTTATGGAGTTCCAAGGAAATCGGCTGGTCGTTTGTGGAAGCTTATGGAAAATCAGGTGGACTTCTTATTATGTGGGATGAAAGTAAATTGTCAGTGCTGGAA
TTTTTAAAGGGTGGTTATTCTCTTTCAGTCAAATGTCTCACTCTTTGTAAAAAAGTTTGTTGGGTTTCAAATGTTTATGGTCCAAATGACTACAAAGAAAGGAGATTCCT
TTGGTTTGAATTACGCTCTCTCTCTTATTATTGCACGGATCCTTGGTGTATTGGAGGAGACTTTAATATTACTCGATGGGTTCATGAACGATTTCCAGTAGGAAGGCAAA
CGAAAGGGATGCGTAGATTTAACAAATTCATTGAAGACTCGGGTCTTATGGAAATTCCTTTATCAAATGGTAAATTTACATGGTCTAGGGATGGAAACGCTTATTCTCAC
TCTCTTATTGATAGATTTTTGGTGACAAAAGAATGGGATGTGTTATTTGATAATTCCAGAGTATCAAGGAAGGCACGCATATTTTCTGATCATTTTCCTCTTTTATTAGA
AGCTGGTTCTTTTATGTGGGGACCAAGTCCTTTCAGGTTTTATAATAGTTGGCTTTCTCAAGCGGAATGTGATAGGATTATTTTGGATTCTCTTTCCATTGATCGATCAC
AAGGATGGGCTGGTTTTGTTATTAGCTCCAAATTCAGAAATTTAAAAGTTGCCATTAAGAAGTGGTTTGCAGAATTTGAAGATAGCAGAAAAAGTAAAGAGAAAAATTTG
CTTTTTGAACTTGAATTCTTTGATGCAAAGGCTGAAGAATCTCTTTTATCTGATGAAGAGTTGGATATTCTCTTGGCTATAAAAGGCGAAATTATGGGTTTATACATGTC
TGATGAAAGAAATTTAATTAAAAAATGTAAGCTTAATTGGCTTAAGCTTGGTGATGAGAATACAAGTTTTTTCCATCGATTTTTAGCAGCCAAGAAGAGGAAGAACTTGA
TTACTGATTTAATTTCCAGCAATGGTGTTTCTTTAGTTTCCTTCAGGGAAATTGAACAAGAAATTCTGGATTTCTTTTCTTTATCAGAAAATTCCAGAATTGCCTTCTGG
CTTGATTCTTGGGTTGATGATCTTCCTTTTTGTTCAAAGTATCCTAGTTTATTTCGGATTGCTTCTCTTCCTAATGCCTCCGTTTTGGATCATTGGGATGGGGAGACTCT
CTCATGGAATATTTCCTTTCGTCGGCTTCTCAAAGAGGAAGAAATTTCTGATTTTCAGCAGTTGTTGGTCTGTTTAAATGATGCCATTGTATCTGAATTTTCAGATTCTC
GTATTTGGTCTCTTGAGAATTCGGGACTATATTCGTGTTGGGGAAAGCTATTGTCTATCTTCAAGCTTCAATGGGTTCTGGATCAGTCATTCAAAGAAAATGTGCAGCAA
CTTTTAAGTGGTCCATCAGTTAAGCCGCATCCTAGTCATTTTCCATTTTTTAAGCATGCTCGAGATGAATCAGCAGCAACACGTCTGTCTATGAGTTTGCATCAAGAGGA
TAAGCTACACAGTATGGGAGGTGAAAAGTTTTATGCGGAAGTGGTAAAGATGAATCCTATGGAAAATCTCAGTACCAAAGACTCTTCAGTACAAAAAGTTGTTATAAAGA
AGTCTTCTTCCATTAGCTCTTATTGGGTTCGTAATGATCATGAGGTGCTAAGTTTAGATTTTGATAATTTATGGGCAGTGACTAGGTTATTCGCCCATAATGATTGGAAT
AAGATTAAAGCTTCACTAGAAGATTATTTCCAATCAAAAGTAATGATTAATCCACTTTTTGATGATAAAGCCTCGATCAAATTTGGTGAAGATATCCAGGATAGTCCAAA
GGTGCAACGAAACTTATGTGGATTTATACCCGTCTCAATTGAAGTCAAAGATAAAAAAAGAGGGAATATCCTTCTTCACTTTGGAGACATTGAAGCATTGGACCCTCCAA
ACATCATTGATAGAGAGCTTCATGTGAATGGTTTTCAGAATCCAATGGATCTTTTCCGGCTTAATAAGGTAATGGATGATGAAGGTTTTGGCGATTCTCAAGTTTGGAAT
TCAAAGGCTGGTGTAGTTTGTAATGAAGTACCAAAGAAAGTTAATGATGAGATTGCTTTAAACATGAAGGTCATGCAAGAAGAGGGCATTAATTTGGGGATAAATCACGA
TGTGAATATTGAAAAATTGAATGAAGTGGTGCCAACCAGATTTTATGGGTCCCAATGTGAAAAGTTGTCAACCCTTTCTCCTTCCAATCCTTGTACTTACCAAAAATCTT
GGTCTCTGTGTGAGGATAATCCAGTTTCAGCAGAAGTGTTGTGTGAAAAGATTAATATCTGTAGCCAAAGAATCAAGACTTCTCAGATGCCTTCATCTCACAAATCTTCT
TTAAGGAGCATGAATCATTATCCTCTTTATTATACCAGGAAGAAAGACATTAGCTTAAGCAGCGAGGAGATAGAAGATCAAGTAGTTGAAGCAGATAGTGATGGAATTGT
TGCTGAAGAGTCTTTCACGGAAGCTTTTGAAACTCTGTTTATGGATGCTAATAATGAACAAGTCAATGACTCTTCTCTTGGTATAGTTTCAGAAGTAAATTATTCTTCAA
GCCCTTCTAAATTTTCCTCTCTTATTGAGGTGTGTGGTATACAGTTACGTGAAATTCCCCCATTGTTACCTCAGACAAACAAAGGGAATGAAAGGTTTAATAAGTTGATT
AAAGACCTTGATCTTATGGAAATTCTTTTGTCTAACGGAAAGTTCACTTGGTCACGAATTGGTAATGAGTCATCTTACTCTCTGATGGATAGGTTCCTTGTTTCAAAGGA
ATGTGATAATTTGTTTGATAATTCTAGAGTTTCAAGGCAAGCCTGTACACTCTCAGATCATTTCCTCTTATTGCTAGAAGCTGGAAATTTTATTTGGAGACCTTCTCCAT
TTCGGTTTTATAATAGTTGGTTACCTTTGCCAGATTGTGTGTCTATTATTGAGAATTCTGTTACTCAAGATCTTTCTTATGGATGGGCTGGGTTTGTAATTGCTTCTAAA
CTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTTATCAGCTGTTGTATTCAGAATAGGTATTTTTGTACTTGGAGGGAAGGAAATATCCATTTTGTTGAAGATACTTGCAACAAGCGTTTGATTCCATTGTCCAT
TTCCTTCTTACAGTGGTTTGAAAAAGTGTTAGTTGAGATTTTGCAAAATCCCGTTTCTTCATTCTTTCATGAGAAAATCAAGGAAGAATTTGGAGTCATTAGGTTGATTA
AGTTCTTCTCAGATAATGAATGGTTCTTTGAATGTGCTGTTTGGCCTTCCACGGGTGGAAGAAGGATTATTCAAGTTCCTGCTGGCTTGAATAAGAAAGGATGGTATGTT
TTTTGGGAAATGATTAGGGATTTCATCCTTAAAATTCATTCTAATGAGAATCAACCTATTCGGTCATTGTTAAGCAAAGAGGAGAGTCTTCCGGTTTTTGATAAAGTTTC
AGCAGGTCATGCCTCTTCCAATTCATATGCTGAGGTGGTAAAGCGAGGTGGTTCTTTAAAAAGTTCAGTTTCTTTGAATGATTCAATAAGAAATGCCAAGGGTATTAACG
AAGAAGCTTACTGGGTTCGCAAGAATTGTGATGTGCTGAAATTAGATTTGGAAAGATCAATTGTTGTTTCTAGATTGATGGCCCAATATTCTTGGAAGGATGTTAAGATT
GCCCTTGAGAATTTCTTTAAAACTTTTGTCTTAGTTAACCCCTTCATGGATGATAAAGCTCTGATTCATGCAGCAGATGGTGGATTGGAATTTTCTGCAAATGGCAAGTG
GAAGAAATTTGGAAACTTACATTTGAAATTGGATTTTTGGTCCTCTGAAATTCATTCACAGCCGAAGTCTATAAAAAGTTATGGAGGCTGGCTTGCAATTAGAAATATTC
CATTGAATCTATGGCATCGTGATTCCTTTGAAGCTATCGGAAAGAACCTTGGAGGGTTGGTTAGTATTTCTTCCAATACGCTTAATTTGTTAGATTGTTCTGAAGCCTTC
ATTGAAGTAGAAAAGAATTTTTGTGGATTTATTCCTGCTGATATTAATGTTAAGATTGGTAATAAGTATGAATTTTCATTAAGATATGGTGATATTAATTCTTTGGAGAA
CAGAAATTTGAATTTTGATTCAAGAAAACAGCTAGATGCCAATGACTTTTCAAATTCCCTGGATTTAATTAGGGTAAGGCAGGTGATTTTGGATGAAGAATCTGATATTG
TTAATAAAGAGGATAGGATGAATGAGTTGCCTGCTTTCTCTAGGCATGAGGAGGCATTTAATGAGGATTTGGATATTTCAAAGGATGTCTCGGCACAAGATAAATATTTG
AATGGGGAATTGGTTCTTTCAATGGATACCTCGGTGCAAGATCAGAATTTAAAAGAGAGAGTCCAAGTTAATGAGATGTTGGGTTCTCCAAAAGGTGCTTCACTGCATGA
CAGGTGTATTAATAATGCTGGTTGTAAAGGTTTTAATGCCAGAATTAATGAGCCGACATTAGCTCTCTCTCCTTCATTAAATGACAATGAATTTAATGAGTCCGGTCCTC
AGGAAGCCCAACAGTTTCAGGTTTTTGAACTTTCTTATAAGAATGATAATGCCGTTAATGGTATCTTAAATCATGATGTCCAGCAAGTAGCATTAAAGACCTATTCTCGG
AAAAAATGTTCTCTCTCATCGGCTGTTATGACCAACTTTAAGACCAACTTTAATGCTGATCATTTAGAGTCTGACTGTACTCATTTAATTGCTGGAAATAAGGCTTCGGG
ATCTGCTATAATCAATGCTGGAAACGGGTTGAGTCAGGCCAAGGTATTTAAGGAATCTTCTATTCAAATTCCAGGGGGAAGTAATGTTTTTGTCAGAGGTATTGGTAGTT
CCTTCAATCATAGTATTCATTCCCCGGTGGATTCAGATGATGAGTCTATGGTTAGTGTTAGCAGTGAAGATTCTGATCAATTGTTAGATAAAGAGGATAATGTGGAACAA
TTTTCAGATGATCAAATTGGTGAGTCTTTAGAATCTCTTTTTTGTGAGAAGGTTGATGGTTTAGGTTCTCAAATTATTCATGAGTCTTTATTATCACCTTCTCAAATTCC
TAACCAATTCTCTTCGATAGTTGATACTTGTGGATTTCAGTTGTGTAAAATTTCGCCTCAGTCTTCTAAAGTGGCTGAGACTAAACAAGTAGCAATTGATTTGAAATTCA
TTAAATCCTTATGGAGTTCCAAGGAAATCGGCTGGTCGTTTGTGGAAGCTTATGGAAAATCAGGTGGACTTCTTATTATGTGGGATGAAAGTAAATTGTCAGTGCTGGAA
TTTTTAAAGGGTGGTTATTCTCTTTCAGTCAAATGTCTCACTCTTTGTAAAAAAGTTTGTTGGGTTTCAAATGTTTATGGTCCAAATGACTACAAAGAAAGGAGATTCCT
TTGGTTTGAATTACGCTCTCTCTCTTATTATTGCACGGATCCTTGGTGTATTGGAGGAGACTTTAATATTACTCGATGGGTTCATGAACGATTTCCAGTAGGAAGGCAAA
CGAAAGGGATGCGTAGATTTAACAAATTCATTGAAGACTCGGGTCTTATGGAAATTCCTTTATCAAATGGTAAATTTACATGGTCTAGGGATGGAAACGCTTATTCTCAC
TCTCTTATTGATAGATTTTTGGTGACAAAAGAATGGGATGTGTTATTTGATAATTCCAGAGTATCAAGGAAGGCACGCATATTTTCTGATCATTTTCCTCTTTTATTAGA
AGCTGGTTCTTTTATGTGGGGACCAAGTCCTTTCAGGTTTTATAATAGTTGGCTTTCTCAAGCGGAATGTGATAGGATTATTTTGGATTCTCTTTCCATTGATCGATCAC
AAGGATGGGCTGGTTTTGTTATTAGCTCCAAATTCAGAAATTTAAAAGTTGCCATTAAGAAGTGGTTTGCAGAATTTGAAGATAGCAGAAAAAGTAAAGAGAAAAATTTG
CTTTTTGAACTTGAATTCTTTGATGCAAAGGCTGAAGAATCTCTTTTATCTGATGAAGAGTTGGATATTCTCTTGGCTATAAAAGGCGAAATTATGGGTTTATACATGTC
TGATGAAAGAAATTTAATTAAAAAATGTAAGCTTAATTGGCTTAAGCTTGGTGATGAGAATACAAGTTTTTTCCATCGATTTTTAGCAGCCAAGAAGAGGAAGAACTTGA
TTACTGATTTAATTTCCAGCAATGGTGTTTCTTTAGTTTCCTTCAGGGAAATTGAACAAGAAATTCTGGATTTCTTTTCTTTATCAGAAAATTCCAGAATTGCCTTCTGG
CTTGATTCTTGGGTTGATGATCTTCCTTTTTGTTCAAAGTATCCTAGTTTATTTCGGATTGCTTCTCTTCCTAATGCCTCCGTTTTGGATCATTGGGATGGGGAGACTCT
CTCATGGAATATTTCCTTTCGTCGGCTTCTCAAAGAGGAAGAAATTTCTGATTTTCAGCAGTTGTTGGTCTGTTTAAATGATGCCATTGTATCTGAATTTTCAGATTCTC
GTATTTGGTCTCTTGAGAATTCGGGACTATATTCGTGTTGGGGAAAGCTATTGTCTATCTTCAAGCTTCAATGGGTTCTGGATCAGTCATTCAAAGAAAATGTGCAGCAA
CTTTTAAGTGGTCCATCAGTTAAGCCGCATCCTAGTCATTTTCCATTTTTTAAGCATGCTCGAGATGAATCAGCAGCAACACGTCTGTCTATGAGTTTGCATCAAGAGGA
TAAGCTACACAGTATGGGAGGTGAAAAGTTTTATGCGGAAGTGGTAAAGATGAATCCTATGGAAAATCTCAGTACCAAAGACTCTTCAGTACAAAAAGTTGTTATAAAGA
AGTCTTCTTCCATTAGCTCTTATTGGGTTCGTAATGATCATGAGGTGCTAAGTTTAGATTTTGATAATTTATGGGCAGTGACTAGGTTATTCGCCCATAATGATTGGAAT
AAGATTAAAGCTTCACTAGAAGATTATTTCCAATCAAAAGTAATGATTAATCCACTTTTTGATGATAAAGCCTCGATCAAATTTGGTGAAGATATCCAGGATAGTCCAAA
GGTGCAACGAAACTTATGTGGATTTATACCCGTCTCAATTGAAGTCAAAGATAAAAAAAGAGGGAATATCCTTCTTCACTTTGGAGACATTGAAGCATTGGACCCTCCAA
ACATCATTGATAGAGAGCTTCATGTGAATGGTTTTCAGAATCCAATGGATCTTTTCCGGCTTAATAAGGTAATGGATGATGAAGGTTTTGGCGATTCTCAAGTTTGGAAT
TCAAAGGCTGGTGTAGTTTGTAATGAAGTACCAAAGAAAGTTAATGATGAGATTGCTTTAAACATGAAGGTCATGCAAGAAGAGGGCATTAATTTGGGGATAAATCACGA
TGTGAATATTGAAAAATTGAATGAAGTGGTGCCAACCAGATTTTATGGGTCCCAATGTGAAAAGTTGTCAACCCTTTCTCCTTCCAATCCTTGTACTTACCAAAAATCTT
GGTCTCTGTGTGAGGATAATCCAGTTTCAGCAGAAGTGTTGTGTGAAAAGATTAATATCTGTAGCCAAAGAATCAAGACTTCTCAGATGCCTTCATCTCACAAATCTTCT
TTAAGGAGCATGAATCATTATCCTCTTTATTATACCAGGAAGAAAGACATTAGCTTAAGCAGCGAGGAGATAGAAGATCAAGTAGTTGAAGCAGATAGTGATGGAATTGT
TGCTGAAGAGTCTTTCACGGAAGCTTTTGAAACTCTGTTTATGGATGCTAATAATGAACAAGTCAATGACTCTTCTCTTGGTATAGTTTCAGAAGTAAATTATTCTTCAA
GCCCTTCTAAATTTTCCTCTCTTATTGAGGTGTGTGGTATACAGTTACGTGAAATTCCCCCATTGTTACCTCAGACAAACAAAGGGAATGAAAGGTTTAATAAGTTGATT
AAAGACCTTGATCTTATGGAAATTCTTTTGTCTAACGGAAAGTTCACTTGGTCACGAATTGGTAATGAGTCATCTTACTCTCTGATGGATAGGTTCCTTGTTTCAAAGGA
ATGTGATAATTTGTTTGATAATTCTAGAGTTTCAAGGCAAGCCTGTACACTCTCAGATCATTTCCTCTTATTGCTAGAAGCTGGAAATTTTATTTGGAGACCTTCTCCAT
TTCGGTTTTATAATAGTTGGTTACCTTTGCCAGATTGTGTGTCTATTATTGAGAATTCTGTTACTCAAGATCTTTCTTATGGATGGGCTGGGTTTGTAATTGCTTCTAAA
CTCTAG
Protein sequenceShow/hide protein sequence
MEVISCCIQNRYFCTWREGNIHFVEDTCNKRLIPLSISFLQWFEKVLVEILQNPVSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYV
FWEMIRDFILKIHSNENQPIRSLLSKEESLPVFDKVSAGHASSNSYAEVVKRGGSLKSSVSLNDSIRNAKGINEEAYWVRKNCDVLKLDLERSIVVSRLMAQYSWKDVKI
ALENFFKTFVLVNPFMDDKALIHAADGGLEFSANGKWKKFGNLHLKLDFWSSEIHSQPKSIKSYGGWLAIRNIPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAF
IEVEKNFCGFIPADINVKIGNKYEFSLRYGDINSLENRNLNFDSRKQLDANDFSNSLDLIRVRQVILDEESDIVNKEDRMNELPAFSRHEEAFNEDLDISKDVSAQDKYL
NGELVLSMDTSVQDQNLKERVQVNEMLGSPKGASLHDRCINNAGCKGFNARINEPTLALSPSLNDNEFNESGPQEAQQFQVFELSYKNDNAVNGILNHDVQQVALKTYSR
KKCSLSSAVMTNFKTNFNADHLESDCTHLIAGNKASGSAIINAGNGLSQAKVFKESSIQIPGGSNVFVRGIGSSFNHSIHSPVDSDDESMVSVSSEDSDQLLDKEDNVEQ
FSDDQIGESLESLFCEKVDGLGSQIIHESLLSPSQIPNQFSSIVDTCGFQLCKISPQSSKVAETKQVAIDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLE
FLKGGYSLSVKCLTLCKKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSH
SLIDRFLVTKEWDVLFDNSRVSRKARIFSDHFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSKEKNL
LFELEFFDAKAEESLLSDEELDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISSNGVSLVSFREIEQEILDFFSLSENSRIAFW
LDSWVDDLPFCSKYPSLFRIASLPNASVLDHWDGETLSWNISFRRLLKEEEISDFQQLLVCLNDAIVSEFSDSRIWSLENSGLYSCWGKLLSIFKLQWVLDQSFKENVQQ
LLSGPSVKPHPSHFPFFKHARDESAATRLSMSLHQEDKLHSMGGEKFYAEVVKMNPMENLSTKDSSVQKVVIKKSSSISSYWVRNDHEVLSLDFDNLWAVTRLFAHNDWN
KIKASLEDYFQSKVMINPLFDDKASIKFGEDIQDSPKVQRNLCGFIPVSIEVKDKKRGNILLHFGDIEALDPPNIIDRELHVNGFQNPMDLFRLNKVMDDEGFGDSQVWN
SKAGVVCNEVPKKVNDEIALNMKVMQEEGINLGINHDVNIEKLNEVVPTRFYGSQCEKLSTLSPSNPCTYQKSWSLCEDNPVSAEVLCEKINICSQRIKTSQMPSSHKSS
LRSMNHYPLYYTRKKDISLSSEEIEDQVVEADSDGIVAEESFTEAFETLFMDANNEQVNDSSLGIVSEVNYSSSPSKFSSLIEVCGIQLREIPPLLPQTNKGNERFNKLI
KDLDLMEILLSNGKFTWSRIGNESSYSLMDRFLVSKECDNLFDNSRVSRQACTLSDHFLLLLEAGNFIWRPSPFRFYNSWLPLPDCVSIIENSVTQDLSYGWAGFVIASK
L