; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036612 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036612
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionFlap endonuclease 1
Genome locationchr3:49320791..49336761
RNA-Seq ExpressionLag0036612
SyntenyLag0036612
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR006084 - XPG/Rad2 endonuclease
IPR006085 - XPG, N-terminal
IPR006086 - XPG-I domain
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR019974 - XPG conserved site
IPR023426 - Flap endonuclease 1
IPR025724 - GAG-pre-integrase domain
IPR029060 - PIN-like domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN67064.1 hypothetical protein VITISV_017541 [Vitis vinifera]4.7e-17243.45Show/hide
Query:  EQIDQLLKLLKVTSSSGNPSVSLAQTGNYPLALSC-LNSSPWVIDSGASDHMTSSSLLFTSYSPLYCNEQIRIADGSFTPIAGKGTISLTKNITLQSVLH
        ++++ LL LLK   +SG  SVSLA TGN   ALS    S+PW+IDSGASDHMT+SS +F SYSP   N+++RIADG+F+PIAGKG I +++ I L+SVLH
Subjt:  EQIDQLLKLLKVTSSSGNPSVSLAQTGNYPLALSC-LNSSPWVIDSGASDHMTSSSLLFTSYSPLYCNEQIRIADGSFTPIAGKGTISLTKNITLQSVLH

Query:  VPKLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQGLSSVSSSSVKETIMLWHRRLGHPK-DFLKDK-GIFH
        VPKL CNLLSVSK+S+D+NC V F+E+H IFQDQ SG+ IG ARM++GLYYF+D+  S+K  QGLSS+SS  V++ I+ WH RLGHP   +LK    +  
Subjt:  VPKLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQGLSSVSSSSVKETIMLWHRRLGHPK-DFLKDK-GIFH

Query:  QSTCRDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAV--------LTAAFF---INRMPSKILAFKTPLDQFRKFYPT----------------
        Q     + Q       K++    +++   +Y   P YL+   V        L    F   +  +PS IL      +  ++  PT                
Subjt:  QSTCRDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAV--------LTAAFF---INRMPSKILAFKTPLDQFRKFYPT----------------

Query:  ----------VYTRRTLHQKNGDQIVDLSQYQSNAPTNDTED-SGN------------QSLSDIS-----------------------------DLDIPI
                  VY+R+ +  ++ +Q +  +  Q  A  N + + SGN             S+ D+S                             DLD+PI
Subjt:  ----------VYTRRTLHQKNGDQIVDLSQYQSNAPTNDTED-SGN------------QSLSDIS-----------------------------DLDIPI

Query:  AHRKGVRNCTKYPIANYLSYHRLSDNHKAFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVELPKDKKTVGCKWVFTIKCKADGDIERY
        A RKG   CTK+PIA Y+SY  L DN+ AF + I+ L   RNIQEA ++P+W LAV EEMNALK+N TW+VV+LP++KK VGCKWVFTIK KADG +ERY
Subjt:  AHRKGVRNCTKYPIANYLSYHRLSDNHKAFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVELPKDKKTVGCKWVFTIKCKADGDIERY

Query:  KARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEVFRT----------------YHQSQADHTIFYKHTGNDK
        KARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLS++VNS+WPL+QLDVKNAFLNGDLEEEVF +                Y QSQADHT+FYKH+   K
Subjt:  KARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEVFRT----------------YHQSQADHTIFYKHTGNDK

Query:  MVVLIVYVDDIILT--------------------------------------------------------------------------------------
        + +LIVYVDDI+LT                                                                                      
Subjt:  MVVLIVYVDDIILT--------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------GSTTDRRSTSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRA
                                                                  GS  DRRSTSGYCSFVGGNLV W+SKKQ+V+A+SS EAEFRA
Subjt:  ----------------------------------------------------------GSTTDRRSTSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRA

Query:  LAHGICEGIWIKRLLEELKFVQTQPMRVYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIDTGVICIP
        +AHGICE +WI+RLLE LK   + PM++YCDNKA IS+A N VLHDRTKH+EVDKHFIKEKID G++ +P
Subjt:  LAHGICEGIWIKRLLEELKFVQTQPMRVYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIDTGVICIP

CAN72141.1 hypothetical protein VITISV_017108 [Vitis vinifera]4.1e-19237.17Show/hide
Query:  TANVVDSHQFNQEQIDQLLKLLKVTSSSGNPSVSLAQTGNYPLALSC-LNSSPWVIDSGASDHMTSSSLLFTSYSPLYCNEQIRIADGSFTPIAGKGTIS
        TAN  ++  F  EQ++ LL LLK   +SG  SVSLA TGN   ALSC   S+PW+IDSGASDHMT+SS +F SYSP   N++++IADG+F+PIAGKG I 
Subjt:  TANVVDSHQFNQEQIDQLLKLLKVTSSSGNPSVSLAQTGNYPLALSC-LNSSPWVIDSGASDHMTSSSLLFTSYSPLYCNEQIRIADGSFTPIAGKGTIS

Query:  LTKNITLQSVLHVPKLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQGLSSVSSSSVKETIMLWHRRLGHP-
        +++ I L+ VLHVPKL CNLL VSK+S+D NC V F+E+HCIFQD+ S + IG ARM++GLYYF+D+  S+K  QGLSS+SS  V++ IM+WH +LG P 
Subjt:  LTKNITLQSVLHVPKLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQGLSSVSSSSVKETIMLWHRRLGHP-

Query:  ----------------------------------------------------------------------------------------------KDFLK-
                                                                                                      K+F K 
Subjt:  ----------------------------------------------------------------------------------------------KDFLK-

Query:  ------------------------------DKGIFHQSTCRDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAVLTAAFFINRMPSKILAFKTPL
                                       KGI HQS+C DTPQQNGIA+RKN+HLL+VARA+MFYM++P YLW DA+LTA++ INRMP+KIL + TPL
Subjt:  ------------------------------DKGIFHQSTCRDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAVLTAAFFINRMPSKILAFKTPL

Query:  DQFRKFYP--------------------------------------------------------------------------------------------
           +K +P                                                                                            
Subjt:  DQFRKFYP--------------------------------------------------------------------------------------------

Query:  -----------------------------------------------TVYTRRTLHQKNGDQIVDLSQYQSNAPTNDTED-SGN------------QSLS
                                                        VY+R+ +  ++ DQ +  +  Q  A  N + + SGN             S++
Subjt:  -----------------------------------------------TVYTRRTLHQKNGDQIVDLSQYQSNAPTNDTED-SGN------------QSLS

Query:  DIS-----------------------------DLDIPIAHRKGVRNCTKYPIANYLSYHRLSDNHKAFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNA
        D+S                             DLD+PIA RKG + CTK+ IA Y+SY  LSDNH+AF + I+ L +PRNIQEA ++P+W LAV +EMNA
Subjt:  DIS-----------------------------DLDIPIAHRKGVRNCTKYPIANYLSYHRLSDNHKAFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNA

Query:  LKQNCTWDVVELPKDKKTVGCKWVFTIKCKADGDIERYKARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEV
        LK+N TW+ V+LP++KK VGCKWVFTIK KADG +ERYKARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLS++VNS+WPL+QLDVKNAFLNGDLEEEV
Subjt:  LKQNCTWDVVELPKDKKTVGCKWVFTIKCKADGDIERYKARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEV

Query:  FRT-------------------------------------------YHQSQADHTIFYKHTGNDKMVVLIVYVDDIILT---------------------
        F +                                           Y QSQA+HT+FYKH+   K+V+LIVYVDDI+LT                     
Subjt:  FRT-------------------------------------------YHQSQADHTIFYKHTGNDKMVVLIVYVDDIILT---------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------GSTTDRRSTSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRALAHGICEGIWIKRLLEEL
                                                GS  DRRSTSGYCSFV GNLV WRSKKQ+V+A+SSAEAEFR +AHG CE +WI+RLLEEL
Subjt:  ----------------------------------------GSTTDRRSTSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRALAHGICEGIWIKRLLEEL

Query:  KFVQTQPMRVYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIDTGVICIPYLPTTEQTADVLTKGLPKLRFDKLINKLTMEDIFKPA
        K   + PM++YCDNKAAIS+AHNPVLHDRTKH+EVDK FIKEKID   +C+ Y+PT EQ A+V TK L K +FD L+ KL MEDIFK A
Subjt:  KFVQTQPMRVYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIDTGVICIPYLPTTEQTADVLTKGLPKLRFDKLINKLTMEDIFKPA

GAU39772.1 hypothetical protein TSUD_220160 [Trifolium subterraneum]1.6e-20440.67Show/hide
Query:  NQSTSTANVVDSHQFNQEQIDQLLKLLKVTSSSGNPSVSLAQTGNYPLALSCLN-SSPWVIDSGASDHMTSSSLLFTSYSPLYCNEQIRIADGSFTPIAG
        N+  ++AN   S  F +EQ+D L KLL+  SS   P  ++AQTG    ALS  N S+PW+IDSGAS+HMT+ S LF+SY     +E++RIADGS++ IAG
Subjt:  NQSTSTANVVDSHQFNQEQIDQLLKLLKVTSSSGNPSVSLAQTGNYPLALSCLN-SSPWVIDSGASDHMTSSSLLFTSYSPLYCNEQIRIADGSFTPIAG

Query:  KGTISLTKNITLQSVLHVPKLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQGLSSVSSS-SVKETIMLWHR
        KG I ++++ITLQSVLHVPK ACNLLSV K+SKD NC V F  + C+FQDQ+SG+MIG AR ++GLYY D++P  +KK   L S S   SV + +MLWHR
Subjt:  KGTISLTKNITLQSVLHVPKLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQGLSSVSSS-SVKETIMLWHR

Query:  RLGHP-----------------------------KD----------------------------------------------------------------
        RLGHP                             KD                                                                
Subjt:  RLGHP-----------------------------KD----------------------------------------------------------------

Query:  ---------------------------------FLKDKGIFHQSTCRDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAVLTAAFFINRMPSKIL
                                         FL  KGI HQSTCRDTPQQNGIAERKNRHLL+V RAIM  M+VP YLW +A+LTA + INRMP+++L
Subjt:  ---------------------------------FLKDKGIFHQSTCRDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAVLTAAFFINRMPSKIL

Query:  AFKTPLDQFRKFYPT----------------------------------------------------------------------------------VYT
         ++TPL   +K +PT                                                                                  VY 
Subjt:  AFKTPLDQFRKFYPT----------------------------------------------------------------------------------VYT

Query:  RRTLHQKNGDQIVDLSQYQSNAPTND-------TEDSGNQSLS--------------------DISDLDIPIAHRKGVRNCTKYPIANYLSYHRLSDNHK
        R+  H+     I+  +  QS++P+         T   GN S S                    +I DLD+PIA RK  R CTK+PI+NYLSY +LS  HK
Subjt:  RRTLHQKNGDQIVDLSQYQSNAPTND-------TEDSGNQSLS--------------------DISDLDIPIAHRKGVRNCTKYPIANYLSYHRLSDNHK

Query:  AFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVE-LPKDKKTVGCKWVFTIKCKADGDIERYKARLVAKGFTQTYGIDYQETFAPVAKI
        A+ SRI+NLF+PR +QEA  DPNW LAV EEM+AL++N TW + + LPK KK VGCKWVFT+KCKADG +ERYKARLVAKGFTQT+GIDYQETFAPVAKI
Subjt:  AFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVE-LPKDKKTVGCKWVFTIKCKADGDIERYKARLVAKGFTQTYGIDYQETFAPVAKI

Query:  NSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEVFRT------------------YHQSQADHTIFYKHTGNDKMVVLIVYVDDIILT-----------
        NSIR+LLS++VN +W L+Q DVKNAFLNG+L EEV+ +                  + QSQADHT+F+KH+   K+ +LIVYVDDII+T           
Subjt:  NSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEVFRT------------------YHQSQADHTIFYKHTGNDKMVVLIVYVDDIILT-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------GSTTDRRSTSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRALAHGICEGIWIKRLLEELKFVQTQPMRVYCDNKAA
                               G+  DRRSTSGYC+FVGGN V WRSKKQSV+A+SSAEAEFR++AHG CE +WIK+ +EELK     P++VYCDNKAA
Subjt:  -----------------------GSTTDRRSTSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRALAHGICEGIWIKRLLEELKFVQTQPMRVYCDNKAA

Query:  ISIAHNPVLHDRTKHIEVDKHFIKEKIDTGVICIPYLPTTEQTADVLTKGLPKLRFDKLINKLTMEDIFKPA
        ISIAHNPV HDRTKH+EVDKHFIKEKID+G IC+ Y+PTT Q ADVLTK LPK +FD ++ KL M DIF PA
Subjt:  ISIAHNPVLHDRTKHIEVDKHFIKEKIDTGVICIPYLPTTEQTADVLTKGLPKLRFDKLINKLTMEDIFKPA

XP_024044151.1 uncharacterized protein LOC18046468 isoform X1 [Citrus clementina]2.3e-17134.68Show/hide
Query:  HSPGIDYFLFRSNNQSTSTANVVDSHQFNQEQIDQLLKLLKVTSSSGNPS--VSLAQTGNYPLALSCL--NSSPWVIDSGASDHMTSSSLLFTSYSPLYC
        HS G   F     NQ T+     +S  F +EQ++QL + L  + S  NPS   SLAQ GN   AL  +     PW+IDSGA+DHMTS S LF+SY P   
Subjt:  HSPGIDYFLFRSNNQSTSTANVVDSHQFNQEQIDQLLKLLKVTSSSGNPS--VSLAQTGNYPLALSCL--NSSPWVIDSGASDHMTSSSLLFTSYSPLYC

Query:  NEQIRIADGSFTPIAGKGTISLTKNITLQSVLHVPKLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQGLSS
        +++I+IADGS + +AGKG+I ++ N+ L SVLHVP L+CNLLSVSKI+KD +C   F  ++C FQD  SG+ IG AR +DGLYYF++  +   + Q  ++
Subjt:  NEQIRIADGSFTPIAGKGTISLTKNITLQSVLHVPKLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQGLSS

Query:  VSSSSVKETIMLWHRRLGHPK-------------------------------------------------------------------------------
          + S+++ IMLWH RLGHP                                                                                
Subjt:  VSSSSVKETIMLWHRRLGHPK-------------------------------------------------------------------------------

Query:  -----------------------------------------------DFLKDKGIFHQSTCRDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAV
                                                        +  + GI HQS+C DTPQQNG+AERKNRHLL+VAR++MF   VP   W +A+
Subjt:  -----------------------------------------------DFLKDKGIFHQSTCRDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAV

Query:  LTAAFFINRMPSKILAFKTPLDQFRKFYP----------------------------------------------------------------TVYTRRT
        LTA++ INRMP++I  F++PL+ F K YP                                                                T +  R+
Subjt:  LTAAFFINRMPSKILAFKTPLDQFRKFYP----------------------------------------------------------------TVYTRRT

Query:  LHQKNGDQIVDLSQ-------------------------------------------------------------YQSNAPTNDTEDSGNQSLSDIS---
           K   Q  D +Q                                                               S+  T + E +GN   + IS   
Subjt:  LHQKNGDQIVDLSQ-------------------------------------------------------------YQSNAPTNDTEDSGNQSLSDIS---

Query:  -----DLDIPIAHRKGVRNCTKYPIANYLSYHRLSDNHKAFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVELPKDKKTVGCKWVFTI
             DLD+PIA RKG R+CT +PI+ Y+SYHRLS   +AF + ++ + IP+++Q+A + P W  AV  EM AL++N TW++V+LP++KK VGCKW+FT+
Subjt:  -----DLDIPIAHRKGVRNCTKYPIANYLSYHRLSDNHKAFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVELPKDKKTVGCKWVFTI

Query:  KCKADGDIERYKARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEVFRT------------------------
        K +ADG +ERYKARLVAKGFTQTYGIDYQETFAPVAK+NSIRVLLS++ +  W L QLDVKNAFLNG+LEEEV+                          
Subjt:  KCKADGDIERYKARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEVFRT------------------------

Query:  -------------------YHQSQADHTIFYKHTGNDKMVVLIVYVDDIILTGSTT--------------------------------------------
                           Y QSQADHT+F+KH  +  +V++IVYVDDIILTGS                                              
Subjt:  -------------------YHQSQADHTIFYKHTGNDKMVVLIVYVDDIILTGSTT--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------DRRSTSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRALAHGICEGIWIKRLLEELKFVQTQPMRVYCDNKAAISIAHNPVLH
                         DRRSTSGYC+FVGGNLV WRSKKQSV+A+SSAEAEFRA+AHGICE +W+K+LLEELK +   PM++YCDNKAAI+IAHNPV H
Subjt:  -----------------DRRSTSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRALAHGICEGIWIKRLLEELKFVQTQPMRVYCDNKAAISIAHNPVLH

Query:  DRTKHIEVDKHFIKEKIDTGVICIPYLPTTEQTADVLTKGLPKLRFDKLINKLTMEDIFKPA
        DRTKH+EVD+HFIKEKI++G+IC+ ++PTT+Q AD+LTKGL K  F+ L +KL M DIF+PA
Subjt:  DRTKHIEVDKHFIKEKIDTGVICIPYLPTTEQTADVLTKGLPKLRFDKLINKLTMEDIFKPA

XP_024044152.1 uncharacterized protein LOC18046468 isoform X2 [Citrus clementina]2.3e-17134.68Show/hide
Query:  HSPGIDYFLFRSNNQSTSTANVVDSHQFNQEQIDQLLKLLKVTSSSGNPS--VSLAQTGNYPLALSCL--NSSPWVIDSGASDHMTSSSLLFTSYSPLYC
        HS G   F     NQ T+     +S  F +EQ++QL + L  + S  NPS   SLAQ GN   AL  +     PW+IDSGA+DHMTS S LF+SY P   
Subjt:  HSPGIDYFLFRSNNQSTSTANVVDSHQFNQEQIDQLLKLLKVTSSSGNPS--VSLAQTGNYPLALSCL--NSSPWVIDSGASDHMTSSSLLFTSYSPLYC

Query:  NEQIRIADGSFTPIAGKGTISLTKNITLQSVLHVPKLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQGLSS
        +++I+IADGS + +AGKG+I ++ N+ L SVLHVP L+CNLLSVSKI+KD +C   F  ++C FQD  SG+ IG AR +DGLYYF++  +   + Q  ++
Subjt:  NEQIRIADGSFTPIAGKGTISLTKNITLQSVLHVPKLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQGLSS

Query:  VSSSSVKETIMLWHRRLGHPK-------------------------------------------------------------------------------
          + S+++ IMLWH RLGHP                                                                                
Subjt:  VSSSSVKETIMLWHRRLGHPK-------------------------------------------------------------------------------

Query:  -----------------------------------------------DFLKDKGIFHQSTCRDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAV
                                                        +  + GI HQS+C DTPQQNG+AERKNRHLL+VAR++MF   VP   W +A+
Subjt:  -----------------------------------------------DFLKDKGIFHQSTCRDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAV

Query:  LTAAFFINRMPSKILAFKTPLDQFRKFYP----------------------------------------------------------------TVYTRRT
        LTA++ INRMP++I  F++PL+ F K YP                                                                T +  R+
Subjt:  LTAAFFINRMPSKILAFKTPLDQFRKFYP----------------------------------------------------------------TVYTRRT

Query:  LHQKNGDQIVDLSQ-------------------------------------------------------------YQSNAPTNDTEDSGNQSLSDIS---
           K   Q  D +Q                                                               S+  T + E +GN   + IS   
Subjt:  LHQKNGDQIVDLSQ-------------------------------------------------------------YQSNAPTNDTEDSGNQSLSDIS---

Query:  -----DLDIPIAHRKGVRNCTKYPIANYLSYHRLSDNHKAFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVELPKDKKTVGCKWVFTI
             DLD+PIA RKG R+CT +PI+ Y+SYHRLS   +AF + ++ + IP+++Q+A + P W  AV  EM AL++N TW++V+LP++KK VGCKW+FT+
Subjt:  -----DLDIPIAHRKGVRNCTKYPIANYLSYHRLSDNHKAFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVELPKDKKTVGCKWVFTI

Query:  KCKADGDIERYKARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEVFRT------------------------
        K +ADG +ERYKARLVAKGFTQTYGIDYQETFAPVAK+NSIRVLLS++ +  W L QLDVKNAFLNG+LEEEV+                          
Subjt:  KCKADGDIERYKARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEVFRT------------------------

Query:  -------------------YHQSQADHTIFYKHTGNDKMVVLIVYVDDIILTGSTT--------------------------------------------
                           Y QSQADHT+F+KH  +  +V++IVYVDDIILTGS                                              
Subjt:  -------------------YHQSQADHTIFYKHTGNDKMVVLIVYVDDIILTGSTT--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------DRRSTSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRALAHGICEGIWIKRLLEELKFVQTQPMRVYCDNKAAISIAHNPVLH
                         DRRSTSGYC+FVGGNLV WRSKKQSV+A+SSAEAEFRA+AHGICE +W+K+LLEELK +   PM++YCDNKAAI+IAHNPV H
Subjt:  -----------------DRRSTSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRALAHGICEGIWIKRLLEELKFVQTQPMRVYCDNKAAISIAHNPVLH

Query:  DRTKHIEVDKHFIKEKIDTGVICIPYLPTTEQTADVLTKGLPKLRFDKLINKLTMEDIFKPA
        DRTKH+EVD+HFIKEKI++G+IC+ ++PTT+Q AD+LTKGL K  F+ L +KL M DIF+PA
Subjt:  DRTKHIEVDKHFIKEKIDTGVICIPYLPTTEQTADVLTKGLPKLRFDKLINKLTMEDIFKPA

TrEMBL top hitse value%identityAlignment
A0A2Z6NTX3 Integrase catalytic domain-containing protein7.8e-20540.67Show/hide
Query:  NQSTSTANVVDSHQFNQEQIDQLLKLLKVTSSSGNPSVSLAQTGNYPLALSCLN-SSPWVIDSGASDHMTSSSLLFTSYSPLYCNEQIRIADGSFTPIAG
        N+  ++AN   S  F +EQ+D L KLL+  SS   P  ++AQTG    ALS  N S+PW+IDSGAS+HMT+ S LF+SY     +E++RIADGS++ IAG
Subjt:  NQSTSTANVVDSHQFNQEQIDQLLKLLKVTSSSGNPSVSLAQTGNYPLALSCLN-SSPWVIDSGASDHMTSSSLLFTSYSPLYCNEQIRIADGSFTPIAG

Query:  KGTISLTKNITLQSVLHVPKLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQGLSSVSSS-SVKETIMLWHR
        KG I ++++ITLQSVLHVPK ACNLLSV K+SKD NC V F  + C+FQDQ+SG+MIG AR ++GLYY D++P  +KK   L S S   SV + +MLWHR
Subjt:  KGTISLTKNITLQSVLHVPKLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQGLSSVSSS-SVKETIMLWHR

Query:  RLGHP-----------------------------KD----------------------------------------------------------------
        RLGHP                             KD                                                                
Subjt:  RLGHP-----------------------------KD----------------------------------------------------------------

Query:  ---------------------------------FLKDKGIFHQSTCRDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAVLTAAFFINRMPSKIL
                                         FL  KGI HQSTCRDTPQQNGIAERKNRHLL+V RAIM  M+VP YLW +A+LTA + INRMP+++L
Subjt:  ---------------------------------FLKDKGIFHQSTCRDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAVLTAAFFINRMPSKIL

Query:  AFKTPLDQFRKFYPT----------------------------------------------------------------------------------VYT
         ++TPL   +K +PT                                                                                  VY 
Subjt:  AFKTPLDQFRKFYPT----------------------------------------------------------------------------------VYT

Query:  RRTLHQKNGDQIVDLSQYQSNAPTND-------TEDSGNQSLS--------------------DISDLDIPIAHRKGVRNCTKYPIANYLSYHRLSDNHK
        R+  H+     I+  +  QS++P+         T   GN S S                    +I DLD+PIA RK  R CTK+PI+NYLSY +LS  HK
Subjt:  RRTLHQKNGDQIVDLSQYQSNAPTND-------TEDSGNQSLS--------------------DISDLDIPIAHRKGVRNCTKYPIANYLSYHRLSDNHK

Query:  AFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVE-LPKDKKTVGCKWVFTIKCKADGDIERYKARLVAKGFTQTYGIDYQETFAPVAKI
        A+ SRI+NLF+PR +QEA  DPNW LAV EEM+AL++N TW + + LPK KK VGCKWVFT+KCKADG +ERYKARLVAKGFTQT+GIDYQETFAPVAKI
Subjt:  AFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVE-LPKDKKTVGCKWVFTIKCKADGDIERYKARLVAKGFTQTYGIDYQETFAPVAKI

Query:  NSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEVFRT------------------YHQSQADHTIFYKHTGNDKMVVLIVYVDDIILT-----------
        NSIR+LLS++VN +W L+Q DVKNAFLNG+L EEV+ +                  + QSQADHT+F+KH+   K+ +LIVYVDDII+T           
Subjt:  NSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEVFRT------------------YHQSQADHTIFYKHTGNDKMVVLIVYVDDIILT-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------GSTTDRRSTSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRALAHGICEGIWIKRLLEELKFVQTQPMRVYCDNKAA
                               G+  DRRSTSGYC+FVGGN V WRSKKQSV+A+SSAEAEFR++AHG CE +WIK+ +EELK     P++VYCDNKAA
Subjt:  -----------------------GSTTDRRSTSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRALAHGICEGIWIKRLLEELKFVQTQPMRVYCDNKAA

Query:  ISIAHNPVLHDRTKHIEVDKHFIKEKIDTGVICIPYLPTTEQTADVLTKGLPKLRFDKLINKLTMEDIFKPA
        ISIAHNPV HDRTKH+EVDKHFIKEKID+G IC+ Y+PTT Q ADVLTK LPK +FD ++ KL M DIF PA
Subjt:  ISIAHNPVLHDRTKHIEVDKHFIKEKIDTGVICIPYLPTTEQTADVLTKGLPKLRFDKLINKLTMEDIFKPA

A0A438FY12 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-16939.44Show/hide
Query:  DSHQFNQEQIDQLLKLL-------KVTSSSGNP---SVSLAQTGNYPLALSC--LNSSPWVIDSGASDHMTSSSLLFTSYSPLYCNEQIRIADGSFTPIA
        ++  FN+E+++ L KLL         T SS N    S +LA  GN+  A +       PW++DSGAS+HMT  + +F +YS    N  +RIADGS + +A
Subjt:  DSHQFNQEQIDQLLKLL-------KVTSSSGNP---SVSLAQTGNYPLALSC--LNSSPWVIDSGASDHMTSSSLLFTSYSPLYCNEQIRIADGSFTPIA

Query:  GKGTISLTKNITLQSVLHVPKLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQ-----GLSSVSSSSVKETI
          G++ L++++TL SVL VP L CNLLS+SK++K+  C   F  THC FQD DSG+ IG A    GLY   +     ++ Q        SVS  +    I
Subjt:  GKGTISLTKNITLQSVLHVPKLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQ-----GLSSVSSSSVKETI

Query:  MLWHRRLGHPKDFLKDKGIFHQSTCRDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAVLTAAFFINRMPSK------------ILAFK------
         LWH RLG   +FL  +GI H S+C DTPQQNGIAERKNRHLL+VAR++MF M+VP   W  AVLTAA+ INRMPS+            + +F       
Subjt:  MLWHRRLGHPKDFLKDKGIFHQSTCRDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAVLTAAFFINRMPSK------------ILAFK------

Query:  ---TPLDQFRKFYPTV--YTRRTLHQKNGDQIVDLSQYQSNAPTNDTEDSGNQSLSDISD-------LDIPIAHRKGVRNCTKYPIANYLSYHRLSDNHK
            PL+ F +    V  + +  + ++  +  +    +++ +  N ++  GN +     D       L++PIA RKGVR+CT++PI N++SY +LS   +
Subjt:  ---TPLDQFRKFYPTV--YTRRTLHQKNGDQIVDLSQYQSNAPTNDTEDSGNQSLSDISD-------LDIPIAHRKGVRNCTKYPIANYLSYHRLSDNHK

Query:  AFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVELPKDKKTVGCKWVFTIKCKADGDIERYKARLVAKGFTQTYGIDYQETFAPVAKIN
        AF S I  + +P+NIQEA   P W  AV EE+ AL++N TW++ +LP+ KK VGCKW+FT+K KADG+++RYKARLVAKGFTQ+YGIDYQETFAP+AK+N
Subjt:  AFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVELPKDKKTVGCKWVFTIKCKADGDIERYKARLVAKGFTQTYGIDYQETFAPVAKIN

Query:  SIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEVFRTYH-------------------------------------------QSQADHTIFYKHTGNDKM
        ++RVLLS++ N DW L+QLDVKNAFLNGDLEEEV+   H                                           Q Q+DHT+F KH    K+
Subjt:  SIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEVFRTYH-------------------------------------------QSQADHTIFYKHTGNDKM

Query:  VVLIVYVDDIILT---------------------------------------------------------------------------------------
         ++IVYVDDIILT                                                                                       
Subjt:  VVLIVYVDDIILT---------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------GSTTDRRSTSGYCSFVGGNLVNWRSK
                                                                                  GS TDRRST GYCSFV GNLV WRSK
Subjt:  --------------------------------------------------------------------------GSTTDRRSTSGYCSFVGGNLVNWRSK

Query:  KQSVLAKSSAEAEFRALAHGICEGIWIKRLLEELKFVQTQPMRVYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIDTGVICIPYLPTTEQTADVLTK
        KQSV+A+SSAEAEF A+A GICEGIW+ RLL+EL+     PM +YCDN+AAISIA NPV HDRTKH+E+D+HFIKEKI+ GV  + Y PT  QTAD+LTK
Subjt:  KQSVLAKSSAEAEFRALAHGICEGIWIKRLLEELKFVQTQPMRVYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIDTGVICIPYLPTTEQTADVLTK

Query:  GLPKLRFDKLINKLTMEDIFKPA
         L ++ F+ L  KL M +I+  A
Subjt:  GLPKLRFDKLINKLTMEDIFKPA

A5ARX9 ABC transporter domain-containing protein2.3e-17243.45Show/hide
Query:  EQIDQLLKLLKVTSSSGNPSVSLAQTGNYPLALSC-LNSSPWVIDSGASDHMTSSSLLFTSYSPLYCNEQIRIADGSFTPIAGKGTISLTKNITLQSVLH
        ++++ LL LLK   +SG  SVSLA TGN   ALS    S+PW+IDSGASDHMT+SS +F SYSP   N+++RIADG+F+PIAGKG I +++ I L+SVLH
Subjt:  EQIDQLLKLLKVTSSSGNPSVSLAQTGNYPLALSC-LNSSPWVIDSGASDHMTSSSLLFTSYSPLYCNEQIRIADGSFTPIAGKGTISLTKNITLQSVLH

Query:  VPKLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQGLSSVSSSSVKETIMLWHRRLGHPK-DFLKDK-GIFH
        VPKL CNLLSVSK+S+D+NC V F+E+H IFQDQ SG+ IG ARM++GLYYF+D+  S+K  QGLSS+SS  V++ I+ WH RLGHP   +LK    +  
Subjt:  VPKLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQGLSSVSSSSVKETIMLWHRRLGHPK-DFLKDK-GIFH

Query:  QSTCRDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAV--------LTAAFF---INRMPSKILAFKTPLDQFRKFYPT----------------
        Q     + Q       K++    +++   +Y   P YL+   V        L    F   +  +PS IL      +  ++  PT                
Subjt:  QSTCRDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAV--------LTAAFF---INRMPSKILAFKTPLDQFRKFYPT----------------

Query:  ----------VYTRRTLHQKNGDQIVDLSQYQSNAPTNDTED-SGN------------QSLSDIS-----------------------------DLDIPI
                  VY+R+ +  ++ +Q +  +  Q  A  N + + SGN             S+ D+S                             DLD+PI
Subjt:  ----------VYTRRTLHQKNGDQIVDLSQYQSNAPTNDTED-SGN------------QSLSDIS-----------------------------DLDIPI

Query:  AHRKGVRNCTKYPIANYLSYHRLSDNHKAFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVELPKDKKTVGCKWVFTIKCKADGDIERY
        A RKG   CTK+PIA Y+SY  L DN+ AF + I+ L   RNIQEA ++P+W LAV EEMNALK+N TW+VV+LP++KK VGCKWVFTIK KADG +ERY
Subjt:  AHRKGVRNCTKYPIANYLSYHRLSDNHKAFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVELPKDKKTVGCKWVFTIKCKADGDIERY

Query:  KARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEVFRT----------------YHQSQADHTIFYKHTGNDK
        KARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLS++VNS+WPL+QLDVKNAFLNGDLEEEVF +                Y QSQADHT+FYKH+   K
Subjt:  KARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEVFRT----------------YHQSQADHTIFYKHTGNDK

Query:  MVVLIVYVDDIILT--------------------------------------------------------------------------------------
        + +LIVYVDDI+LT                                                                                      
Subjt:  MVVLIVYVDDIILT--------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------GSTTDRRSTSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRA
                                                                  GS  DRRSTSGYCSFVGGNLV W+SKKQ+V+A+SS EAEFRA
Subjt:  ----------------------------------------------------------GSTTDRRSTSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRA

Query:  LAHGICEGIWIKRLLEELKFVQTQPMRVYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIDTGVICIP
        +AHGICE +WI+RLLE LK   + PM++YCDNKA IS+A N VLHDRTKH+EVDKHFIKEKID G++ +P
Subjt:  LAHGICEGIWIKRLLEELKFVQTQPMRVYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIDTGVICIP

A5B683 Uncharacterized protein2.4e-16942.73Show/hide
Query:  IDQLLKLLKVTSSSGNPSVSLAQTGNYPLALSC-LNSSPWVIDSGASDHMTSSSLLFTSYSPLYCNEQIRIADGSFTPIAGKGTISLTKNITLQSVLHVP
        I+ LL LLK   ++   SVSLA TGN   ALSC   S+PW+IDS ASDHMT+SS +F SYSP   N+++RIA G+F+PIA KG I +++ I L+SVLHVP
Subjt:  IDQLLKLLKVTSSSGNPSVSLAQTGNYPLALSC-LNSSPWVIDSGASDHMTSSSLLFTSYSPLYCNEQIRIADGSFTPIAGKGTISLTKNITLQSVLHVP

Query:  KLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQGLSSVSSSSVKETIMLWHRRLGHPKDFLKDKGIFHQSTC
        KL CNLLSVSK+S+D+NC V F+E+HCIFQDQ SG+ IG A+M++GLYYF+D+  S+K  QGLSS+SS SV++ IM        P +   + G+  +   
Subjt:  KLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQGLSSVSSSSVKETIMLWHRRLGHPKDFLKDKGIFHQSTC

Query:  RDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAVLTAAFFINRMPSKILAFKTPLDQFRKFYPTVYTRRTLHQKNGDQIVDLSQYQSNAPTND-T
        R          +KNR+ L+    +     VP    + +++ A    +  P  +      +       PT      +H  +   + DLS      P+ + +
Subjt:  RDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAVLTAAFFINRMPSKILAFKTPLDQFRKFYPTVYTRRTLHQKNGDQIVDLSQYQSNAPTND-T

Query:  EDSGNQSLSDI-----SDLDIPIAHRKGVRNCTKYPIANYLSYHRLSDNHKAFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVELPKD
               L  I      DLD+PIA RKG R CTK+PI  Y+SY  LSDN++AF + I+ L +PRNIQEA ++P+W LAV EEMNALK+N TW+VV+LP++
Subjt:  EDSGNQSLSDI-----SDLDIPIAHRKGVRNCTKYPIANYLSYHRLSDNHKAFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVELPKD

Query:  KKTVGCKWVFTIKCKADGDIERYKARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEVFRT------------
        KK VGCKWVFTIK KADG +ERYKARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLS++VNS+WPL+QLDVKN FLNGDLEEEVF +            
Subjt:  KKTVGCKWVFTIKCKADGDIERYKARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEVFRT------------

Query:  -------------------------------YHQSQADHTIFYKHTGNDKMVVLIVYVDDIILT------------------------------------
                                       Y QSQADHT+FYKH+   K+ +LIVYVDDI+LT                                    
Subjt:  -------------------------------YHQSQADHTIFYKHTGNDKMVVLIVYVDDIILT------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------GSTTDRRSTSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRALAHGICEGIWIKRLLEELKFVQTQPMRVYCDNK
                                 GS  DRRST GYCSFVGGNLV WRSKKQ+V+A+SSA+AEFRA+AHGICE +WI+RLLEELK   +          
Subjt:  -------------------------GSTTDRRSTSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRALAHGICEGIWIKRLLEELKFVQTQPMRVYCDNK

Query:  AAISIAHNPVLHDRTKHIEVDKHFIKEKIDTGVICIPYLPTTEQTADVLTKGLPKLRFDKLINKLTMEDIFKP
             +   VLHDRTKH+EVDKHFIKEKID G++C+ Y+PT EQ  DV TKGL K +FD L+ KL MEDIFKP
Subjt:  AAISIAHNPVLHDRTKHIEVDKHFIKEKIDTGVICIPYLPTTEQTADVLTKGLPKLRFDKLINKLTMEDIFKP

A5B9Y8 Integrase catalytic domain-containing protein2.0e-19237.17Show/hide
Query:  TANVVDSHQFNQEQIDQLLKLLKVTSSSGNPSVSLAQTGNYPLALSC-LNSSPWVIDSGASDHMTSSSLLFTSYSPLYCNEQIRIADGSFTPIAGKGTIS
        TAN  ++  F  EQ++ LL LLK   +SG  SVSLA TGN   ALSC   S+PW+IDSGASDHMT+SS +F SYSP   N++++IADG+F+PIAGKG I 
Subjt:  TANVVDSHQFNQEQIDQLLKLLKVTSSSGNPSVSLAQTGNYPLALSC-LNSSPWVIDSGASDHMTSSSLLFTSYSPLYCNEQIRIADGSFTPIAGKGTIS

Query:  LTKNITLQSVLHVPKLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQGLSSVSSSSVKETIMLWHRRLGHP-
        +++ I L+ VLHVPKL CNLL VSK+S+D NC V F+E+HCIFQD+ S + IG ARM++GLYYF+D+  S+K  QGLSS+SS  V++ IM+WH +LG P 
Subjt:  LTKNITLQSVLHVPKLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQGLSSVSSSSVKETIMLWHRRLGHP-

Query:  ----------------------------------------------------------------------------------------------KDFLK-
                                                                                                      K+F K 
Subjt:  ----------------------------------------------------------------------------------------------KDFLK-

Query:  ------------------------------DKGIFHQSTCRDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAVLTAAFFINRMPSKILAFKTPL
                                       KGI HQS+C DTPQQNGIA+RKN+HLL+VARA+MFYM++P YLW DA+LTA++ INRMP+KIL + TPL
Subjt:  ------------------------------DKGIFHQSTCRDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAVLTAAFFINRMPSKILAFKTPL

Query:  DQFRKFYP--------------------------------------------------------------------------------------------
           +K +P                                                                                            
Subjt:  DQFRKFYP--------------------------------------------------------------------------------------------

Query:  -----------------------------------------------TVYTRRTLHQKNGDQIVDLSQYQSNAPTNDTED-SGN------------QSLS
                                                        VY+R+ +  ++ DQ +  +  Q  A  N + + SGN             S++
Subjt:  -----------------------------------------------TVYTRRTLHQKNGDQIVDLSQYQSNAPTNDTED-SGN------------QSLS

Query:  DIS-----------------------------DLDIPIAHRKGVRNCTKYPIANYLSYHRLSDNHKAFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNA
        D+S                             DLD+PIA RKG + CTK+ IA Y+SY  LSDNH+AF + I+ L +PRNIQEA ++P+W LAV +EMNA
Subjt:  DIS-----------------------------DLDIPIAHRKGVRNCTKYPIANYLSYHRLSDNHKAFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNA

Query:  LKQNCTWDVVELPKDKKTVGCKWVFTIKCKADGDIERYKARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEV
        LK+N TW+ V+LP++KK VGCKWVFTIK KADG +ERYKARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLS++VNS+WPL+QLDVKNAFLNGDLEEEV
Subjt:  LKQNCTWDVVELPKDKKTVGCKWVFTIKCKADGDIERYKARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEV

Query:  FRT-------------------------------------------YHQSQADHTIFYKHTGNDKMVVLIVYVDDIILT---------------------
        F +                                           Y QSQA+HT+FYKH+   K+V+LIVYVDDI+LT                     
Subjt:  FRT-------------------------------------------YHQSQADHTIFYKHTGNDKMVVLIVYVDDIILT---------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------GSTTDRRSTSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRALAHGICEGIWIKRLLEEL
                                                GS  DRRSTSGYCSFV GNLV WRSKKQ+V+A+SSAEAEFR +AHG CE +WI+RLLEEL
Subjt:  ----------------------------------------GSTTDRRSTSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRALAHGICEGIWIKRLLEEL

Query:  KFVQTQPMRVYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIDTGVICIPYLPTTEQTADVLTKGLPKLRFDKLINKLTMEDIFKPA
        K   + PM++YCDNKAAIS+AHNPVLHDRTKH+EVDK FIKEKID   +C+ Y+PT EQ A+V TK L K +FD L+ KL MEDIFK A
Subjt:  KFVQTQPMRVYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIDTGVICIPYLPTTEQTADVLTKGLPKLRFDKLINKLTMEDIFKPA

SwissProt top hitse value%identityAlignment
B4FHY0 Flap endonuclease 13.5e-11786.25Show/hide
Query:  MGIKGLTKLLADNAPKGMKEQKFESYFGRKIAIDASMSIYQFLIVVGRSGTEMLTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPDLKKQELAK
        MGIKGLTKLLADNAPK MKEQKFESYFGRKIA+DASMSIYQFLIVVGR+G E LTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPD+KKQELAK
Subjt:  MGIKGLTKLLADNAPKGMKEQKFESYFGRKIAIDASMSIYQFLIVVGRSGTEMLTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPDLKKQELAK

Query:  RYSKRADATEDLADAIEVGNKEDIEKFSKRTVKVTKQHNDDCKRLLRLMGVPVIEAPSEAEAQCAALCKLGKVYAVASEDMDSLTFGSPKFLRHLMDPSS
        RYSKR DAT+DL +A+EVG+K+ IEK SKRTVKVT+QHN+DCKRLLRLMGVPV+EAPSEAEA+CAALC   KV+AVASEDMDSLTFG+P+FLRHLMDPSS
Subjt:  RYSKRADATEDLADAIEVGNKEDIEKFSKRTVKVTKQHNDDCKRLLRLMGVPVIEAPSEAEAQCAALCKLGKVYAVASEDMDSLTFGSPKFLRHLMDPSS

Query:  RKIPVMEFEVGKILEELNLTMDQFVDLCILSGCDYCDSIR
        +KIPVMEF+V K+LEEL LTMDQF+DLCIL GCDYCDSI+
Subjt:  RKIPVMEFEVGKILEELNLTMDQFVDLCILSGCDYCDSIR

B8AW67 Flap endonuclease 1-A1.0e-11686.25Show/hide
Query:  MGIKGLTKLLADNAPKGMKEQKFESYFGRKIAIDASMSIYQFLIVVGRSGTEMLTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPDLKKQELAK
        MGIKGLTKLLADNAPK MKEQKFESYFGR+IA+DASMSIYQFLIVVGR+G E LTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPDLKKQELAK
Subjt:  MGIKGLTKLLADNAPKGMKEQKFESYFGRKIAIDASMSIYQFLIVVGRSGTEMLTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPDLKKQELAK

Query:  RYSKRADATEDLADAIEVGNKEDIEKFSKRTVKVTKQHNDDCKRLLRLMGVPVIEAPSEAEAQCAALCKLGKVYAVASEDMDSLTFGSPKFLRHLMDPSS
        RYSKR DAT++L +A+E G+K+ IEKFSKRTVKVTKQHN++CKRLLRLMGVPV+EAP EAEA+CAALC    VYAVASEDMDSLTFG+P+FLRHLMDPSS
Subjt:  RYSKRADATEDLADAIEVGNKEDIEKFSKRTVKVTKQHNDDCKRLLRLMGVPVIEAPSEAEAQCAALCKLGKVYAVASEDMDSLTFGSPKFLRHLMDPSS

Query:  RKIPVMEFEVGKILEELNLTMDQFVDLCILSGCDYCDSIR
        +KIPVMEFEV K+LEEL LTMDQF+DLCILSGCDYCDSI+
Subjt:  RKIPVMEFEVGKILEELNLTMDQFVDLCILSGCDYCDSIR

C5YUK3 Flap endonuclease 1-A7.7e-11785.83Show/hide
Query:  MGIKGLTKLLADNAPKGMKEQKFESYFGRKIAIDASMSIYQFLIVVGRSGTEMLTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPDLKKQELAK
        MGIKGLTKLLADNAPK MKEQKFESYFGRKIAIDASMSIYQFLIVVGR+G E LTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPD+KK+ELAK
Subjt:  MGIKGLTKLLADNAPKGMKEQKFESYFGRKIAIDASMSIYQFLIVVGRSGTEMLTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPDLKKQELAK

Query:  RYSKRADATEDLADAIEVGNKEDIEKFSKRTVKVTKQHNDDCKRLLRLMGVPVIEAPSEAEAQCAALCKLGKVYAVASEDMDSLTFGSPKFLRHLMDPSS
        R+SKR DAT DL +A+E G+K+ +EK SKRTVKVT QHNDDCKRLLRLMGVPV+EAPSEAEA+CAALCK  KV+AVASEDMDSLTFG+P+FLRHLMDPSS
Subjt:  RYSKRADATEDLADAIEVGNKEDIEKFSKRTVKVTKQHNDDCKRLLRLMGVPVIEAPSEAEAQCAALCKLGKVYAVASEDMDSLTFGSPKFLRHLMDPSS

Query:  RKIPVMEFEVGKILEELNLTMDQFVDLCILSGCDYCDSIR
        +KIPVMEF+V K+LEEL LTMDQF+DLCIL GCDYCDSI+
Subjt:  RKIPVMEFEVGKILEELNLTMDQFVDLCILSGCDYCDSIR

C6TEX6 Flap endonuclease 14.5e-12592.5Show/hide
Query:  MGIKGLTKLLADNAPKGMKEQKFESYFGRKIAIDASMSIYQFLIVVGRSGTEMLTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPDLKKQELAK
        MGIKGLTKLLADNAPK MKE KFESYFGRKIAIDASMSIYQFLIVVGRSGTEMLTNEAGEVTSHLQGMF+RTIRLLEAGIKPVYVFDGKPPDLKKQELAK
Subjt:  MGIKGLTKLLADNAPKGMKEQKFESYFGRKIAIDASMSIYQFLIVVGRSGTEMLTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPDLKKQELAK

Query:  RYSKRADATEDLADAIEVGNKEDIEKFSKRTVKVTKQHNDDCKRLLRLMGVPVIEAPSEAEAQCAALCKLGKVYAVASEDMDSLTFGSPKFLRHLMDPSS
        RYSKRA+ATEDL++A+E  NKEDIEKFSKRTVKVTKQHNDDCKRLLRLMGVPV+EAPSEAEAQCAALCK GKVY V SEDMDSLTFG+PKFLRHLMDPSS
Subjt:  RYSKRADATEDLADAIEVGNKEDIEKFSKRTVKVTKQHNDDCKRLLRLMGVPVIEAPSEAEAQCAALCKLGKVYAVASEDMDSLTFGSPKFLRHLMDPSS

Query:  RKIPVMEFEVGKILEELNLTMDQFVDLCILSGCDYCDSIR
        +KIPVMEFEV KILEELN+TMDQF+DLCILSGCDYCDSIR
Subjt:  RKIPVMEFEVGKILEELNLTMDQFVDLCILSGCDYCDSIR

O65251 Flap endonuclease 13.2e-12392.08Show/hide
Query:  MGIKGLTKLLADNAPKGMKEQKFESYFGRKIAIDASMSIYQFLIVVGRSGTEMLTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPDLKKQELAK
        MGIKGLTKLLADNAP  MKEQKFESYFGRKIA+DASMSIYQFLIVVGR+GTEMLTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPP+LK+QELAK
Subjt:  MGIKGLTKLLADNAPKGMKEQKFESYFGRKIAIDASMSIYQFLIVVGRSGTEMLTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPDLKKQELAK

Query:  RYSKRADATEDLADAIEVGNKEDIEKFSKRTVKVTKQHNDDCKRLLRLMGVPVIEAPSEAEAQCAALCKLGKVYAVASEDMDSLTFGSPKFLRHLMDPSS
        RYSKRADAT DL  AIE GNKEDIEK+SKRTVKVTKQHNDDCKRLLRLMGVPV+EA SEAEAQCAALCK GKVY VASEDMDSLTFG+PKFLRHLMDPSS
Subjt:  RYSKRADATEDLADAIEVGNKEDIEKFSKRTVKVTKQHNDDCKRLLRLMGVPVIEAPSEAEAQCAALCKLGKVYAVASEDMDSLTFGSPKFLRHLMDPSS

Query:  RKIPVMEFEVGKILEELNLTMDQFVDLCILSGCDYCDSIR
        RKIPVMEFEV KILEEL LTMDQF+DLCILSGCDYCDSIR
Subjt:  RKIPVMEFEVGKILEELNLTMDQFVDLCILSGCDYCDSIR

Arabidopsis top hitse value%identityAlignment
AT1G29630.2 5'-3' exonuclease family protein2.3e-1529.44Show/hide
Query:  MGIKGLTKLLAD-NAPKGMKEQKFESYFGRKIAIDASMSIYQFLIVVGRSGTEMLTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPDLKKQELA
        MGI+GL  LL     P  +KE +     G  +A+D    +++  +   R   + L  +      H+Q   +R   L   G+KP+ VFDG P  +K ++  
Subjt:  MGIKGLTKLLAD-NAPKGMKEQKFESYFGRKIAIDASMSIYQFLIVVGRSGTEMLTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPDLKKQELA

Query:  KRYSKRADATEDLADAIE---VGNKEDIEKFSKRTVKVTKQHNDDCKRLLRLMGVPVIEAPSEAEAQCAALCKLGKVYAVASEDMDSLTFGSPKFLRHLM
        KR   R    E+LA A+E    GN     +   + V ++     +  ++LR   V  + AP EA+AQ A L    +V A+ +ED D + FG  + +   M
Subjt:  KRYSKRADATEDLADAIE---VGNKEDIEKFSKRTVKVTKQHNDDCKRLLRLMGVPVIEAPSEAEAQCAALCKLGKVYAVASEDMDSLTFGSPKFLRHLM

Query:  DPSSRKIPVMEFEVGKILEELNLTMDQF-----VDLCILSGCDYCDSI
        D     +   EF+  K+ +  +L++  F     +++CILSGCDY  S+
Subjt:  DPSSRKIPVMEFEVGKILEELNLTMDQF-----VDLCILSGCDYCDSI

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.4e-4927.78Show/hide
Query:  VRNCTKYPIANYLSYHRLSDNHKAFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVELPKDKKTVGCKWVFTIKCKADGDIERYKARLV
        V + T + I+ +LSY ++S  + +F   I     P    EA+    W  A+ +E+ A++   TW++  LP +KK +GCKWV+ IK  +DG IERYKARLV
Subjt:  VRNCTKYPIANYLSYHRLSDNHKAFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVELPKDKKTVGCKWVFTIKCKADGDIERYKARLV

Query:  AKGFTQTYGIDYQETFAPVAKINSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEVFRT----------------------------------------
        AKG+TQ  GID+ ETF+PV K+ S++++L+IS   ++ L+QLD+ NAFLNGDL+EE++                                          
Subjt:  AKGFTQTYGIDYQETFAPVAKINSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEVFRT----------------------------------------

Query:  -------YHQSQADHTIFYKHTGNDKMVVLIVYVDDIILTGST---------------------------------------------------------
               + QS +DHT F K T    + VL VYVDDII+  +                                                          
Subjt:  -------YHQSQADHTIFYKHTGNDKMVVLIVYVDDIILTGST---------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----TDRRSTSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRALAHGICEGIWIKRLLEELKFVQTQPMRVYCDNKAAISIAHNPVLHDRTKHIEVDKHF
              RRST+GYC F+G +L++W+SKKQ V++KSSAEAE+RAL+    E +W+ +   EL+   ++P  ++CDN AAI IA N V H+RTKHIE D H 
Subjt:  ----TDRRSTSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRALAHGICEGIWIKRLLEELKFVQTQPMRVYCDNKAAISIAHNPVLHDRTKHIEVDKHF

Query:  IKEK
        ++E+
Subjt:  IKEK

AT5G26680.1 5'-3' exonuclease family protein2.3e-12492.08Show/hide
Query:  MGIKGLTKLLADNAPKGMKEQKFESYFGRKIAIDASMSIYQFLIVVGRSGTEMLTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPDLKKQELAK
        MGIKGLTKLLADNAP  MKEQKFESYFGRKIA+DASMSIYQFLIVVGR+GTEMLTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPP+LK+QELAK
Subjt:  MGIKGLTKLLADNAPKGMKEQKFESYFGRKIAIDASMSIYQFLIVVGRSGTEMLTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPDLKKQELAK

Query:  RYSKRADATEDLADAIEVGNKEDIEKFSKRTVKVTKQHNDDCKRLLRLMGVPVIEAPSEAEAQCAALCKLGKVYAVASEDMDSLTFGSPKFLRHLMDPSS
        RYSKRADAT DL  AIE GNKEDIEK+SKRTVKVTKQHNDDCKRLLRLMGVPV+EA SEAEAQCAALCK GKVY VASEDMDSLTFG+PKFLRHLMDPSS
Subjt:  RYSKRADATEDLADAIEVGNKEDIEKFSKRTVKVTKQHNDDCKRLLRLMGVPVIEAPSEAEAQCAALCKLGKVYAVASEDMDSLTFGSPKFLRHLMDPSS

Query:  RKIPVMEFEVGKILEELNLTMDQFVDLCILSGCDYCDSIR
        RKIPVMEFEV KILEEL LTMDQF+DLCILSGCDYCDSIR
Subjt:  RKIPVMEFEVGKILEELNLTMDQFVDLCILSGCDYCDSIR

AT5G26680.2 5'-3' exonuclease family protein2.3e-12492.08Show/hide
Query:  MGIKGLTKLLADNAPKGMKEQKFESYFGRKIAIDASMSIYQFLIVVGRSGTEMLTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPDLKKQELAK
        MGIKGLTKLLADNAP  MKEQKFESYFGRKIA+DASMSIYQFLIVVGR+GTEMLTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPP+LK+QELAK
Subjt:  MGIKGLTKLLADNAPKGMKEQKFESYFGRKIAIDASMSIYQFLIVVGRSGTEMLTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPDLKKQELAK

Query:  RYSKRADATEDLADAIEVGNKEDIEKFSKRTVKVTKQHNDDCKRLLRLMGVPVIEAPSEAEAQCAALCKLGKVYAVASEDMDSLTFGSPKFLRHLMDPSS
        RYSKRADAT DL  AIE GNKEDIEK+SKRTVKVTKQHNDDCKRLLRLMGVPV+EA SEAEAQCAALCK GKVY VASEDMDSLTFG+PKFLRHLMDPSS
Subjt:  RYSKRADATEDLADAIEVGNKEDIEKFSKRTVKVTKQHNDDCKRLLRLMGVPVIEAPSEAEAQCAALCKLGKVYAVASEDMDSLTFGSPKFLRHLMDPSS

Query:  RKIPVMEFEVGKILEELNLTMDQFVDLCILSGCDYCDSIR
        RKIPVMEFEV KILEEL LTMDQF+DLCILSGCDYCDSIR
Subjt:  RKIPVMEFEVGKILEELNLTMDQFVDLCILSGCDYCDSIR

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.5e-1944.9Show/hide
Query:  PRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVELPKDKKTVGCKWVFTIKCKADGDIERYKARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLSIS
        P+++  A  DP W  A+ EE++AL +N TW +V  P ++  +GCKWVF  K  +DG ++R KARLVAKGF Q  GI + ET++PV +  +IR +L+++
Subjt:  PRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVELPKDKKTVGCKWVFTIKCKADGDIERYKARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLSIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCATTAAGGGTTTAACCAAGCTTTTAGCTGACAATGCTCCCAAAGGCATGAAGGAGCAGAAGTTCGAAAGCTATTTCGGCCGCAAAATCGCAATCGATGCCAGCAT
GAGTATTTATCAATTTCTTATTGTCGTGGGAAGAAGTGGGACGGAGATGCTGACCAACGAAGCTGGTGAAGTCACAAGCCATTTGCAAGGGATGTTTAACCGGACAATAA
GGCTTCTCGAAGCTGGAATTAAGCCAGTCTATGTCTTTGATGGAAAGCCACCCGATTTGAAAAAACAAGAACTTGCAAAACGTTATTCAAAGAGAGCAGATGCTACTGAG
GACCTGGCAGATGCAATCGAGGTTGGCAACAAGGAGGACATCGAGAAATTCAGTAAAAGGACGGTGAAGGTTACAAAACAGCACAATGATGACTGTAAAAGACTCTTGAG
ACTCATGGGAGTGCCTGTGATTGAGGCTCCCTCGGAAGCTGAGGCACAATGTGCTGCGCTTTGCAAGTTAGGAAAGGTTTATGCTGTGGCGTCGGAAGACATGGATTCAT
TAACATTTGGGTCTCCCAAATTTCTTCGTCATTTAATGGATCCCAGCTCAAGGAAGATCCCAGTTATGGAATTTGAAGTTGGGAAGATTTTGGAGGAGTTGAACCTTACC
ATGGATCAATTTGTCGATTTGTGCATTCTTTCTGGATGTGATTATTGTGATAGTATTCGAGAGGATTTTAATAAGAAATACCTTTTGCACATGGTATCAGAGCAAAGGCA
ACCGCCGCCGTTACCTGAGTGCAAGGAGATCCGCAGCAGCCGTCTGATCCAATCGCGACCCACGACTAGGACTCGACCCGCGATTTTGCGCCGTGACCCGTCACCTCTGC
AAGTCGTTCCTTTCTGTCCACTCCGTGCTCGCGATTTTGCGCCGCCGCCGTTTGCGGTTTTGCTCCGTCAACTCCGACCGCTCGTTTTCGTCAACTCCGGCCACTCGCCT
GGGATAGATTACTTTCTTTTTCGGAGTAACAATCAGTCCACCTCCACTGCTAATGTTGTTGACTCTCATCAATTCAATCAAGAGCAAATTGATCAACTCCTGAAGCTGCT
AAAGGTCACTTCGTCATCTGGTAATCCTAGTGTTTCCTTGGCACAAACAGGTAATTATCCTCTAGCTTTATCTTGTCTTAATTCATCTCCGTGGGTTATAGACTCCGGAG
CATCTGACCATATGACTAGTTCCTCCCTTCTGTTTACCTCATACTCTCCCTTGTATTGCAATGAACAAATTCGAATTGCCGATGGTAGTTTTACTCCTATTGCAGGAAAA
GGAACTATCTCGTTGACTAAAAATATTACCTTACAATCTGTTCTCCATGTTCCTAAATTAGCCTGTAATTTGTTATCTGTCAGTAAAATTTCTAAGGATGCTAATTGTCG
TGTTACTTTCTTTGAAACTCACTGCATCTTTCAAGATCAGGACTCGGGGGAGATGATTGGGCGTGCTAGGATGCTGGATGGTCTCTACTATTTTGATGATTCTCCAACTA
GTGATAAAAAAGTTCAGGGCCTAAGTAGTGTTAGTTCTTCTTCTGTTAAAGAAACAATCATGCTTTGGCATCGTAGACTAGGACATCCCAAGGATTTTTTGAAGGATAAA
GGAATTTTCCACCAATCTACCTGTCGTGATACTCCACAACAAAATGGGATTGCTGAGCGAAAAAATAGACATTTACTCGATGTTGCTCGTGCTATTATGTTTTATATGCA
TGTTCCTCACTATTTGTGGGAAGACGCAGTCCTTACCGCTGCCTTTTTCATAAACCGGATGCCTTCTAAGATTTTGGCCTTTAAAACTCCTCTTGATCAATTTAGAAAAT
TTTACCCCACTGTTTATACTAGAAGGACATTACATCAAAAGAACGGGGATCAGATAGTGGACTTGTCACAATACCAATCTAATGCTCCGACAAATGATACTGAAGATTCA
GGTAACCAATCTTTATCTGATATCTCTGACCTTGATATCCCAATAGCCCATAGGAAAGGTGTCCGTAATTGCACCAAATACCCTATTGCAAATTACCTTTCTTATCATAG
ATTGTCTGATAATCATAAAGCCTTCAACTCTAGGATAACCAACCTATTTATTCCAAGGAACATACAGGAGGCCCAAAATGATCCGAATTGGAATTTAGCAGTTATGGAAG
AGATGAATGCGCTAAAACAGAATTGTACATGGGATGTAGTTGAACTACCAAAAGATAAGAAAACGGTAGGGTGCAAATGGGTATTCACTATAAAGTGCAAAGCTGATGGT
GATATTGAACGATACAAAGCCAGATTAGTTGCTAAAGGCTTTACCCAGACCTATGGAATTGATTATCAAGAAACATTTGCTCCAGTGGCTAAGATTAACTCTATAAGAGT
ACTTTTATCTATTTCTGTTAATTCAGATTGGCCTCTTTATCAGCTTGATGTAAAAAATGCATTCCTCAATGGTGATCTTGAAGAGGAAGTATTTAGGACTTACCACCAGA
GTCAAGCAGATCATACCATTTTCTATAAGCACACAGGAAATGACAAGATGGTTGTTTTAATAGTGTATGTTGATGATATCATCCTTACAGGTAGCACAACAGATAGGAGA
TCTACTTCTGGTTACTGTTCATTTGTTGGAGGAAATCTAGTTAACTGGCGTAGTAAAAAACAAAGTGTGCTAGCCAAAAGTAGTGCAGAAGCAGAGTTCAGAGCTTTAGC
TCATGGCATTTGTGAAGGTATTTGGATAAAACGACTGCTTGAAGAATTGAAATTTGTCCAAACACAACCTATGCGTGTCTACTGTGACAATAAGGCAGCCATCTCCATAG
CTCACAATCCAGTCCTTCACGATAGGACAAAGCATATTGAAGTTGATAAACACTTTATAAAGGAGAAGATTGATACTGGAGTAATATGTATCCCCTATCTCCCTACAACG
GAGCAAACTGCTGATGTGCTAACTAAGGGACTTCCAAAATTACGATTTGACAAGTTGATTAACAAGCTGACAATGGAAGATATCTTCAAACCAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCATTAAGGGTTTAACCAAGCTTTTAGCTGACAATGCTCCCAAAGGCATGAAGGAGCAGAAGTTCGAAAGCTATTTCGGCCGCAAAATCGCAATCGATGCCAGCAT
GAGTATTTATCAATTTCTTATTGTCGTGGGAAGAAGTGGGACGGAGATGCTGACCAACGAAGCTGGTGAAGTCACAAGCCATTTGCAAGGGATGTTTAACCGGACAATAA
GGCTTCTCGAAGCTGGAATTAAGCCAGTCTATGTCTTTGATGGAAAGCCACCCGATTTGAAAAAACAAGAACTTGCAAAACGTTATTCAAAGAGAGCAGATGCTACTGAG
GACCTGGCAGATGCAATCGAGGTTGGCAACAAGGAGGACATCGAGAAATTCAGTAAAAGGACGGTGAAGGTTACAAAACAGCACAATGATGACTGTAAAAGACTCTTGAG
ACTCATGGGAGTGCCTGTGATTGAGGCTCCCTCGGAAGCTGAGGCACAATGTGCTGCGCTTTGCAAGTTAGGAAAGGTTTATGCTGTGGCGTCGGAAGACATGGATTCAT
TAACATTTGGGTCTCCCAAATTTCTTCGTCATTTAATGGATCCCAGCTCAAGGAAGATCCCAGTTATGGAATTTGAAGTTGGGAAGATTTTGGAGGAGTTGAACCTTACC
ATGGATCAATTTGTCGATTTGTGCATTCTTTCTGGATGTGATTATTGTGATAGTATTCGAGAGGATTTTAATAAGAAATACCTTTTGCACATGGTATCAGAGCAAAGGCA
ACCGCCGCCGTTACCTGAGTGCAAGGAGATCCGCAGCAGCCGTCTGATCCAATCGCGACCCACGACTAGGACTCGACCCGCGATTTTGCGCCGTGACCCGTCACCTCTGC
AAGTCGTTCCTTTCTGTCCACTCCGTGCTCGCGATTTTGCGCCGCCGCCGTTTGCGGTTTTGCTCCGTCAACTCCGACCGCTCGTTTTCGTCAACTCCGGCCACTCGCCT
GGGATAGATTACTTTCTTTTTCGGAGTAACAATCAGTCCACCTCCACTGCTAATGTTGTTGACTCTCATCAATTCAATCAAGAGCAAATTGATCAACTCCTGAAGCTGCT
AAAGGTCACTTCGTCATCTGGTAATCCTAGTGTTTCCTTGGCACAAACAGGTAATTATCCTCTAGCTTTATCTTGTCTTAATTCATCTCCGTGGGTTATAGACTCCGGAG
CATCTGACCATATGACTAGTTCCTCCCTTCTGTTTACCTCATACTCTCCCTTGTATTGCAATGAACAAATTCGAATTGCCGATGGTAGTTTTACTCCTATTGCAGGAAAA
GGAACTATCTCGTTGACTAAAAATATTACCTTACAATCTGTTCTCCATGTTCCTAAATTAGCCTGTAATTTGTTATCTGTCAGTAAAATTTCTAAGGATGCTAATTGTCG
TGTTACTTTCTTTGAAACTCACTGCATCTTTCAAGATCAGGACTCGGGGGAGATGATTGGGCGTGCTAGGATGCTGGATGGTCTCTACTATTTTGATGATTCTCCAACTA
GTGATAAAAAAGTTCAGGGCCTAAGTAGTGTTAGTTCTTCTTCTGTTAAAGAAACAATCATGCTTTGGCATCGTAGACTAGGACATCCCAAGGATTTTTTGAAGGATAAA
GGAATTTTCCACCAATCTACCTGTCGTGATACTCCACAACAAAATGGGATTGCTGAGCGAAAAAATAGACATTTACTCGATGTTGCTCGTGCTATTATGTTTTATATGCA
TGTTCCTCACTATTTGTGGGAAGACGCAGTCCTTACCGCTGCCTTTTTCATAAACCGGATGCCTTCTAAGATTTTGGCCTTTAAAACTCCTCTTGATCAATTTAGAAAAT
TTTACCCCACTGTTTATACTAGAAGGACATTACATCAAAAGAACGGGGATCAGATAGTGGACTTGTCACAATACCAATCTAATGCTCCGACAAATGATACTGAAGATTCA
GGTAACCAATCTTTATCTGATATCTCTGACCTTGATATCCCAATAGCCCATAGGAAAGGTGTCCGTAATTGCACCAAATACCCTATTGCAAATTACCTTTCTTATCATAG
ATTGTCTGATAATCATAAAGCCTTCAACTCTAGGATAACCAACCTATTTATTCCAAGGAACATACAGGAGGCCCAAAATGATCCGAATTGGAATTTAGCAGTTATGGAAG
AGATGAATGCGCTAAAACAGAATTGTACATGGGATGTAGTTGAACTACCAAAAGATAAGAAAACGGTAGGGTGCAAATGGGTATTCACTATAAAGTGCAAAGCTGATGGT
GATATTGAACGATACAAAGCCAGATTAGTTGCTAAAGGCTTTACCCAGACCTATGGAATTGATTATCAAGAAACATTTGCTCCAGTGGCTAAGATTAACTCTATAAGAGT
ACTTTTATCTATTTCTGTTAATTCAGATTGGCCTCTTTATCAGCTTGATGTAAAAAATGCATTCCTCAATGGTGATCTTGAAGAGGAAGTATTTAGGACTTACCACCAGA
GTCAAGCAGATCATACCATTTTCTATAAGCACACAGGAAATGACAAGATGGTTGTTTTAATAGTGTATGTTGATGATATCATCCTTACAGGTAGCACAACAGATAGGAGA
TCTACTTCTGGTTACTGTTCATTTGTTGGAGGAAATCTAGTTAACTGGCGTAGTAAAAAACAAAGTGTGCTAGCCAAAAGTAGTGCAGAAGCAGAGTTCAGAGCTTTAGC
TCATGGCATTTGTGAAGGTATTTGGATAAAACGACTGCTTGAAGAATTGAAATTTGTCCAAACACAACCTATGCGTGTCTACTGTGACAATAAGGCAGCCATCTCCATAG
CTCACAATCCAGTCCTTCACGATAGGACAAAGCATATTGAAGTTGATAAACACTTTATAAAGGAGAAGATTGATACTGGAGTAATATGTATCCCCTATCTCCCTACAACG
GAGCAAACTGCTGATGTGCTAACTAAGGGACTTCCAAAATTACGATTTGACAAGTTGATTAACAAGCTGACAATGGAAGATATCTTCAAACCAGCTTGA
Protein sequenceShow/hide protein sequence
MGIKGLTKLLADNAPKGMKEQKFESYFGRKIAIDASMSIYQFLIVVGRSGTEMLTNEAGEVTSHLQGMFNRTIRLLEAGIKPVYVFDGKPPDLKKQELAKRYSKRADATE
DLADAIEVGNKEDIEKFSKRTVKVTKQHNDDCKRLLRLMGVPVIEAPSEAEAQCAALCKLGKVYAVASEDMDSLTFGSPKFLRHLMDPSSRKIPVMEFEVGKILEELNLT
MDQFVDLCILSGCDYCDSIREDFNKKYLLHMVSEQRQPPPLPECKEIRSSRLIQSRPTTRTRPAILRRDPSPLQVVPFCPLRARDFAPPPFAVLLRQLRPLVFVNSGHSP
GIDYFLFRSNNQSTSTANVVDSHQFNQEQIDQLLKLLKVTSSSGNPSVSLAQTGNYPLALSCLNSSPWVIDSGASDHMTSSSLLFTSYSPLYCNEQIRIADGSFTPIAGK
GTISLTKNITLQSVLHVPKLACNLLSVSKISKDANCRVTFFETHCIFQDQDSGEMIGRARMLDGLYYFDDSPTSDKKVQGLSSVSSSSVKETIMLWHRRLGHPKDFLKDK
GIFHQSTCRDTPQQNGIAERKNRHLLDVARAIMFYMHVPHYLWEDAVLTAAFFINRMPSKILAFKTPLDQFRKFYPTVYTRRTLHQKNGDQIVDLSQYQSNAPTNDTEDS
GNQSLSDISDLDIPIAHRKGVRNCTKYPIANYLSYHRLSDNHKAFNSRITNLFIPRNIQEAQNDPNWNLAVMEEMNALKQNCTWDVVELPKDKKTVGCKWVFTIKCKADG
DIERYKARLVAKGFTQTYGIDYQETFAPVAKINSIRVLLSISVNSDWPLYQLDVKNAFLNGDLEEEVFRTYHQSQADHTIFYKHTGNDKMVVLIVYVDDIILTGSTTDRR
STSGYCSFVGGNLVNWRSKKQSVLAKSSAEAEFRALAHGICEGIWIKRLLEELKFVQTQPMRVYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIDTGVICIPYLPTT
EQTADVLTKGLPKLRFDKLINKLTMEDIFKPA