; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg018108 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg018108
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionADP-ribosylation factor 1
Genome locationscaffold9:29770389..29787705
RNA-Seq ExpressionSpg018108
SyntenySpg018108
Gene Ontology termsGO:0003924 - GTPase activity (molecular function)
GO:0005525 - GTP binding (molecular function)
InterPro domainsIPR006689 - Small GTPase superfamily, ARF/SAR type
IPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain
IPR027417 - P-loop containing nucleoside triphosphate hydrolase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]4.7e-1333.1Show/hide
Query:  VDQWWNSNLLQGWPGHGLMQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSIIKAELMNISAKEEIVWRQKCKLKWFVE
        ++ WW +    G+ G+  M++LK L   +K W +         K A  +E+  ID  E     +E    +R+ +KA+L  I+  E  +W QKCK  W  E
Subjt:  VDQWWNSNLLQGWPGHGLMQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSIIKAELMNISAKEEIVWRQKCKLKWFVE

Query:  GDVNSAFFHRIVTANRRKNAIFEVLSASGINQQLAASLASSF
        GD NS+FFH+I TA ++K  I ++++ SG N    + +A +F
Subjt:  GDVNSAFFHRIVTANRRKNAIFEVLSASGINQQLAASLASSF

KAA0041397.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]6.0e-8538.57Show/hide
Query:  NRRKNAIFEVLSASGINQQLAASLASSFGCKLGSWPITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGRHTLIQATLANLPVYYLSIFLAP
        N  K+ IF +     +    A S+  S+G   G  P TYLG+PL G P S NFW  +++KI+K+L +W  + +SKGGR TLI +TL +LP+Y LS+F  P
Subjt:  NRRKNAIFEVLSASGINQQLAASLASSFGCKLGSWPITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGRHTLIQATLANLPVYYLSIFLAP

Query:  MKVTKAIEKLFRNYLWRGAHRNRGSHLLNWSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKYGAAKNNLKPGISSLHSSKGPWK
          + + IE  +RN+LW G        L+ W+ + SP E+GGLG++ +H  N +LL KW+W+F  EKE +W+++I++KY   K    P      S+  PWK
Subjt:  MKVTKAIEKLFRNYLWRGAHRNRGSHLLNWSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKYGAAKNNLKPGISSLHSSKGPWK

Query:  AIYKLQNLIYRSISIKVGNGNSTLFWEEVWLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVESSEWASMSHYLFSFTIKEEVDKW
        A+    +  Y++I  KV +G    FW + W G S L +  P L+ LS  K+G + D WNPS K WN+ + R L + E + W ++   L +        K 
Subjt:  AIYKLQNLIYRSISIKVGNGNSTLFWEEVWLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVESSEWASMSHYLFSFTIKEEVDKW

Query:  IWSLDKSGVFSTGSLTKEL---ASSKNTSNVDLYRLLWKGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPSCCCCLCFQASESQIHLFSLCSYVT
        +W L+ + +F T S+ K+L   ++S    +  LY+ LWK   PKK KFFIW L H CINTAD +Q+++P+ +LSP+  C +C ++ E   HLF  C Y  
Subjt:  IWSLDKSGVFSTGSLTKEL---ASSKNTSNVDLYRLLWKGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPSCCCCLCFQASESQIHLFSLCSYVT

Query:  SFWDTLQ
          W   Q
Subjt:  SFWDTLQ

KAA0041397.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.3e-1234.35Show/hide
Query:  VDQWWNSNLLQGWPGHGLMQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSIIKAELMNISAKEEIVWRQKCKLKWFVE
        ++ WW +    G+ G+  M++LK L  ++K W ++        K A  +E+++ID  E     +E    +R  +KA+L  I+  E  +W QKCK  W  E
Subjt:  VDQWWNSNLLQGWPGHGLMQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSIIKAELMNISAKEEIVWRQKCKLKWFVE

Query:  GDVNSAFFHRIVTANRRKNAIFEVLSASGIN
        GD NS+FFH+I TA ++K  I +V++  G N
Subjt:  GDVNSAFFHRIVTANRRKNAIFEVLSASGIN

KAA0041397.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.0e-8438.57Show/hide
Query:  NRRKNAIFEVLSASGINQQLAASLASSFGCKLGSWPITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGRHTLIQATLANLPVYYLSIFLAP
        N  K+ IF +     +    A S+  S+G   G  P TYLG+PL G P S NFW  +++KI+K+L +W  + +SKGGR TLI +TL +LP+Y LS+F  P
Subjt:  NRRKNAIFEVLSASGINQQLAASLASSFGCKLGSWPITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGRHTLIQATLANLPVYYLSIFLAP

Query:  MKVTKAIEKLFRNYLWRGAHRNRGSHLLNWSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKYGAAKNNLKPGISSLHSSKGPWK
          + + IE  +RN+LW G        L+ W+ + SP E+GGLG++ +H  N +LL KW+W+F  EKE +W+++I++KY   K    P      S+  PWK
Subjt:  MKVTKAIEKLFRNYLWRGAHRNRGSHLLNWSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKYGAAKNNLKPGISSLHSSKGPWK

Query:  AIYKLQNLIYRSISIKVGNGNSTLFWEEVWLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVESSEWASMSHYLFSFTIKEEVDKW
        A+    +  Y++I  KV +G    FW + W G S L +  P L+ LS  K+G + D WNPS K WN+ + R L + E + W ++   L +        K 
Subjt:  AIYKLQNLIYRSISIKVGNGNSTLFWEEVWLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVESSEWASMSHYLFSFTIKEEVDKW

Query:  IWSLDKSGVFSTGSLTKEL---ASSKNTSNVDLYRLLWKGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPSCCCCLCFQASESQIHLFSLCSYVT
        +W L+ + +F T S+ K+L   ++S    +  LY+ LWK   PKK KFFIW L H CINTAD +Q+++P+ +LSP+  C +C ++ E   HLF  C Y  
Subjt:  IWSLDKSGVFSTGSLTKEL---ASSKNTSNVDLYRLLWKGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPSCCCCLCFQASESQIHLFSLCSYVT

Query:  SFWDTLQ
          W   Q
Subjt:  SFWDTLQ

KAA0044556.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.0e-0734.51Show/hide
Query:  MQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSIIKAELMNISAKEEIVWRQKCKLKWFVEGDVNSAFFHRIVTANRRK
        M++LK L  ++K W ++        K A  +E+ +ID  E     +E   ++R  +KA+L  I+  +  +W QKCK  W  EGD NS+FFH+I T  ++K
Subjt:  MQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSIIKAELMNISAKEEIVWRQKCKLKWFVEGDVNSAFFHRIVTANRRK

Query:  NAIFEVLSASGIN
          I +V++  G N
Subjt:  NAIFEVLSASGIN

KAA0044556.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.3e-8335.44Show/hide
Query:  LSRELSVIDNKEEIDHLS-EQDINRRSIIKAE--LMNISAKEEIVWRQKCKLKWFVEGDVNSAFFHRIVTANRRKNAIFEVLSASGINQQLAASLASSFG
        LSR L+ + +K +I+ +    ++N   I+ A+  L+ +  +++ V   K  L  F             +  N  K+ IF +     +    A S+A S+G
Subjt:  LSRELSVIDNKEEIDHLS-EQDINRRSIIKAE--LMNISAKEEIVWRQKCKLKWFVEGDVNSAFFHRIVTANRRKNAIFEVLSASGINQQLAASLASSFG

Query:  CKLGSWPITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGRHTLIQATLANLPVYYLSIFLAPMKVTKAIEKLFRNYLWRGAHRNRGSHLLN
           G  P +YLG+PL G P S NFW  V++KI+K+L NW  + +SKGGR TLI +TL +LP+Y +S+F  P  + + IE  +RN+LW GA       L+ 
Subjt:  CKLGSWPITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGRHTLIQATLANLPVYYLSIFLAPMKVTKAIEKLFRNYLWRGAHRNRGSHLLN

Query:  WSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKYGAAKNNLKPGISSLHSSKGPWKAIYKLQNLIYRSISIKVGNGNSTLFWEEV
        W+ + SP E+GGLG++ ++  N +LL KW+W+F  EK+ +W+++I++KY   K    P      S+  PWKA+ +  +  Y++IS KV +G    FW + 
Subjt:  WSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKYGAAKNNLKPGISSLHSSKGPWKAIYKLQNLIYRSISIKVGNGNSTLFWEEV

Query:  WLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVESSEWASMSHYLFSFTIKEEVDKWIWSLDKSGVFSTGSLTKELAS---SKNTS
        W G + L +  P L+ LS  K+G + +FWNPS   W+L + R L + E + W ++   L +        K +W+L+ + +F T S+ + +A    S    
Subjt:  WLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVESSEWASMSHYLFSFTIKEEVDKWIWSLDKSGVFSTGSLTKELAS---SKNTS

Query:  NVDLYRLLWKGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPSCCCCLCFQASESQIHLFSLCSYVTSFW
        + +LY+ LWK   PKK KFFIW L H CINTAD +Q+++P+ +LSP+  C +C ++ E   HLF  C Y    W
Subjt:  NVDLYRLLWKGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPSCCCCLCFQASESQIHLFSLCSYVTSFW

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]9.2e-8622.34Show/hide
Query:  PISFRIERKSFSITADPRNPRC-YRLTEATRDKSFSLTLSKEALYWIKASFSKLCDLPLNQKFFTEKRVEDATLWVEKITNKKDHSAEIAKLGNNGGLNK
        P S ++ERK F +  D  +    Y LTE    K+FS+ +S   L WI+ +   L   P   +FF E R  +  +W+ K  N K  +AEI ++      + 
Subjt:  PISFRIERKSFSITADPRNPRC-YRLTEATRDKSFSLTLSKEALYWIKASFSKLCDLPLNQKFFTEKRVEDATLWVEKITNKKDHSAEIAKLGNNGGLNK

Query:  IIIPVGDERKGWKSFTECINSLT-------------NNPSASLPPPIYDVKDNTSYKAALKAPPREPRNQPSAVPILPEESASLNPF----ATLFLSSAV
        I++P G ++ GW SF   I                  +P   L PPI D    +  KA  +  P    +   +        +S N F    ++  L + V
Subjt:  IIIPVGDERKGWKSFTECINSLT-------------NNPSASLPPPIYDVKDNTSYKAALKAPPREPRNQPSAVPILPEESASLNPF----ATLFLSSAV

Query:  IVQRRHFHDNWLDIMRALQQHLSAYASVSPLQPDKALLACENEEQAQVLAKIKDWYKVGKFQVRFFPWNSEIMNGDRKVPSYGGWIRVRNLPIDKWSIEA
        ++ RR FHD+W  I++ L++      + +    +KAL+   +   A +L + K W  VGK+ VRF  W+       + +PSYGGW   R +P+  W++  
Subjt:  IVQRRHFHDNWLDIMRALQQHLSAYASVSPLQPDKALLACENEEQAQVLAKIKDWYKVGKFQVRFFPWNSEIMNGDRKVPSYGGWIRVRNLPIDKWSIEA

Query:  FKLIGDECGGYLETTTKTLTRLDMMEIMIKVKTNHTGFIPAEVHIPSTSSSPSKVSI-----------------------------------DPFFME--
        F+ IG  C G ++   +T +  +++E  IKV+ N++GF+PA V I     +   V +                                   + FF E  
Subjt:  FKLIGDECGGYLETTTKTLTRLDMMEIMIKVKTNHTGFIPAEVHIPSTSSSPSKVSI-----------------------------------DPFFME--

Query:  -----DYYIGYIAGIHGKIPTGPKEY-----------------------DDSIIGKSGKS-------MDNRGASQPRAAQAKEIEKGDNPQISYSCALQG
             D+      G     P  P                          D ++   + KS       + N G    +  Q  +I+   N  ++   + + 
Subjt:  -----DYYIGYIAGIHGKIPTGPKEY-----------------------DDSIIGKSGKS-------MDNRGASQPRAAQAKEIEKGDNPQISYSCALQG

Query:  TPFQ--VCHTEDYQTQSEWAPHLPE-DHVIPQSIVATTGGSKKPCPLTLPQSEAHPNQ-----QP------PRQAPKSPL--------YPDGPPQ-----
          F      T  +   S  A H P  +    +  V+     KK    T P S+A+ N+     QP       R A K  L         P   P      
Subjt:  TPFQ--VCHTEDYQTQSEWAPHLPE-DHVIPQSIVATTGGSKKPCPLTLPQSEAHPNQ-----QP------PRQAPKSPL--------YPDGPPQ-----

Query:  ----------IYTNQPVCP-NPEAPIDINPSTISPPASPIPSIKNHPRRK------------------KPIIIN--NKETFLLSGTVHSTGSTFH-----
                    TN  V P  PE  + +N ++ S   +     K+  +RK                  K  +++   K    LS    S+G+T       
Subjt:  ----------IYTNQPVCP-NPEAPIDINPSTISPPASPIPSIKNHPRRK------------------KPIIIN--NKETFLLSGTVHSTGSTFH-----

Query:  -------------------------------------------------LSDSEGALSSPCSPNLDES---------------------PPIHHQKSSNQ
                                                         LS  EG  S   +  L+ +                       +H+ +  N 
Subjt:  -------------------------------------------------LSDSEGALSSPCSPNLDES---------------------PPIHHQKSSNQ

Query:  AD------------SPPSISFLFESSEDQALDIDNPIPLMIEKPSAAG--IQQNIEMETT-ALVDIDVEELSGED--SVSKTHFQHRDAAAYLPFLFPWL
                         S S L  S   + L+      L+I+ P         N+    T + +D  +   S E+  S   T    R  + + P +    
Subjt:  AD------------SPPSISFLFESSEDQALDIDNPIPLMIEKPSAAG--IQQNIEMETT-ALVDIDVEELSGED--SVSKTHFQHRDAAAYLPFLFPWL

Query:  ADHGMCIMPMPNRNKLSTAAKKKTQWMVDQWWNSNLLQGWPGHGLMQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSI
        ++  +   P+P R    T +  + +  + +WW +++  G+PG   +Q+LK L   +K W ++  +S    K A+ RE+  ID KE    L++++ NRR  
Subjt:  ADHGMCIMPMPNRNKLSTAAKKKTQWMVDQWWNSNLLQGWPGHGLMQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSI

Query:  IKAELMNISAKEEIVWRQKCKLKWFVEGDVNSAFFHRIVTANRRKNAIFEV-------------------------------------------------
        +KA+L  +S KE   W Q+ K  W  EGD NS+FFHRI ++ ++++ I E+                                                 
Subjt:  IKAELMNISAKEEIVWRQKCKLKWFVEGDVNSAFFHRIVTANRRKNAIFEV-------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------LSASGI------------------------
                                                                              L +SG                         
Subjt:  ----------------------------------------------------------------------LSASGI------------------------

Query:  ---------NQQLAASL-----------------------------ASSFGCKLGSWPITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGR
                 N ++A SL                             AS +G    S P++YLG+PL GNP+S  FW  V +KI+K+L NW    ISKGGR
Subjt:  ---------NQQLAASL-----------------------------ASSFGCKLGSWPITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGR

Query:  HTLIQATLANLPVYYLSIFLAPMKVTKAIEKLFRNYLWRGAHRNRGSHLLNWSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKY
         TLI++TL++LP+Y LS+F AP    K IEKL+R +LW+G + + GSHL+NW+ +    EEGGLG+  ++  N +LL+KW+WR+  E  A+WR++I  KY
Subjt:  HTLIQATLANLPVYYLSIFLAPMKVTKAIEKLFRNYLWRGAHRNRGSHLLNWSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKY

Query:  -GAAKNNLKPGISSLHSSKGPWKAIYKLQNLIYRSISIKVGNGNSTLFWEEVWLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVE
         G    ++   ISS  +SK PW++I    +    + S  + NG+   FW   W     L   YP L+ L+L KE  + D WN     WN+  RR LN+ E
Subjt:  -GAAKNNLKPGISSLHSSKGPWKAIYKLQNLIYRSISIKVGNGNSTLFWEEVWLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVE

Query:  SSEWASMSHYLFSFTIKEEVDKWIWSLDKSGVFSTGS----LTKELASSKNTSNVDLYRLLWKGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPS
           W  +   L +        K  W  D +  FS  S    ++++L  +       L  ++WK S+P K+KFF+W L    INT +V+Q+KMP+  L P+
Subjt:  SSEWASMSHYLFSFTIKEEVDKWIWSLDKSGVFSTGS----LTKELASSKNTSNVDLYRLLWKGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPS

Query:  C
        C
Subjt:  C

TYK00493.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.9e-8722.92Show/hide
Query:  PISFRIERKSFSITADPRNPRC-YRLTEATRDKSFSLTLSKEALYWIKASFSKLCDLPLNQKFFTEKRVEDATLWVEKITNKKDHSAEIAKLGNNGGLNK
        P S +IERK F +  D       Y LTE    K+FS+ +S   L WI+++   L + P + +FF E R  +  +W+ K  N K  +AEI ++ +    + 
Subjt:  PISFRIERKSFSITADPRNPRC-YRLTEATRDKSFSLTLSKEALYWIKASFSKLCDLPLNQKFFTEKRVEDATLWVEKITNKKDHSAEIAKLGNNGGLNK

Query:  IIIPVGDERKGWKSFTECINSLT-------------NNPSASLPPPIYDVKDNTSYKAALKAPPREPRNQPSAVPILPEESASLN-----PFATLFLSSA
        I++P G E+  W SF   I                 ++P   L PPI D    +  KA  +       +   +        +S N     PF  L L + 
Subjt:  IIIPVGDERKGWKSFTECINSLT-------------NNPSASLPPPIYDVKDNTSYKAALKAPPREPRNQPSAVPILPEESASLN-----PFATLFLSSA

Query:  VIVQRRHFHDNWLDIMRALQQHLSAYASVSPLQPDKALLACENEEQAQVLAKIKDWYKVGKFQVRFFPWNSEIMNGDRKVPSYGGWIRVRNLPIDKWSIE
        V++ RR FHD+W  I++ L++      + +    +K L+   +   A +L + K W  VGK+ VRF  W        + +PSYGGW   R +P+  W++ 
Subjt:  VIVQRRHFHDNWLDIMRALQQHLSAYASVSPLQPDKALLACENEEQAQVLAKIKDWYKVGKFQVRFFPWNSEIMNGDRKVPSYGGWIRVRNLPIDKWSIE

Query:  AFKLIGDECGGYLETTTKTLTRLDMMEIMIKVKTNHTGFIPAEVHIPSTSSSPSKVSIDPFFMEDYYIGYIAGIHGKIP-TGPKEYDD------------
         F+ IG  CGG ++   +T T  +++E  +K++ N++GF+PA V I     +   V +       + +     +HG         +DD            
Subjt:  AFKLIGDECGGYLETTTKTLTRLDMMEIMIKVKTNHTGFIPAEVHIPSTSSSPSKVSIDPFFMEDYYIGYIAGIHGKIP-TGPKEYDD------------

Query:  ------SIIGKSGKSMDNRGASQPRAAQAKEIEKGD--------NPQISYSCALQGT---------------------------PFQVC-----------
               ++     S  +    QP A ++  I+           N ++    +L  T                           P Q+            
Subjt:  ------SIIGKSGKSMDNRGASQPRAAQAKEIEKGD--------NPQISYSCALQGT---------------------------PFQVC-----------

Query:  --------HTEDYQTQSEWAPHLPE--------------DHVIPQSIVATTG-----------------GSKKPCPLTLPQSEAHPNQQPPRQAPKSPLY
                 T  +   S  A H PE                 I   + A  G                  SKK   LT+      P   P +        
Subjt:  --------HTEDYQTQSEWAPHLPE--------------DHVIPQSIVATTG-----------------GSKKPCPLTLPQSEAHPNQQPPRQAPKSPLY

Query:  PDGPPQIYTNQPVCP-NPEAPI-DINPSTISPPASPIPSIKNHPRR------------------KKPIIINNKETFL-LSGTVHSTGSTFHLSDSEGALS
         +      TN  V P  PE  + D   S  SP  +      +H RR                  K  ++   KE  L LS    S+G+T   +     L 
Subjt:  PDGPPQIYTNQPVCP-NPEAPI-DINPSTISPPASPIPSIKNHPRR------------------KKPIIINNKETFL-LSGTVHSTGSTFHLSDSEGALS

Query:  SPCSPNLDESPPIHHQKSSNQADSPPSISFLFES---------------------------------SEDQALDIDNPIPLMIEKPSAAGIQQ------N
        S     L      HH   S Q +   S+S  F S                                 S    +  D  +  M E+ +A           N
Subjt:  SPCSPNLDESPPIHHQKSSNQADSPPSISFLFES---------------------------------SEDQALDIDNPIPLMIEKPSAAGIQQ------N

Query:  IEMETTALVDIDV-------EELSGEDSVSKTH---FQHRDAAAYLPFL-----------FPWLADHGMCIM---PMPNRNKLSTAAKKKTQWMVDQWWN
          +    L+D  +         L    + S+     +  R    + P +           FP + +     +   P P R         + +  +++WW 
Subjt:  IEMETTALVDIDV-------EELSGEDSVSKTH---FQHRDAAAYLPFL-----------FPWLADHGMCIM---PMPNRNKLSTAAKKKTQWMVDQWWN

Query:  SNLLQGWPGHGLMQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSIIKAELMNISAKEEIVWRQKCKLKWFVEGDVNSA
         ++  G PG   +Q+LK L   +K W ++ F S    K  + RE+  ID  E    LS ++ NRR  +KAEL ++S KE   W Q+ K  W  EGD NSA
Subjt:  SNLLQGWPGHGLMQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSIIKAELMNISAKEEIVWRQKCKLKWFVEGDVNSA

Query:  FFHRIVTANRRKNAIFEVLSASG-----------------------------------------------------------------------------
        FFHRI ++ +++N I E+    G                                                                             
Subjt:  FFHRIVTANRRKNAIFEVLSASG-----------------------------------------------------------------------------

Query:  -----------------------------------------------------------INQQLAASLAS------------------------------
                                                                   I + +A +L++                              
Subjt:  -----------------------------------------------------------INQQLAASLAS------------------------------

Query:  -------------SFGCKL------------------------GSW------------------------------------------------------
                      F  KL                         SW                                                      
Subjt:  -------------SFGCKL------------------------GSW------------------------------------------------------

Query:  -----------------PITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGRHTLIQATLANLPVYYLSIFLAPMKVTKAIEKLFRNYLWRG
                         P+TYLG+PL GNP+S  FW  + ++I+K+L NW   HISKGGR TLI++TL++LP+Y LS+F AP    K IEKL+RN+LW+G
Subjt:  -----------------PITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGRHTLIQATLANLPVYYLSIFLAPMKVTKAIEKLFRNYLWRG

Query:  AHRNRGSHLLNWSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKY-GAAKNNLKPGISSLHSSKGPWKAIYKLQNLIYRSISIKV
        +   +GSHL+NWS++  P EEGGLG+  +   N +LL+KW+WR++ E  ++WR++I  KY G    +L   ISS  SSK PW++I    +    +    +
Subjt:  AHRNRGSHLLNWSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKY-GAAKNNLKPGISSLHSSKGPWKAIYKLQNLIYRSISIKV

Query:  GNGNSTLFWEEVWLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVESSEWASMSHYLFSFTIKEEVDKWIWSLDKSGVFSTGSLTK
         NG+   FW   W     L   YP L+ LS+ KE  I D WN +   W +  RR LN+ E S W  +   L          K  W  D    FS  S   
Subjt:  GNGNSTLFWEEVWLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVESSEWASMSHYLFSFTIKEEVDKWIWSLDKSGVFSTGSLTK

Query:  ELASSKNTS----NVDLYRLLWKGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPSCCCCLCFQASESQIHLFSLCSYVTSFWDTLQQA
         ++   + S     V L  L+WK  +P K+KFF+W L    +NT +V     P+        C LC + SE+  HLF  C  V   W  L ++
Subjt:  ELASSKNTS----NVDLYRLLWKGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPSCCCCLCFQASESQIHLFSLCSYVTSFWDTLQQA

TrEMBL top hitse value%identityAlignment
A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein2.3e-1333.1Show/hide
Query:  VDQWWNSNLLQGWPGHGLMQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSIIKAELMNISAKEEIVWRQKCKLKWFVE
        ++ WW +    G+ G+  M++LK L   +K W +         K A  +E+  ID  E     +E    +R+ +KA+L  I+  E  +W QKCK  W  E
Subjt:  VDQWWNSNLLQGWPGHGLMQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSIIKAELMNISAKEEIVWRQKCKLKWFVE

Query:  GDVNSAFFHRIVTANRRKNAIFEVLSASGINQQLAASLASSF
        GD NS+FFH+I TA ++K  I ++++ SG N    + +A +F
Subjt:  GDVNSAFFHRIVTANRRKNAIFEVLSASGINQQLAASLASSF

A0A5A7TIB8 LINE-1 retrotransposable element ORF2 protein2.9e-8538.57Show/hide
Query:  NRRKNAIFEVLSASGINQQLAASLASSFGCKLGSWPITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGRHTLIQATLANLPVYYLSIFLAP
        N  K+ IF +     +    A S+  S+G   G  P TYLG+PL G P S NFW  +++KI+K+L +W  + +SKGGR TLI +TL +LP+Y LS+F  P
Subjt:  NRRKNAIFEVLSASGINQQLAASLASSFGCKLGSWPITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGRHTLIQATLANLPVYYLSIFLAP

Query:  MKVTKAIEKLFRNYLWRGAHRNRGSHLLNWSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKYGAAKNNLKPGISSLHSSKGPWK
          + + IE  +RN+LW G        L+ W+ + SP E+GGLG++ +H  N +LL KW+W+F  EKE +W+++I++KY   K    P      S+  PWK
Subjt:  MKVTKAIEKLFRNYLWRGAHRNRGSHLLNWSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKYGAAKNNLKPGISSLHSSKGPWK

Query:  AIYKLQNLIYRSISIKVGNGNSTLFWEEVWLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVESSEWASMSHYLFSFTIKEEVDKW
        A+    +  Y++I  KV +G    FW + W G S L +  P L+ LS  K+G + D WNPS K WN+ + R L + E + W ++   L +        K 
Subjt:  AIYKLQNLIYRSISIKVGNGNSTLFWEEVWLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVESSEWASMSHYLFSFTIKEEVDKW

Query:  IWSLDKSGVFSTGSLTKEL---ASSKNTSNVDLYRLLWKGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPSCCCCLCFQASESQIHLFSLCSYVT
        +W L+ + +F T S+ K+L   ++S    +  LY+ LWK   PKK KFFIW L H CINTAD +Q+++P+ +LSP+  C +C ++ E   HLF  C Y  
Subjt:  IWSLDKSGVFSTGSLTKEL---ASSKNTSNVDLYRLLWKGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPSCCCCLCFQASESQIHLFSLCSYVT

Query:  SFWDTLQ
          W   Q
Subjt:  SFWDTLQ

A0A5A7TIB8 LINE-1 retrotransposable element ORF2 protein1.1e-1234.35Show/hide
Query:  VDQWWNSNLLQGWPGHGLMQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSIIKAELMNISAKEEIVWRQKCKLKWFVE
        ++ WW +    G+ G+  M++LK L  ++K W ++        K A  +E+++ID  E     +E    +R  +KA+L  I+  E  +W QKCK  W  E
Subjt:  VDQWWNSNLLQGWPGHGLMQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSIIKAELMNISAKEEIVWRQKCKLKWFVE

Query:  GDVNSAFFHRIVTANRRKNAIFEVLSASGIN
        GD NS+FFH+I TA ++K  I +V++  G N
Subjt:  GDVNSAFFHRIVTANRRKNAIFEVLSASGIN

A0A5A7TIB8 LINE-1 retrotransposable element ORF2 protein4.9e-8538.57Show/hide
Query:  NRRKNAIFEVLSASGINQQLAASLASSFGCKLGSWPITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGRHTLIQATLANLPVYYLSIFLAP
        N  K+ IF +     +    A S+  S+G   G  P TYLG+PL G P S NFW  +++KI+K+L +W  + +SKGGR TLI +TL +LP+Y LS+F  P
Subjt:  NRRKNAIFEVLSASGINQQLAASLASSFGCKLGSWPITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGRHTLIQATLANLPVYYLSIFLAP

Query:  MKVTKAIEKLFRNYLWRGAHRNRGSHLLNWSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKYGAAKNNLKPGISSLHSSKGPWK
          + + IE  +RN+LW G        L+ W+ + SP E+GGLG++ +H  N +LL KW+W+F  EKE +W+++I++KY   K    P      S+  PWK
Subjt:  MKVTKAIEKLFRNYLWRGAHRNRGSHLLNWSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKYGAAKNNLKPGISSLHSSKGPWK

Query:  AIYKLQNLIYRSISIKVGNGNSTLFWEEVWLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVESSEWASMSHYLFSFTIKEEVDKW
        A+    +  Y++I  KV +G    FW + W G S L +  P L+ LS  K+G + D WNPS K WN+ + R L + E + W ++   L +        K 
Subjt:  AIYKLQNLIYRSISIKVGNGNSTLFWEEVWLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVESSEWASMSHYLFSFTIKEEVDKW

Query:  IWSLDKSGVFSTGSLTKEL---ASSKNTSNVDLYRLLWKGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPSCCCCLCFQASESQIHLFSLCSYVT
        +W L+ + +F T S+ K+L   ++S    +  LY+ LWK   PKK KFFIW L H CINTAD +Q+++P+ +LSP+  C +C ++ E   HLF  C Y  
Subjt:  IWSLDKSGVFSTGSLTKEL---ASSKNTSNVDLYRLLWKGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPSCCCCLCFQASESQIHLFSLCSYVT

Query:  SFWDTLQ
          W   Q
Subjt:  SFWDTLQ

A0A5A7TR15 LINE-1 retrotransposable element ORF2 protein4.9e-0834.51Show/hide
Query:  MQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSIIKAELMNISAKEEIVWRQKCKLKWFVEGDVNSAFFHRIVTANRRK
        M++LK L  ++K W ++        K A  +E+ +ID  E     +E   ++R  +KA+L  I+  +  +W QKCK  W  EGD NS+FFH+I T  ++K
Subjt:  MQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSIIKAELMNISAKEEIVWRQKCKLKWFVEGDVNSAFFHRIVTANRRK

Query:  NAIFEVLSASGIN
          I +V++  G N
Subjt:  NAIFEVLSASGIN

A0A5A7TR15 LINE-1 retrotransposable element ORF2 protein1.6e-8335.44Show/hide
Query:  LSRELSVIDNKEEIDHLS-EQDINRRSIIKAE--LMNISAKEEIVWRQKCKLKWFVEGDVNSAFFHRIVTANRRKNAIFEVLSASGINQQLAASLASSFG
        LSR L+ + +K +I+ +    ++N   I+ A+  L+ +  +++ V   K  L  F             +  N  K+ IF +     +    A S+A S+G
Subjt:  LSRELSVIDNKEEIDHLS-EQDINRRSIIKAE--LMNISAKEEIVWRQKCKLKWFVEGDVNSAFFHRIVTANRRKNAIFEVLSASGINQQLAASLASSFG

Query:  CKLGSWPITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGRHTLIQATLANLPVYYLSIFLAPMKVTKAIEKLFRNYLWRGAHRNRGSHLLN
           G  P +YLG+PL G P S NFW  V++KI+K+L NW  + +SKGGR TLI +TL +LP+Y +S+F  P  + + IE  +RN+LW GA       L+ 
Subjt:  CKLGSWPITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGRHTLIQATLANLPVYYLSIFLAPMKVTKAIEKLFRNYLWRGAHRNRGSHLLN

Query:  WSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKYGAAKNNLKPGISSLHSSKGPWKAIYKLQNLIYRSISIKVGNGNSTLFWEEV
        W+ + SP E+GGLG++ ++  N +LL KW+W+F  EK+ +W+++I++KY   K    P      S+  PWKA+ +  +  Y++IS KV +G    FW + 
Subjt:  WSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKYGAAKNNLKPGISSLHSSKGPWKAIYKLQNLIYRSISIKVGNGNSTLFWEEV

Query:  WLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVESSEWASMSHYLFSFTIKEEVDKWIWSLDKSGVFSTGSLTKELAS---SKNTS
        W G + L +  P L+ LS  K+G + +FWNPS   W+L + R L + E + W ++   L +        K +W+L+ + +F T S+ + +A    S    
Subjt:  WLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVESSEWASMSHYLFSFTIKEEVDKWIWSLDKSGVFSTGSLTKELAS---SKNTS

Query:  NVDLYRLLWKGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPSCCCCLCFQASESQIHLFSLCSYVTSFW
        + +LY+ LWK   PKK KFFIW L H CINTAD +Q+++P+ +LSP+  C +C ++ E   HLF  C Y    W
Subjt:  NVDLYRLLWKGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPSCCCCLCFQASESQIHLFSLCSYVTSFW

A0A5D3BL61 LINE-1 retrotransposable element ORF2 protein1.4e-8722.92Show/hide
Query:  PISFRIERKSFSITADPRNPRC-YRLTEATRDKSFSLTLSKEALYWIKASFSKLCDLPLNQKFFTEKRVEDATLWVEKITNKKDHSAEIAKLGNNGGLNK
        P S +IERK F +  D       Y LTE    K+FS+ +S   L WI+++   L + P + +FF E R  +  +W+ K  N K  +AEI ++ +    + 
Subjt:  PISFRIERKSFSITADPRNPRC-YRLTEATRDKSFSLTLSKEALYWIKASFSKLCDLPLNQKFFTEKRVEDATLWVEKITNKKDHSAEIAKLGNNGGLNK

Query:  IIIPVGDERKGWKSFTECINSLT-------------NNPSASLPPPIYDVKDNTSYKAALKAPPREPRNQPSAVPILPEESASLN-----PFATLFLSSA
        I++P G E+  W SF   I                 ++P   L PPI D    +  KA  +       +   +        +S N     PF  L L + 
Subjt:  IIIPVGDERKGWKSFTECINSLT-------------NNPSASLPPPIYDVKDNTSYKAALKAPPREPRNQPSAVPILPEESASLN-----PFATLFLSSA

Query:  VIVQRRHFHDNWLDIMRALQQHLSAYASVSPLQPDKALLACENEEQAQVLAKIKDWYKVGKFQVRFFPWNSEIMNGDRKVPSYGGWIRVRNLPIDKWSIE
        V++ RR FHD+W  I++ L++      + +    +K L+   +   A +L + K W  VGK+ VRF  W        + +PSYGGW   R +P+  W++ 
Subjt:  VIVQRRHFHDNWLDIMRALQQHLSAYASVSPLQPDKALLACENEEQAQVLAKIKDWYKVGKFQVRFFPWNSEIMNGDRKVPSYGGWIRVRNLPIDKWSIE

Query:  AFKLIGDECGGYLETTTKTLTRLDMMEIMIKVKTNHTGFIPAEVHIPSTSSSPSKVSIDPFFMEDYYIGYIAGIHGKIP-TGPKEYDD------------
         F+ IG  CGG ++   +T T  +++E  +K++ N++GF+PA V I     +   V +       + +     +HG         +DD            
Subjt:  AFKLIGDECGGYLETTTKTLTRLDMMEIMIKVKTNHTGFIPAEVHIPSTSSSPSKVSIDPFFMEDYYIGYIAGIHGKIP-TGPKEYDD------------

Query:  ------SIIGKSGKSMDNRGASQPRAAQAKEIEKGD--------NPQISYSCALQGT---------------------------PFQVC-----------
               ++     S  +    QP A ++  I+           N ++    +L  T                           P Q+            
Subjt:  ------SIIGKSGKSMDNRGASQPRAAQAKEIEKGD--------NPQISYSCALQGT---------------------------PFQVC-----------

Query:  --------HTEDYQTQSEWAPHLPE--------------DHVIPQSIVATTG-----------------GSKKPCPLTLPQSEAHPNQQPPRQAPKSPLY
                 T  +   S  A H PE                 I   + A  G                  SKK   LT+      P   P +        
Subjt:  --------HTEDYQTQSEWAPHLPE--------------DHVIPQSIVATTG-----------------GSKKPCPLTLPQSEAHPNQQPPRQAPKSPLY

Query:  PDGPPQIYTNQPVCP-NPEAPI-DINPSTISPPASPIPSIKNHPRR------------------KKPIIINNKETFL-LSGTVHSTGSTFHLSDSEGALS
         +      TN  V P  PE  + D   S  SP  +      +H RR                  K  ++   KE  L LS    S+G+T   +     L 
Subjt:  PDGPPQIYTNQPVCP-NPEAPI-DINPSTISPPASPIPSIKNHPRR------------------KKPIIINNKETFL-LSGTVHSTGSTFHLSDSEGALS

Query:  SPCSPNLDESPPIHHQKSSNQADSPPSISFLFES---------------------------------SEDQALDIDNPIPLMIEKPSAAGIQQ------N
        S     L      HH   S Q +   S+S  F S                                 S    +  D  +  M E+ +A           N
Subjt:  SPCSPNLDESPPIHHQKSSNQADSPPSISFLFES---------------------------------SEDQALDIDNPIPLMIEKPSAAGIQQ------N

Query:  IEMETTALVDIDV-------EELSGEDSVSKTH---FQHRDAAAYLPFL-----------FPWLADHGMCIM---PMPNRNKLSTAAKKKTQWMVDQWWN
          +    L+D  +         L    + S+     +  R    + P +           FP + +     +   P P R         + +  +++WW 
Subjt:  IEMETTALVDIDV-------EELSGEDSVSKTH---FQHRDAAAYLPFL-----------FPWLADHGMCIM---PMPNRNKLSTAAKKKTQWMVDQWWN

Query:  SNLLQGWPGHGLMQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSIIKAELMNISAKEEIVWRQKCKLKWFVEGDVNSA
         ++  G PG   +Q+LK L   +K W ++ F S    K  + RE+  ID  E    LS ++ NRR  +KAEL ++S KE   W Q+ K  W  EGD NSA
Subjt:  SNLLQGWPGHGLMQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSIIKAELMNISAKEEIVWRQKCKLKWFVEGDVNSA

Query:  FFHRIVTANRRKNAIFEVLSASG-----------------------------------------------------------------------------
        FFHRI ++ +++N I E+    G                                                                             
Subjt:  FFHRIVTANRRKNAIFEVLSASG-----------------------------------------------------------------------------

Query:  -----------------------------------------------------------INQQLAASLAS------------------------------
                                                                   I + +A +L++                              
Subjt:  -----------------------------------------------------------INQQLAASLAS------------------------------

Query:  -------------SFGCKL------------------------GSW------------------------------------------------------
                      F  KL                         SW                                                      
Subjt:  -------------SFGCKL------------------------GSW------------------------------------------------------

Query:  -----------------PITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGRHTLIQATLANLPVYYLSIFLAPMKVTKAIEKLFRNYLWRG
                         P+TYLG+PL GNP+S  FW  + ++I+K+L NW   HISKGGR TLI++TL++LP+Y LS+F AP    K IEKL+RN+LW+G
Subjt:  -----------------PITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGRHTLIQATLANLPVYYLSIFLAPMKVTKAIEKLFRNYLWRG

Query:  AHRNRGSHLLNWSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKY-GAAKNNLKPGISSLHSSKGPWKAIYKLQNLIYRSISIKV
        +   +GSHL+NWS++  P EEGGLG+  +   N +LL+KW+WR++ E  ++WR++I  KY G    +L   ISS  SSK PW++I    +    +    +
Subjt:  AHRNRGSHLLNWSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKY-GAAKNNLKPGISSLHSSKGPWKAIYKLQNLIYRSISIKV

Query:  GNGNSTLFWEEVWLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVESSEWASMSHYLFSFTIKEEVDKWIWSLDKSGVFSTGSLTK
         NG+   FW   W     L   YP L+ LS+ KE  I D WN +   W +  RR LN+ E S W  +   L          K  W  D    FS  S   
Subjt:  GNGNSTLFWEEVWLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVESSEWASMSHYLFSFTIKEEVDKWIWSLDKSGVFSTGSLTK

Query:  ELASSKNTS----NVDLYRLLWKGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPSCCCCLCFQASESQIHLFSLCSYVTSFWDTLQQA
         ++   + S     V L  L+WK  +P K+KFF+W L    +NT +V     P+        C LC + SE+  HLF  C  V   W  L ++
Subjt:  ELASSKNTS----NVDLYRLLWKGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPSCCCCLCFQASESQIHLFSLCSYVTSFWDTLQQA

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein4.5e-8622.34Show/hide
Query:  PISFRIERKSFSITADPRNPRC-YRLTEATRDKSFSLTLSKEALYWIKASFSKLCDLPLNQKFFTEKRVEDATLWVEKITNKKDHSAEIAKLGNNGGLNK
        P S ++ERK F +  D  +    Y LTE    K+FS+ +S   L WI+ +   L   P   +FF E R  +  +W+ K  N K  +AEI ++      + 
Subjt:  PISFRIERKSFSITADPRNPRC-YRLTEATRDKSFSLTLSKEALYWIKASFSKLCDLPLNQKFFTEKRVEDATLWVEKITNKKDHSAEIAKLGNNGGLNK

Query:  IIIPVGDERKGWKSFTECINSLT-------------NNPSASLPPPIYDVKDNTSYKAALKAPPREPRNQPSAVPILPEESASLNPF----ATLFLSSAV
        I++P G ++ GW SF   I                  +P   L PPI D    +  KA  +  P    +   +        +S N F    ++  L + V
Subjt:  IIIPVGDERKGWKSFTECINSLT-------------NNPSASLPPPIYDVKDNTSYKAALKAPPREPRNQPSAVPILPEESASLNPF----ATLFLSSAV

Query:  IVQRRHFHDNWLDIMRALQQHLSAYASVSPLQPDKALLACENEEQAQVLAKIKDWYKVGKFQVRFFPWNSEIMNGDRKVPSYGGWIRVRNLPIDKWSIEA
        ++ RR FHD+W  I++ L++      + +    +KAL+   +   A +L + K W  VGK+ VRF  W+       + +PSYGGW   R +P+  W++  
Subjt:  IVQRRHFHDNWLDIMRALQQHLSAYASVSPLQPDKALLACENEEQAQVLAKIKDWYKVGKFQVRFFPWNSEIMNGDRKVPSYGGWIRVRNLPIDKWSIEA

Query:  FKLIGDECGGYLETTTKTLTRLDMMEIMIKVKTNHTGFIPAEVHIPSTSSSPSKVSI-----------------------------------DPFFME--
        F+ IG  C G ++   +T +  +++E  IKV+ N++GF+PA V I     +   V +                                   + FF E  
Subjt:  FKLIGDECGGYLETTTKTLTRLDMMEIMIKVKTNHTGFIPAEVHIPSTSSSPSKVSI-----------------------------------DPFFME--

Query:  -----DYYIGYIAGIHGKIPTGPKEY-----------------------DDSIIGKSGKS-------MDNRGASQPRAAQAKEIEKGDNPQISYSCALQG
             D+      G     P  P                          D ++   + KS       + N G    +  Q  +I+   N  ++   + + 
Subjt:  -----DYYIGYIAGIHGKIPTGPKEY-----------------------DDSIIGKSGKS-------MDNRGASQPRAAQAKEIEKGDNPQISYSCALQG

Query:  TPFQ--VCHTEDYQTQSEWAPHLPE-DHVIPQSIVATTGGSKKPCPLTLPQSEAHPNQ-----QP------PRQAPKSPL--------YPDGPPQ-----
          F      T  +   S  A H P  +    +  V+     KK    T P S+A+ N+     QP       R A K  L         P   P      
Subjt:  TPFQ--VCHTEDYQTQSEWAPHLPE-DHVIPQSIVATTGGSKKPCPLTLPQSEAHPNQ-----QP------PRQAPKSPL--------YPDGPPQ-----

Query:  ----------IYTNQPVCP-NPEAPIDINPSTISPPASPIPSIKNHPRRK------------------KPIIIN--NKETFLLSGTVHSTGSTFH-----
                    TN  V P  PE  + +N ++ S   +     K+  +RK                  K  +++   K    LS    S+G+T       
Subjt:  ----------IYTNQPVCP-NPEAPIDINPSTISPPASPIPSIKNHPRRK------------------KPIIIN--NKETFLLSGTVHSTGSTFH-----

Query:  -------------------------------------------------LSDSEGALSSPCSPNLDES---------------------PPIHHQKSSNQ
                                                         LS  EG  S   +  L+ +                       +H+ +  N 
Subjt:  -------------------------------------------------LSDSEGALSSPCSPNLDES---------------------PPIHHQKSSNQ

Query:  AD------------SPPSISFLFESSEDQALDIDNPIPLMIEKPSAAG--IQQNIEMETT-ALVDIDVEELSGED--SVSKTHFQHRDAAAYLPFLFPWL
                         S S L  S   + L+      L+I+ P         N+    T + +D  +   S E+  S   T    R  + + P +    
Subjt:  AD------------SPPSISFLFESSEDQALDIDNPIPLMIEKPSAAG--IQQNIEMETT-ALVDIDVEELSGED--SVSKTHFQHRDAAAYLPFLFPWL

Query:  ADHGMCIMPMPNRNKLSTAAKKKTQWMVDQWWNSNLLQGWPGHGLMQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSI
        ++  +   P+P R    T +  + +  + +WW +++  G+PG   +Q+LK L   +K W ++  +S    K A+ RE+  ID KE    L++++ NRR  
Subjt:  ADHGMCIMPMPNRNKLSTAAKKKTQWMVDQWWNSNLLQGWPGHGLMQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSI

Query:  IKAELMNISAKEEIVWRQKCKLKWFVEGDVNSAFFHRIVTANRRKNAIFEV-------------------------------------------------
        +KA+L  +S KE   W Q+ K  W  EGD NS+FFHRI ++ ++++ I E+                                                 
Subjt:  IKAELMNISAKEEIVWRQKCKLKWFVEGDVNSAFFHRIVTANRRKNAIFEV-------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------LSASGI------------------------
                                                                              L +SG                         
Subjt:  ----------------------------------------------------------------------LSASGI------------------------

Query:  ---------NQQLAASL-----------------------------ASSFGCKLGSWPITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGR
                 N ++A SL                             AS +G    S P++YLG+PL GNP+S  FW  V +KI+K+L NW    ISKGGR
Subjt:  ---------NQQLAASL-----------------------------ASSFGCKLGSWPITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGR

Query:  HTLIQATLANLPVYYLSIFLAPMKVTKAIEKLFRNYLWRGAHRNRGSHLLNWSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKY
         TLI++TL++LP+Y LS+F AP    K IEKL+R +LW+G + + GSHL+NW+ +    EEGGLG+  ++  N +LL+KW+WR+  E  A+WR++I  KY
Subjt:  HTLIQATLANLPVYYLSIFLAPMKVTKAIEKLFRNYLWRGAHRNRGSHLLNWSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKY

Query:  -GAAKNNLKPGISSLHSSKGPWKAIYKLQNLIYRSISIKVGNGNSTLFWEEVWLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVE
         G    ++   ISS  +SK PW++I    +    + S  + NG+   FW   W     L   YP L+ L+L KE  + D WN     WN+  RR LN+ E
Subjt:  -GAAKNNLKPGISSLHSSKGPWKAIYKLQNLIYRSISIKVGNGNSTLFWEEVWLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVE

Query:  SSEWASMSHYLFSFTIKEEVDKWIWSLDKSGVFSTGS----LTKELASSKNTSNVDLYRLLWKGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPS
           W  +   L +        K  W  D +  FS  S    ++++L  +       L  ++WK S+P K+KFF+W L    INT +V+Q+KMP+  L P+
Subjt:  SSEWASMSHYLFSFTIKEEVDKWIWSLDKSGVFSTGS----LTKELASSKNTSNVDLYRLLWKGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPS

Query:  C
        C
Subjt:  C

SwissProt top hitse value%identityAlignment
P34727 ADP-ribosylation factor2.7e-4045.95Show/hide
Query:  MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGF
        MG AF KLFD  +G  EMR++M+GLDAAGKTTILYKL +GE+++T+PTI                                                 GF
Subjt:  MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGF

Query:  NVEKVQYKNVVFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDMIQQPSP
        NVE V+YKN+ FTVWDVGGQ+K+RPLWRHYF NT G+I+VVDS DR+R+ +A+ E Q ++N+  + ++++LVFANKQD+    SP
Subjt:  NVEKVQYKNVVFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDMIQQPSP

P61209 ADP-ribosylation factor 16.7e-3946.37Show/hide
Query:  MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGF
        MG  F  LF   FG  EMR++M+GLDAAGKTTILYKL +GE+++T+PTI                                                 GF
Subjt:  MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGF

Query:  NVEKVQYKNVVFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDM
        NVE V+YKN+ FTVWDVGGQ+K+RPLWRHYF NT GLI+VVDS DRERIG+A+ E   ++ +  + ++V+L+FANKQD+
Subjt:  NVEKVQYKNVVFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDM

P61210 ADP-ribosylation factor 16.7e-3946.37Show/hide
Query:  MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGF
        MG  F  LF   FG  EMR++M+GLDAAGKTTILYKL +GE+++T+PTI                                                 GF
Subjt:  MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGF

Query:  NVEKVQYKNVVFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDM
        NVE V+YKN+ FTVWDVGGQ+K+RPLWRHYF NT GLI+VVDS DRERIG+A+ E   ++ +  + ++V+L+FANKQD+
Subjt:  NVEKVQYKNVVFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDM

Q61LA8 ADP-ribosylation factor 1-like 26.7e-3946.41Show/hide
Query:  MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGF
        MG  F  LF   FG  EMR++M+GLDAAGKTTILYKL +GE+++T+PTI                                                 GF
Subjt:  MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGF

Query:  NVEKVQYKNVVFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDMIQ
        NVE V+YKN+ FTVWDVGGQ+K+RPLWRHYF NT GLI+VVDS DRER+G+A+ E   ++ +  + ++V+LVFANKQD+ Q
Subjt:  NVEKVQYKNVVFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDMIQ

Q75A26 ADP-ribosylation factor2.3e-3946.37Show/hide
Query:  MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGF
        MG +F KLF   FG+ EMR++M+GLD AGKTT+LYKL +GEV++T+PTI                                                 GF
Subjt:  MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGF

Query:  NVEKVQYKNVVFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDM
        NVE V+YKN+ FTVWDVGGQ+K+RPLWRHYF NT+G+I+VVDS DR RI +A+   Q ++N+  + N+V+LVFANKQD+
Subjt:  NVEKVQYKNVVFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDM

Arabidopsis top hitse value%identityAlignment
AT1G10630.1 ADP-ribosylation factor A1F1.4e-3945.81Show/hide
Query:  MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGF
        MG +F KLF   F   EMR++M+GLDAAGKTTILYKL +GE+++T+PTI                                                 GF
Subjt:  MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGF

Query:  NVEKVQYKNVVFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDM
        NVE V+YKN+ FTVWDVGGQ+K+RPLWRHYF NT GLI+VVDS DR+R+ +A+ E   ++N+  + ++V+LVFANKQD+
Subjt:  NVEKVQYKNVVFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDM

AT1G23490.1 ADP-ribosylation factor 11.4e-3945.81Show/hide
Query:  MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGF
        MG +F KLF   F   EMR++M+GLDAAGKTTILYKL +GE+++T+PTI                                                 GF
Subjt:  MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGF

Query:  NVEKVQYKNVVFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDM
        NVE V+YKN+ FTVWDVGGQ+K+RPLWRHYF NT GLI+VVDS DR+R+ +A+ E   ++N+  + ++V+LVFANKQD+
Subjt:  NVEKVQYKNVVFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDM

AT1G70490.1 Ras-related small GTP-binding family protein1.4e-3945.81Show/hide
Query:  MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGF
        MG +F KLF   F   EMR++M+GLDAAGKTTILYKL +GE+++T+PTI                                                 GF
Subjt:  MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGF

Query:  NVEKVQYKNVVFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDM
        NVE V+YKN+ FTVWDVGGQ+K+RPLWRHYF NT GLI+VVDS DR+R+ +A+ E   ++N+  + ++V+LVFANKQD+
Subjt:  NVEKVQYKNVVFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDM

AT3G03120.1 ADP-ribosylation factor B1C4.9e-6168.11Show/hide
Query:  MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGF
        MGQ FRKLFDTFFGN EMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTI                                                 GF
Subjt:  MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGF

Query:  NVEKVQYKNVVFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDMIQQPSP
        NVEKVQYKNV+FTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAK EFQ II DPFMLNSVILVFANKQDM    SP
Subjt:  NVEKVQYKNVVFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDMIQQPSP

AT5G17060.1 ADP-ribosylation factor B1B2.9e-6168.11Show/hide
Query:  MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGF
        MGQAFRKLFDTFFGN EMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTI                                                 GF
Subjt:  MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGF

Query:  NVEKVQYKNVVFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDMIQQPSP
        NVEKVQYKNV+FTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAK EFQ II DPFMLNS+ILVFANKQDM    SP
Subjt:  NVEKVQYKNVVFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDMIQQPSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCAAGCGTTTCGCAAACTCTTCGACACATTTTTCGGCAACAGTGAGATGCGGGTTGTTATGCTTGGCTTGGATGCAGCTGGCAAGACAACTATATTGTACAAGCT
GCACATTGGGGAGGTTTTATCGACAGTTCCTACAATTGCTCTAGAGAAGTTTCGCCAAAGGATTCTTGCTTTGTCCTTAAAAACACCCCAGGTGTTTGGGGTTGGCCTTG
GCTCTCATTCTGAGTTGTCAGGAGGTCTTCTCCATCCCTCTGTCTTTTGCACGATGAATTCTTATCTATTTGTTGGTTTCAATGTGGAGAAAGTGCAGTATAAAAATGTT
GTGTTCACTGTTTGGGATGTTGGAGGACAAGAGAAACTGAGGCCATTGTGGAGGCACTACTTTAATAACACAGATGGACTGATTTATGTGGTAGACTCCTTGGACCGTGA
GAGAATTGGGAAGGCAAAAACAGAATTTCAGGCCATCATTAACGATCCATTTATGCTGAATAGTGTCATATTGGTCTTTGCGAATAAGCAGGATATGATCCAACAGCCAT
CACCCATATCCTTCAGGATTGAGCGAAAATCCTTCTCCATAACAGCTGATCCCAGGAACCCCAGATGTTATCGCCTCACAGAAGCCACTCGCGACAAAAGCTTCTCTTTG
ACGCTTTCCAAAGAAGCTTTATATTGGATAAAAGCTAGCTTCTCCAAGCTTTGTGACCTTCCCCTGAACCAGAAGTTCTTTACCGAGAAAAGAGTTGAAGACGCGACCCT
ATGGGTGGAAAAGATTACAAACAAAAAGGACCACTCAGCAGAGATAGCAAAGTTAGGAAATAATGGGGGTCTAAATAAGATTATTATACCTGTGGGGGATGAGAGAAAAG
GCTGGAAAAGTTTTACTGAGTGCATTAACTCTCTCACCAACAACCCATCAGCTAGCCTCCCCCCTCCCATTTATGATGTTAAGGATAACACATCTTATAAGGCTGCCTTA
AAAGCTCCTCCCAGAGAACCCCGCAATCAGCCCTCTGCAGTGCCAATTCTCCCTGAGGAATCTGCCTCCCTCAATCCGTTTGCGACCTTGTTTTTATCTTCGGCGGTCAT
AGTGCAGAGAAGACATTTTCATGACAACTGGTTAGACATAATGAGGGCCCTTCAACAACACTTATCAGCTTATGCCTCAGTTAGTCCCCTGCAACCCGATAAAGCTTTAC
TTGCTTGTGAGAATGAGGAACAAGCACAGGTTTTAGCCAAAATCAAAGATTGGTATAAGGTTGGTAAATTCCAGGTTAGATTCTTCCCATGGAATTCTGAAATCATGAAT
GGAGATAGAAAGGTCCCTTCATATGGGGGATGGATTAGGGTCCGCAACTTGCCTATAGACAAATGGTCGATTGAAGCCTTCAAATTGATTGGAGATGAATGTGGGGGATA
CCTGGAAACAACGACCAAGACTCTTACCAGATTAGATATGATGGAGATTATGATAAAAGTGAAAACCAATCATACTGGCTTCATCCCAGCCGAGGTTCACATTCCATCAA
CATCGAGTAGCCCCAGCAAGGTCAGTATCGACCCTTTCTTTATGGAAGACTACTATATTGGATATATAGCCGGAATCCATGGAAAAATCCCAACAGGTCCGAAGGAATAT
GACGATTCAATCATCGGAAAATCTGGAAAATCCATGGACAATCGGGGTGCATCTCAGCCACGCGCCGCCCAAGCAAAGGAAATTGAGAAAGGGGATAATCCCCAAATATC
GTACAGTTGTGCCCTCCAAGGAACCCCTTTTCAGGTCTGTCACACAGAGGATTACCAGACGCAATCAGAATGGGCCCCACACCTTCCAGAAGATCACGTGATTCCCCAAT
CCATAGTTGCCACCACCGGCGGGTCCAAAAAACCCTGCCCCCTTACATTGCCCCAATCAGAGGCCCACCCAAATCAACAGCCCCCTCGTCAAGCCCCAAAAAGCCCACTT
TATCCAGATGGCCCACCCCAAATCTACACCAATCAACCTGTTTGCCCTAATCCCGAAGCCCCTATAGACATAAACCCCTCGACAATCTCCCCACCAGCCAGCCCAATCCC
ATCCATCAAAAATCACCCTCGTCGGAAAAAGCCCATCATCATTAATAATAAGGAAACCTTCCTTCTATCGGGCACTGTACACTCGACTGGATCGACATTCCACCTATCAG
ATTCGGAAGGAGCTCTTTCCTCCCCTTGCTCGCCGAATTTGGATGAATCTCCCCCTATTCACCATCAAAAGTCTTCCAACCAAGCGGATTCTCCTCCATCTATATCATTC
TTATTTGAATCCTCTGAAGATCAGGCCCTTGACATTGATAACCCCATTCCGTTGATGATTGAAAAGCCCTCTGCGGCAGGTATACAGCAAAACATCGAAATGGAGACCAC
TGCCTTGGTCGACATTGATGTAGAAGAATTGTCAGGAGAAGATTCCGTTTCCAAGACCCACTTCCAGCATAGAGATGCCGCTGCTTATCTTCCCTTTCTTTTCCCTTGGC
TTGCTGATCATGGCATGTGCATCATGCCAATGCCAAACAGAAATAAACTATCTACTGCAGCCAAAAAGAAGACTCAATGGATGGTGGATCAATGGTGGAATTCTAATCTT
TTACAAGGCTGGCCAGGTCATGGTCTTATGCAAAAGCTGAAAGGGTTGAAAGGAGAGCTCAAGCTTTGGAATCAGCAAACCTTTAATTCACAGAGGGAGCTGAAATTTGC
TTTGAGCCGTGAGCTATCTGTTATTGATAACAAGGAGGAAATTGACCATCTATCTGAGCAGGATATCAATAGAAGATCGATCATCAAAGCTGAATTGATGAATATTTCGG
CCAAAGAAGAAATTGTCTGGAGGCAAAAATGCAAACTCAAATGGTTTGTTGAGGGAGATGTAAATTCAGCTTTTTTCCACCGAATTGTTACAGCGAACAGAAGGAAAAAC
GCCATTTTTGAAGTCCTTTCAGCATCAGGTATCAACCAACAACTTGCTGCCTCCTTGGCCTCATCCTTCGGGTGCAAGTTGGGATCTTGGCCGATTACATATCTTGGGCT
CCCTCTAAATGGCAATCCCCGATCGTTGAATTTTTGGTCTCCGGTGATAGAAAAGATTGAAAAAAGATTGTTCAATTGGGGTTCCAATCACATATCCAAAGGAGGCCGTC
ACACTCTTATTCAAGCTACTTTAGCAAACCTCCCCGTCTACTACCTCTCCATTTTCCTTGCTCCCATGAAAGTCACAAAAGCTATTGAAAAGCTTTTCCGAAATTATCTA
TGGAGAGGGGCTCACAGAAACAGAGGCAGCCACCTCCTTAATTGGTCTCTTTTAAAATCTCCCATTGAGGAAGGAGGTTTGGGCTTATACGATATCCACAAAAAGAATAT
CTCCCTTTTGGCTAAGTGGATTTGGAGATTTCATATTGAAAAGGAAGCTATTTGGAGGAAGATTATTGTAGCTAAATATGGGGCTGCCAAAAACAACCTCAAACCTGGGA
TATCATCTTTACACTCTTCAAAAGGTCCATGGAAAGCTATTTACAAATTGCAGAACCTGATTTACAGAAGTATTTCTATCAAAGTTGGAAATGGAAATAGTACTTTGTTT
TGGGAGGAGGTTTGGCTTGGAACCTCTAGCCTGCGTGTGAATTATCCCCATCTTTATCATCTTTCTCTCAAAAAAGAAGGCCATATTGCCGATTTTTGGAACCCCTCCCA
AAAAGCATGGAACCTTCAGCTTAGAAGAAATTTGAATGAAGTCGAATCATCCGAATGGGCATCCATGTCACACTATCTATTTTCCTTCACCATCAAAGAGGAGGTCGATA
AATGGATATGGTCCCTTGACAAATCAGGAGTCTTTTCCACGGGCTCTCTCACTAAAGAGTTGGCCTCCAGTAAAAACACATCCAATGTGGACTTGTACAGGTTGCTTTGG
AAAGGTTCTATGCCCAAAAAGGTCAAGTTTTTCATTTGGGAGCTTAGCCATGCTTGCATTAACACAGCTGATGTTGTCCAACGCAAAATGCCTTCGATGTCCCTCTCCCC
TAGCTGCTGCTGCTGCCTGTGTTTTCAAGCATCAGAATCGCAAATTCATCTTTTCAGTTTGTGCTCTTATGTGACATCTTTTTGGGACACTCTTCAGCAAGCTTTTGGCT
GA
mRNA sequenceShow/hide mRNA sequence
ATGGGTCAAGCGTTTCGCAAACTCTTCGACACATTTTTCGGCAACAGTGAGATGCGGGTTGTTATGCTTGGCTTGGATGCAGCTGGCAAGACAACTATATTGTACAAGCT
GCACATTGGGGAGGTTTTATCGACAGTTCCTACAATTGCTCTAGAGAAGTTTCGCCAAAGGATTCTTGCTTTGTCCTTAAAAACACCCCAGGTGTTTGGGGTTGGCCTTG
GCTCTCATTCTGAGTTGTCAGGAGGTCTTCTCCATCCCTCTGTCTTTTGCACGATGAATTCTTATCTATTTGTTGGTTTCAATGTGGAGAAAGTGCAGTATAAAAATGTT
GTGTTCACTGTTTGGGATGTTGGAGGACAAGAGAAACTGAGGCCATTGTGGAGGCACTACTTTAATAACACAGATGGACTGATTTATGTGGTAGACTCCTTGGACCGTGA
GAGAATTGGGAAGGCAAAAACAGAATTTCAGGCCATCATTAACGATCCATTTATGCTGAATAGTGTCATATTGGTCTTTGCGAATAAGCAGGATATGATCCAACAGCCAT
CACCCATATCCTTCAGGATTGAGCGAAAATCCTTCTCCATAACAGCTGATCCCAGGAACCCCAGATGTTATCGCCTCACAGAAGCCACTCGCGACAAAAGCTTCTCTTTG
ACGCTTTCCAAAGAAGCTTTATATTGGATAAAAGCTAGCTTCTCCAAGCTTTGTGACCTTCCCCTGAACCAGAAGTTCTTTACCGAGAAAAGAGTTGAAGACGCGACCCT
ATGGGTGGAAAAGATTACAAACAAAAAGGACCACTCAGCAGAGATAGCAAAGTTAGGAAATAATGGGGGTCTAAATAAGATTATTATACCTGTGGGGGATGAGAGAAAAG
GCTGGAAAAGTTTTACTGAGTGCATTAACTCTCTCACCAACAACCCATCAGCTAGCCTCCCCCCTCCCATTTATGATGTTAAGGATAACACATCTTATAAGGCTGCCTTA
AAAGCTCCTCCCAGAGAACCCCGCAATCAGCCCTCTGCAGTGCCAATTCTCCCTGAGGAATCTGCCTCCCTCAATCCGTTTGCGACCTTGTTTTTATCTTCGGCGGTCAT
AGTGCAGAGAAGACATTTTCATGACAACTGGTTAGACATAATGAGGGCCCTTCAACAACACTTATCAGCTTATGCCTCAGTTAGTCCCCTGCAACCCGATAAAGCTTTAC
TTGCTTGTGAGAATGAGGAACAAGCACAGGTTTTAGCCAAAATCAAAGATTGGTATAAGGTTGGTAAATTCCAGGTTAGATTCTTCCCATGGAATTCTGAAATCATGAAT
GGAGATAGAAAGGTCCCTTCATATGGGGGATGGATTAGGGTCCGCAACTTGCCTATAGACAAATGGTCGATTGAAGCCTTCAAATTGATTGGAGATGAATGTGGGGGATA
CCTGGAAACAACGACCAAGACTCTTACCAGATTAGATATGATGGAGATTATGATAAAAGTGAAAACCAATCATACTGGCTTCATCCCAGCCGAGGTTCACATTCCATCAA
CATCGAGTAGCCCCAGCAAGGTCAGTATCGACCCTTTCTTTATGGAAGACTACTATATTGGATATATAGCCGGAATCCATGGAAAAATCCCAACAGGTCCGAAGGAATAT
GACGATTCAATCATCGGAAAATCTGGAAAATCCATGGACAATCGGGGTGCATCTCAGCCACGCGCCGCCCAAGCAAAGGAAATTGAGAAAGGGGATAATCCCCAAATATC
GTACAGTTGTGCCCTCCAAGGAACCCCTTTTCAGGTCTGTCACACAGAGGATTACCAGACGCAATCAGAATGGGCCCCACACCTTCCAGAAGATCACGTGATTCCCCAAT
CCATAGTTGCCACCACCGGCGGGTCCAAAAAACCCTGCCCCCTTACATTGCCCCAATCAGAGGCCCACCCAAATCAACAGCCCCCTCGTCAAGCCCCAAAAAGCCCACTT
TATCCAGATGGCCCACCCCAAATCTACACCAATCAACCTGTTTGCCCTAATCCCGAAGCCCCTATAGACATAAACCCCTCGACAATCTCCCCACCAGCCAGCCCAATCCC
ATCCATCAAAAATCACCCTCGTCGGAAAAAGCCCATCATCATTAATAATAAGGAAACCTTCCTTCTATCGGGCACTGTACACTCGACTGGATCGACATTCCACCTATCAG
ATTCGGAAGGAGCTCTTTCCTCCCCTTGCTCGCCGAATTTGGATGAATCTCCCCCTATTCACCATCAAAAGTCTTCCAACCAAGCGGATTCTCCTCCATCTATATCATTC
TTATTTGAATCCTCTGAAGATCAGGCCCTTGACATTGATAACCCCATTCCGTTGATGATTGAAAAGCCCTCTGCGGCAGGTATACAGCAAAACATCGAAATGGAGACCAC
TGCCTTGGTCGACATTGATGTAGAAGAATTGTCAGGAGAAGATTCCGTTTCCAAGACCCACTTCCAGCATAGAGATGCCGCTGCTTATCTTCCCTTTCTTTTCCCTTGGC
TTGCTGATCATGGCATGTGCATCATGCCAATGCCAAACAGAAATAAACTATCTACTGCAGCCAAAAAGAAGACTCAATGGATGGTGGATCAATGGTGGAATTCTAATCTT
TTACAAGGCTGGCCAGGTCATGGTCTTATGCAAAAGCTGAAAGGGTTGAAAGGAGAGCTCAAGCTTTGGAATCAGCAAACCTTTAATTCACAGAGGGAGCTGAAATTTGC
TTTGAGCCGTGAGCTATCTGTTATTGATAACAAGGAGGAAATTGACCATCTATCTGAGCAGGATATCAATAGAAGATCGATCATCAAAGCTGAATTGATGAATATTTCGG
CCAAAGAAGAAATTGTCTGGAGGCAAAAATGCAAACTCAAATGGTTTGTTGAGGGAGATGTAAATTCAGCTTTTTTCCACCGAATTGTTACAGCGAACAGAAGGAAAAAC
GCCATTTTTGAAGTCCTTTCAGCATCAGGTATCAACCAACAACTTGCTGCCTCCTTGGCCTCATCCTTCGGGTGCAAGTTGGGATCTTGGCCGATTACATATCTTGGGCT
CCCTCTAAATGGCAATCCCCGATCGTTGAATTTTTGGTCTCCGGTGATAGAAAAGATTGAAAAAAGATTGTTCAATTGGGGTTCCAATCACATATCCAAAGGAGGCCGTC
ACACTCTTATTCAAGCTACTTTAGCAAACCTCCCCGTCTACTACCTCTCCATTTTCCTTGCTCCCATGAAAGTCACAAAAGCTATTGAAAAGCTTTTCCGAAATTATCTA
TGGAGAGGGGCTCACAGAAACAGAGGCAGCCACCTCCTTAATTGGTCTCTTTTAAAATCTCCCATTGAGGAAGGAGGTTTGGGCTTATACGATATCCACAAAAAGAATAT
CTCCCTTTTGGCTAAGTGGATTTGGAGATTTCATATTGAAAAGGAAGCTATTTGGAGGAAGATTATTGTAGCTAAATATGGGGCTGCCAAAAACAACCTCAAACCTGGGA
TATCATCTTTACACTCTTCAAAAGGTCCATGGAAAGCTATTTACAAATTGCAGAACCTGATTTACAGAAGTATTTCTATCAAAGTTGGAAATGGAAATAGTACTTTGTTT
TGGGAGGAGGTTTGGCTTGGAACCTCTAGCCTGCGTGTGAATTATCCCCATCTTTATCATCTTTCTCTCAAAAAAGAAGGCCATATTGCCGATTTTTGGAACCCCTCCCA
AAAAGCATGGAACCTTCAGCTTAGAAGAAATTTGAATGAAGTCGAATCATCCGAATGGGCATCCATGTCACACTATCTATTTTCCTTCACCATCAAAGAGGAGGTCGATA
AATGGATATGGTCCCTTGACAAATCAGGAGTCTTTTCCACGGGCTCTCTCACTAAAGAGTTGGCCTCCAGTAAAAACACATCCAATGTGGACTTGTACAGGTTGCTTTGG
AAAGGTTCTATGCCCAAAAAGGTCAAGTTTTTCATTTGGGAGCTTAGCCATGCTTGCATTAACACAGCTGATGTTGTCCAACGCAAAATGCCTTCGATGTCCCTCTCCCC
TAGCTGCTGCTGCTGCCTGTGTTTTCAAGCATCAGAATCGCAAATTCATCTTTTCAGTTTGTGCTCTTATGTGACATCTTTTTGGGACACTCTTCAGCAAGCTTTTGGCT
GA
Protein sequenceShow/hide protein sequence
MGQAFRKLFDTFFGNSEMRVVMLGLDAAGKTTILYKLHIGEVLSTVPTIALEKFRQRILALSLKTPQVFGVGLGSHSELSGGLLHPSVFCTMNSYLFVGFNVEKVQYKNV
VFTVWDVGGQEKLRPLWRHYFNNTDGLIYVVDSLDRERIGKAKTEFQAIINDPFMLNSVILVFANKQDMIQQPSPISFRIERKSFSITADPRNPRCYRLTEATRDKSFSL
TLSKEALYWIKASFSKLCDLPLNQKFFTEKRVEDATLWVEKITNKKDHSAEIAKLGNNGGLNKIIIPVGDERKGWKSFTECINSLTNNPSASLPPPIYDVKDNTSYKAAL
KAPPREPRNQPSAVPILPEESASLNPFATLFLSSAVIVQRRHFHDNWLDIMRALQQHLSAYASVSPLQPDKALLACENEEQAQVLAKIKDWYKVGKFQVRFFPWNSEIMN
GDRKVPSYGGWIRVRNLPIDKWSIEAFKLIGDECGGYLETTTKTLTRLDMMEIMIKVKTNHTGFIPAEVHIPSTSSSPSKVSIDPFFMEDYYIGYIAGIHGKIPTGPKEY
DDSIIGKSGKSMDNRGASQPRAAQAKEIEKGDNPQISYSCALQGTPFQVCHTEDYQTQSEWAPHLPEDHVIPQSIVATTGGSKKPCPLTLPQSEAHPNQQPPRQAPKSPL
YPDGPPQIYTNQPVCPNPEAPIDINPSTISPPASPIPSIKNHPRRKKPIIINNKETFLLSGTVHSTGSTFHLSDSEGALSSPCSPNLDESPPIHHQKSSNQADSPPSISF
LFESSEDQALDIDNPIPLMIEKPSAAGIQQNIEMETTALVDIDVEELSGEDSVSKTHFQHRDAAAYLPFLFPWLADHGMCIMPMPNRNKLSTAAKKKTQWMVDQWWNSNL
LQGWPGHGLMQKLKGLKGELKLWNQQTFNSQRELKFALSRELSVIDNKEEIDHLSEQDINRRSIIKAELMNISAKEEIVWRQKCKLKWFVEGDVNSAFFHRIVTANRRKN
AIFEVLSASGINQQLAASLASSFGCKLGSWPITYLGLPLNGNPRSLNFWSPVIEKIEKRLFNWGSNHISKGGRHTLIQATLANLPVYYLSIFLAPMKVTKAIEKLFRNYL
WRGAHRNRGSHLLNWSLLKSPIEEGGLGLYDIHKKNISLLAKWIWRFHIEKEAIWRKIIVAKYGAAKNNLKPGISSLHSSKGPWKAIYKLQNLIYRSISIKVGNGNSTLF
WEEVWLGTSSLRVNYPHLYHLSLKKEGHIADFWNPSQKAWNLQLRRNLNEVESSEWASMSHYLFSFTIKEEVDKWIWSLDKSGVFSTGSLTKELASSKNTSNVDLYRLLW
KGSMPKKVKFFIWELSHACINTADVVQRKMPSMSLSPSCCCCLCFQASESQIHLFSLCSYVTSFWDTLQQAFG