; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G17040 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G17040
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationChr4:14472312..14476902
RNA-Seq ExpressionCSPI04G17040
SyntenyCSPI04G17040
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042451.1 reverse transcriptase [Cucumis melo var. makuwa]7.9e-10031.78Show/hide
Query:  LREGVEN--EENEDNETLLAGQNPRGRRVTNIDEDEDDTMARFLEGMNRELDNQVDRQNYFDMKELLHLA-------------------------NSN--
        LR+G ++  E   + ETL+      GR    I EDE+DTM+RFL G+N+E+ + VDR   ++M+++ H A                         +SN  
Subjt:  LREGVEN--EENEDNETLLAGQNPRGRRVTNIDEDEDDTMARFLEGMNRELDNQVDRQNYFDMKELLHLA-------------------------NSN--

Query:  ------SGNVNFKPKGKIEFEKKDKVESSKGKEKLDGSKEVREKSQYIKCWKCQGIGHIARDCPNKRTMINKKVKLSPMEKKEFEDVF-QDEAPKGLPPI
              + N + + +GK    +K + ESS  K K++ SKEVREK+  I+CWKC+G GH+++DC NKR M+ +   +      EF D+F  ++AP GLPP+
Subjt:  ------SGNVNFKPKGKIEFEKKDKVESSKGKEKLDGSKEVREKSQYIKCWKCQGIGHIARDCPNKRTMINKKVKLSPMEKKEFEDVF-QDEAPKGLPPI

Query:  RGIEHKIEFIHGATIPNRPAYKANPARIKEIQRQVEELMEK------------------------------RAINKITVRYRHPIPRLDDLLDELH----
        RGIEH+I+FI GAT+PN  AY+ NP   KEIQRQVEELM+K                              RAINKITV+YRHPIPRLDD+LDELH    
Subjt:  RGIEHKIEFIHGATIPNRPAYKANPARIKEIQRQVEELMEK------------------------------RAINKITVRYRHPIPRLDDLLDELH----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------GDVLMQDKKSLMYFSEKLSGASLNYPTYNKELFALVHALQ-------
                                                             G VLMQ+K+ +M+FSEKL+GA LNY TY+KEL ALV AL+       
Subjt:  -----------------------------------------------------GDVLMQDKKSLMYFSEKLSGASLNYPTYNKELFALVHALQ-------

Query:  ---------------LKGQSKLNRRNAKWIEFIE-------YKQDNDNVVADVLSRRYNLFTSLSAKILGFEHVKELY-TDDTDFGMIYSSCLDGKIIDD
                       LKGQ+KLN+R+AKW+EFIE       YK+  DN+VAD LSRRY L +SLSAK+LGF+H+ ELY  + ++F  +Y+ CL+GK + D
Subjt:  ---------------LKGQSKLNRRNAKWIEFIE-------YKQDNDNVVADVLSRRYNLFTSLSAKILGFEHVKELY-TDDTDFGMIYSSCLDGKIIDD

Query:  YYVFHDFLFKKSKLCVPK------------------------------------------------CFEDLAKKGK------------------------
        Y VF   LF+K KLC+PK                                                CF+    K K                        
Subjt:  YYVFHDFLFKKSKLCVPK------------------------------------------------CFEDLAKKGK------------------------

Query:  -----------DSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIAT-----------------------------------------
                   DSIFVVVDRFSKM HFIPCNKT+ A ++++LFFREVVRLH IP+TI +                                         
Subjt:  -----------DSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIAT-----------------------------------------

Query:  ---------ISKNVKSWKDSLPFVKFAYNRVIHSTTHCSSFEVVYGFNPLTPLYLLSIICNVFVSD-----------------------------ATTNK
                 ISKN+KSW+++LPFV+FAYNR IHSTTHCS FE++ G N        S +C   + +                                 K
Subjt:  ---------ISKNVKSWKDSLPFVKFAYNRVIHSTTHCSSFEVVYGFNPLTPLYLLSIICNVFVSD-----------------------------ATTNK

Query:  EQFPSQRKTKLHPRGDGPFQVM
        E+FP QRK+KL  RGD PFQV+
Subjt:  EQFPSQRKTKLHPRGDGPFQVM

KAA0057648.1 putative polyprotein [Cucumis melo var. makuwa]1.3e-9730.81Show/hide
Query:  IDEDEDDTMARFLEGMNRELDNQVDRQNYFDMKELLHLA-----------NSNSGNVNFKPKGKIEFEKKDKVESSKGKEKLDGSKEVREKSQYIKCWKC
        I EDE+DTM+RFL G+N+E+ + VDR   ++M+++ H A             +  N + + +GK    +K + ESS  K K++ SKEVREK+  I+CWKC
Subjt:  IDEDEDDTMARFLEGMNRELDNQVDRQNYFDMKELLHLA-----------NSNSGNVNFKPKGKIEFEKKDKVESSKGKEKLDGSKEVREKSQYIKCWKC

Query:  QGIGHIARDCPNKRTMI----------------------------------------------NKKVKLSPMEKK-------------------------
        +G GH+++DC NKR M+                                              N K+K   +E +                         
Subjt:  QGIGHIARDCPNKRTMI----------------------------------------------NKKVKLSPMEKK-------------------------

Query:  ---EFEDVF-QDEAPKGLPPIRGIEHKIEFIHGATIPNRPAYKANPARIKEIQRQVEELMEK------------------------------RAINKITV
           EF D+F  ++AP GLPP+RGIEH+I+FI GAT+PN  AY+ NP   KEIQRQVEELM+K                              RAINKI V
Subjt:  ---EFEDVF-QDEAPKGLPPIRGIEHKIEFIHGATIPNRPAYKANPARIKEIQRQVEELMEK------------------------------RAINKITV

Query:  RYRHPIPRLDDLLDELH-----------------------------------------------------------------------------------
        +YRHPIPRLDD+LDELH                                                                                   
Subjt:  RYRHPIPRLDDLLDELH-----------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------GDVLMQDKKSLMYFSE
                                                                                            G VLMQ+K+ +M+FSE
Subjt:  ------------------------------------------------------------------------------------GDVLMQDKKSLMYFSE

Query:  KLSGASLNYPTYNKELFALVHALQ----------------------LKGQSKLNRRNAKWIEFIE-------YKQDNDNVVADVLSRRYNLFTSLSAKIL
        KL+GA LNY TY+KEL ALV AL+                      LKGQ+KLN+R+AKW+EFIE       YK+  DN+VAD LSR Y LF+S SAK+L
Subjt:  KLSGASLNYPTYNKELFALVHALQ----------------------LKGQSKLNRRNAKWIEFIE-------YKQDNDNVVADVLSRRYNLFTSLSAKIL

Query:  GFEHVKELY-TDDTDFGMIYSSCLDGKIIDDYYVFHDFLFKKSKLCVPK------------------------------------------------CFE
        GF+H+ ELY  + ++F  +Y+ CL+GK I DY VF + LF+K KLC+PK                                                CF+
Subjt:  GFEHVKELY-TDDTDFGMIYSSCLDGKIIDDYYVFHDFLFKKSKLCVPK------------------------------------------------CFE

Query:  DLAKKGK-----------------------------------DSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVV------------RLHGIPKTI
            K K                                   DSIFVVVDRFSKM  FIPCNKT+ A ++++LFFREV+            +   + +T+
Subjt:  DLAKKGK-----------------------------------DSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVV------------RLHGIPKTI

Query:  -----ATISKNVKSWKDSLPFVKFAYNRVIHSTTHCSSFEVVYGFNPLTPLYLLSIICNVFVSDATTN--------------------------------
             + ISKN+KSW+++LPFV+FAYNR IHS THCS FEVVYGFNPLTPL L  +  N+F SDA ++                                
Subjt:  -----ATISKNVKSWKDSLPFVKFAYNRVIHSTTHCSSFEVVYGFNPLTPLYLLSIICNVFVSDATTN--------------------------------

Query:  ---------------KEQFPSQRKTKLHPRGDGPFQVM
                       KE+FP QRK+KL  RGD PFQV+
Subjt:  ---------------KEQFPSQRKTKLHPRGDGPFQVM

TYK22420.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]1.0e-9428.99Show/hide
Query:  LREGVEN--EENEDNETLLAGQNPRGRRVTNIDEDEDDTMARFLEGMNRELDNQVDRQNYFDMKELLHLA-------------------------NSN--
        LR+G ++  E   + ETL+      GR    I EDE+DTM+RFL G+N+E+ + VDR   ++M+++ H A                         +SN  
Subjt:  LREGVEN--EENEDNETLLAGQNPRGRRVTNIDEDEDDTMARFLEGMNRELDNQVDRQNYFDMKELLHLA-------------------------NSN--

Query:  ------SGNVNFKPKGKIEFEKKDKVESSKGKEKLDGSKEVREKSQYIKCWKCQGIGHIARDCPNKRTMI------------------------------
              + N + + +GK    +K + ESS  K K + SKEVREK+  I+CWKC+G GH+++DC NKR M+                              
Subjt:  ------SGNVNFKPKGKIEFEKKDKVESSKGKEKLDGSKEVREKSQYIKCWKCQGIGHIARDCPNKRTMI------------------------------

Query:  ----------------NKKVKLSPMEKK----------------------------EFEDVF-QDEAPKGLPPIRGIEHKIEFIHGATIPNRPAYKANPA
                        N K+K   +E +                            EF D+F  ++AP GLPP+RGIEH+I+FI GAT+PN  AY+ NP 
Subjt:  ----------------NKKVKLSPMEKK----------------------------EFEDVF-QDEAPKGLPPIRGIEHKIEFIHGATIPNRPAYKANPA

Query:  RIKEIQRQVEELMEK------------------------------RAINKITVRYRHPIPRLDDLLDELH------------------------------
          KEIQRQVEELM+K                              RAINKITV+YRHPIPRLDD+LDELH                              
Subjt:  RIKEIQRQVEELMEK------------------------------RAINKITVRYRHPIPRLDDLLDELH------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------------------GDVL
                                                                                                        G VL
Subjt:  ------------------------------------------------------------------------------------------------GDVL

Query:  MQDKKSLMYFSEKLSGASLNYPTYNKELFALVHALQ----------------------LKGQSKLNRRNAKWIEFIE-------YKQDNDNVVADVLSRR
        MQ+K+ +M+FSEKL+GA LNY TY+KEL ALV AL+                      LKGQ+KLN+R+AKW+EFIE       YK+  DN+VAD LSRR
Subjt:  MQDKKSLMYFSEKLSGASLNYPTYNKELFALVHALQ----------------------LKGQSKLNRRNAKWIEFIE-------YKQDNDNVVADVLSRR

Query:  YNLFTSLSAKILGFEHVKELY-TDDTDFGMIYSSCLDGKIIDDYYVFHDFLFKKSKLCVPK---------------------------------------
        Y LF+SLSAK+LGF+H+ ELY  + ++F  +Y+ CL+GK + DY VF   LF+K KLC+PK                                       
Subjt:  YNLFTSLSAKILGFEHVKELY-TDDTDFGMIYSSCLDGKIIDDYYVFHDFLFKKSKLCVPK---------------------------------------

Query:  ---------CFEDLAKKGK-----------------------------------DSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTI
                 CF+    K K                                   DSIFVVVDRFSKM HFIPCNKT+ A ++++LFFREVVRLHGIPKTI
Subjt:  ---------CFEDLAKKGK-----------------------------------DSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTI

Query:  AT--------------------------------------------------ISKNVKSWKDSLPFVKFAYNRVIHSTTHCSSFEVVYGFNPLTPLYLLS
         +                                                  ISKN+KSW+++LPFV+FAYNR IHSTTHCS FEVVYGFNPLTPL L  
Subjt:  AT--------------------------------------------------ISKNVKSWKDSLPFVKFAYNRVIHSTTHCSSFEVVYGFNPLTPLYLLS

Query:  IICNVFVSDATTN-----------------------------------------------KEQFPSQRKTKLHPRGDGPFQVM
        +  N+F SDA ++                                               KE+FP QRK+KL  RGDGPFQV+
Subjt:  IICNVFVSDATTN-----------------------------------------------KEQFPSQRKTKLHPRGDGPFQVM

XP_022157414.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111024115 [Momordica charantia]2.0e-9529.97Show/hide
Query:  QLDRMKVALRNHHQG-----EVLREGVENEEN--EDNETLLAGQNPRGRRVTNIDEDEDDTMARFLEGMNRELDNQVDRQNYFDMKELLHLA--------
        +L R +   R+ H+      +VLR+G ++ E+  ++ +TL+           +++ED ++ MARFL G+N+E+ ++VD Q YF+++E+LHLA        
Subjt:  QLDRMKVALRNHHQG-----EVLREGVENEEN--EDNETLLAGQNPRGRRVTNIDEDEDDTMARFLEGMNRELDNQVDRQNYFDMKELLHLA--------

Query:  ------------------NSNSGNVNFKPKGKIEFEK-KDKVE----SSKGKEKLDGSKEVREKSQYIKCWKCQGIGHIARDCPNKRTMINK--------
                           ++S + + K + +  +EK K+K E    S K KEK+    E   ++  IKCWKCQG+GH +RDCPN+RTMI +        
Subjt:  ------------------NSNSGNVNFKPKGKIEFEK-KDKVE----SSKGKEKLDGSKEVREKSQYIKCWKCQGIGHIARDCPNKRTMINK--------

Query:  -------------------------------------------------------------------------------------------KVKLS----
                                                                                                   +VK+S    
Subjt:  -------------------------------------------------------------------------------------------KVKLS----

Query:  -----------------PM-------------EKKEFEDVFQDEAPKGLPPIRGIEHKIEFIHGATIPNRPAYKANPARIKEIQRQVEELM---------
                         PM             +++EFED+F +E PK LPP+RGIE+KI+FI GATIPNRPAY+ NP   KEIQRQV+EL+         
Subjt:  -----------------PM-------------EKKEFEDVFQDEAPKGLPPIRGIEHKIEFIHGATIPNRPAYKANPARIKEIQRQVEELM---------

Query:  ----------------------------------------------EKRAINKITVRYRHPIPRLDDLLDELH---------------------------
                                                      E R+ + +   YR  I     L   L+                           
Subjt:  ----------------------------------------------EKRAINKITVRYRHPIPRLDDLLDELH---------------------------

Query:  -------------------------GDVLMQDKKSLMYFSEKLSGASLNYPTYNKELFALVHALQ----------------------LKGQSKLNRRNAK
                                 G VLMQ  K LM+FSEKLSGASL YPTY+KEL+ALV ALQ                      L+GQSKLNRR+AK
Subjt:  -------------------------GDVLMQDKKSLMYFSEKLSGASLNYPTYNKELFALVHALQ----------------------LKGQSKLNRRNAK

Query:  WIEFIE-------YKQDNDNVVADVLSRRYNLFTSLSAKILGFEHVKELYTDDTDFGMIYSSCLDGKIIDDYYVFHDFLFKKSKLCVPKC----------
        W+EFIE       YKQ  +N+VAD LSRRY LF +L+AKILGFEH+K LY DD  F     SC  G ++DDY +F  FLF+K KLC+P C          
Subjt:  WIEFIE-------YKQDNDNVVADVLSRRYNLFTSLSAKILGFEHVKELYTDDTDFGMIYSSCLDGKIIDDYYVFHDFLFKKSKLCVPKC----------

Query:  --------------FEDLA-----------------------------------------------------------KKGKDSIFVVVDRFSKMIHFIP
                      ++ LA                                                           +KG+DSIFVVVDRFSKM HFIP
Subjt:  --------------FEDLA-----------------------------------------------------------KKGKDSIFVVVDRFSKMIHFIP

Query:  CNKTNTAKHVSDLFFREVVRLHGIPKTI--------------------------------------------------ATISKNVKSWKDSLPFVKFAYN
        C+KT+ AK ++DLFFREVVRLHGIP+TI                                                  A I KN+K+W+D LP ++FAYN
Subjt:  CNKTNTAKHVSDLFFREVVRLHGIPKTI--------------------------------------------------ATISKNVKSWKDSLPFVKFAYN

Query:  RVIHSTTHCSSFEVVYGFNPLTPLYLLSIICNVFVS-DATT----------------------------------------------NKEQFPSQRKTKL
        R +H TT C+ FE+VYGFNP++PL LL +  N FVS DA+T                                               KE+FP+QR++KL
Subjt:  RVIHSTTHCSSFEVVYGFNPLTPLYLLSIICNVFVS-DATT----------------------------------------------NKEQFPSQRKTKL

Query:  HPRGDGPFQVM
         PRGDGPFQV+
Subjt:  HPRGDGPFQVM

XP_022158803.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111025268 [Momordica charantia]5.0e-9433.04Show/hide
Query:  EVLREGVENEEN--EDNETLLAGQNPRGRRVTNIDEDEDDTMARFLEGMNRELDNQVDRQNYFDMKELLHLA--------------------------NS
        +VLR+G ++ E+  ++ +TL+           +++ED ++ MARFL G+N+E+ ++VD Q YF+++E+LHLA                           +
Subjt:  EVLREGVENEEN--EDNETLLAGQNPRGRRVTNIDEDEDDTMARFLEGMNRELDNQVDRQNYFDMKELLHLA--------------------------NS

Query:  NSGNVNFKPKGKIEFEKKDKVE----SSKGKEKLDGSKEVREKSQYIKCWKCQGIGHIARDCPNKRTMINK-----------------------------
        +S + + K + +  ++ K+K E    S KGKEKL     VR +   IKCWK QG+GH +RDCPN+RTMI +                             
Subjt:  NSGNVNFKPKGKIEFEKKDKVE----SSKGKEKLDGSKEVREKSQYIKCWKCQGIGHIARDCPNKRTMINK-----------------------------

Query:  ----------------------------------------------------------------------KVKLS---------------------PM--
                                                                              +VK+S                     PM  
Subjt:  ----------------------------------------------------------------------KVKLS---------------------PM--

Query:  -----------EKKEFEDVFQDEAPKGLPPIRGIEHKIEFIHGATIPNRPAYKANPARIKEIQRQVEELMEKRAI-------------------------
                   +++EFEDVFQ++ PKGLPPIRGIEH+I+FI GATIPNRPAY+ANP   KEI+RQVEELMEK  +                         
Subjt:  -----------EKKEFEDVFQDEAPKGLPPIRGIEHKIEFIHGATIPNRPAYKANPARIKEIQRQVEELMEKRAI-------------------------

Query:  ----------------NKITVRYRHPIPR----------------------------LDDLL----------------DELHGD----------------
                         K+      P P+                            L+DL+                +EL G                 
Subjt:  ----------------NKITVRYRHPIPR----------------------------LDDLL----------------DELHGD----------------

Query:  -------------VLMQDKKSLMYFSEKLSGASLNYPTYNKELFALVHALQ----------------------LKGQSKLNRRNAKWIEFIE-------Y
                     VLMQD K LMYFSEKL+GASLNYPTY+KELFALV  LQ                      LKGQSKLNRR+AKW+EFIE       Y
Subjt:  -------------VLMQDKKSLMYFSEKLSGASLNYPTYNKELFALVHALQ----------------------LKGQSKLNRRNAKWIEFIE-------Y

Query:  KQDNDNVVADVLSRRYNLFTSLSAKILGFE-HVKELYTDDTDFGMIYSSCLDGKIIDDYYVFHD-FLFKKSKLCVPK----CFE----------------
        KQ  DNVVAD LSRRYNLF    +K+   +  +++L   +   G +        I   Y + H+ F + K K  V K    CF+                
Subjt:  KQDNDNVVADVLSRRYNLFTSLSAKILGFE-HVKELYTDDTDFGMIYSSCLDGKIIDDYYVFHD-FLFKKSKLCVPK----CFE----------------

Query:  -------------------DLAKKGKDSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIAT--------------------------
                              KKG+DSIFVVVD+FSKM HFI CNKT+ AKHV+DLFFREVVRLHGIP+TI +                          
Subjt:  -------------------DLAKKGKDSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIAT--------------------------

Query:  ------------------------ISKNVKSWKDSLPFVKFAYNRVIHSTTHCSSFEVVYGFNPLTPLYLLSIICNVFVSDATTNKEQFPSQRKTKLHPR
                                IS+N+ SW++SLPFV+FAYNR IHSTTHCS FEVVYGFNPLTPL LL I  N FV       E+FPS+RKTKLHPR
Subjt:  ------------------------ISKNVKSWKDSLPFVKFAYNRVIHSTTHCSSFEVVYGFNPLTPLYLLSIICNVFVSDATTNKEQFPSQRKTKLHPR

Query:  GDGPFQVM
        G GP+QV+
Subjt:  GDGPFQVM

TrEMBL top hitse value%identityAlignment
A0A2N9J791 Uncharacterized protein7.0e-10233.67Show/hide
Query:  NIDEDEDDTMARFLEGMNRELDNQVDRQNYFDMKELLHLA------NSNSGNVNFKPKG-----KIEFEK------KDKVESSKGKEKL----DGSKEVR
        N++ED + TMARFL G+NR++ N V+ Q+Y ++++++H+A          G  +F+  G     K  + K      K K E  K +E++     G  E +
Subjt:  NIDEDEDDTMARFLEGMNRELDNQVDRQNYFDMKELLHLA------NSNSGNVNFKPKG-----KIEFEK------KDKVESSKGKEKL----DGSKEVR

Query:  EKSQYIKCWKCQGIGHIARDCPNKRTMI--------------------NKKVKL----SPMEKKEFEDVFQDEAPKGLPPIRGIEHKIEFIHGATIPNRP
         +++ IKC++C G+GHIA  CPNKRTMI                    NK+  +      +  KE+EDVF ++ P GLPPIRGIEH+I+F+ GATIPNRP
Subjt:  EKSQYIKCWKCQGIGHIARDCPNKRTMI--------------------NKKVKL----SPMEKKEFEDVFQDEAPKGLPPIRGIEHKIEFIHGATIPNRP

Query:  AYKANPARIKEIQRQVEELMEK------------------------------RAINKIT------------------------VRYRH------------
        AY++NP   KE+QRQVEEL+ K                              RAIN IT                        VR  H            
Subjt:  AYKANPARIKEIQRQVEELMEK------------------------------RAINKIT------------------------VRYRH------------

Query:  --------------------------------------PIPRLDDLLDELH----------GDVLMQDKKSLMYFSEKLSGASLNYPTYNKELFALVHAL
                                              P+  L D                G VLMQ+K+ + YFSEKL+GA+LNYPTY+KEL+    AL
Subjt:  --------------------------------------PIPRLDDLLDELH----------GDVLMQDKKSLMYFSEKLSGASLNYPTYNKELFALVHAL

Query:  QLKGQSKLNRRNAKWIEFIE-------YKQDNDNVVADVLSRRYNLFTSLSAKILGFEHVKELYTDDTDFGMIYSSCLDGKIIDDYYVFHDFLFKKSKLC
         LKGQ KLNRR+A+W+EFIE       YKQ  +N+VAD LSRRY L ++L+AK+LGFE+VKELY +D DF  ++++C +      +Y    +LF++++LC
Subjt:  QLKGQSKLNRRNAKWIEFIE-------YKQDNDNVVADVLSRRYNLFTSLSAKILGFEHVKELYTDDTDFGMIYSSCLDGKIIDDYYVFHDFLFKKSKLC

Query:  V-------------------------------------PKCFEDL----------------------------------------------AKKGKDSIF
        V                                     PK   D+                                              ++KG+DSIF
Subjt:  V-------------------------------------PKCFEDL----------------------------------------------AKKGKDSIF

Query:  VVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIAT--------------------------------------------------ISKNVK
        VVVDRFSKM HFI C+KT+ A H++DLFFRE+VRLHG+P++I +                                                  I KN+K
Subjt:  VVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIAT--------------------------------------------------ISKNVK

Query:  SWKDSLPFVKFAYNRVIHSTTHCSSFEVVYGFNPLTPLYLLSIICNVFVS------------------DATTNKEQFPSQRKTKLHPRGDGPFQVM
        +W+D LPF++FAYNR +HSTT  S FE+VYGFNPLTPL LL +  N   S                       KE+FP++R++KLHPRGDGPFQV+
Subjt:  SWKDSLPFVKFAYNRVIHSTTHCSSFEVVYGFNPLTPLYLLSIICNVFVS------------------DATTNKEQFPSQRKTKLHPRGDGPFQVM

A0A5A7TLJ3 Reverse transcriptase3.8e-10031.78Show/hide
Query:  LREGVEN--EENEDNETLLAGQNPRGRRVTNIDEDEDDTMARFLEGMNRELDNQVDRQNYFDMKELLHLA-------------------------NSN--
        LR+G ++  E   + ETL+      GR    I EDE+DTM+RFL G+N+E+ + VDR   ++M+++ H A                         +SN  
Subjt:  LREGVEN--EENEDNETLLAGQNPRGRRVTNIDEDEDDTMARFLEGMNRELDNQVDRQNYFDMKELLHLA-------------------------NSN--

Query:  ------SGNVNFKPKGKIEFEKKDKVESSKGKEKLDGSKEVREKSQYIKCWKCQGIGHIARDCPNKRTMINKKVKLSPMEKKEFEDVF-QDEAPKGLPPI
              + N + + +GK    +K + ESS  K K++ SKEVREK+  I+CWKC+G GH+++DC NKR M+ +   +      EF D+F  ++AP GLPP+
Subjt:  ------SGNVNFKPKGKIEFEKKDKVESSKGKEKLDGSKEVREKSQYIKCWKCQGIGHIARDCPNKRTMINKKVKLSPMEKKEFEDVF-QDEAPKGLPPI

Query:  RGIEHKIEFIHGATIPNRPAYKANPARIKEIQRQVEELMEK------------------------------RAINKITVRYRHPIPRLDDLLDELH----
        RGIEH+I+FI GAT+PN  AY+ NP   KEIQRQVEELM+K                              RAINKITV+YRHPIPRLDD+LDELH    
Subjt:  RGIEHKIEFIHGATIPNRPAYKANPARIKEIQRQVEELMEK------------------------------RAINKITVRYRHPIPRLDDLLDELH----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------GDVLMQDKKSLMYFSEKLSGASLNYPTYNKELFALVHALQ-------
                                                             G VLMQ+K+ +M+FSEKL+GA LNY TY+KEL ALV AL+       
Subjt:  -----------------------------------------------------GDVLMQDKKSLMYFSEKLSGASLNYPTYNKELFALVHALQ-------

Query:  ---------------LKGQSKLNRRNAKWIEFIE-------YKQDNDNVVADVLSRRYNLFTSLSAKILGFEHVKELY-TDDTDFGMIYSSCLDGKIIDD
                       LKGQ+KLN+R+AKW+EFIE       YK+  DN+VAD LSRRY L +SLSAK+LGF+H+ ELY  + ++F  +Y+ CL+GK + D
Subjt:  ---------------LKGQSKLNRRNAKWIEFIE-------YKQDNDNVVADVLSRRYNLFTSLSAKILGFEHVKELY-TDDTDFGMIYSSCLDGKIIDD

Query:  YYVFHDFLFKKSKLCVPK------------------------------------------------CFEDLAKKGK------------------------
        Y VF   LF+K KLC+PK                                                CF+    K K                        
Subjt:  YYVFHDFLFKKSKLCVPK------------------------------------------------CFEDLAKKGK------------------------

Query:  -----------DSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIAT-----------------------------------------
                   DSIFVVVDRFSKM HFIPCNKT+ A ++++LFFREVVRLH IP+TI +                                         
Subjt:  -----------DSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIAT-----------------------------------------

Query:  ---------ISKNVKSWKDSLPFVKFAYNRVIHSTTHCSSFEVVYGFNPLTPLYLLSIICNVFVSD-----------------------------ATTNK
                 ISKN+KSW+++LPFV+FAYNR IHSTTHCS FE++ G N        S +C   + +                                 K
Subjt:  ---------ISKNVKSWKDSLPFVKFAYNRVIHSTTHCSSFEVVYGFNPLTPLYLLSIICNVFVSD-----------------------------ATTNK

Query:  EQFPSQRKTKLHPRGDGPFQVM
        E+FP QRK+KL  RGD PFQV+
Subjt:  EQFPSQRKTKLHPRGDGPFQVM

A0A6J1DWE9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110241159.8e-9629.97Show/hide
Query:  QLDRMKVALRNHHQG-----EVLREGVENEEN--EDNETLLAGQNPRGRRVTNIDEDEDDTMARFLEGMNRELDNQVDRQNYFDMKELLHLA--------
        +L R +   R+ H+      +VLR+G ++ E+  ++ +TL+           +++ED ++ MARFL G+N+E+ ++VD Q YF+++E+LHLA        
Subjt:  QLDRMKVALRNHHQG-----EVLREGVENEEN--EDNETLLAGQNPRGRRVTNIDEDEDDTMARFLEGMNRELDNQVDRQNYFDMKELLHLA--------

Query:  ------------------NSNSGNVNFKPKGKIEFEK-KDKVE----SSKGKEKLDGSKEVREKSQYIKCWKCQGIGHIARDCPNKRTMINK--------
                           ++S + + K + +  +EK K+K E    S K KEK+    E   ++  IKCWKCQG+GH +RDCPN+RTMI +        
Subjt:  ------------------NSNSGNVNFKPKGKIEFEK-KDKVE----SSKGKEKLDGSKEVREKSQYIKCWKCQGIGHIARDCPNKRTMINK--------

Query:  -------------------------------------------------------------------------------------------KVKLS----
                                                                                                   +VK+S    
Subjt:  -------------------------------------------------------------------------------------------KVKLS----

Query:  -----------------PM-------------EKKEFEDVFQDEAPKGLPPIRGIEHKIEFIHGATIPNRPAYKANPARIKEIQRQVEELM---------
                         PM             +++EFED+F +E PK LPP+RGIE+KI+FI GATIPNRPAY+ NP   KEIQRQV+EL+         
Subjt:  -----------------PM-------------EKKEFEDVFQDEAPKGLPPIRGIEHKIEFIHGATIPNRPAYKANPARIKEIQRQVEELM---------

Query:  ----------------------------------------------EKRAINKITVRYRHPIPRLDDLLDELH---------------------------
                                                      E R+ + +   YR  I     L   L+                           
Subjt:  ----------------------------------------------EKRAINKITVRYRHPIPRLDDLLDELH---------------------------

Query:  -------------------------GDVLMQDKKSLMYFSEKLSGASLNYPTYNKELFALVHALQ----------------------LKGQSKLNRRNAK
                                 G VLMQ  K LM+FSEKLSGASL YPTY+KEL+ALV ALQ                      L+GQSKLNRR+AK
Subjt:  -------------------------GDVLMQDKKSLMYFSEKLSGASLNYPTYNKELFALVHALQ----------------------LKGQSKLNRRNAK

Query:  WIEFIE-------YKQDNDNVVADVLSRRYNLFTSLSAKILGFEHVKELYTDDTDFGMIYSSCLDGKIIDDYYVFHDFLFKKSKLCVPKC----------
        W+EFIE       YKQ  +N+VAD LSRRY LF +L+AKILGFEH+K LY DD  F     SC  G ++DDY +F  FLF+K KLC+P C          
Subjt:  WIEFIE-------YKQDNDNVVADVLSRRYNLFTSLSAKILGFEHVKELYTDDTDFGMIYSSCLDGKIIDDYYVFHDFLFKKSKLCVPKC----------

Query:  --------------FEDLA-----------------------------------------------------------KKGKDSIFVVVDRFSKMIHFIP
                      ++ LA                                                           +KG+DSIFVVVDRFSKM HFIP
Subjt:  --------------FEDLA-----------------------------------------------------------KKGKDSIFVVVDRFSKMIHFIP

Query:  CNKTNTAKHVSDLFFREVVRLHGIPKTI--------------------------------------------------ATISKNVKSWKDSLPFVKFAYN
        C+KT+ AK ++DLFFREVVRLHGIP+TI                                                  A I KN+K+W+D LP ++FAYN
Subjt:  CNKTNTAKHVSDLFFREVVRLHGIPKTI--------------------------------------------------ATISKNVKSWKDSLPFVKFAYN

Query:  RVIHSTTHCSSFEVVYGFNPLTPLYLLSIICNVFVS-DATT----------------------------------------------NKEQFPSQRKTKL
        R +H TT C+ FE+VYGFNP++PL LL +  N FVS DA+T                                               KE+FP+QR++KL
Subjt:  RVIHSTTHCSSFEVVYGFNPLTPLYLLSIICNVFVS-DATT----------------------------------------------NKEQFPSQRKTKL

Query:  HPRGDGPFQVM
         PRGDGPFQV+
Subjt:  HPRGDGPFQVM

A0A6J1DX46 LOW QUALITY PROTEIN: uncharacterized protein LOC1110252682.4e-9433.04Show/hide
Query:  EVLREGVENEEN--EDNETLLAGQNPRGRRVTNIDEDEDDTMARFLEGMNRELDNQVDRQNYFDMKELLHLA--------------------------NS
        +VLR+G ++ E+  ++ +TL+           +++ED ++ MARFL G+N+E+ ++VD Q YF+++E+LHLA                           +
Subjt:  EVLREGVENEEN--EDNETLLAGQNPRGRRVTNIDEDEDDTMARFLEGMNRELDNQVDRQNYFDMKELLHLA--------------------------NS

Query:  NSGNVNFKPKGKIEFEKKDKVE----SSKGKEKLDGSKEVREKSQYIKCWKCQGIGHIARDCPNKRTMINK-----------------------------
        +S + + K + +  ++ K+K E    S KGKEKL     VR +   IKCWK QG+GH +RDCPN+RTMI +                             
Subjt:  NSGNVNFKPKGKIEFEKKDKVE----SSKGKEKLDGSKEVREKSQYIKCWKCQGIGHIARDCPNKRTMINK-----------------------------

Query:  ----------------------------------------------------------------------KVKLS---------------------PM--
                                                                              +VK+S                     PM  
Subjt:  ----------------------------------------------------------------------KVKLS---------------------PM--

Query:  -----------EKKEFEDVFQDEAPKGLPPIRGIEHKIEFIHGATIPNRPAYKANPARIKEIQRQVEELMEKRAI-------------------------
                   +++EFEDVFQ++ PKGLPPIRGIEH+I+FI GATIPNRPAY+ANP   KEI+RQVEELMEK  +                         
Subjt:  -----------EKKEFEDVFQDEAPKGLPPIRGIEHKIEFIHGATIPNRPAYKANPARIKEIQRQVEELMEKRAI-------------------------

Query:  ----------------NKITVRYRHPIPR----------------------------LDDLL----------------DELHGD----------------
                         K+      P P+                            L+DL+                +EL G                 
Subjt:  ----------------NKITVRYRHPIPR----------------------------LDDLL----------------DELHGD----------------

Query:  -------------VLMQDKKSLMYFSEKLSGASLNYPTYNKELFALVHALQ----------------------LKGQSKLNRRNAKWIEFIE-------Y
                     VLMQD K LMYFSEKL+GASLNYPTY+KELFALV  LQ                      LKGQSKLNRR+AKW+EFIE       Y
Subjt:  -------------VLMQDKKSLMYFSEKLSGASLNYPTYNKELFALVHALQ----------------------LKGQSKLNRRNAKWIEFIE-------Y

Query:  KQDNDNVVADVLSRRYNLFTSLSAKILGFE-HVKELYTDDTDFGMIYSSCLDGKIIDDYYVFHD-FLFKKSKLCVPK----CFE----------------
        KQ  DNVVAD LSRRYNLF    +K+   +  +++L   +   G +        I   Y + H+ F + K K  V K    CF+                
Subjt:  KQDNDNVVADVLSRRYNLFTSLSAKILGFE-HVKELYTDDTDFGMIYSSCLDGKIIDDYYVFHD-FLFKKSKLCVPK----CFE----------------

Query:  -------------------DLAKKGKDSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIAT--------------------------
                              KKG+DSIFVVVD+FSKM HFI CNKT+ AKHV+DLFFREVVRLHGIP+TI +                          
Subjt:  -------------------DLAKKGKDSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIAT--------------------------

Query:  ------------------------ISKNVKSWKDSLPFVKFAYNRVIHSTTHCSSFEVVYGFNPLTPLYLLSIICNVFVSDATTNKEQFPSQRKTKLHPR
                                IS+N+ SW++SLPFV+FAYNR IHSTTHCS FEVVYGFNPLTPL LL I  N FV       E+FPS+RKTKLHPR
Subjt:  ------------------------ISKNVKSWKDSLPFVKFAYNRVIHSTTHCSSFEVVYGFNPLTPLYLLSIICNVFVSDATTNKEQFPSQRKTKLHPR

Query:  GDGPFQVM
        G GP+QV+
Subjt:  GDGPFQVM

A0A6J1I4Q5 LOW QUALITY PROTEIN: uncharacterized protein LOC1114709741.3e-9232.81Show/hide
Query:  IDEDEDDTMARFLEGMNRELDNQVDRQNYFDMKELLHLA------------------------------NSNSGNVNFKPKGKIEFEKKDKVESSKGKEK
        IDEDE DTMA FL G+NR+L +QVDRQ YFDM+ELLHLA                              N+N   ++FK  GK + +KK+K E+SK KEK
Subjt:  IDEDEDDTMARFLEGMNRELDNQVDRQNYFDMKELLHLA------------------------------NSNSGNVNFKPKGKIEFEKKDKVESSKGKEK

Query:  LDGSKEVREKSQYIKCWKCQGIGHIARDCPNKRTMINKK-------------------------------------------------------------
         +  KE++E+++ IKCWK QGIGH+ RDCPNKR MI K                                                              
Subjt:  LDGSKEVREKSQYIKCWKCQGIGHIARDCPNKRTMINKK-------------------------------------------------------------

Query:  --VKLSP---------------------------------------------------------------------------------MEKKEFEDVFQD
          VK +P                                                                                  +++EF+DVFQD
Subjt:  --VKLSP---------------------------------------------------------------------------------MEKKEFEDVFQD

Query:  EAPKGLPPIRGIEHKIEFIHGATIPNRPAYKANPARIKEIQRQVEELMEKRAI-----------------------------------------------
        E PKGLPPIRGIEHKI+FI G  IPN+PAY A+P   KEI+R++EELMEK  I                                               
Subjt:  EAPKGLPPIRGIEHKIEFIHGATIPNRPAYKANPARIKEIQRQVEELMEKRAI-----------------------------------------------

Query:  ---------NKITVRYRHPIPR---------------LDDLL----------------DELH-----------------------------GDVLMQDKK
                 N   VR  H +                 L+DL+                +EL                              G VLMQD K
Subjt:  ---------NKITVRYRHPIPR---------------LDDLL----------------DELH-----------------------------GDVLMQDKK

Query:  SLMYFSEKLSGASLNYPTYNKELFALVHALQ----------------------LKGQSKLNRRNAKWIEFIE-------YKQDNDNVVADVLSRRYNLFT
         +MYFSEKL+GA+LNYPTY+KELFALV  LQ                      L GQSKLNRR+ KW+EFIE       YK   DN+VAD LS  +    
Subjt:  SLMYFSEKLSGASLNYPTYNKELFALVHALQ----------------------LKGQSKLNRRNAKWIEFIE-------YKQDNDNVVADVLSRRYNLFT

Query:  SLSAKILGFEHVKELYTDDT-------------DFGMIYSSCLDGKIIDDYYVFHDFLFK----KSKLCVPKCFEDL--------------------AKK
          S   +    +++L   +               + M++      K+  D + F    FK    KS+      +  L                     KK
Subjt:  SLSAKILGFEHVKELYTDDT-------------DFGMIYSSCLDGKIIDDYYVFHDFLFK----KSKLCVPKCFEDL--------------------AKK

Query:  GKDSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTI-------------------------------------------------ATI
        GKDSIFVV DRFSKM HFI CNKT+ AKHV DLFFREVVRLHGIP+ I                                                 A I
Subjt:  GKDSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTI-------------------------------------------------ATI

Query:  SKNVKSWKDSLPFVKFAYNRVIHSTTHCSSFEVVYGFNPLTPLYLLSIICNVFVSDATTNKEQFPSQRKTKL----HPRGDGPFQVM
        SKN+KSW + LPFV+FAYNR IHSTTHCS FEVVYGFNPLTPL LL     V++      KE+FPSQRKTKL    HPRGD P++V+
Subjt:  SKNVKSWKDSLPFVKFAYNRVIHSTTHCSSFEVVYGFNPLTPLYLLSIICNVFVSDATTNKEQFPSQRKTKL----HPRGDGPFQVM

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.5e-0829.09Show/hide
Query:  GKDSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIATISKNV---KSWKD-----------SLPF----------------------
        G +++FVVVDRFSKM   +PC K+ TA+  + +F + V+   G PK I   + ++   ++WKD           SLP+                      
Subjt:  GKDSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIATISKNV---KSWKD-----------SLPF----------------------

Query:  --------------VKFAYNRVIHSTTHCSSFEVVYGFNP-LTPLYLLSIICNVFVSDATTNKEQ
                      V+ +YN  IHS T  + FE+V+ ++P L+PL L S       SD T    Q
Subjt:  --------------VKFAYNRVIHSTTHCSSFEVVYGFNP-LTPLYLLSIICNVFVSDATTNKEQ

P0CT35 Transposon Tf2-2 polyprotein1.5e-0829.09Show/hide
Query:  GKDSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIATISKNV---KSWKD-----------SLPF----------------------
        G +++FVVVDRFSKM   +PC K+ TA+  + +F + V+   G PK I   + ++   ++WKD           SLP+                      
Subjt:  GKDSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIATISKNV---KSWKD-----------SLPF----------------------

Query:  --------------VKFAYNRVIHSTTHCSSFEVVYGFNP-LTPLYLLSIICNVFVSDATTNKEQ
                      V+ +YN  IHS T  + FE+V+ ++P L+PL L S       SD T    Q
Subjt:  --------------VKFAYNRVIHSTTHCSSFEVVYGFNP-LTPLYLLSIICNVFVSDATTNKEQ

P0CT36 Transposon Tf2-3 polyprotein1.5e-0829.09Show/hide
Query:  GKDSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIATISKNV---KSWKD-----------SLPF----------------------
        G +++FVVVDRFSKM   +PC K+ TA+  + +F + V+   G PK I   + ++   ++WKD           SLP+                      
Subjt:  GKDSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIATISKNV---KSWKD-----------SLPF----------------------

Query:  --------------VKFAYNRVIHSTTHCSSFEVVYGFNP-LTPLYLLSIICNVFVSDATTNKEQ
                      V+ +YN  IHS T  + FE+V+ ++P L+PL L S       SD T    Q
Subjt:  --------------VKFAYNRVIHSTTHCSSFEVVYGFNP-LTPLYLLSIICNVFVSDATTNKEQ

P0CT41 Transposon Tf2-12 polyprotein1.5e-0829.09Show/hide
Query:  GKDSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIATISKNV---KSWKD-----------SLPF----------------------
        G +++FVVVDRFSKM   +PC K+ TA+  + +F + V+   G PK I   + ++   ++WKD           SLP+                      
Subjt:  GKDSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIATISKNV---KSWKD-----------SLPF----------------------

Query:  --------------VKFAYNRVIHSTTHCSSFEVVYGFNP-LTPLYLLSIICNVFVSDATTNKEQ
                      V+ +YN  IHS T  + FE+V+ ++P L+PL L S       SD T    Q
Subjt:  --------------VKFAYNRVIHSTTHCSSFEVVYGFNP-LTPLYLLSIICNVFVSDATTNKEQ

Q9UR07 Transposon Tf2-11 polyprotein1.5e-0829.09Show/hide
Query:  GKDSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIATISKNV---KSWKD-----------SLPF----------------------
        G +++FVVVDRFSKM   +PC K+ TA+  + +F + V+   G PK I   + ++   ++WKD           SLP+                      
Subjt:  GKDSIFVVVDRFSKMIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIATISKNV---KSWKD-----------SLPF----------------------

Query:  --------------VKFAYNRVIHSTTHCSSFEVVYGFNP-LTPLYLLSIICNVFVSDATTNKEQ
                      V+ +YN  IHS T  + FE+V+ ++P L+PL L S       SD T    Q
Subjt:  --------------VKFAYNRVIHSTTHCSSFEVVYGFNP-LTPLYLLSIICNVFVSDATTNKEQ

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCAAAAAGTGGAAAATCGTGGGGAGATTAATGAAGCTAGGAAGATAGAAGCACAAGGAGGAGCAATTGTTCATTTAACTCAAGTAATTAATGACTTGACAAATCA
ATTAGATAGGATGAAGGTAGCATTAAGGAATCATCACCAAGGGGAAGTCTTGAGGGAAGGAGTTGAAAATGAAGAGAATGAGGATAATGAAACCCTTTTAGCCGGACAAA
ATCCAAGAGGGAGGAGAGTAACCAATATTGATGAAGATGAAGATGATACAATGGCAAGATTCCTTGAAGGAATGAACCGAGAATTGGACAACCAAGTGGATAGACAAAAT
TACTTTGATATGAAAGAGTTGTTACATCTTGCTAATTCAAATAGTGGAAATGTGAATTTCAAACCTAAAGGAAAGATTGAGTTTGAAAAGAAAGATAAGGTTGAGAGCTC
CAAGGGTAAAGAAAAATTGGATGGATCGAAGGAAGTTCGAGAAAAGAGTCAATACATAAAATGTTGGAAATGCCAAGGGATTGGACATATTGCACGTGATTGTCCAAACA
AAAGAACCATGATCAATAAAAAGGTGAAGTTGTCACCAATGGAGAAGAAAGAATTTGAAGATGTATTTCAAGATGAAGCTCCTAAAGGATTACCACCTATTAGAGGTATT
GAGCACAAGATAGAGTTCATTCATGGGGCTACCATTCCAAATAGACCAGCCTATAAAGCAAATCCAGCTAGGATAAAAGAAATTCAAAGGCAAGTGGAGGAGCTCATGGA
GAAGAGGGCCATCAACAAAATCACTGTAAGGTATCGTCATCCTATTCCTAGGCTAGATGATCTGCTTGATGAATTGCATGGTGATGTTTTGATGCAAGACAAAAAGTCTT
TAATGTATTTTAGTGAAAAACTGAGTGGGGCAAGTTTGAACTACCCTACTTATAATAAAGAATTGTTTGCTCTCGTTCATGCCTTGCAGCTTAAGGGACAAAGCAAACTG
AATCGTAGGAACGCAAAGTGGATTGAGTTCATTGAATATAAGCAAGATAACGACAATGTAGTAGCTGATGTTTTGTCAAGAAGGTACAATCTCTTTACTTCTCTTAGTGC
AAAAATTCTTGGATTTGAACATGTTAAAGAGTTATATACAGATGACACTGATTTTGGAATGATTTATTCATCTTGTCTTGATGGAAAAATTATTGATGATTATTATGTGT
TTCATGATTTTTTGTTCAAGAAAAGTAAGCTTTGTGTTCCTAAGTGTTTTGAGGATCTTGCTAAAAAGGGTAAGGATAGTATTTTTGTTGTTGTGGACAGATTTAGTAAA
ATGATACATTTTATACCATGTAACAAAACTAATACTGCTAAGCATGTGTCTGACTTGTTCTTTAGAGAGGTTGTAAGGTTGCATGGAATTCCTAAAACTATTGCAACTAT
TTCAAAGAACGTTAAGTCTTGGAAGGATTCTTTACCATTTGTTAAATTTGCATATAATAGAGTGATTCATAGTACCACTCATTGTTCTTCTTTTGAAGTTGTTTATGGGT
TTAACCCACTTACACCTCTTTATTTGCTGTCTATTATCTGTAATGTGTTTGTAAGTGATGCAACTACAAACAAGGAGCAATTTCCGTCTCAAAGGAAAACAAAGTTACAT
CCAAGAGGGGATGGACCTTTTCAAGTGATGAATTTCTTGATTAGAGGACGAATCCTTTTGAAGAGAGGGAAAATGATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATCAAAAAGTGGAAAATCGTGGGGAGATTAATGAAGCTAGGAAGATAGAAGCACAAGGAGGAGCAATTGTTCATTTAACTCAAGTAATTAATGACTTGACAAATCA
ATTAGATAGGATGAAGGTAGCATTAAGGAATCATCACCAAGGGGAAGTCTTGAGGGAAGGAGTTGAAAATGAAGAGAATGAGGATAATGAAACCCTTTTAGCCGGACAAA
ATCCAAGAGGGAGGAGAGTAACCAATATTGATGAAGATGAAGATGATACAATGGCAAGATTCCTTGAAGGAATGAACCGAGAATTGGACAACCAAGTGGATAGACAAAAT
TACTTTGATATGAAAGAGTTGTTACATCTTGCTAATTCAAATAGTGGAAATGTGAATTTCAAACCTAAAGGAAAGATTGAGTTTGAAAAGAAAGATAAGGTTGAGAGCTC
CAAGGGTAAAGAAAAATTGGATGGATCGAAGGAAGTTCGAGAAAAGAGTCAATACATAAAATGTTGGAAATGCCAAGGGATTGGACATATTGCACGTGATTGTCCAAACA
AAAGAACCATGATCAATAAAAAGGTGAAGTTGTCACCAATGGAGAAGAAAGAATTTGAAGATGTATTTCAAGATGAAGCTCCTAAAGGATTACCACCTATTAGAGGTATT
GAGCACAAGATAGAGTTCATTCATGGGGCTACCATTCCAAATAGACCAGCCTATAAAGCAAATCCAGCTAGGATAAAAGAAATTCAAAGGCAAGTGGAGGAGCTCATGGA
GAAGAGGGCCATCAACAAAATCACTGTAAGGTATCGTCATCCTATTCCTAGGCTAGATGATCTGCTTGATGAATTGCATGGTGATGTTTTGATGCAAGACAAAAAGTCTT
TAATGTATTTTAGTGAAAAACTGAGTGGGGCAAGTTTGAACTACCCTACTTATAATAAAGAATTGTTTGCTCTCGTTCATGCCTTGCAGCTTAAGGGACAAAGCAAACTG
AATCGTAGGAACGCAAAGTGGATTGAGTTCATTGAATATAAGCAAGATAACGACAATGTAGTAGCTGATGTTTTGTCAAGAAGGTACAATCTCTTTACTTCTCTTAGTGC
AAAAATTCTTGGATTTGAACATGTTAAAGAGTTATATACAGATGACACTGATTTTGGAATGATTTATTCATCTTGTCTTGATGGAAAAATTATTGATGATTATTATGTGT
TTCATGATTTTTTGTTCAAGAAAAGTAAGCTTTGTGTTCCTAAGTGTTTTGAGGATCTTGCTAAAAAGGGTAAGGATAGTATTTTTGTTGTTGTGGACAGATTTAGTAAA
ATGATACATTTTATACCATGTAACAAAACTAATACTGCTAAGCATGTGTCTGACTTGTTCTTTAGAGAGGTTGTAAGGTTGCATGGAATTCCTAAAACTATTGCAACTAT
TTCAAAGAACGTTAAGTCTTGGAAGGATTCTTTACCATTTGTTAAATTTGCATATAATAGAGTGATTCATAGTACCACTCATTGTTCTTCTTTTGAAGTTGTTTATGGGT
TTAACCCACTTACACCTCTTTATTTGCTGTCTATTATCTGTAATGTGTTTGTAAGTGATGCAACTACAAACAAGGAGCAATTTCCGTCTCAAAGGAAAACAAAGTTACAT
CCAAGAGGGGATGGACCTTTTCAAGTGATGAATTTCTTGATTAGAGGACGAATCCTTTTGAAGAGAGGGAAAATGATGTGA
Protein sequenceShow/hide protein sequence
MNQKVENRGEINEARKIEAQGGAIVHLTQVINDLTNQLDRMKVALRNHHQGEVLREGVENEENEDNETLLAGQNPRGRRVTNIDEDEDDTMARFLEGMNRELDNQVDRQN
YFDMKELLHLANSNSGNVNFKPKGKIEFEKKDKVESSKGKEKLDGSKEVREKSQYIKCWKCQGIGHIARDCPNKRTMINKKVKLSPMEKKEFEDVFQDEAPKGLPPIRGI
EHKIEFIHGATIPNRPAYKANPARIKEIQRQVEELMEKRAINKITVRYRHPIPRLDDLLDELHGDVLMQDKKSLMYFSEKLSGASLNYPTYNKELFALVHALQLKGQSKL
NRRNAKWIEFIEYKQDNDNVVADVLSRRYNLFTSLSAKILGFEHVKELYTDDTDFGMIYSSCLDGKIIDDYYVFHDFLFKKSKLCVPKCFEDLAKKGKDSIFVVVDRFSK
MIHFIPCNKTNTAKHVSDLFFREVVRLHGIPKTIATISKNVKSWKDSLPFVKFAYNRVIHSTTHCSSFEVVYGFNPLTPLYLLSIICNVFVSDATTNKEQFPSQRKTKLH
PRGDGPFQVMNFLIRGRILLKRGKMM