; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G09050 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G09050
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationClcChr06:11379226..11382716
RNA-Seq ExpressionClc06G09050
SyntenyClc06G09050
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]8.8e-10040.67Show/hide
Query:  MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMA
        +++E YYTKL TIWQ L+E+R T +CTCGG+K F+DHL+SE++M FLMGLN+ Y  +RAQIL+M P+PSI   F L+IQEE QRS        D +AL  
Subjt:  MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMA

Query:  TTENAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLH--------------------------------------------------------------
         +  A  T+++RKK  +RP C+ CGIKGH+ DKCYK H                                                              
Subjt:  TTENAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLH--------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------ALNFCDDYCTIQDRLSLKMIGKANNKHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTLSLPS-SLLKLTCHVCPL
                 +L+F    C IQD     MIGKA+ ++ LY+LN   ++N   A     AIS++TWH RLGHLSPKCLS L  TL L + S+   +CHVCPL
Subjt:  ---------ALNFCDDYCTIQDRLSLKMIGKANNKHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTLSLPS-SLLKLTCHVCPL

Query:  AKQHKLSFNSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPKLRFTDLFA
        AKQ +LSF+SNN+VA + FDLVH D WGPFK P+Y GY+YF T+VDDC  FTW Y++R KSD L+I+P+FF L+ET FSK IK FRSDNAP+L+ T+ FA
Subjt:  AKQHKLSFNSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPKLRFTDLFA

Query:  TKGTIHQFSCVERPQQNSVVERKHQHLLNVARALLF
         KGT+HQFSCVE+PQQNSVVERKHQHLLNVARAL F
Subjt:  TKGTIHQFSCVERPQQNSVVERKHQHLLNVARALLF

KAA0068025.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]1.6e-6941.02Show/hide
Query:  VYYTKLITIWQELSEHRPT---QECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMAT
        +Y+TK  T+  EL+ +RP      C CGG     + L +E++M FLMGLN+ Y   R Q+L+M P+PSI++ F L++QEE QR++ +     +  A    
Subjt:  VYYTKLITIWQELSEHRPT---QECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMAT

Query:  TENAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLHALNFCDDYCTIQDRLSLKMIGKANNKHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLS
                                                       + D+ + K IG A     LY+L   D      +     A     WH R+GH S
Subjt:  TENAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLHALNFCDDYCTIQDRLSLKMIGKANNKHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLS

Query:  PKCLSLLKDTLSL-PSSLLKLT-CHVCPLAKQHKLSFNSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFF
         + L+ L+  L L PS++ K++ C +CPLAKQ KLSF SNNH++ N FDL+H D WGPF   TY+ Y YF TIVDD + +TW ++++ KSD + IIP FF
Subjt:  PKCLSLLKDTLSL-PSSLLKLT-CHVCPLAKQHKLSFNSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFF

Query:  ALVETHFSKTIKIFRSDNAPKLRFTDLFATKGTIHQFSCVERPQQNSVVERKHQHLLNVARALLFQSRIPIRF
         L+ET + K IK  RSDNA KL+FT  F  KG IHQ+SCV+ PQQNSVVE+KHQH+LN ARAL FQS++P+ F
Subjt:  ALVETHFSKTIKIFRSDNAPKLRFTDLFATKGTIHQFSCVERPQQNSVVERKHQHLLNVARALLFQSRIPIRF

KAA0068025.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]1.4e-1754.64Show/hide
Query:  VQLVANGGFLAAKPALAPFPFDLKLTATAGIPLNLDDASSYRRLIGRLLYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGTAGQGIFL
        +QL+ + G L AKP   P     KL A+    L+  DA+ YRRLIGRLLYL ISRPD++FAVHKLSQ++AKP  +H+ AA++L++YLKG+ G+GI L
Subjt:  VQLVANGGFLAAKPALAPFPFDLKLTATAGIPLNLDDASSYRRLIGRLLYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGTAGQGIFL

KZV25004.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum]8.6e-7132.55Show/hide
Query:  MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQ--------V
        M V  YYTKL T+W EL +++PT  CTCG ++ + ++ + E VM FLMGLN+ Y  +RAQ+L++ P+P+I K F LVIQEE QRS+  +V         +
Subjt:  MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQ--------V

Query:  TDAMALMATTENAKRTNQSRKKD-SQRPICTNCGIKGHVIDKCYKLH-----------------------------------------------------
           +   A T  + RT+Q+ K     R IC++C  + H +DKCYKLH                                                     
Subjt:  TDAMALMATTENAKRTNQSRKKD-SQRPICTNCGIKGHVIDKCYKLH-----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------ALNFCDDYCTIQDRLSLKMIGKANNKHELYLL----NFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTLSLPS
                          +++F  D C IQD   ++MIG       LY+L     F+ S   +T  + S     E WH R+GH S   LS LK+ L++ +
Subjt:  ------------------ALNFCDDYCTIQDRLSLKMIGKANNKHELYLL----NFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTLSLPS

Query:  SLLKLTCHVCPLAKQHKLSFNSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSD
        + +   CH C L+KQ +L   S N+++  IF+L+H DTWGPF   + +G+R+FFTIVDD S +TW Y+++SKSD L I P F  +V T F  T+K  RSD
Subjt:  SLLKLTCHVCPLAKQHKLSFNSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSD

Query:  NAPKLRFTDLFATKGTIHQFSCVERPQQNSVVERKHQHLLNVARALLFQSRIPIRF
        NAP+L F D FA  G  H  SCVERPQQNSVVERKHQH+LNVARALLFQS IP+ +
Subjt:  NAPKLRFTDLFATKGTIHQFSCVERPQQNSVVERKHQHLLNVARALLFQSRIPIRF

KZV25004.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum]3.5e-1649.07Show/hide
Query:  GVLFLYLTVIVQLVANGGFLAAKPALAPFPFDLKLTATAGIPLNLDDASSYRRLIGRLLYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGT
        GV        + L+   G L  KP   P   + KL   +G  L+  D +SYRRLIGRLLYL I+RPD+ FAV+KLSQYV+ P   H+ AA N+L+Y+KGT
Subjt:  GVLFLYLTVIVQLVANGGFLAAKPALAPFPFDLKLTATAGIPLNLDDASSYRRLIGRLLYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGT

Query:  AGQGIFLS
         GQG+F S
Subjt:  AGQGIFLS

KZV25004.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum]1.6e-6932.78Show/hide
Query:  MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNV---QVTDAMA
        M +  YYTKL  +W EL +++PT  C CG ++ ++ + + E VM FLMGLNE Y  +RAQ+L+M P+P I+K F LV+QEE QRS+ + V    +  +++
Subjt:  MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNV---QVTDAMA

Query:  L-MATTENAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLH----------------------------------------------------------
        L   T+    R +++ + D  + +C++C  + H +DKCYKLH                                                          
Subjt:  L-MATTENAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLH----------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------ALNFCDDYCTIQDRLSLKMIGKANNKHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTLSLPSSLLKL-TCHVCP
                  +++F  + C IQD    KMIG       LY+L  + S+++   A++      E WH R+GH SP  LS LKD L   S+ + +  CHVC 
Subjt:  ----------ALNFCDDYCTIQDRLSLKMIGKANNKHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTLSLPSSLLKL-TCHVCP

Query:  LAKQHKLSFNSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPKLRFTDLF
        L+KQ +L F SNN + D+ F+L+H D WGPF   + +GYR+F TIVDD + FTW YL+RSKSD   I P F  +V+T F   IK  RSDNAP+L F D F
Subjt:  LAKQHKLSFNSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPKLRFTDLF

Query:  ATKGTIHQFSCVERPQQNSVVERKHQHLLNVARALLFQSRIPI
           G +H +SCVERPQQNSVVERKHQH+LNVARAL+FQS IPI
Subjt:  ATKGTIHQFSCVERPQQNSVVERKHQHLLNVARALLFQSRIPI

XP_022861542.1 uncharacterized protein LOC111381922 [Olea europaea var. sylvestris]3.7e-7439.86Show/hide
Query:  IWQELSEHRPT---QECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMA---------
        +W+EL+  RP     +CTCGG+K    +   E++M FLMGL+E ++  R QIL+M P+P I K F L+ QEE+QR +      +DA    A         
Subjt:  IWQELSEHRPT---QECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMA---------

Query:  TTENAKRTNQSRK-------KDSQRPICTNCGIKGHVIDKCYKLHAL----------NFCDDYCTIQ---------------------------------
        +T N +  N S +       + ++RP CT+C + GH I KCYK+H            NF      IQ                                 
Subjt:  TTENAKRTNQSRK-------KDSQRPICTNCGIKGHVIDKCYKLHAL----------NFCDDYCTIQ---------------------------------

Query:  -----DRLSLKMIGKANNKHELYLLN---FVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKD--TLSLPSSLLKLTCHVCPLAKQHKLSFNSNN
             +  + KMIG+ +   +LY+L+   F   SN      ++       WH+RLG+LS K L  LK+  T  +      L C++CP+AKQ +LSF SNN
Subjt:  -----DRLSLKMIGKANNKHELYLLN---FVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKD--TLSLPSSLLKLTCHVCPLAKQHKLSFNSNN

Query:  HVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPKLRFTDLFATKGTIHQFSCVE
        +++   FDL+HCD WGP+  P++ GYRYFFT+VDD S FT  Y++R K DA+ ++ RFF ++ T ++  IK FRSDNA +L F + FA KG +HQFSCVE
Subjt:  HVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPKLRFTDLFATKGTIHQFSCVE

Query:  RPQQNSVVERKHQHLLNVARALLF
        RPQQNSVVE KHQHLLNVARAL F
Subjt:  RPQQNSVVERKHQHLLNVARALLF

TrEMBL top hitse value%identityAlignment
A0A2N9FB96 Integrase catalytic domain-containing protein5.0e-1639.6Show/hide
Query:  VESNEATNLPHNITTLGEWLWMKKLMPWK--------EQELGVLFLYLTVIVQLVANGGFLAAKPALAPFPFDLKLTATAGIPLNLDDASSYRRLIGRLL
        + SN+A ++      L +   +K L P K          + G+        + ++ + GFLAAKP   P    LKL+   G  L   D S YRRLIGRLL
Subjt:  VESNEATNLPHNITTLGEWLWMKKLMPWK--------EQELGVLFLYLTVIVQLVANGGFLAAKPALAPFPFDLKLTATAGIPLNLDDASSYRRLIGRLL

Query:  YLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGTAGQGIFLS
        YL ++RPD+S++V  LSQ++A+P   HL AAH +L+YLK + GQG+F S
Subjt:  YLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGTAGQGIFLS

A0A2N9FB96 Integrase catalytic domain-containing protein4.1e-7132.55Show/hide
Query:  MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQ--------V
        M V  YYTKL T+W EL +++PT  CTCG ++ + ++ + E VM FLMGLN+ Y  +RAQ+L++ P+P+I K F LVIQEE QRS+  +V         +
Subjt:  MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQ--------V

Query:  TDAMALMATTENAKRTNQSRKKD-SQRPICTNCGIKGHVIDKCYKLH-----------------------------------------------------
           +   A T  + RT+Q+ K     R IC++C  + H +DKCYKLH                                                     
Subjt:  TDAMALMATTENAKRTNQSRKKD-SQRPICTNCGIKGHVIDKCYKLH-----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------ALNFCDDYCTIQDRLSLKMIGKANNKHELYLL----NFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTLSLPS
                          +++F  D C IQD   ++MIG       LY+L     F+ S   +T  + S     E WH R+GH S   LS LK+ L++ +
Subjt:  ------------------ALNFCDDYCTIQDRLSLKMIGKANNKHELYLL----NFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTLSLPS

Query:  SLLKLTCHVCPLAKQHKLSFNSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSD
        + +   CH C L+KQ +L   S N+++  IF+L+H DTWGPF   + +G+R+FFTIVDD S +TW Y+++SKSD L I P F  +V T F  T+K  RSD
Subjt:  SLLKLTCHVCPLAKQHKLSFNSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSD

Query:  NAPKLRFTDLFATKGTIHQFSCVERPQQNSVVERKHQHLLNVARALLFQSRIPIRF
        NAP+L F D FA  G  H  SCVERPQQNSVVERKHQH+LNVARALLFQS IP+ +
Subjt:  NAPKLRFTDLFATKGTIHQFSCVERPQQNSVVERKHQHLLNVARALLFQSRIPIRF

A0A2N9HKE6 Uncharacterized protein9.5e-7636.82Show/hide
Query:  SVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVR--NNVQVTDAMALM
        SV  YYT+L ++W ELS  RP  +C+CG +K  LD+   E+VM FLMGLN+ ++ +RAQIL+  P+PSITK F LVIQEE QR++   +     D++AL 
Subjt:  SVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVR--NNVQVTDAMALM

Query:  ATTENAKRT---NQSRKKDSQRPICTNCGIKGHVIDKCYKLHA-------------------------LNFCDDYC------------------------
           E  +     NQS KKD  RPIC++CGI GH +DKCYKLH                          L F    C                        
Subjt:  ATTENAKRT---NQSRKKDSQRPICTNCGIKGHVIDKCYKLHA-------------------------LNFCDDYC------------------------

Query:  --------------------------------------------------------------TIQ---------DRLSLKMIGKANNKHELYLL----NF
                                                                      T+Q         D ++ K IG    K+ LY L    + 
Subjt:  --------------------------------------------------------------TIQ---------DRLSLKMIGKANNKHELYLL----NF

Query:  VDSSNHHTAAALSCAIS---IETWHHRLGHLSPKCLSLLKDTLS---LPSSLLKLTCHVCPLAKQHKLSFNSNNHVADNIFDLVHCDTWGPFKHPTYNGY
        V SS+  + AA +   +    + WHHRLGH S   LSLLK+ +S   +PS+     C VC ++KQ +L F++  H AD  FDL+HCD WGP+  PT +  
Subjt:  VDSSNHHTAAALSCAIS---IETWHHRLGHLSPKCLSLLKDTLS---LPSSLLKLTCHVCPLAKQHKLSFNSNNHVADNIFDLVHCDTWGPFKHPTYNGY

Query:  RYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPKLRFTDLFATKGTIHQFSCVERPQQNSVVERKHQHLLNVARALLFQS
        RYF TIVDDC+  TW +LM+ KS+   +I  FFAL++T FS +IK+ RSDN P+ +    +A  GT+HQ SCV  PQQN+ VERKHQHLL VARAL FQ+
Subjt:  RYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPKLRFTDLFATKGTIHQFSCVERPQQNSVVERKHQHLLNVARALLFQS

Query:  RIPIRFLEFLDVYAMH
         +P+ F  +  + A H
Subjt:  RIPIRFLEFLDVYAMH

A0A2N9HKE6 Uncharacterized protein1.4e-1346.88Show/hide
Query:  VQLVANGGFLAAKPALAPFPFDLKLTATAGIPLNLDDASSYRRLIGRLLYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGTAGQGIF
        ++++ + G L  KP   P   +LKL+   G PL L D + YRRLIGRL+YL ++RPD+ FAVHKLSQ++  P   H  AA ++L+Y+KG   QG+F
Subjt:  VQLVANGGFLAAKPALAPFPFDLKLTATAGIPLNLDDASSYRRLIGRLLYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGTAGQGIF

A0A2N9HKE6 Uncharacterized protein4.9e-7237.8Show/hide
Query:  MSVEVYYTKLITIWQELSEHRPTQECT------CGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSV-RNNVQVT
        +SV  YYTKL   W+EL  +RP   CT      CG ++  +D+     +M FLMGLNE +T +R QIL+M PMP I K F L+ QEE QRS+ +  +   
Subjt:  MSVEVYYTKLITIWQELSEHRPTQECT------CGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSV-RNNVQVT

Query:  DAMALMATTE-----NAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLHAL------------------------------------------------
        ++ AL+  +E      AK+  Q R K    P C +CG  GH +DKCYK+H                                                  
Subjt:  DAMALMATTE-----NAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLHAL------------------------------------------------

Query:  ----NFCDDYCTIQDRLSLKMIGKANNKHELYLL-------------------NFVDSSNHHTA--AALSCA----ISIETWHHRLGHLSPKCLSLLKDT
            +   DYC IQ     +MIG     + LY+L                    F   S+++T+  A++ C      +++ WH+RLGH S   L  L   
Subjt:  ----NFCDDYCTIQDRLSLKMIGKANNKHELYLL-------------------NFVDSSNHHTA--AALSCA----ISIETWHHRLGHLSPKCLSLLKDT

Query:  LSLPSSLLKLT--CHVCPLAKQHKLSFNSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKT
        +    ++ K T  C+VCPLAKQ ++SF +  H+    F+L+HCD WGP+  PT  G++YF TIVDD S  TW YLM SKSD   ++  FF +VET F   
Subjt:  LSLPSSLLKLT--CHVCPLAKQHKLSFNSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKT

Query:  IKIFRSDNAPKLRFTDLFATKGTIHQFSCVERPQQNSVVERKHQHLLNVARALLFQSRIPIRF
        IK  RSD   +   +D F+ KG IHQ SC + PQQNSVVERKHQHLLNVARA+ FQS +P +F
Subjt:  IKIFRSDNAPKLRFTDLFATKGTIHQFSCVERPQQNSVVERKHQHLLNVARALLFQSRIPIRF

A0A2Z7AT15 Cysteine-rich RLK (Receptor-like protein kinase) 81.7e-1649.07Show/hide
Query:  GVLFLYLTVIVQLVANGGFLAAKPALAPFPFDLKLTATAGIPLNLDDASSYRRLIGRLLYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGT
        GV        + L+   G L  KP   P   + KL   +G  L+  D +SYRRLIGRLLYL I+RPD+ FAV+KLSQYV+ P   H+ AA N+L+Y+KGT
Subjt:  GVLFLYLTVIVQLVANGGFLAAKPALAPFPFDLKLTATAGIPLNLDDASSYRRLIGRLLYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGT

Query:  AGQGIFLS
         GQG+F S
Subjt:  AGQGIFLS

A0A2Z7AT15 Cysteine-rich RLK (Receptor-like protein kinase) 84.6e-7038.3Show/hide
Query:  MSVEVYYTKLITIWQELSEHRPTQECTCGG------IKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRS---------
        M V  Y+T+L  +W E   +RP   CTCG        ++ +D+   ++V  FLMGLN+ +  +R QIL+M P+P+I K F L+  +E QR          
Subjt:  MSVEVYYTKLITIWQELSEHRPTQECTCGG------IKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRS---------

Query:  --------VRNNVQV------TDAMALMATTENAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLH----ALNFCDDYCTIQDRLSLKMI---GKANNK
                + N +        T + A    T+N K+  Q  +KD    IC++CG KGH  +KCYKLH              + +++S   +     A+N 
Subjt:  --------VRNNVQV------TDAMALMATTENAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLH----ALNFCDDYCTIQDRLSLKMI---GKANNK

Query:  HEL-----------YLLNFV-----------DSSNHHTAAALSCAISIETWH----HRLGHLSPKCLSLLKDTL--SLPSSLLKLTCHVCPLAKQHKLSF
          +            LLN +           DS NH  A ++S   ++   H      LGH S   +  L   +  +  SS    TC VCPLAKQ KL F
Subjt:  HEL-----------YLLNFV-----------DSSNHHTAAALSCAISIETWH----HRLGHLSPKCLSLLKDTL--SLPSSLLKLTCHVCPLAKQHKLSF

Query:  NSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPKLRFTDLFATKGTIHQF
         +NNH++ N FDL+H D WGP+  PT  GYRYF T+VDDC+  TW YLM+SKSD   ++  F  ++ T F   IK  RSDN  +    D +A+KG IHQ 
Subjt:  NSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPKLRFTDLFATKGTIHQF

Query:  SCVERPQQNSVVERKHQHLLNVARALLFQSRIPIRF
        SCVE PQQNSVVERKHQHLLNVARAL FQS +P+++
Subjt:  SCVERPQQNSVVERKHQHLLNVARALLFQSRIPIRF

A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 84.2e-10040.67Show/hide
Query:  MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMA
        +++E YYTKL TIWQ L+E+R T +CTCGG+K F+DHL+SE++M FLMGLN+ Y  +RAQIL+M P+PSI   F L+IQEE QRS        D +AL  
Subjt:  MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMA

Query:  TTENAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLH--------------------------------------------------------------
         +  A  T+++RKK  +RP C+ CGIKGH+ DKCYK H                                                              
Subjt:  TTENAKRTNQSRKKDSQRPICTNCGIKGHVIDKCYKLH--------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------ALNFCDDYCTIQDRLSLKMIGKANNKHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTLSLPS-SLLKLTCHVCPL
                 +L+F    C IQD     MIGKA+ ++ LY+LN   ++N   A     AIS++TWH RLGHLSPKCLS L  TL L + S+   +CHVCPL
Subjt:  ---------ALNFCDDYCTIQDRLSLKMIGKANNKHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTLSLPS-SLLKLTCHVCPL

Query:  AKQHKLSFNSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPKLRFTDLFA
        AKQ +LSF+SNN+VA + FDLVH D WGPFK P+Y GY+YF T+VDDC  FTW Y++R KSD L+I+P+FF L+ET FSK IK FRSDNAP+L+ T+ FA
Subjt:  AKQHKLSFNSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPKLRFTDLFA

Query:  TKGTIHQFSCVERPQQNSVVERKHQHLLNVARALLF
         KGT+HQFSCVE+PQQNSVVERKHQHLLNVARAL F
Subjt:  TKGTIHQFSCVERPQQNSVVERKHQHLLNVARALLF

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.6e-1127.98Show/hide
Query:  WHHRLGHLSP-KCLSLLKDTLSLPSSL---LKLTCHVCP---LAKQHKLSF---NSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWT
        WH R GH+S  K L + +  +    SL   L+L+C +C      KQ +L F       H+   +F +VH D  GP    T +   YF   VD  + +  T
Subjt:  WHHRLGHLSP-KCLSLLKDTLSLPSSL---LKLTCHVCP---LAKQHKLSF---NSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWT

Query:  YLMRSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPKLRFTDL---FATKGTIHQFSCVERPQQNSVVERKHQHLLNVARALLFQSRIPIRF
        YL++ KSD   +   F A  E HF+  +     DN  +    ++      KG  +  +    PQ N V ER  + +   AR ++  +++   F
Subjt:  YLMRSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPKLRFTDL---FATKGTIHQFSCVERPQQNSVVERKHQHLLNVARALLFQSRIPIRF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-2532.11Show/hide
Query:  ISIETWHHRLGHLSPKCLSLL--KDTLSLPSSLLKLTCHVCPLAKQHKLSFNSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLM
        IS++ WH R+GH+S K L +L  K  +S         C  C   KQH++SF +++    NI DLV+ D  GP +  +  G +YF T +DD S   W Y++
Subjt:  ISIETWHHRLGHLSPKCLSLL--KDTLSLPSSLLKLTCHVCPLAKQHKLSFNSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLM

Query:  RSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPKL---RFTDLFATKGTIHQFSCVERPQQNSVVERKHQHLLNVARALLFQSRIPIRF
        ++K     +  +F ALVE    + +K  RSDN  +     F +  ++ G  H+ +    PQ N V ER ++ ++   R++L  +++P  F
Subjt:  RSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPKL---RFTDLFATKGTIHQFSCVERPQQNSVVERKHQHLLNVARALLFQSRIPIRF

P92519 Uncharacterized mitochondrial protein AtMg008102.8e-0835.05Show/hide
Query:  QLVANGGFLAAKPALAPFPFDLKLT-ATAGIPLNLDDASSYRRLIGRLLYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGTAGQGIFL
        Q++ N G L  KP   P P  L  + +TA  P    D S +R ++G L YL ++RPD+S+AV+ + Q + +P          +LRY+KGT   G+++
Subjt:  QLVANGGFLAAKPALAPFPFDLKLT-ATAGIPLNLDDASSYRRLIGRLLYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGTAGQGIFL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-1627.19Show/hide
Query:  KANNKHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTLSL----PSSLLKLTCHVCPLAKQHKLSFNSNNHVADNIFDLVHCDTW
        +   K ELY      S      A+ S   +  +WH RLGH +P  L+ +    SL    PS    L+C  C + K +K+ F+ +   +    + ++ D W
Subjt:  KANNKHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTLSL----PSSLLKLTCHVCPLAKQHKLSFNSNNHVADNIFDLVHCDTW

Query:  GPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPK-LRFTDLFATKGTIHQFSCVERPQQNSVVERKHQH
              +++ YRY+   VD  + +TW Y ++ KS        F  L+E  F   I  F SDN  + +   + F+  G  H  S    P+ N + ERKH+H
Subjt:  GPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPK-LRFTDLFATKGTIHQFSCVERPQQNSVVERKHQH

Query:  LLNVARALLFQSRIPIRFLEFLDVYAMH
        ++     LL  + IP  +  +    A++
Subjt:  LLNVARALLFQSRIPIRFLEFLDVYAMH

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.8e-1341.84Show/hide
Query:  IVQLVANGGFLAAKPALAPFPFDLKLTATAGIPLNLDDASSYRRLIGRLLYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGTAGQGIFL
        I+ L+A    + AKP   P     KL+  +G    L D + YR ++G L YL  +RPD+S+AV++LSQ++  P  +HL A   +LRYL GT   GIFL
Subjt:  IVQLVANGGFLAAKPALAPFPFDLKLTATAGIPLNLDDASSYRRLIGRLLYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGTAGQGIFL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.7e-1628.04Show/hide
Query:  KANNKHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTLSLP---SSLLKLTCHVCPLAKQHKLSFNSNNHVADNIFDLVHCDTWG
        +   K ELY      S      A+     +  +WH RLGH S   L+ +    SLP    S   L+C  C + K HK+ F+++   +    + ++ D W 
Subjt:  KANNKHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTLSLP---SSLLKLTCHVCPLAKQHKLSFNSNNHVADNIFDLVHCDTWG

Query:  PFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPK-LRFTDLFATKGTIHQFSCVERPQQNSVVERKHQHL
             + + YRY+   VD  + +TW Y ++ KS        F +LVE  F   I    SDN  + +   D  +  G  H  S    P+ N + ERKH+H+
Subjt:  PFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPK-LRFTDLFATKGTIHQFSCVERPQQNSVVERKHQHL

Query:  LNVARALLFQSRIP
        + +   LL  + +P
Subjt:  LNVARALLFQSRIP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.7e-1344.33Show/hide
Query:  VQLVANGGFLAAKPALAPFPFDLKLTATAGIPLNLDDASSYRRLIGRLLYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGTAGQGIFL
        + L+A    L AKP   P     KLT  +G    L D + YR ++G L YL  +RPD+S+AV++LSQY+  P  DH  A   +LRYL GT   GIFL
Subjt:  VQLVANGGFLAAKPALAPFPFDLKLTATAGIPLNLDDASSYRRLIGRLLYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGTAGQGIFL

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.7e-0636.78Show/hide
Query:  SVEVYYTKLITIWQELSEHRPTQECTCGG-----IKSFLDHLDSEFVMIFLMG--LNEIYTILRAQILVMSPMPSITKTFPLVIQEE
        SVE Y+ KL  +W ELSE+ P  EC CGG      K   +  + E    FLMG  LN+ +  +  +I+   P PS+ + F +V   E
Subjt:  SVEVYYTKLITIWQELSEHRPTQECTCGG-----IKSFLDHLDSEFVMIFLMG--LNEIYTILRAQILVMSPMPSITKTFPLVIQEE

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.9e-1646.94Show/hide
Query:  VQLVANGGFLAAKPALAPFPFDLKLTATAGIPLNLDDASSYRRLIGRLLYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGTAGQGIFLS
        + L+   G L  KP  +  P D  +T +A    +  DA +YRRLIGRL+YLQI+R D+SFAV+KLSQ+   P   H  A   +L Y+KGT GQG+F S
Subjt:  VQLVANGGFLAAKPALAPFPFDLKLTATAGIPLNLDDASSYRRLIGRLLYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGTAGQGIFLS

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.2e-0748Show/hide
Query:  LYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGTAGQGIFLS
        +YL I+RPD++FAV++LSQ+ +   T  + A + +L Y+KGT GQG+F S
Subjt:  LYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGTAGQGIFLS

ATMG00810.1 DNA/RNA polymerases superfamily protein2.0e-0935.05Show/hide
Query:  QLVANGGFLAAKPALAPFPFDLKLT-ATAGIPLNLDDASSYRRLIGRLLYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGTAGQGIFL
        Q++ N G L  KP   P P  L  + +TA  P    D S +R ++G L YL ++RPD+S+AV+ + Q + +P          +LRY+KGT   G+++
Subjt:  QLVANGGFLAAKPALAPFPFDLKLT-ATAGIPLNLDDASSYRRLIGRLLYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGTAGQGIFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGTCGAAGTCTACTATACAAAACTCATCACAATTTGGCAAGAACTATCTGAACATCGGCCTACACAAGAATGTACCTGTGGAGGAATAAAATCCTTTCTTGATCA
CCTTGATTCTGAATTCGTCATGATTTTCTTAATGGGACTAAATGAGATCTATACCATTCTACGTGCTCAAATCTTGGTTATGAGTCCAATGCCTTCAATCACCAAAACCT
TCCCATTGGTAATTCAAGAGGAGCATCAACGATCTGTTCGAAACAATGTCCAAGTAACTGATGCAATGGCTTTAATGGCAACTACAGAGAATGCTAAGAGGACAAATCAA
TCACGAAAGAAAGATTCTCAACGACCTATTTGTACAAATTGTGGCATCAAAGGTCATGTCATTGACAAATGCTACAAACTTCATGCACTCAATTTCTGTGATGACTACTG
CACCATACAGGACAGACTTTCATTGAAGATGATTGGCAAGGCTAACAACAAACATGAACTCTATTTGCTCAATTTTGTTGACAGCTCCAATCATCATACTGCTGCTGCTC
TTTCTTGCGCCATCTCAATTGAAACTTGGCATCATCGCTTGGGCCATTTATCTCCCAAATGTTTATCATTGCTAAAAGATACTTTGTCCTTACCAAGTTCTCTATTAAAA
CTTACATGTCATGTATGTCCTTTAGCTAAACAACACAAACTATCCTTTAACTCCAACAATCATGTTGCTGATAATATTTTTGATCTAGTACATTGTGACACTTGGGGACC
ATTTAAACATCCGACTTATAATGGATATAGATATTTTTTTACTATTGTTGATGACTGTTCTTGCTTCACATGGACTTATTTGATGCGTTCAAAATCTGATGCTCTATATA
TTATTCCACGCTTCTTTGCTCTTGTTGAGACACATTTTTCCAAGACCATCAAAATTTTTCGATCAGACAATGCACCAAAACTTCGATTTACTGATCTTTTTGCTACAAAA
GGAACAATTCATCAATTCTCTTGTGTAGAACGACCACAACAGAACTCTGTTGTTGAGAGAAAACACCAGCACCTTCTCAATGTCGCTCGAGCATTATTGTTTCAATCTAG
AATTCCAATCAGATTTTTGGAGTTTTTGGATGTTTATGCTATGCATCCACACTATCCGCTCACAGAACAAAATTTGATCCACGAGCAACGCCTTGTATCTTCATTGGCTA
CCACCCCCCCCCCCCCCCCCGGGCATAAAAGCTGTCCTGAAGATAATGACAATGATTTATCTGACACATTGGCAAACCATGTTTTGCCGCTTCCTATTCAAGGAACATTA
CAACAAAATGAAGAGAATCACACAAGTCCTAACACTTCTGATATGTTTAATCCTGGAAATACTCCTCATACAGCAGAAGAACAAGTAACATTTAATGAAGAAATTATGCA
AAATCCTTCTACCATTGAAGCTATTGAGCCCAACAATACGGTTGAATCTAATGAAGCTACAAATCTTCCACATAACATCACTACTCTTGGAGAATGGCTATGGATGAAGA
AATTAATGCCATGGAAAGAACAAGAACTTGGAGTATTGTTCCTCTACCTGACGGTCATCGTGCAATTGGTTGCAAATGGTGGCTTTTTAGCAGCCAAACCAGCATTAGCC
CCATTTCCATTCGATTTAAAATTGACTGCTACCGCTGGCATTCCATTAAACTTGGATGATGCTTCTTCCTATAGAAGATTGATCGGGCGCCTCCTATATCTACAAATTTC
ACGACCAGATGTTTCTTTCGCAGTTCATAAACTCAGCCAATATGTTGCTAAGCCATATACTGACCATTTGTTTGCTGCTCATAACTTACTGCGTTATTTGAAAGGCACTG
CTGGGCAAGGAATTTTTCTCTCTTTTCGTGAGGGAAGTAACCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCGTCGAAGTCTACTATACAAAACTCATCACAATTTGGCAAGAACTATCTGAACATCGGCCTACACAAGAATGTACCTGTGGAGGAATAAAATCCTTTCTTGATCA
CCTTGATTCTGAATTCGTCATGATTTTCTTAATGGGACTAAATGAGATCTATACCATTCTACGTGCTCAAATCTTGGTTATGAGTCCAATGCCTTCAATCACCAAAACCT
TCCCATTGGTAATTCAAGAGGAGCATCAACGATCTGTTCGAAACAATGTCCAAGTAACTGATGCAATGGCTTTAATGGCAACTACAGAGAATGCTAAGAGGACAAATCAA
TCACGAAAGAAAGATTCTCAACGACCTATTTGTACAAATTGTGGCATCAAAGGTCATGTCATTGACAAATGCTACAAACTTCATGCACTCAATTTCTGTGATGACTACTG
CACCATACAGGACAGACTTTCATTGAAGATGATTGGCAAGGCTAACAACAAACATGAACTCTATTTGCTCAATTTTGTTGACAGCTCCAATCATCATACTGCTGCTGCTC
TTTCTTGCGCCATCTCAATTGAAACTTGGCATCATCGCTTGGGCCATTTATCTCCCAAATGTTTATCATTGCTAAAAGATACTTTGTCCTTACCAAGTTCTCTATTAAAA
CTTACATGTCATGTATGTCCTTTAGCTAAACAACACAAACTATCCTTTAACTCCAACAATCATGTTGCTGATAATATTTTTGATCTAGTACATTGTGACACTTGGGGACC
ATTTAAACATCCGACTTATAATGGATATAGATATTTTTTTACTATTGTTGATGACTGTTCTTGCTTCACATGGACTTATTTGATGCGTTCAAAATCTGATGCTCTATATA
TTATTCCACGCTTCTTTGCTCTTGTTGAGACACATTTTTCCAAGACCATCAAAATTTTTCGATCAGACAATGCACCAAAACTTCGATTTACTGATCTTTTTGCTACAAAA
GGAACAATTCATCAATTCTCTTGTGTAGAACGACCACAACAGAACTCTGTTGTTGAGAGAAAACACCAGCACCTTCTCAATGTCGCTCGAGCATTATTGTTTCAATCTAG
AATTCCAATCAGATTTTTGGAGTTTTTGGATGTTTATGCTATGCATCCACACTATCCGCTCACAGAACAAAATTTGATCCACGAGCAACGCCTTGTATCTTCATTGGCTA
CCACCCCCCCCCCCCCCCCCGGGCATAAAAGCTGTCCTGAAGATAATGACAATGATTTATCTGACACATTGGCAAACCATGTTTTGCCGCTTCCTATTCAAGGAACATTA
CAACAAAATGAAGAGAATCACACAAGTCCTAACACTTCTGATATGTTTAATCCTGGAAATACTCCTCATACAGCAGAAGAACAAGTAACATTTAATGAAGAAATTATGCA
AAATCCTTCTACCATTGAAGCTATTGAGCCCAACAATACGGTTGAATCTAATGAAGCTACAAATCTTCCACATAACATCACTACTCTTGGAGAATGGCTATGGATGAAGA
AATTAATGCCATGGAAAGAACAAGAACTTGGAGTATTGTTCCTCTACCTGACGGTCATCGTGCAATTGGTTGCAAATGGTGGCTTTTTAGCAGCCAAACCAGCATTAGCC
CCATTTCCATTCGATTTAAAATTGACTGCTACCGCTGGCATTCCATTAAACTTGGATGATGCTTCTTCCTATAGAAGATTGATCGGGCGCCTCCTATATCTACAAATTTC
ACGACCAGATGTTTCTTTCGCAGTTCATAAACTCAGCCAATATGTTGCTAAGCCATATACTGACCATTTGTTTGCTGCTCATAACTTACTGCGTTATTTGAAAGGCACTG
CTGGGCAAGGAATTTTTCTCTCTTTTCGTGAGGGAAGTAACCATTAA
Protein sequenceShow/hide protein sequence
MSVEVYYTKLITIWQELSEHRPTQECTCGGIKSFLDHLDSEFVMIFLMGLNEIYTILRAQILVMSPMPSITKTFPLVIQEEHQRSVRNNVQVTDAMALMATTENAKRTNQ
SRKKDSQRPICTNCGIKGHVIDKCYKLHALNFCDDYCTIQDRLSLKMIGKANNKHELYLLNFVDSSNHHTAAALSCAISIETWHHRLGHLSPKCLSLLKDTLSLPSSLLK
LTCHVCPLAKQHKLSFNSNNHVADNIFDLVHCDTWGPFKHPTYNGYRYFFTIVDDCSCFTWTYLMRSKSDALYIIPRFFALVETHFSKTIKIFRSDNAPKLRFTDLFATK
GTIHQFSCVERPQQNSVVERKHQHLLNVARALLFQSRIPIRFLEFLDVYAMHPHYPLTEQNLIHEQRLVSSLATTPPPPPGHKSCPEDNDNDLSDTLANHVLPLPIQGTL
QQNEENHTSPNTSDMFNPGNTPHTAEEQVTFNEEIMQNPSTIEAIEPNNTVESNEATNLPHNITTLGEWLWMKKLMPWKEQELGVLFLYLTVIVQLVANGGFLAAKPALA
PFPFDLKLTATAGIPLNLDDASSYRRLIGRLLYLQISRPDVSFAVHKLSQYVAKPYTDHLFAAHNLLRYLKGTAGQGIFLSFREGSNH