; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031588 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031588
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr11:10419821..10424237
RNA-Seq ExpressionLag0031588
SyntenyLag0031588
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5443558.1 hypothetical protein F2P56_036105, partial [Juglans regia]4.7e-6531.87Show/hide
Query:  MSSAKRLMGFNHCFCVDWHGWSGGLALLWDSSISFSLLSFSRNHIDGWIV-WDDCRWRLTVFYGFPSADLRAQTWSLLSRLRGCEDTSGLIG--------
        M + K  +   HCF VD  G SGGL+LLW   +   + SFS +HID  I   D   WR T  YG P    R  TW+LL RL    D   L+G        
Subjt:  MSSAKRLMGFNHCFCVDWHGWSGGLALLWDSSISFSLLSFSRNHIDGWIV-WDDCRWRLTVFYGFPSADLRAQTWSLLSRLRGCEDTSGLIG--------

Query:  ----------------------------DLGFVGDHFTWCNRRPGGETICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDHRPV-------ELILTASTM
                                    DLGF G  +TWCN R G   I ERLD       +  L+P +VV H   + SDH PV       E    A  +
Subjt:  ----------------------------DLGFVGDHFTWCNRRPGGETICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDHRPV-------ELILTASTM

Query:  L---VLGVRSEESSNTTPMV------------LADKTERCMREMANWGRSKTGNFPTCISIANQRVQSAIAELGVSGSRALLVQAKAQL------EEVL-
             + V  E+ S     V            +    ++C  +++ W +   GN    +++A   ++       +     ++ +A+ ++      EEV+ 
Subjt:  L---VLGVRSEESSNTTPMV------------LADKTERCMREMANWGRSKTGNFPTCISIANQRVQSAIAELGVSGSRALLVQAKAQL------EEVL-

Query:  ---------------------RTSYRRKLNHIGGLEDSQGVWHQEKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALLQPFTEEDVL
                             + + RRK N I GL+D+ GV + E     +++ D+F  LFT SNP     E+ L  +   V   MN  LL+P+  ++V 
Subjt:  ---------------------RTSYRRKLNHIGGLEDSQGVWHQEKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALLQPFTEEDVL

Query:  LAL---------------------RCVVDNAILGFECIHELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNL
         AL                     R + DN ++ +E ++ LR +  G+  +++LKLDM KAYDRVEWSFL   ML+MG A  +V LV+ CV ++ FS  +
Subjt:  LAL---------------------RCVVDNAILGFECIHELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNL

Query:  NEEKVEQVRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDSLLFSR
        N E    + P+ GLRQGD LSPYLFLLC EGL++LL  A+R + + G  + R +P +SHL FADDS++F +
Subjt:  NEEKVEQVRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDSLLFSR

XP_030483769.1 uncharacterized protein LOC115700339 [Cannabis sativa]7.9e-6529.55Show/hide
Query:  GWSGGLALLWDSSISFSLLSFSRNHIDGWIVWDD-CRWRLTVFYGFPSADLRAQTWSLLSRLRGCE--DTSGLIGDLG----------------------
        G  GG+ LLW   +  +LLS + NH D +++++D  RW  +  YGFP +  +  TW+L+ RL      D   LIGDL                       
Subjt:  GWSGGLALLWDSSISFSLLSFSRNHIDGWIVWDD-CRWRLTVFYGFPSADLRAQTWSLLSRLRGCE--DTSGLIGDLG----------------------

Query:  -------------FVGDHFTWCNRRPGGETICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDHRPVELILTAS-------------TMLVLGVRSEESS-
                      VGD FTW   R     + ERLDWCF    W+D   N ++ HLDY  SDHR + + ++ S                 + ++ EE + 
Subjt:  -------------FVGDHFTWCNRRPGGETICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDHRPVELILTAS-------------TMLVLGVRSEESS-

Query:  -----------NTTPMVLADKTERCMREMANWGRSKTGNFPTCISIANQRVQSAIAELGVSGSRALLVQ-AKAQLEEVL---------------------
                   N     L    ++C   +  W   K G     IS A +RV    +    SG+ +  VQ A+  LEE+L                     
Subjt:  -----------NTTPMVLADKTERCMREMANWGRSKTGNFPTCISIANQRVQSAIAELGVSGSRALLVQ-AKAQLEEVL---------------------

Query:  -------RTSYRRKLNHIGGLEDSQGVWHQEKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALLQPFTEEDVLLAL-----------
               + S R+  N I  L D+ G     K+ +  +V DYF  LFT+S          L  +  ++  + N  LL  FT  DVL AL           
Subjt:  -------RTSYRRKLNHIGGLEDSQGVWHQEKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALLQPFTEEDVLLAL-----------

Query:  -----------------------------------------------------------RCVVDNAILGFECIHELRRRSRGRSKWVALKLDMRKAYDRV
                                                                   R + DN ++ FE +H L+ R RG   + ALKLDM KA+DRV
Subjt:  -----------------------------------------------------------RCVVDNAILGFECIHELRRRSRGRSKWVALKLDMRKAYDRV

Query:  EWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQVRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADD
        EWSFL   M +MG    W++L++ C+ + +FSF +N E    V P  GLRQGDPLSPYLFL+C+EGLS LL+  +   ++ G  ++R SP+ISHL FADD
Subjt:  EWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQVRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADD

Query:  SLLFSRPRRVRCWSFR
        SLLF +     C + +
Subjt:  SLLFSRPRRVRCWSFR

XP_030483769.1 uncharacterized protein LOC115700339 [Cannabis sativa]8.6e-1136.63Show/hide
Query:  PSSSTVSELIFASDSWNEVMIRAHLSEADCKAILKIPLRYGLGDDRLIWHFEKHGTFSIKSGYRLAHSLAVQDRPSSLDPDRMRAWWSSLWKLNVPSKHR
        PSS+ VS+ I     WN  +++A  S  D   IL IPL Y    DR IWH +  G +S+ +GY  A SL  ++R  S   +    WW + W  N+P+K +
Subjt:  PSSSTVSELIFASDSWNEVMIRAHLSEADCKAILKIPLRYGLGDDRLIWHFEKHGTFSIKSGYRLAHSLAVQDRPSSLDPDRMRAWWSSLWKLNVPSKHR

Query:  L
        +
Subjt:  L

XP_030958760.1 uncharacterized protein LOC115980671 [Quercus lobata]5.0e-6731.37Show/hide
Query:  MSSAKRLMGFNHCFCVDWHGWSGGLALLWDSSISFSLLSFSRNHIDGWIVWD-DCRWRLTVFYGFPSADLRAQTWSLLSRL-------------------
        M   K  +GF +   +  HG SGGLALLW   I+  + SFS  HID  +  D   +WR+T FYG P    R ++W LL  L                   
Subjt:  MSSAKRLMGFNHCFCVDWHGWSGGLALLWDSSISFSLLSFSRNHIDGWIVWD-DCRWRLTVFYGFPSADLRAQTWSLLSRL-------------------

Query:  ------------RGCEDTSGLIG-----DLGFVGDHFTWCNRRPGGETICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDHRPVELILTASTML------
                    R  +D    I      DLGF G  FTWCN + G   +  RLD    T  W D Y +  V+HL  S SDH    L++T S +L      
Subjt:  ------------RGCEDTSGLIG-----DLGFVGDHFTWCNRRPGGETICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDHRPVELILTASTML------

Query:  -----VLGVRSEESSNT------------TPMVLADKTERCMREMANWGRSKTGNFPTCISIANQRVQSAIAELGVSGSRALLV-QAKAQLEEVL-----
              +  R +E  +             +P  +    ++C  +++ W     G  P  I    +RV + +      GS    + + + ++ ++L     
Subjt:  -----VLGVRSEESSNT------------TPMVLADKTERCMREMANWGRSKTGNFPTCISIANQRVQSAIAELGVSGSRALLV-QAKAQLEEVL-----

Query:  -----------------------RTSYRRKLNHIGGLEDSQGVWHQEKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALLQPFTEED
                               + S R+K N I  + D +G+W +    + +V   YF+ L+ +SNP +      ++ +   V  EMN++L++ FT E+
Subjt:  -----------------------RTSYRRKLNHIGGLEDSQGVWHQEKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALLQPFTEED

Query:  VLLALRCVVDNAILG---FECIHELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQVRPSWGLRQ
        V  AL+ +  +   G   FE +H L  +  G+  ++A+KLDM KA+DRVEW F+   M ++G   SW  L++ C+ S+++S  +N      + PS GLRQ
Subjt:  VLLALRCVVDNAILG---FECIHELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQVRPSWGLRQ

Query:  GDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDSLLFSRPRRVRC
        GDPLSPYLFLLCA+G SSL+  A R   ++G  I R  P ISHLFFADDSLLF +     C
Subjt:  GDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDSLLFSRPRRVRC

XP_030970961.1 uncharacterized protein LOC115991405 [Quercus lobata]4.2e-6631.37Show/hide
Query:  MSSAKRLMGFNHCFCVDWHGWSGGLALLWDSSISFSLLSFSRNHIDGWIVWD-DCRWRLTVFYGFPSADLRAQTWSLLSRL-------------------
        M   K  +GF +   +  HG SGGLALLW   I+  + SFS  HID  +  D   +WR+T FYG P    R ++W LL  L                   
Subjt:  MSSAKRLMGFNHCFCVDWHGWSGGLALLWDSSISFSLLSFSRNHIDGWIVWD-DCRWRLTVFYGFPSADLRAQTWSLLSRL-------------------

Query:  ------------RGCEDTSGLIG-----DLGFVGDHFTWCNRRPGGETICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDHRPVELILTASTML------
                    R  +D    I      DLGF G  FTWCN + G   +  RLD    T  W D Y +  V+HL  S SDH    L++T S +L      
Subjt:  ------------RGCEDTSGLIG-----DLGFVGDHFTWCNRRPGGETICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDHRPVELILTASTML------

Query:  -----VLGVRSEESSNT------------TPMVLADKTERCMREMANWGRSKTGNFPTCISIANQRVQSAIAELGVSGSRALLV-QAKAQLEEVL-----
              +  R +E  +             +P  +    ++C  +++ W     G  P  I    +RV + +      GS    + + + ++ ++L     
Subjt:  -----VLGVRSEESSNT------------TPMVLADKTERCMREMANWGRSKTGNFPTCISIANQRVQSAIAELGVSGSRALLV-QAKAQLEEVL-----

Query:  -----------------------RTSYRRKLNHIGGLEDSQGVWHQEKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALLQPFTEED
                               + S R+K N I  + D +G+W +    + +V   YF+ L+ +SNP +      ++ +   V  EMN++L++ FT E+
Subjt:  -----------------------RTSYRRKLNHIGGLEDSQGVWHQEKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALLQPFTEED

Query:  VLLALRCVVDNAILG---FECIHELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQVRPSWGLRQ
        V  AL+ +  +   G   FE +H L  +  G+  ++A+KLDM KA+DRVEW F+   M ++G   SW  L++ C+ S+++S  +N      + PS GLRQ
Subjt:  VLLALRCVVDNAILG---FECIHELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQVRPSWGLRQ

Query:  GDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDSLLFSRPRRVRC
        GDPLSPYLFLLCA+G SSL+  A R   ++G  I R  P ISHLFFADDSLLF +     C
Subjt:  GDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDSLLFSRPRRVRC

XP_042944657.1 uncharacterized protein LOC122278542 [Carya illinoinensis]3.1e-6932.04Show/hide
Query:  MSSAKRLMGFNHCFCVDWHGWSGGLALLWDSSISFSLLSFSRNHIDGWIVWDD---CRWRLTVFYGFPSADLRAQTWSLLSRLRGCEDTSGL--------
        M   KR +GF +C  V   G  GGLAL W   +  ++L +S+NHI   I  ++    RW LT  YGFP    R  TW+L+  L+  ++   L        
Subjt:  MSSAKRLMGFNHCFCVDWHGWSGGLALLWDSSISFSLLSFSRNHIDGWIVWDD---CRWRLTVFYGFPSADLRAQTWSLLSRLRGCEDTSGL--------

Query:  ----------------------------IGDLGFVGDHFTWCNRRPGGETICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDHRPVELILTASTMLVLGV
                                    I DLG+ G  +TW NRR   E I  RLD   +   W    P   V H   + SDH P+ +  T   +   G 
Subjt:  ----------------------------IGDLGFVGDHFTWCNRRPGGETICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDHRPVELILTASTMLVLGV

Query:  R----------------------SEESSNTTPMVLADKTERCMREMANWGRSKTGNFPTCISIA-------------------NQRVQSAI-------AE
        R                       E S       L   T++   ++  W +   GN    ++ A                   N  V+  +        E
Subjt:  R----------------------SEESSNTTPMVLADKTERCMREMANWGRSKTGNFPTCISIA-------------------NQRVQSAI-------AE

Query:  LGVSGSRALLVQAKAQLEEVL--RTSYRRKLNHIGGLEDSQGVWHQEKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALLQPFTEED
        +    S+AL +QA  Q       + S+R+K N I  L+D Q  W + +D + ++V +YFQ LF+SS   E+D   ++  +   V   MN  L +PFT  +
Subjt:  LGVSGSRALLVQAKAQLEEVL--RTSYRRKLNHIGGLEDSQGVWHQEKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALLQPFTEED

Query:  VLLAL-------------RCVVDNAILGFECIHELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVE
        V LAL             R + DN ++ +E +H LR +  G+  ++++KLDM KAYDRVEW FL   M++MG   SW++L+++CV S++FS  LN     
Subjt:  VLLAL-------------RCVVDNAILGFECIHELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVE

Query:  QVRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDSLLFSR
         ++P+ GLRQGDPLSPYLFLLC EGL S+L+ A   S I G  I R +P+I+HL FADDS++F +
Subjt:  QVRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDSLLFSR

TrEMBL top hitse value%identityAlignment
A0A2N9FYH3 CCHC-type domain-containing protein3.8e-7334.52Show/hide
Query:  MGFNHCFCVDWHGWSGGLALLWDSSISFSLLSFSRNHIDGWI-VWDDCRWRLTVFYGFPSADLRAQTWSLLSRLRGCEDTSGLI-GDL----------GF
        +G   CF VD   + GGLALLWD S+   + S+S++HID W+       WR T FYG P    R  +W LL RL+G  +   L+ GD           GF
Subjt:  MGFNHCFCVDWHGWSGGLALLWDSSISFSLLSFSRNHIDGWI-VWDDCRWRLTVFYGFPSADLRAQTWSLLSRLRGCEDTSGLI-GDL----------GF

Query:  VGDHFTWCNRRPGGETICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDHRPVELILTASTMLVLG--------------VRSE------------ESSNT
         G  FTW N R  GE + ERLD   +TT W DL+P   + H  ++ SDH  + L+L   T+   G              +R E            +   T
Subjt:  VGDHFTWCNRRPGGETICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDHRPVELILTASTMLVLG--------------VRSE------------ESSNT

Query:  TPMVLADKTERCMREMANWGRSKTGNFPTCISIANQRVQSAIAELGVSGS-----------RALLVQAKAQLEEVLRTSY-----------------RRK
            L  K ++C   + +W +S+    P  I+    R++      G   +           R LL + +    +  R ++                 R+K
Subjt:  TPMVLADKTERCMREMANWGRSKTGNFPTCISIANQRVQSAIAELGVSGS-----------RALLVQAKAQLEEVLRTSY-----------------RRK

Query:  LNHIGGLEDSQGVWHQEKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALLQPFT-EEDVLLALRCVVDNAILGFECIHELRRRSRGR
         N I G+ DS  +W+ +   + +VV +YF  ++TSSNP+  D     Q++   V   MN  L+ PFT EE   +  R + DN ++ FE +H L+ +  G+
Subjt:  LNHIGGLEDSQGVWHQEKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALLQPFT-EEDVLLALRCVVDNAILGFECIHELRRRSRGR

Query:  SKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQVRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGF
           +A+KLDM KAYDRVEW +L+  ML++G    WV L++ CV S+++S  +N E    V+PS GLRQGDPLSPYLFL+CAEGL++LLR A+R S + G 
Subjt:  SKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQVRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGF

Query:  WIARTSP
         I R  P
Subjt:  WIARTSP

A0A2N9GW67 Uncharacterized protein7.2e-7233.39Show/hide
Query:  GGLALLWDSSISFSLLSFSRNHIDGWIVWDD-CRWRLTVFYGFPSADLRAQTWSLLSRLRGCEDTSGLI-------------------------------
        GGLAL W SS++  + ++S NHID  IV +D   WRLT FYG P   LR  +W+LL  L        ++                               
Subjt:  GGLALLWDSSISFSLLSFSRNHIDGWIVWDD-CRWRLTVFYGFPSADLRAQTWSLLSRLRGCEDTSGLI-------------------------------

Query:  -----GDLGFVGDHFTWCNRRPGGETICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDHRPVELILTASTMLVLGVRS---------------EES----
              D+GF G  F+W NRR  G  +  RLD C +   W  ++PN+ V+H  ++ SDH  + +IL +  ++  G R                E+S    
Subjt:  -----GDLGFVGDHFTWCNRRPGGETICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDHRPVELILTASTMLVLGVRS---------------EES----

Query:  -----SNTTPMVLADKTERCMREMANWGRSKTGNFPTCISIANQRV-QSAIAELGVSGSRA---------LLVQAK----------AQLEEVLR------
             S T   ++A K + C   +  W +++    P  I    +R+ Q   + +    SR          +L++ +          A L+E  R      
Subjt:  -----SNTTPMVLADKTERCMREMANWGRSKTGNFPTCISIANQRV-QSAIAELGVSGSRA---------LLVQAK----------AQLEEVLR------

Query:  --TSYRRKLNHIGGLEDSQGVWHQEKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALLQPFTEEDVLLAL-----------------
           S R+K N I GL DSQGVWH E  ++  +  +YF +LF SSNP        +  +   V  +MN  L + FT E++  AL                 
Subjt:  --TSYRRKLNHIGGLEDSQGVWHQEKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALLQPFTEEDVLLAL-----------------

Query:  ----RCVVDNAILGFECIHELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQVRPSWGLRQGDPL
            R + DN I+ FE +H L+    G +  +A KLDM KAYDRVEW+FL+  +L++G  + WV+L++ CV S +F+  +N      ++PS GLRQGDPL
Subjt:  ----RCVVDNAILGFECIHELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQVRPSWGLRQGDPL

Query:  SPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDSLLFSR
        SPYLFLLCAEGLS+L+R A+R   I G  I R  P +SHLFFADDS++F R
Subjt:  SPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDSLLFSR

A0A2N9IZB6 Reverse transcriptase domain-containing protein1.3e-7333.51Show/hide
Query:  CFCVDWHGWSGGLALLWDSSISFSLLSFSRNHIDGWIVWDDCR-WRLTVFYGFPSADLRAQTWSLLSRLR-----------------------GCEDTS-
        CF VD HG+ GGLALLWDSS+S  + S+S  HID  +  +D + WR+T FYG P   LR +TW LL RL                        G ED S 
Subjt:  CFCVDWHGWSGGLALLWDSSISFSLLSFSRNHIDGWIVWDDCR-WRLTVFYGFPSADLRAQTWSLLSRLR-----------------------GCEDTS-

Query:  --------GLIG----DLGFVGDHFTWCNRRPGGETICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDH-------RP----VELILTASTMLVL----G
                 L+     DLGF G  FTW NRR G + +  RLD   S   W  L+PN  V+H+    SDH       RP     +L+   S  +V     G
Subjt:  --------GLIG----DLGFVGDHFTWCNRRPGGETICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDH-------RP----VELILTASTMLVL----G

Query:  VRSEESSNTTPMVLADKTERCMREMANW---GRSKTGNFPTCISIANQRVQSAIAELGVSGSRALLVQAKAQLEEVLRTSYRRKLNHIGGLEDSQGVWHQ
        V +     T+ +   +   R  R   NW   G   TG F  C                                     + R++ N I GL D +  W  
Subjt:  VRSEESSNTTPMVLADKTERCMREMANW---GRSKTGNFPTCISIANQRVQSAIAELGVSGSRALLVQAKAQLEEVLRTSYRRKLNHIGGLEDSQGVWHQ

Query:  EKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALLQPFTEEDVLLAL-----------------------------------------
        +   V Q+ T YF  LFTSS P++ D  + + +  ++ D  MN  LL+PF+ ++V  AL                                         
Subjt:  EKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALLQPFTEEDVLLAL-----------------------------------------

Query:  --------------------RCVVDNAILGFECIHELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEK
                            R + DN I+ FE IH L+    G++  +A KLDM KAYDRVEW +LR  M ++G    WVDL++ CV S+++S  +N + 
Subjt:  --------------------RCVVDNAILGFECIHELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEK

Query:  VEQVRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDSLLFSRPRRVRC
           ++P  G+RQGDPLSPYLFL+CAEGLS+L+R A+R   I G  I R  P ISHLFFADDS++F R + + C
Subjt:  VEQVRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDSLLFSRPRRVRC

A0A803NML1 Uncharacterized protein7.0e-7526.67Show/hide
Query:  SSAKRLMGFNHCFCVDWHGWSGGLALLWDSSISFSLLSFSRNHIDGWI-VWDDCRWRLTVFYGFPSADLRAQTWSLLSRLRGCEDTSG--LIGDLG----
        ++++ L+ FNH   V   G SGGL LLW   I  +L  ++ N  DG++ V +  +W  T FYG P+   R  +W+LL RL+         +IGD      
Subjt:  SSAKRLMGFNHCFCVDWHGWSGGLALLWDSSISFSLLSFSRNHIDGWI-VWDDCRWRLTVFYGFPSADLRAQTWSLLSRLRGCEDTSG--LIGDLG----

Query:  -------------------------------FVGDHFTWCNRRPGGETICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDHRPVELI----------LTA
                                       F GD FTW   R     + ERLDWCF    W+ ++      HLDY  SDHR + +           LT 
Subjt:  -------------------------------FVGDHFTWCNRRPGGETICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDHRPVELI----------LTA

Query:  ST---------------MLVLGVRSEESSNTTPMVLADKTERCMREMANWGRSKTGNFPTCISIANQRVQSAIAELGVSGSRAL-----LVQAKAQLEEV
         T                ++    S+  +    + LA+  + C   +  W   K GN    I+     +Q  ++ L  +  R++     L  ++A L+E+
Subjt:  ST---------------MLVLGVRSEESSNTTPMVLADKTERCMREMANWGRSKTGNFPTCISIANQRVQSAIAELGVSGSRAL-----LVQAKAQLEEV

Query:  LR---------------------TSY-------RRKLNHIGGLEDSQGVWHQEKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALLQ
        L+                     TS+       R+  N I  L ++QG     K  +  V++DY+  LF S           L  +  ++  EMN++L  
Subjt:  LR---------------------TSY-------RRKLNHIGGLEDSQGVWHQEKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALLQ

Query:  PFTEEDVLLAL-----------------------------------------------------------------------------------------
        PFT  +V  AL                                                                                         
Subjt:  PFTEEDVLLAL-----------------------------------------------------------------------------------------

Query:  -----------------RCVVDNAILGFECIHELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQ
                         R + DN ++ FE IH LR R +GR  + ALKLDM KA+DRVEW +L   M +MG    W+ L++ C+ + +FSF+LN + V  
Subjt:  -----------------RCVVDNAILGFECIHELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQ

Query:  VRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDSLLFSRPRRVRCWSFRICWYCMRGLQARQCLIREI-------
        V P  GLRQGDPLSPYLFL+C+EGLS  L+  ++   + G  + R +PS+SHL FADDSLLF +       S +           ++ L++ +       
Subjt:  VRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDSLLFSRPRRVRCWSFRICWYCMRGLQARQCLIREI-------

Query:  --------------HQVQSVSLLPSSS-TVSELIFASDSWNEVMIRAHLSEADCKAILKIPLRYGLGDDRLIWHFEKHGTFSIKSGYRLAHSLAVQDRPS
                      +  + VS L S S  VS  I  +  WN  ++ ++  + D   IL IPL Y  G DRL+WH   +G +S+K+G+ LA +L  +D+ S
Subjt:  --------------HQVQSVSLLPSSS-TVSELIFASDSWNEVMIRAHLSEADCKAILKIPLRYGLGDDRLIWHFEKHGTFSIKSGYRLAHSLAVQDRPS

Query:  SLDPDRMRAWWSSLWKLNVPSKHRL
        S   ++   WW   W L +P K R+
Subjt:  SLDPDRMRAWWSSLWKLNVPSKHRL

A0A803P5H2 Uncharacterized protein1.4e-7828.62Show/hide
Query:  MSSAKRLMGFNHCFCVDWHGWSGGLALLWDSSISFSLLSFSRNHIDGWIVWDD-CRWRLTVFYGFPSADLRAQTWSLLSRLRGCE--DTSGLIGDLGFV-
        +S  K L+ F++   V   G  GGL LLW   +  +LLS + NH D +I++DD  RW  +  YGFP A  +  TW L+ RL      D   LIGD+  + 
Subjt:  MSSAKRLMGFNHCFCVDWHGWSGGLALLWDSSISFSLLSFSRNHIDGWIVWDD-CRWRLTVFYGFPSADLRAQTWSLLSRLRGCE--DTSGLIGDLGFV-

Query:  ----------------------------------GDHFTWCNRRPGGETICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDHRPV------ELILTASTM
                                          GD FTW   R    ++ ERLDWCF    W+D + +  ++HLDY  SDHR +         L A   
Subjt:  ----------------------------------GDHFTWCNRRPGGETICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDHRPV------ELILTASTM

Query:  LVLGVRSEE------------------SSNTTPMV-LADKTERCMREMANWGRSKTGNFPTCISIANQRVQSAIAELGVSGS-----RALLVQAKAQLEE
             R E+                  SS T P   L      C   + +W   K G     I +A    Q  +  L  S S      A +  A++ L+E
Subjt:  LVLGVRSEE------------------SSNTTPMV-LADKTERCMREMANWGRSKTGNFPTCISIANQRVQSAIAELGVSGS-----RALLVQAKAQLEE

Query:  VL----------------------------RTSYRRKLNHIGGLEDSQGVWHQEKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALL
        +L                            + S R   N I  L D  G     K+ + +VV DYFQ LFT+SN         L  +  ++ DE N  L 
Subjt:  VL----------------------------RTSYRRKLNHIGGLEDSQGVWHQEKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALL

Query:  QPFTEEDVLLAL----------------------------------------------------------------------------------------
        Q FT  +VL AL                                                                                        
Subjt:  QPFTEEDVLLAL----------------------------------------------------------------------------------------

Query:  ------------------RCVVDNAILGFECIHELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVE
                          R + DN ++ FE +H L+ R+RG   + ALKLDM KA+DRVEWSFL   M +MG    W+ L++ C+ + +FSF +N E   
Subjt:  ------------------RCVVDNAILGFECIHELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVE

Query:  QVRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDSLLFSRPRRVRCWSFR--------------------ICWYC
         V P  GLRQGDPLSPYLFL+C+EGLS LL+  ++  ++ G  ++R SPSI+HL FADDSLLF +     C S +                    I W  
Subjt:  QVRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDSLLFSRPRRVRCWSFR--------------------ICWYC

Query:  MRGLQARQCLIR-----EIHQVQSVSLLP-------------SSSTVSELIFASDSWNEVMIRAHLSEADCKAILKIPLRYGLGDDRLIWHFEKHGTFSI
         R L ++  +I+      ++  Q  S +P              S+ V++ I  +  W+  ++    S AD   IL IPL Y    DR  WH++  G +++
Subjt:  MRGLQARQCLIR-----EIHQVQSVSLLP-------------SSSTVSELIFASDSWNEVMIRAHLSEADCKAILKIPLRYGLGDDRLIWHFEKHGTFSI

Query:  KSGYRLAHSLAVQDRPSSLDPDRMRAWWSSLWKLNVPSKHRL
        KSGY LA SL  +D  SS       AWW   W LN+PSK R+
Subjt:  KSGYRLAHSLAVQDRPSSLDPDRMRAWWSSLWKLNVPSKHRL

SwissProt top hitse value%identityAlignment
P0CV25 Secreted RxLR effector protein 781.6e-0437.65Show/hide
Query:  SKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQVRPSWGLRQGDPLSPYLFLLCAEGLS
        S  V L LD +KAYD V   FL + +LR   +  +V ++       T  F +N E  E      G+RQG  L+P LF+L AE L+
Subjt:  SKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQVRPSWGLRQGDPLSPYLFLLCAEGLS

P11369 LINE-1 retrotransposable element ORF2 protein4.2e-0830.17Show/hide
Query:  LKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQVRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIART
        + LD  KA+D+++  F+   + R G+   +++++         +  +N EK+E +    G RQG PLSPYLF +  E L+   R  +++ +I G  I + 
Subjt:  LKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQVRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIART

Query:  SPSISHLFFADDSLLF
           IS L  ADD +++
Subjt:  SPSISHLFFADDSLLF

P92555 Uncharacterized mitochondrial protein AtMg012506.6e-1450Show/hide
Query:  VCSITFSFNLNEEKVEQVRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDS
        VC     F +N      V PS GLRQGDPLSPYLF+LC E LS L R A+ + ++ G  ++  SP I+HL FADD+
Subjt:  VCSITFSFNLNEEKVEQVRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDS

Q03278 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)1.1e-0826.15Show/hide
Query:  TEEDVLLALRCVVDNAILGFECIHELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQVRPSWGLR
        T +   +    V +N  L    I E R + +G   ++A+ LD++KA+D VE   +  A+ R  L     + ++W   +      + + K   +RP+ G+R
Subjt:  TEEDVLLALRCVVDNAILGFECIHELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQVRPSWGLR

Query:  QGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDSLLFSRPRRVRCWSFRICWYCMRGLQARQCLIREIHQVQSVSLLPSSSTV
        QGDPLSP LF    + +       +R  + TGF +   +  I  L FADD +L +  R               GLQA    I    Q Q + ++P     
Subjt:  QGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDSLLFSRPRRVRCWSFRICWYCMRGLQARQCLIREIHQVQSVSLLPSSSTV

Query:  SELIFASDSWNEVMIRAH
          L+  S    ++ +  H
Subjt:  SELIFASDSWNEVMIRAH

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases5.0e-0929.37Show/hide
Query:  RCVVDNAILGFECIHELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQVRPS---------WGLR
        R   DN +   E +H +RR+ +G   W+ LKLD+ KAYDR+ W +L   ++  G  + W+      +   TF       +V +   S         WG R
Subjt:  RCVVDNAILGFECIHELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQVRPS---------WGLR

Query:  QGDPLSPYL--FLLCAEGLSSLLRGA
          D  +P+    + CAE L  + RG+
Subjt:  QGDPLSPYL--FLLCAEGLSSLLRGA

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)4.7e-1550Show/hide
Query:  VCSITFSFNLNEEKVEQVRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDS
        VC     F +N      V PS GLRQGDPLSPYLF+LC E LS L R A+ + ++ G  ++  SP I+HL FADD+
Subjt:  VCSITFSFNLNEEKVEQVRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGFWIARTSPSISHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCTGCGAAACGTTTGATGGGTTTTAACCATTGCTTTTGTGTTGATTGGCATGGATGGAGTGGTGGGTTGGCTCTCCTATGGGATTCATCTATCTCTTTTAGTCT
CCTTTCTTTCTCCAGGAATCACATTGATGGATGGATCGTTTGGGATGATTGTAGGTGGCGGCTCACTGTTTTCTATGGTTTCCCTTCGGCGGACCTACGAGCTCAGACTT
GGTCCCTTCTCTCTAGATTGAGAGGTTGTGAGGATACGTCCGGGCTGATTGGAGATCTGGGCTTTGTTGGGGATCATTTTACTTGGTGTAATAGACGACCAGGGGGTGAA
ACGATCTGTGAACGGTTGGATTGGTGTTTTAGCACCACAACTTGGCAAGACCTTTACCCAAACTATGTGGTTAATCATCTTGATTATAGTCAGTCTGATCATAGGCCAGT
GGAACTGATCCTTACCGCCTCCACAATGTTGGTCCTGGGTGTGAGGTCTGAGGAGTCTAGTAATACAACTCCTATGGTTCTGGCAGATAAGACAGAGAGATGTATGCGCG
AGATGGCTAATTGGGGTCGATCAAAGACTGGGAATTTCCCGACGTGTATCAGTATTGCCAATCAGAGGGTTCAATCGGCTATTGCTGAGTTAGGTGTATCTGGTTCTCGT
GCCTTGCTCGTTCAGGCTAAGGCTCAGTTGGAGGAGGTGCTTCGAACCTCTTACCGTCGAAAGCTTAATCACATTGGGGGTTTGGAGGATAGTCAGGGAGTGTGGCATCA
AGAGAAGGATGCAGTTATTCAGGTGGTAACTGACTACTTTCAGCATCTTTTTACCTCTTCGAATCCGAGTGAGCAGGATTTTGAGATTGCTCTGCAGGATTTGACCCTGT
CGGTGGATGATGAGATGAACCGAGCTTTGTTGCAGCCTTTTACTGAGGAGGATGTTCTGTTGGCTTTGAGATGTGTTGTAGATAATGCCATCTTAGGGTTTGAGTGTATT
CATGAATTAAGGCGGCGATCCAGGGGACGGTCCAAGTGGGTTGCACTGAAGCTAGACATGAGAAAAGCCTATGATAGGGTGGAGTGGTCTTTCCTTCGAATGGCTATGTT
ACGAATGGGGCTTGCTCAGTCGTGGGTTGATCTGGTCCTTTGGTGTGTTTGTTCGATAACATTCTCTTTCAACCTGAATGAGGAGAAAGTGGAGCAGGTGAGACCTTCTT
GGGGTCTACGACAGGGGGATCCTTTATCCCCATATCTTTTTCTATTGTGCGCAGAAGGTTTATCTAGTCTATTGCGTGGGGCTAAGCGGAGATCTCAGATCACGGGTTTT
TGGATTGCACGTACTAGTCCATCGATCTCTCATCTTTTCTTTGCAGATGACAGTCTCCTATTTTCAAGGCCAAGGCGAGTGAGGTGTTGGTCATTCAGGATCTGTTGGTA
CTGTATGAGAGGACTTCAGGCCAGACAGTGTTTGATCAGAGAGATCCATCAGGTACAGTCGGTCTCATTGCTTCCTTCTTCGAGTACAGTGAGTGAGCTAATTTTTGCTT
CGGATAGTTGGAATGAGGTTATGATCAGAGCCCATTTAAGTGAGGCTGATTGCAAGGCCATTTTGAAAATCCCATTACGTTATGGTTTAGGTGATGATCGATTAATTTGG
CACTTTGAGAAACATGGGACCTTTTCTATTAAGAGTGGGTATCGGCTTGCTCATTCGTTGGCTGTTCAGGATCGTCCTTCCTCTTTGGACCCTGATAGAATGCGTGCGTG
GTGGTCTTCCTTGTGGAAGTTGAATGTGCCTAGTAAACATAGGCTTGAGGAAATCATTGGAGCGATGAAGAATAATCTTGCGGGGTCGGATTTTGAACTTGTGGTCATTT
TTTGGTGGTCCGTGTGGAATTTTCAGAACAATTTAAGTTGGGGTGATCAGTCAGATGGGCGGGACTTATGGTTGTATGCGACTGATTACCTTAGTGTCTTTCACGCAGTT
GGGAGGCGTCGCCTATCAAGAGACTATTTACGAACCCAGCCGAGTGATCACGTGTCTTATCAGCAGGATCCTCCCGGAACTCTGGATTTTTTGGAATGTGTTTCAACTCC
AATGACGTGCTACCGATCAAGCACTAGGCAACTGGTTATCCTTAACTCAGTCACTGGCGTCCACCAACACATAATGGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCTGCGAAACGTTTGATGGGTTTTAACCATTGCTTTTGTGTTGATTGGCATGGATGGAGTGGTGGGTTGGCTCTCCTATGGGATTCATCTATCTCTTTTAGTCT
CCTTTCTTTCTCCAGGAATCACATTGATGGATGGATCGTTTGGGATGATTGTAGGTGGCGGCTCACTGTTTTCTATGGTTTCCCTTCGGCGGACCTACGAGCTCAGACTT
GGTCCCTTCTCTCTAGATTGAGAGGTTGTGAGGATACGTCCGGGCTGATTGGAGATCTGGGCTTTGTTGGGGATCATTTTACTTGGTGTAATAGACGACCAGGGGGTGAA
ACGATCTGTGAACGGTTGGATTGGTGTTTTAGCACCACAACTTGGCAAGACCTTTACCCAAACTATGTGGTTAATCATCTTGATTATAGTCAGTCTGATCATAGGCCAGT
GGAACTGATCCTTACCGCCTCCACAATGTTGGTCCTGGGTGTGAGGTCTGAGGAGTCTAGTAATACAACTCCTATGGTTCTGGCAGATAAGACAGAGAGATGTATGCGCG
AGATGGCTAATTGGGGTCGATCAAAGACTGGGAATTTCCCGACGTGTATCAGTATTGCCAATCAGAGGGTTCAATCGGCTATTGCTGAGTTAGGTGTATCTGGTTCTCGT
GCCTTGCTCGTTCAGGCTAAGGCTCAGTTGGAGGAGGTGCTTCGAACCTCTTACCGTCGAAAGCTTAATCACATTGGGGGTTTGGAGGATAGTCAGGGAGTGTGGCATCA
AGAGAAGGATGCAGTTATTCAGGTGGTAACTGACTACTTTCAGCATCTTTTTACCTCTTCGAATCCGAGTGAGCAGGATTTTGAGATTGCTCTGCAGGATTTGACCCTGT
CGGTGGATGATGAGATGAACCGAGCTTTGTTGCAGCCTTTTACTGAGGAGGATGTTCTGTTGGCTTTGAGATGTGTTGTAGATAATGCCATCTTAGGGTTTGAGTGTATT
CATGAATTAAGGCGGCGATCCAGGGGACGGTCCAAGTGGGTTGCACTGAAGCTAGACATGAGAAAAGCCTATGATAGGGTGGAGTGGTCTTTCCTTCGAATGGCTATGTT
ACGAATGGGGCTTGCTCAGTCGTGGGTTGATCTGGTCCTTTGGTGTGTTTGTTCGATAACATTCTCTTTCAACCTGAATGAGGAGAAAGTGGAGCAGGTGAGACCTTCTT
GGGGTCTACGACAGGGGGATCCTTTATCCCCATATCTTTTTCTATTGTGCGCAGAAGGTTTATCTAGTCTATTGCGTGGGGCTAAGCGGAGATCTCAGATCACGGGTTTT
TGGATTGCACGTACTAGTCCATCGATCTCTCATCTTTTCTTTGCAGATGACAGTCTCCTATTTTCAAGGCCAAGGCGAGTGAGGTGTTGGTCATTCAGGATCTGTTGGTA
CTGTATGAGAGGACTTCAGGCCAGACAGTGTTTGATCAGAGAGATCCATCAGGTACAGTCGGTCTCATTGCTTCCTTCTTCGAGTACAGTGAGTGAGCTAATTTTTGCTT
CGGATAGTTGGAATGAGGTTATGATCAGAGCCCATTTAAGTGAGGCTGATTGCAAGGCCATTTTGAAAATCCCATTACGTTATGGTTTAGGTGATGATCGATTAATTTGG
CACTTTGAGAAACATGGGACCTTTTCTATTAAGAGTGGGTATCGGCTTGCTCATTCGTTGGCTGTTCAGGATCGTCCTTCCTCTTTGGACCCTGATAGAATGCGTGCGTG
GTGGTCTTCCTTGTGGAAGTTGAATGTGCCTAGTAAACATAGGCTTGAGGAAATCATTGGAGCGATGAAGAATAATCTTGCGGGGTCGGATTTTGAACTTGTGGTCATTT
TTTGGTGGTCCGTGTGGAATTTTCAGAACAATTTAAGTTGGGGTGATCAGTCAGATGGGCGGGACTTATGGTTGTATGCGACTGATTACCTTAGTGTCTTTCACGCAGTT
GGGAGGCGTCGCCTATCAAGAGACTATTTACGAACCCAGCCGAGTGATCACGTGTCTTATCAGCAGGATCCTCCCGGAACTCTGGATTTTTTGGAATGTGTTTCAACTCC
AATGACGTGCTACCGATCAAGCACTAGGCAACTGGTTATCCTTAACTCAGTCACTGGCGTCCACCAACACATAATGGTCTAG
Protein sequenceShow/hide protein sequence
MSSAKRLMGFNHCFCVDWHGWSGGLALLWDSSISFSLLSFSRNHIDGWIVWDDCRWRLTVFYGFPSADLRAQTWSLLSRLRGCEDTSGLIGDLGFVGDHFTWCNRRPGGE
TICERLDWCFSTTTWQDLYPNYVVNHLDYSQSDHRPVELILTASTMLVLGVRSEESSNTTPMVLADKTERCMREMANWGRSKTGNFPTCISIANQRVQSAIAELGVSGSR
ALLVQAKAQLEEVLRTSYRRKLNHIGGLEDSQGVWHQEKDAVIQVVTDYFQHLFTSSNPSEQDFEIALQDLTLSVDDEMNRALLQPFTEEDVLLALRCVVDNAILGFECI
HELRRRSRGRSKWVALKLDMRKAYDRVEWSFLRMAMLRMGLAQSWVDLVLWCVCSITFSFNLNEEKVEQVRPSWGLRQGDPLSPYLFLLCAEGLSSLLRGAKRRSQITGF
WIARTSPSISHLFFADDSLLFSRPRRVRCWSFRICWYCMRGLQARQCLIREIHQVQSVSLLPSSSTVSELIFASDSWNEVMIRAHLSEADCKAILKIPLRYGLGDDRLIW
HFEKHGTFSIKSGYRLAHSLAVQDRPSSLDPDRMRAWWSSLWKLNVPSKHRLEEIIGAMKNNLAGSDFELVVIFWWSVWNFQNNLSWGDQSDGRDLWLYATDYLSVFHAV
GRRRLSRDYLRTQPSDHVSYQQDPPGTLDFLECVSTPMTCYRSSTRQLVILNSVTGVHQHIMV