; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008231 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008231
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr9:15337408..15340553
RNA-Seq ExpressionLag0008231
SyntenyLag0008231
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN67403.1 hypothetical protein VITISV_025614 [Vitis vinifera]7.0e-5928.51Show/hide
Query:  QVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAAIGEPLSYRDH
        + NP++++W++++R+++SW+YSSL  +  G+I+G   +   W  L  ++ +SS AR+M LR + Q  RK SLTM +Y+ K+K +    AAIGEP++ RD 
Subjt:  QVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAAIGEPLSYRDH

Query:  LVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLL--------AYVDHKLCAPPKWPNNNSTS--------------------------PHCQICGKLG
        ++ +L GL ++YN  V S+      PF + ++  L           +   L  P     NN  S                          P CQ+CGK G
Subjt:  LVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLL--------AYVDHKLCAPPKWPNNNSTS--------------------------PHCQICGKLG

Query:  HTALVCYNRHNPLYHASNPPTPQAFFTQVQSSTNSTSSSAATPDNASHFYFSTPSPHPNESWFMDSRETHHVTPDATNLQQSSVYYGNELVVIDNRKI--
        HT + CY+R +  +   NP            + ++  ++     N      ++PS   +E+WF D+  THH++     L     Y GN+ V++ N     
Subjt:  HTALVCYNRHNPLYHASNPPTPQAFFTQVQSSTNSTSSSAATPDNASHFYFSTPSPHPNESWFMDSRETHHVTPDATNLQQSSVYYGNELVVIDNRKI--

Query:  -------------FFPRQS-----------FFGVILMGVSK----------SSPTFHIGLLVSRV-------SLLFFYQP----FKMFSCGI--PVRPSD
                     + P Q+             G+ L+  +            +  F I  L ++V        +LF   P    FK+F C     +RP +
Subjt:  -------------FFPRQS-----------FFGVILMGVSK----------SSPTFHIGLLVSRV-------SLLFFYQP----FKMFSCGI--PVRPSD

Query:  ISNCSKSAKN------FSSHKGNLCLDLSSGKSFVSRHVVFNESVFPFQSNSVKAPMSVYQTFPNYFFPPDS---LMSLSSFTPSPLNINPVSPTSLSIT
         +  S  +         S+HKG +CL+  +G+ +V+RHVVF+E+VFPFQS   ++   V  T P   F P S   + SL S T    +  P++    S  
Subjt:  ISNCSKSAKN------FSSHKGNLCLDLSSGKSFVSRHVVFNESVFPFQSNSVKAPMSVYQTFPNYFFPPDS---LMSLSSFTPSPLNINPVSPTSLSIT

Query:  SSPTITELNDQN-QASSPSPDPPTPTFSSDQS--------FPSHHTSPASLPSSMPPS-------------------PILSSPSQSYRV----CHPLHLL
        S P + ++   +   S P P    P  +  ++        F SH + P +   ++  S                    ++  PS    +     + L   
Subjt:  SSPTITELNDQN-QASSPSPDPPTPTFSSDQS--------FPSHHTSPASLPSSMPPS-------------------PILSSPSQSYRV----CHPLHLL

Query:  AQLHLFQFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQPASFEDPFFPNHVLLLH
            + ++ +  +AQG+TQ  G++YFETFSPVVK++TIRI+L++A++ NW+V QLDV NAFLHGDL+E VFM+QP  F +  +P+HV  L+
Subjt:  AQLHLFQFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQPASFEDPFFPNHVLLLH

CAN71553.1 hypothetical protein VITISV_034738 [Vitis vinifera]7.2e-4824.8Show/hide
Query:  VAQTQVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAAIGEPLS
        ++  +VNP++LVW++Y+R+++SW+YSSL  +  G+I+G   + E W  L+  + +S+ AR M LR   Q  +K SLTM +Y+ K+K I+   AAIGEP+ 
Subjt:  VAQTQVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAAIGEPLS

Query:  YRDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLLAY-----------VDHKLCA----------------------------------------
         +D ++ +L GL +EYNP V S+  R D   L  V S+LL +            +  L A                                        
Subjt:  YRDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLLAY-----------VDHKLCA----------------------------------------

Query:  ------PPKWPNNNSTS---PHCQICGKLGHTALVCYNRHNPLYHASNPPTPQAFFTQVQSSTNSTSSSAATPDNASHFYFSTPSPHPNESWFMDSRETH
              P ++ NN S +   P CQ+CGK GH  L CY+R +  Y     P             +STS    TP   +    + PS    +SWF+DS  TH
Subjt:  ------PPKWPNNNSTS---PHCQICGKLGHTALVCYNRHNPLYHASNPPTPQAFFTQVQSSTNSTSSSAATPDNASHFYFSTPSPHPNESWFMDSRETH

Query:  HVTPDATNLQQSSVYYGNELVVI---------DNRKIF-FPRQSFF------------------------------------------------------
        H++  A N+   + Y G + V++         DN  I  F   SFF                                                      
Subjt:  HVTPDATNLQQSSVYYGNELVVI---------DNRKIF-FPRQSFF------------------------------------------------------

Query:  ----GVILMGV------------------------------------------------------------SKSSPTFHIGLLVSRVSLLFFYQPFKMFS
            GVIL  +                                                             K   T H G  ++    L  +     FS
Subjt:  ----GVILMGV------------------------------------------------------------SKSSPTFHIGLLVSRVSLLFFYQPFKMFS

Query:  CG--------------------------------------------IPVRPSDISN------------------------CSKSAKNFS-----------
        C                                             I   PS + N                        C    K+ +           
Subjt:  CG--------------------------------------------IPVRPSDISN------------------------CSKSAKNFS-----------

Query:  -------SHKGNLCLDLSSGKSFVSRHVVFNESVFPFQSNSVKAPMSVY-QTFPNYFFPPDSLM---SLSSFTPSPLNINPVSPTSLSITSSPTI-----
               SHKG LCL+ ++ + ++SRHVVF E+ FPFQ+ S  +  S +    P +  PP  ++   + SS   +P    P SP + S++  P I     
Subjt:  -------SHKGNLCLDLSSGKSFVSRHVVFNESVFPFQSNSVKAPMSVY-QTFPNYFFPPDSLM---SLSSFTPSPLNINPVSPTSLSITSSPTI-----

Query:  ---TELNDQNQASSPSPDPPTPTFSSDQS--------FPSHHTSPASLPSSMPPSPILSSPSQSYRV--------------------CHPLHLLAQL---
            E    +   S +P P  P  +  +S          S    P ++  ++       +  Q Y+                     C  +  L      
Subjt:  ---TELNDQNQASSPSPDPPTPTFSSDQS--------FPSHHTSPASLPSSMPPSPILSSPSQSYRV--------------------CHPLHLLAQL---

Query:  HLFQFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQPASFEDPFFPNHVLLL
         + ++ +  +AQG+ Q YGID+FETFSPVVK  TIR+VLS+AV++NW ++QLDVHNAFL+GDL+E+VFM QP  FED   P HV  L
Subjt:  HLFQFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQPASFEDPFFPNHVLLL

CAN75478.1 hypothetical protein VITISV_020209 [Vitis vinifera]9.7e-5329.09Show/hide
Query:  VAQTQVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAAIGEPLS
        ++  +VNP++LV ++Y+R+++SW+YSSL  D   +I+G   + E W  L+  + +S+ AR M LR   Q  +K SLTM +Y+ K+K I+   AAIGEP+ 
Subjt:  VAQTQVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAAIGEPLS

Query:  YRDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLLAYVDHKLCAPPKWPNNNSTSPHCQICGKLGHTALVCYNRHN-PLYHASNPPT----PQAF
         +D ++ +L GL +EYNP V S+  R D   L  V S+LL + + +L              H Q         +  +  HN P    SN P+    PQ+F
Subjt:  YRDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLLAYVDHKLCAPPKWPNNNSTSPHCQICGKLGHTALVCYNRHN-PLYHASNPPT----PQAF

Query:  FTQVQSST------------------NSTSSSAATPD-----NASHFYFSTPSPHPNESWFMDSRETHHVTPDATNLQQSSVYYGNELVVIDN-------
           ++  T                  N+ S +   P         H  FS    H  +  +   R THH++  A N+   + Y G + V++ N       
Subjt:  FTQVQSST------------------NSTSSSAATPD-----NASHFYFSTPSPHPNESWFMDSRETHHVTPDATNLQQSSVYYGNELVVIDN-------

Query:  --------RKI----------------FFPRQSFFGVILMGVSKSSPTFHIGLLVSRVSLLFFYQP----FKMFSCGIPVRPSDISNCSKSAKN------
                R +                F+P  +F   + +     SP  +     S  SLL+ + P    FK+F C        +++     ++      
Subjt:  --------RKI----------------FFPRQSFFGVILMGVSKSSPTFHIGLLVSRVSLLFFYQP----FKMFSCGIPVRPSDISNCSKSAKN------

Query:  --FSSHKGNLCLDLSSGKSFVSRHVVFNESVFPFQSNSVKAPMSVY-QTFPNYFFPPDSLM---SLSSFTPSPLNINPVSPTSLSITSSPTI--------
            SHKG LCL+ ++ + ++SRHVVF E+ FPFQ+ S  +  S +    P +  PP  ++   + SS   +P    P SP + S++  P I        
Subjt:  --FSSHKGNLCLDLSSGKSFVSRHVVFNESVFPFQSNSVKAPMSVY-QTFPNYFFPPDSLM---SLSSFTPSPLNINPVSPTSLSITSSPTI--------

Query:  TELNDQNQASSPSPDPPTPTFSSDQS--------FPSHHTSPASLPSSMPPSPILSSPSQSYRV--------------------CHPLHLLAQL---HLF
         E    +   S +P P  P  +  +S          S    P ++  ++       +  Q Y+                     C  +  L       + 
Subjt:  TELNDQNQASSPSPDPPTPTFSSDQS--------FPSHHTSPASLPSSMPPSPILSSPSQSYRV--------------------CHPLHLLAQL---HLF

Query:  QFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQPASFEDPFFPNHVLLL
        ++ +  +AQG+ Q YGID+FETFSPVVK  TIR+VLS+AV++NW ++QLDVHNAFL+GDL+E+VFM QP  FED   P HV  L
Subjt:  QFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQPASFEDPFFPNHVLLL

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.7e-4941.75Show/hide
Query:  APIQFLDVAQTQVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFA
        +P ++LD A  QVNP ++ W + N+++MSW+YSSL     G+I+    A +IW  L   YES S A +M+L SQLQ+++K  + +S+YL+++K +  +FA
Subjt:  APIQFLDVAQTQVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFA

Query:  AIGEPLSYRDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLLAY--------VDHKLCAP---PKWPNNNSTSPHCQICGKLGHTALVCYNRHNP
         IGEPLSYRD L  ILEGL  EY+ FVTSI NR+DRP L +V SLL  Y        +D  L  P   P+ P  N++ P CQICGK GH AL  Y+R N 
Subjt:  AIGEPLSYRDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLLAY--------VDHKLCAP---PKWPNNNSTSPHCQICGKLGHTALVCYNRHNP

Query:  LYHASNPPTPQAFFTQVQSSTNSTSSSAATPDNASHFYFSTPSPHPNESWFMDSRETHHVTPDATNLQQSSVYYGNELVVIDNRK
         YH    P   AF       T+S  S+  T   A     S  S   + SW+MDS  THH TP+  ++  +  Y   +  ++ N K
Subjt:  LYHASNPPTPQAFFTQVQSSTNSTSSSAATPDNASHFYFSTPSPHPNESWFMDSRETHHVTPDATNLQQSSVYYGNELVVIDNRK

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]2.6e-6143.33Show/hide
Query:  PIQFLDVAQTQVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAA
        P QFLD  Q Q NP Y  W++YNR+LM W+YSSL+E+K GE++      +IW  L  VY+S +TAR+M L+++LQ +RKD  ++S+YLAKIK+I  +FAA
Subjt:  PIQFLDVAQTQVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAA

Query:  IGEPLSYRDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLLAY------------------------VDHKLCAPP-------------------
        +GEPLSYRDHL ++L+GL SEYN FVTSI NR D P L DVRSLLLAY                        + H    PP                   
Subjt:  IGEPLSYRDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLLAY------------------------VDHKLCAPP-------------------

Query:  ---------------KWPNNNSTSP-HCQICGKLGHTALVCYNRHNPLYHASNPPTPQAFFTQVQSSTNSTSSSAATPDNASHFYFSTPSPHPNESWFMD
                       KWP   S+S   CQICGKLGH+A VCY+R N  YH +   +PQA +  VQ S    SS         H +      HP+ESWFMD
Subjt:  ---------------KWPNNNSTSP-HCQICGKLGHTALVCYNRHNPLYHASNPPTPQAFFTQVQSSTNSTSSSAATPDNASHFYFSTPSPHPNESWFMD

Query:  SRETHHVTPDATNLQQSSVYYGNELVVIDN
        S  THH+TPD++ L   + Y G E V + N
Subjt:  SRETHHVTPDATNLQQSSVYYGNELVVIDN

TrEMBL top hitse value%identityAlignment
A0A2N9GWM4 Uncharacterized protein1.1e-4925.61Show/hide
Query:  INGT-PAPIQFLDVAQTQV-----NPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYL
        I+GT PAP   L V+ +       NP +  W   +++++S + SSL+E     ++ C  + ++W  L  ++ S S AR M L  QL  ++K   +M+ + 
Subjt:  INGT-PAPIQFLDVAQTQV-----NPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYL

Query:  AKIKDITYQFAAIGEPLSYRDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLLAY----------VDHKLCAP----------------------
         K   +    AAI +PL   D + + L GL S+Y+  VT+IQ R     L ++    L++          VD  L +                       
Subjt:  AKIKDITYQFAAIGEPLSYRDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLLAY----------VDHKLCAP----------------------

Query:  ------------------PKWPNNNSTSPHCQICGKLGHTALVCYNRHNPLYHASNPPTPQAFFTQVQSSTNSTSSSAATPDNASHFYFSTPSPHPNESW
                           + P+ NS+ P CQ+C K GHTAL CY+R +  Y    PP  QA                           +TP+  P+ +W
Subjt:  ------------------PKWPNNNSTSPHCQICGKLGHTALVCYNRHNPLYHASNPPTPQAFFTQVQSSTNSTSSSAATPDNASHFYFSTPSPHPNESW

Query:  FMDSRETHHVTPDATNLQ-QSSVYYG-NELVVIDNRKIFFPRQSFFGVILMGVSKSSPTFHIGLLVSRVSLLFFYQPFKMFSCGIPVRPSDISNCSKSAK
        + DS  THH+T D  NL  ++  Y+G +++  +D+ K           +L G SK                     PF +        P+ +     S  
Subjt:  FMDSRETHHVTPDATNLQ-QSSVYYG-NELVVIDNRKIFFPRQSFFGVILMGVSKSSPTFHIGLLVSRVSLLFFYQPFKMFSCGIPVRPSDISNCSKSAK

Query:  NFSSHKGNLCLDLSSGKSFVSRHVVFNESVFPFQSNSVKAPMSVYQTFPNYFFPPDSLMSLSSFTPSPLNINPVSPTSLSITS----------SPTITEL
         + S  G  C  + +G++++SR V+F E+ FPFQ    + P  V Q+ P+   PP  L+ L S  P     +PV P + S  S          +P++  L
Subjt:  NFSSHKGNLCLDLSSGKSFVSRHVVFNESVFPFQSNSVKAPMSVYQTFPNYFFPPDSLMSLSSFTPSPLNINPVSPTSLSITS----------SPTITEL

Query:  NDQNQASSPSPDP------PTPTFSSDQSFPSHHTSPASLPSSMPPSP-------------ILSSPSQ----SYRVCHPLHLLAQLH-------------
           N +S P  DP      PT T  +  SF  +  +P +  ++ PP P              +S P      + R   P  LLA+ H             
Subjt:  NDQNQASSPSPDP------PTPTFSSDQSFPSHHTSPASLPSSMPPSP-------------ILSSPSQ----SYRVCHPLHLLAQLH-------------

Query:  ---------------------------------------------------LFQFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVR
                                                           + +F +  +A+G+ Q  GIDY ET+SPV+K  T+R +LS+A++  W++R
Subjt:  ---------------------------------------------------LFQFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVR

Query:  QLDVHNAFLHGDLKEEVFMRQPASFEDPFFPNHVLLLHVIVW
        Q+D+ NAFLHG L E+VFM QP  ++ P +P+HV  L+  ++
Subjt:  QLDVHNAFLHGDLKEEVFMRQPASFEDPFFPNHVLLLHVIVW

A0A6J1DQX7 uncharacterized protein LOC1110223151.2e-6143.33Show/hide
Query:  PIQFLDVAQTQVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAA
        P QFLD  Q Q NP Y  W++YNR+LM W+YSSL+E+K GE++      +IW  L  VY+S +TAR+M L+++LQ +RKD  ++S+YLAKIK+I  +FAA
Subjt:  PIQFLDVAQTQVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAA

Query:  IGEPLSYRDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLLAY------------------------VDHKLCAPP-------------------
        +GEPLSYRDHL ++L+GL SEYN FVTSI NR D P L DVRSLLLAY                        + H    PP                   
Subjt:  IGEPLSYRDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLLAY------------------------VDHKLCAPP-------------------

Query:  ---------------KWPNNNSTSP-HCQICGKLGHTALVCYNRHNPLYHASNPPTPQAFFTQVQSSTNSTSSSAATPDNASHFYFSTPSPHPNESWFMD
                       KWP   S+S   CQICGKLGH+A VCY+R N  YH +   +PQA +  VQ S    SS         H +      HP+ESWFMD
Subjt:  ---------------KWPNNNSTSP-HCQICGKLGHTALVCYNRHNPLYHASNPPTPQAFFTQVQSSTNSTSSSAATPDNASHFYFSTPSPHPNESWFMD

Query:  SRETHHVTPDATNLQQSSVYYGNELVVIDN
        S  THH+TPD++ L   + Y G E V + N
Subjt:  SRETHHVTPDATNLQQSSVYYGNELVVIDN

A5AQ04 Integrase catalytic domain-containing protein3.5e-4824.8Show/hide
Query:  VAQTQVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAAIGEPLS
        ++  +VNP++LVW++Y+R+++SW+YSSL  +  G+I+G   + E W  L+  + +S+ AR M LR   Q  +K SLTM +Y+ K+K I+   AAIGEP+ 
Subjt:  VAQTQVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAAIGEPLS

Query:  YRDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLLAY-----------VDHKLCA----------------------------------------
         +D ++ +L GL +EYNP V S+  R D   L  V S+LL +            +  L A                                        
Subjt:  YRDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLLAY-----------VDHKLCA----------------------------------------

Query:  ------PPKWPNNNSTS---PHCQICGKLGHTALVCYNRHNPLYHASNPPTPQAFFTQVQSSTNSTSSSAATPDNASHFYFSTPSPHPNESWFMDSRETH
              P ++ NN S +   P CQ+CGK GH  L CY+R +  Y     P             +STS    TP   +    + PS    +SWF+DS  TH
Subjt:  ------PPKWPNNNSTS---PHCQICGKLGHTALVCYNRHNPLYHASNPPTPQAFFTQVQSSTNSTSSSAATPDNASHFYFSTPSPHPNESWFMDSRETH

Query:  HVTPDATNLQQSSVYYGNELVVI---------DNRKIF-FPRQSFF------------------------------------------------------
        H++  A N+   + Y G + V++         DN  I  F   SFF                                                      
Subjt:  HVTPDATNLQQSSVYYGNELVVI---------DNRKIF-FPRQSFF------------------------------------------------------

Query:  ----GVILMGV------------------------------------------------------------SKSSPTFHIGLLVSRVSLLFFYQPFKMFS
            GVIL  +                                                             K   T H G  ++    L  +     FS
Subjt:  ----GVILMGV------------------------------------------------------------SKSSPTFHIGLLVSRVSLLFFYQPFKMFS

Query:  CG--------------------------------------------IPVRPSDISN------------------------CSKSAKNFS-----------
        C                                             I   PS + N                        C    K+ +           
Subjt:  CG--------------------------------------------IPVRPSDISN------------------------CSKSAKNFS-----------

Query:  -------SHKGNLCLDLSSGKSFVSRHVVFNESVFPFQSNSVKAPMSVY-QTFPNYFFPPDSLM---SLSSFTPSPLNINPVSPTSLSITSSPTI-----
               SHKG LCL+ ++ + ++SRHVVF E+ FPFQ+ S  +  S +    P +  PP  ++   + SS   +P    P SP + S++  P I     
Subjt:  -------SHKGNLCLDLSSGKSFVSRHVVFNESVFPFQSNSVKAPMSVY-QTFPNYFFPPDSLM---SLSSFTPSPLNINPVSPTSLSITSSPTI-----

Query:  ---TELNDQNQASSPSPDPPTPTFSSDQS--------FPSHHTSPASLPSSMPPSPILSSPSQSYRV--------------------CHPLHLLAQL---
            E    +   S +P P  P  +  +S          S    P ++  ++       +  Q Y+                     C  +  L      
Subjt:  ---TELNDQNQASSPSPDPPTPTFSSDQS--------FPSHHTSPASLPSSMPPSPILSSPSQSYRV--------------------CHPLHLLAQL---

Query:  HLFQFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQPASFEDPFFPNHVLLL
         + ++ +  +AQG+ Q YGID+FETFSPVVK  TIR+VLS+AV++NW ++QLDVHNAFL+GDL+E+VFM QP  FED   P HV  L
Subjt:  HLFQFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQPASFEDPFFPNHVLLL

A5B1N8 Integrase catalytic domain-containing protein3.4e-5928.51Show/hide
Query:  QVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAAIGEPLSYRDH
        + NP++++W++++R+++SW+YSSL  +  G+I+G   +   W  L  ++ +SS AR+M LR + Q  RK SLTM +Y+ K+K +    AAIGEP++ RD 
Subjt:  QVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAAIGEPLSYRDH

Query:  LVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLL--------AYVDHKLCAPPKWPNNNSTS--------------------------PHCQICGKLG
        ++ +L GL ++YN  V S+      PF + ++  L           +   L  P     NN  S                          P CQ+CGK G
Subjt:  LVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLL--------AYVDHKLCAPPKWPNNNSTS--------------------------PHCQICGKLG

Query:  HTALVCYNRHNPLYHASNPPTPQAFFTQVQSSTNSTSSSAATPDNASHFYFSTPSPHPNESWFMDSRETHHVTPDATNLQQSSVYYGNELVVIDNRKI--
        HT + CY+R +  +   NP            + ++  ++     N      ++PS   +E+WF D+  THH++     L     Y GN+ V++ N     
Subjt:  HTALVCYNRHNPLYHASNPPTPQAFFTQVQSSTNSTSSSAATPDNASHFYFSTPSPHPNESWFMDSRETHHVTPDATNLQQSSVYYGNELVVIDNRKI--

Query:  -------------FFPRQS-----------FFGVILMGVSK----------SSPTFHIGLLVSRV-------SLLFFYQP----FKMFSCGI--PVRPSD
                     + P Q+             G+ L+  +            +  F I  L ++V        +LF   P    FK+F C     +RP +
Subjt:  -------------FFPRQS-----------FFGVILMGVSK----------SSPTFHIGLLVSRV-------SLLFFYQP----FKMFSCGI--PVRPSD

Query:  ISNCSKSAKN------FSSHKGNLCLDLSSGKSFVSRHVVFNESVFPFQSNSVKAPMSVYQTFPNYFFPPDS---LMSLSSFTPSPLNINPVSPTSLSIT
         +  S  +         S+HKG +CL+  +G+ +V+RHVVF+E+VFPFQS   ++   V  T P   F P S   + SL S T    +  P++    S  
Subjt:  ISNCSKSAKN------FSSHKGNLCLDLSSGKSFVSRHVVFNESVFPFQSNSVKAPMSVYQTFPNYFFPPDS---LMSLSSFTPSPLNINPVSPTSLSIT

Query:  SSPTITELNDQN-QASSPSPDPPTPTFSSDQS--------FPSHHTSPASLPSSMPPS-------------------PILSSPSQSYRV----CHPLHLL
        S P + ++   +   S P P    P  +  ++        F SH + P +   ++  S                    ++  PS    +     + L   
Subjt:  SSPTITELNDQN-QASSPSPDPPTPTFSSDQS--------FPSHHTSPASLPSSMPPS-------------------PILSSPSQSYRV----CHPLHLL

Query:  AQLHLFQFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQPASFEDPFFPNHVLLLH
            + ++ +  +AQG+TQ  G++YFETFSPVVK++TIRI+L++A++ NW+V QLDV NAFLHGDL+E VFM+QP  F +  +P+HV  L+
Subjt:  AQLHLFQFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQPASFEDPFFPNHVLLLH

A5CAE5 Reverse transcriptase Ty1/copia-type domain-containing protein4.7e-5329.09Show/hide
Query:  VAQTQVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAAIGEPLS
        ++  +VNP++LV ++Y+R+++SW+YSSL  D   +I+G   + E W  L+  + +S+ AR M LR   Q  +K SLTM +Y+ K+K I+   AAIGEP+ 
Subjt:  VAQTQVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAAIGEPLS

Query:  YRDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLLAYVDHKLCAPPKWPNNNSTSPHCQICGKLGHTALVCYNRHN-PLYHASNPPT----PQAF
         +D ++ +L GL +EYNP V S+  R D   L  V S+LL + + +L              H Q         +  +  HN P    SN P+    PQ+F
Subjt:  YRDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLLAYVDHKLCAPPKWPNNNSTSPHCQICGKLGHTALVCYNRHN-PLYHASNPPT----PQAF

Query:  FTQVQSST------------------NSTSSSAATPD-----NASHFYFSTPSPHPNESWFMDSRETHHVTPDATNLQQSSVYYGNELVVIDN-------
           ++  T                  N+ S +   P         H  FS    H  +  +   R THH++  A N+   + Y G + V++ N       
Subjt:  FTQVQSST------------------NSTSSSAATPD-----NASHFYFSTPSPHPNESWFMDSRETHHVTPDATNLQQSSVYYGNELVVIDN-------

Query:  --------RKI----------------FFPRQSFFGVILMGVSKSSPTFHIGLLVSRVSLLFFYQP----FKMFSCGIPVRPSDISNCSKSAKN------
                R +                F+P  +F   + +     SP  +     S  SLL+ + P    FK+F C        +++     ++      
Subjt:  --------RKI----------------FFPRQSFFGVILMGVSKSSPTFHIGLLVSRVSLLFFYQP----FKMFSCGIPVRPSDISNCSKSAKN------

Query:  --FSSHKGNLCLDLSSGKSFVSRHVVFNESVFPFQSNSVKAPMSVY-QTFPNYFFPPDSLM---SLSSFTPSPLNINPVSPTSLSITSSPTI--------
            SHKG LCL+ ++ + ++SRHVVF E+ FPFQ+ S  +  S +    P +  PP  ++   + SS   +P    P SP + S++  P I        
Subjt:  --FSSHKGNLCLDLSSGKSFVSRHVVFNESVFPFQSNSVKAPMSVY-QTFPNYFFPPDSLM---SLSSFTPSPLNINPVSPTSLSITSSPTI--------

Query:  TELNDQNQASSPSPDPPTPTFSSDQS--------FPSHHTSPASLPSSMPPSPILSSPSQSYRV--------------------CHPLHLLAQL---HLF
         E    +   S +P P  P  +  +S          S    P ++  ++       +  Q Y+                     C  +  L       + 
Subjt:  TELNDQNQASSPSPDPPTPTFSSDQS--------FPSHHTSPASLPSSMPPSPILSSPSQSYRV--------------------CHPLHLLAQL---HLF

Query:  QFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQPASFEDPFFPNHVLLL
        ++ +  +AQG+ Q YGID+FETFSPVVK  TIR+VLS+AV++NW ++QLDVHNAFL+GDL+E+VFM QP  FED   P HV  L
Subjt:  QFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQPASFEDPFFPNHVLLL

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-1147.83Show/hide
Query:  QFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQP
        ++ +  +A+G+TQ Y IDY ETF+PV + ++ R +LSL +  N  V Q+DV  AFL+G LKEE++MR P
Subjt:  QFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQP

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-1350.67Show/hide
Query:  LFQFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQPASFE
        L ++ +  + +G+ Q  GID+ E FSPVVK  +IR +LSLA + +  V QLDV  AFLHGDL+EE++M QP  FE
Subjt:  LFQFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQPASFE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.6e-2024.37Show/hide
Query:  AQTQVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAAIGEPLSY
        A  +VNP+Y  W++ ++++ S +  +++      +     A +IWE LR +Y + S   +  LR+QL++  K + T+  Y+  +     Q A +G+P+ +
Subjt:  AQTQVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAAIGEPLSY

Query:  RDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLLAY-------------------VDHKLCAPPK----------------------W-------
         + +  +LE L  EY P +  I  +   P L ++   LL +                   V H+                            W       
Subjt:  RDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLLAY-------------------VDHKLCAPPK----------------------W-------

Query:  -PNNNSTSPH---CQICGKLGHTALVCYNRHNPL--YHASNPPTPQAFFTQVQSSTNSTSSSAATPDNASHFYFSTPSPHPNESWFMDSRETHHVTPDAT
         PNNN + P+   CQICG  GH+A  C    + L   ++  PP+P   FT  Q   N                 +  SP+ + +W +DS  THH+T D  
Subjt:  -PNNNSTSPH---CQICGKLGHTALVCYNRHNPL--YHASNPPTPQAFFTQVQSSTNSTSSSAATPDNASHFYFSTPSPHPNESWFMDSRETHHVTPDAT

Query:  NLQQSSVYYGNELVVI
        NL     Y G + V++
Subjt:  NLQQSSVYYGNELVVI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-1752.33Show/hide
Query:  LFQFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQPASFEDPFFPNHVLLL
        L ++ +  +A+GY Q  G+DY ETFSPV+KS +IRIVL +AV  +W +RQLDV+NAFL G L ++V+M QP  F D   PN+V  L
Subjt:  LFQFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQPASFEDPFFPNHVLLL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.1e-1849.45Show/hide
Query:  LFQFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQPASFEDPFFPNHVLLLHVIVW
        L ++ +  +A+GY Q  G+DY ETFSPV+KS +IRIVL +AV  +W +RQLDV+NAFL G L +EV+M QP  F D   P++V  L   ++
Subjt:  LFQFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQPASFEDPFFPNHVLLLHVIVW

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.7e-0721.83Show/hide
Query:  LDVAQTQVNPNYLVWQKYNRILMSWMYSSLNEDK-TGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAAIGE
        +D      N N + WQK + I+   +Y +L   +  G  +    + +IW  ++  + ++  AR + L S+L+      + ++ Y  K+K +      +  
Subjt:  LDVAQTQVNPNYLVWQKYNRILMSWMYSSLNEDK-TGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAAIGE

Query:  PLSYRDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLL
        P++ R+ ++Y+L GL  +++  +  I++R   P   D  ++L
Subjt:  PLSYRDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLL

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.6e-1542.05Show/hide
Query:  QFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQPASFE----DPFFPNHVLLL
        ++ +  +A+GYTQ  GID+ ETFSPV K  +++++L+++   N+T+ QLD+ NAFL+GDL EE++M+ P  +     D   PN V  L
Subjt:  QFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQLDVHNAFLHGDLKEEVFMRQPASFE----DPFFPNHVLLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGTTTGATCAATGGAACTCCTGCTCCTATTCAGTTTTTGGATGTGGCTCAAACCCAGGTAAACCCTAATTACTTAGTTTGGCAAAAATACAATCGCATTCTGAT
GAGCTGGATGTATTCGTCCCTAAATGAAGATAAAACAGGTGAGATTATTGGTTGTGGTTTTGCTTTTGAAATATGGGAACATCTTCGTGTGGTTTATGAATCTTCTTCTA
CTGCTCGTTTAATGGCTTTACGATCTCAGTTGCAAAAGGTTAGAAAAGATAGTCTCACTATGTCTAAGTACCTTGCCAAAATTAAGGATATTACATATCAGTTTGCTGCA
ATAGGAGAACCTCTGTCTTATAGGGATCATCTTGTTTACATTTTGGAAGGTCTTCGCTCGGAATACAACCCGTTTGTCACCTCAATTCAGAATAGAACAGATCGCCCTTT
TTTAGCTGATGTTAGGAGTCTCTTGCTTGCGTATGTCGATCACAAACTCTGCGCCCCCCCAAAATGGCCTAACAACAATTCAACCAGCCCTCATTGCCAAATTTGTGGAA
AACTTGGCCATACCGCCCTTGTCTGTTACAATCGGCACAATCCCTTATATCATGCTTCAAATCCACCCACCCCTCAGGCATTTTTTACCCAAGTTCAATCCTCAACCAAC
TCCACCTCCTCTTCAGCTGCTACACCAGATAATGCATCCCATTTTTACTTCTCTACTCCATCACCACACCCTAACGAATCTTGGTTCATGGACTCCAGGGAAACTCATCA
CGTGACTCCTGATGCAACTAACCTTCAGCAATCCTCTGTATACTATGGTAATGAACTGGTTGTAATCGACAATAGAAAGATCTTCTTTCCAAGACAATCCTTCTTTGGGG
TCATTTTGATGGGGGTCTCTAAAAGCTCTCCAACTTTTCACATCGGTCTCCTTGTTTCTCGAGTCAGCCTGCTGTTTTTTTATCAACCTTTCAAGATGTTCAGCTGTGGC
ATTCCTGTTAGGCCATCCGACATTTCCAATTGTTCAAAGAGTGCTAAAAACTTTTCCTCTCATAAGGGGAATCTCTGTCTTGACCTGTCTTCTGGAAAATCATTTGTCTC
ACGTCATGTGGTGTTTAATGAATCAGTTTTTCCTTTTCAATCTAACTCGGTTAAGGCTCCTATGTCTGTTTACCAGACCTTTCCAAATTACTTTTTTCCTCCTGATTCTC
TTATGTCTTTATCTTCTTTTACTCCTTCCCCCTTAAATATTAATCCTGTGTCTCCAACATCATTGTCGATTACTTCCTCCCCTACCATTACAGAGTTAAATGATCAGAAC
CAAGCTTCTTCACCATCTCCAGACCCTCCCACCCCTACCTTTTCGAGCGACCAATCTTTCCCCTCTCACCACACATCACCTGCTTCCTTACCTAGTTCCATGCCTCCTTC
TCCCATCTTGAGCTCACCTTCACAGAGTTACAGAGTTTGCCATCCTCTCCATCTTCTGGCTCAGTTGCACCTCTTCCAATTCTCTTCATCCTACTTGGCTCAAGGGTATA
CTCAAGCATATGGAATTGACTACTTTGAGACGTTCAGTCCTGTTGTCAAGTCGGCTACTATTCGAATTGTATTGTCCTTGGCAGTCACTAATAATTGGACCGTTCGACAA
TTAGATGTGCATAACGCCTTCTTGCATGGTGATCTAAAGGAAGAAGTGTTCATGAGACAACCTGCTAGTTTTGAGGATCCATTTTTTCCCAATCATGTTCTGCTTTTACA
CGTCATTGTATGGCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGTTTGATCAATGGAACTCCTGCTCCTATTCAGTTTTTGGATGTGGCTCAAACCCAGGTAAACCCTAATTACTTAGTTTGGCAAAAATACAATCGCATTCTGAT
GAGCTGGATGTATTCGTCCCTAAATGAAGATAAAACAGGTGAGATTATTGGTTGTGGTTTTGCTTTTGAAATATGGGAACATCTTCGTGTGGTTTATGAATCTTCTTCTA
CTGCTCGTTTAATGGCTTTACGATCTCAGTTGCAAAAGGTTAGAAAAGATAGTCTCACTATGTCTAAGTACCTTGCCAAAATTAAGGATATTACATATCAGTTTGCTGCA
ATAGGAGAACCTCTGTCTTATAGGGATCATCTTGTTTACATTTTGGAAGGTCTTCGCTCGGAATACAACCCGTTTGTCACCTCAATTCAGAATAGAACAGATCGCCCTTT
TTTAGCTGATGTTAGGAGTCTCTTGCTTGCGTATGTCGATCACAAACTCTGCGCCCCCCCAAAATGGCCTAACAACAATTCAACCAGCCCTCATTGCCAAATTTGTGGAA
AACTTGGCCATACCGCCCTTGTCTGTTACAATCGGCACAATCCCTTATATCATGCTTCAAATCCACCCACCCCTCAGGCATTTTTTACCCAAGTTCAATCCTCAACCAAC
TCCACCTCCTCTTCAGCTGCTACACCAGATAATGCATCCCATTTTTACTTCTCTACTCCATCACCACACCCTAACGAATCTTGGTTCATGGACTCCAGGGAAACTCATCA
CGTGACTCCTGATGCAACTAACCTTCAGCAATCCTCTGTATACTATGGTAATGAACTGGTTGTAATCGACAATAGAAAGATCTTCTTTCCAAGACAATCCTTCTTTGGGG
TCATTTTGATGGGGGTCTCTAAAAGCTCTCCAACTTTTCACATCGGTCTCCTTGTTTCTCGAGTCAGCCTGCTGTTTTTTTATCAACCTTTCAAGATGTTCAGCTGTGGC
ATTCCTGTTAGGCCATCCGACATTTCCAATTGTTCAAAGAGTGCTAAAAACTTTTCCTCTCATAAGGGGAATCTCTGTCTTGACCTGTCTTCTGGAAAATCATTTGTCTC
ACGTCATGTGGTGTTTAATGAATCAGTTTTTCCTTTTCAATCTAACTCGGTTAAGGCTCCTATGTCTGTTTACCAGACCTTTCCAAATTACTTTTTTCCTCCTGATTCTC
TTATGTCTTTATCTTCTTTTACTCCTTCCCCCTTAAATATTAATCCTGTGTCTCCAACATCATTGTCGATTACTTCCTCCCCTACCATTACAGAGTTAAATGATCAGAAC
CAAGCTTCTTCACCATCTCCAGACCCTCCCACCCCTACCTTTTCGAGCGACCAATCTTTCCCCTCTCACCACACATCACCTGCTTCCTTACCTAGTTCCATGCCTCCTTC
TCCCATCTTGAGCTCACCTTCACAGAGTTACAGAGTTTGCCATCCTCTCCATCTTCTGGCTCAGTTGCACCTCTTCCAATTCTCTTCATCCTACTTGGCTCAAGGGTATA
CTCAAGCATATGGAATTGACTACTTTGAGACGTTCAGTCCTGTTGTCAAGTCGGCTACTATTCGAATTGTATTGTCCTTGGCAGTCACTAATAATTGGACCGTTCGACAA
TTAGATGTGCATAACGCCTTCTTGCATGGTGATCTAAAGGAAGAAGTGTTCATGAGACAACCTGCTAGTTTTGAGGATCCATTTTTTCCCAATCATGTTCTGCTTTTACA
CGTCATTGTATGGCCTTAA
Protein sequenceShow/hide protein sequence
MESLINGTPAPIQFLDVAQTQVNPNYLVWQKYNRILMSWMYSSLNEDKTGEIIGCGFAFEIWEHLRVVYESSSTARLMALRSQLQKVRKDSLTMSKYLAKIKDITYQFAA
IGEPLSYRDHLVYILEGLRSEYNPFVTSIQNRTDRPFLADVRSLLLAYVDHKLCAPPKWPNNNSTSPHCQICGKLGHTALVCYNRHNPLYHASNPPTPQAFFTQVQSSTN
STSSSAATPDNASHFYFSTPSPHPNESWFMDSRETHHVTPDATNLQQSSVYYGNELVVIDNRKIFFPRQSFFGVILMGVSKSSPTFHIGLLVSRVSLLFFYQPFKMFSCG
IPVRPSDISNCSKSAKNFSSHKGNLCLDLSSGKSFVSRHVVFNESVFPFQSNSVKAPMSVYQTFPNYFFPPDSLMSLSSFTPSPLNINPVSPTSLSITSSPTITELNDQN
QASSPSPDPPTPTFSSDQSFPSHHTSPASLPSSMPPSPILSSPSQSYRVCHPLHLLAQLHLFQFSSSYLAQGYTQAYGIDYFETFSPVVKSATIRIVLSLAVTNNWTVRQ
LDVHNAFLHGDLKEEVFMRQPASFEDPFFPNHVLLLHVIVWP