; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008792 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008792
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr9:30212206..30214448
RNA-Seq ExpressionLag0008792
SyntenyLag0008792
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW59875.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.3e-7133.84Show/hide
Query:  IPNTYPAPNPYPTLPQPLAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASWERYNRFIMCWIYSSLSEEKMGEIVNLTS
        +P++  A      L   L +KL  NN++LW+ Q+ N V ANG   +++G    PPQ      T  NPD+  W R++R I+ WIYSSL+ E MG+IV   S
Subjt:  IPNTYPAPNPYPTLPQPLAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASWERYNRFIMCWIYSSLSEEKMGEIVNLTS

Query:  AFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDVRSLLLA
        +   W +L R + + + AR+M L+ + Q  RK  LT+ +Y+ ++K + D  +AIGEP++ RD +  +L GLG++YN+ V ++  R D  SL  V S+LL 
Subjt:  AFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDVRSLLLA

Query:  YEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPFTK------PSVSTQNTNQQQ---TFSSPTLICHHRTNLAYQTPPPQALVHSVQPS
        +E RL  Q  V + N+  ANL+    QH N +  +  Q   S   T+       S S+Q+  Q Q    F    + C+HR ++ +Q   P   + +VQ +
Subjt:  YEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPFTK------PSVSTQNTNQQQ---TFSSPTLICHHRTNLAYQTPPPQALVHSVQPS

Query:  -PTSFSDTSSQAPTDYTHPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIGSFSF-STSKPITLQ--------------------
         P + +   +   +  T  DE+WF D+GATHH++  +  L +  PY G +++ VGNGK + I H G+  F S+SK   L+                    
Subjt:  -PTSFSDTSSQAPTDYTHPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIGSFSF-STSKPITLQ--------------------

Query:  ---------------TDLKSKKILLQGRLEDGLYKLSS---PQPRLSSDSFSHPAVCLASASVSSTWHLRLGHPAASTLKQVLSLCNVSSSI-ANEFCND
                        D  +KKILLQG LE GLY+  +   P P     S    +  L+  + ++ WH RLGHPA + LK +L+ CN+S     N  C  
Subjt:  ---------------TDLKSKKILLQGRLEDGLYKLSS---PQPRLSSDSFSHPAVCLASASVSSTWHLRLGHPAASTLKQVLSLCNVSSSI-ANEFCND

Query:  CQLAKNHRLPFAVVETKTGEPFQIVHSDV
        CQ AK+H+LPF V  ++   P  ++H+D+
Subjt:  CQLAKNHRLPFAVVETKTGEPFQIVHSDV

RVW64314.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.3e-7133.84Show/hide
Query:  IPNTYPAPNPYPTLPQPLAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASWERYNRFIMCWIYSSLSEEKMGEIVNLTS
        +P++  A      L   L +KL  NN++LW+ Q+ N V ANG   +++G    PPQ      T  NPD+  W R++R I+ WIYSSL+ E MG+IV   S
Subjt:  IPNTYPAPNPYPTLPQPLAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASWERYNRFIMCWIYSSLSEEKMGEIVNLTS

Query:  AFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDVRSLLLA
        +   W +L R + + + AR+M L+ + Q  RK  LT+ +Y+ ++K + D  +AIGEP++ RD +  +L GLG++YN+ V ++  R D  SL  V S+LL 
Subjt:  AFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDVRSLLLA

Query:  YEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPFTK------PSVSTQNTNQQQ---TFSSPTLICHHRTNLAYQTPPPQALVHSVQPS
        +E RL  Q  V + N+  ANL+    QH N +  +  Q   S   T+       S S+Q+  Q Q    F    + C+HR ++ +Q   P   + +VQ +
Subjt:  YEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPFTK------PSVSTQNTNQQQ---TFSSPTLICHHRTNLAYQTPPPQALVHSVQPS

Query:  -PTSFSDTSSQAPTDYTHPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIGSFSF-STSKPITLQ--------------------
         P + +   +   +  T  DE+WF D+GATHH++  +  L +  PY G +++ VGNGK + I H G+  F S+SK   L+                    
Subjt:  -PTSFSDTSSQAPTDYTHPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIGSFSF-STSKPITLQ--------------------

Query:  ---------------TDLKSKKILLQGRLEDGLYKLSS---PQPRLSSDSFSHPAVCLASASVSSTWHLRLGHPAASTLKQVLSLCNVSSSI-ANEFCND
                        D  +KKILLQG LE GLY+  +   P P     S    +  L+  + ++ WH RLGHPA + LK +L+ CN+S     N  C  
Subjt:  ---------------TDLKSKKILLQGRLEDGLYKLSS---PQPRLSSDSFSHPAVCLASASVSSTWHLRLGHPAASTLKQVLSLCNVSSSI-ANEFCND

Query:  CQLAKNHRLPFAVVETKTGEPFQIVHSDV
        CQ AK+H+LPF V  ++   P  ++H+D+
Subjt:  CQLAKNHRLPFAVVETKTGEPFQIVHSDV

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]3.7e-8738.4Show/hide
Query:  QQPYPQIPNTYPAPNPYPTLPQPLAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASWERYNRFIMCWIYSSLSEEKMGE
        Q P PQI        P P+L Q L++KL + N LL K+QLLN +IANGL  ++D    SPP++LD    Q NP++  W+R N+ +M WIYSSL+   +G+
Subjt:  QQPYPQIPNTYPAPNPYPTLPQPLAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASWERYNRFIMCWIYSSLSEEKMGE

Query:  IVNLTSAFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDV
        IV  ++A DIW SL+  Y+S + A +M L +QLQ+I+K  + +S+YL+++K + D+F+ IGEP+SYRD L  IL+GL  EY+ FVT+I NR+D PSL++V
Subjt:  IVNLTSAFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDV

Query:  RSLLLAYEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPFTKPSVSTQNTNQQQTFSSPTLICHHRTNLAYQTP--PPQALVHSVQP--
         SLL  YE RL ++++   LN  QA               N  Q   +N   +  +  ++           L  +HRTNL Y  P  P  A  +   P  
Subjt:  RSLLLAYEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPFTKPSVSTQNTNQQQTFSSPTLICHHRTNLAYQTP--PPQALVHSVQP--

Query:  --SPTSFSDTSSQAPTDYT--HPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIG-SFSFSTSKPITLQ----------------
          SP S   T+S APT  +    D SW++DSGATHH TP+   + + + Y+ G+   VGN K+I ISHIG +   S+ KPI L                 
Subjt:  --SPTSFSDTSSQAPTDYT--HPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIG-SFSFSTSKPITLQ----------------

Query:  -------------------TDLKSKKILLQGRLEDGLYKLSSPQPRLSSDSFSHPA----------VCLASASVSSTWHLRLGHPAASTLKQVLSLCNVS
                            D ++K++LLQG LE GLYKL+ P    SS   S P+            L+  +    WH RLGHPA   + QVL  CN+ 
Subjt:  -------------------TDLKSKKILLQGRLEDGLYKLSSPQPRLSSDSFSHPA----------VCLASASVSSTWHLRLGHPAASTLKQVLSLCNVS

Query:  SSIANEFCNDCQLAKNHRLPFAVVETKTGEPFQIVHSDV
         S +   C+ CQLAK+HRLPF + E++  +PF +V+SD+
Subjt:  SSIANEFCNDCQLAKNHRLPFAVVETKTGEPFQIVHSDV

RVX06084.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.3e-7133.84Show/hide
Query:  IPNTYPAPNPYPTLPQPLAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASWERYNRFIMCWIYSSLSEEKMGEIVNLTS
        +P++  A      L   L +KL  NN++LW+ Q+ N V ANG   +++G    PPQ      T  NPD+  W R++R I+ WIYSSL+ E MG+IV   S
Subjt:  IPNTYPAPNPYPTLPQPLAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASWERYNRFIMCWIYSSLSEEKMGEIVNLTS

Query:  AFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDVRSLLLA
        +   W +L R + + + AR+M L+ + Q  RK  LT+ +Y+ ++K + D  +AIGEP++ RD +  +L GLG++YN+ V ++  R D  SL  V S+LL 
Subjt:  AFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDVRSLLLA

Query:  YEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPFTK------PSVSTQNTNQQQ---TFSSPTLICHHRTNLAYQTPPPQALVHSVQPS
        +E RL  Q  V + N+  ANL+    QH N +  +  Q   S   T+       S S+Q+  Q Q    F    + C+HR ++ +Q   P   + +VQ +
Subjt:  YEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPFTK------PSVSTQNTNQQQ---TFSSPTLICHHRTNLAYQTPPPQALVHSVQPS

Query:  -PTSFSDTSSQAPTDYTHPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIGSFSF-STSKPITLQ--------------------
         P + +   +   +  T  DE+WF D+GATHH++  +  L +  PY G +++ VGNGK + I H G+  F S+SK   L+                    
Subjt:  -PTSFSDTSSQAPTDYTHPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIGSFSF-STSKPITLQ--------------------

Query:  ---------------TDLKSKKILLQGRLEDGLYKLSS---PQPRLSSDSFSHPAVCLASASVSSTWHLRLGHPAASTLKQVLSLCNVSSSI-ANEFCND
                        D  +KKILLQG LE GLY+  +   P P     S    +  L+  + ++ WH RLGHPA + LK +L+ CN+S     N  C  
Subjt:  ---------------TDLKSKKILLQGRLEDGLYKLSS---PQPRLSSDSFSHPAVCLASASVSSTWHLRLGHPAASTLKQVLSLCNVSSSI-ANEFCND

Query:  CQLAKNHRLPFAVVETKTGEPFQIVHSDV
        CQ AK+H+LPF V  ++   P  ++H+D+
Subjt:  CQLAKNHRLPFAVVETKTGEPFQIVHSDV

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]2.1e-12760.59Show/hide
Query:  QFSPHQPNFFAQPFYLRPLFPAQQPYPQIPNTYPAPNPYPTLPQPLAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASW
        QF P  PNF AQP                PN + A NP+PTLPQPL VKL DNNFLLWKNQLLNAVIANGL GYLDG+I  PPQFLD    QPNP Y +W
Subjt:  QFSPHQPNFFAQPFYLRPLFPAQQPYPQIPNTYPAPNPYPTLPQPLAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASW

Query:  ERYNRFIMCWIYSSLSEEKMGEIVNLTSAFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLG
        ERYNR +MCWIYSSLSEEKMGE+V+L +  DIW+SL+R YDSKTTARIMGLKT+LQ +RKDG +VSQYLA+IK+I DKF+A+GEP+SYRDHLAH+LDGLG
Subjt:  ERYNRFIMCWIYSSLSEEKMGEIVNLTSAFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLG

Query:  SEYNAFVTTIQNRTDNPSLEDVRSLLLAYEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPF---------------------------
        SEYNAFVT+I NR D+PSLEDVRSLLLAYEARL+KQ  VDQLNIAQANL  L+LQH ++R    P+F+  N +                           
Subjt:  SEYNAFVTTIQNRTDNPSLEDVRSLLLAYEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPF---------------------------

Query:  -TKPSVSTQNTNQQQTFSSPTLICHHRTNLAYQTPPPQALVHSVQPSPTSFSDTSSQAPTDYTHPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITV
          KPS S               +C+HRTN+AY    PQAL H VQPSPT  S     +  ++ HPDESWF+DSGATHHMTPD S LCNP PY+GGEQ+TV
Subjt:  -TKPSVSTQNTNQQQTFSSPTLICHHRTNLAYQTPPPQALVHSVQPSPTSFSDTSSQAPTDYTHPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITV

Query:  GNGKKI
        GNG  +
Subjt:  GNGKKI

TrEMBL top hitse value%identityAlignment
A0A438FIP9 Retrovirus-related Pol polyprotein from transposon RE16.1e-7233.84Show/hide
Query:  IPNTYPAPNPYPTLPQPLAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASWERYNRFIMCWIYSSLSEEKMGEIVNLTS
        +P++  A      L   L +KL  NN++LW+ Q+ N V ANG   +++G    PPQ      T  NPD+  W R++R I+ WIYSSL+ E MG+IV   S
Subjt:  IPNTYPAPNPYPTLPQPLAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASWERYNRFIMCWIYSSLSEEKMGEIVNLTS

Query:  AFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDVRSLLLA
        +   W +L R + + + AR+M L+ + Q  RK  LT+ +Y+ ++K + D  +AIGEP++ RD +  +L GLG++YN+ V ++  R D  SL  V S+LL 
Subjt:  AFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDVRSLLLA

Query:  YEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPFTK------PSVSTQNTNQQQ---TFSSPTLICHHRTNLAYQTPPPQALVHSVQPS
        +E RL  Q  V + N+  ANL+    QH N +  +  Q   S   T+       S S+Q+  Q Q    F    + C+HR ++ +Q   P   + +VQ +
Subjt:  YEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPFTK------PSVSTQNTNQQQ---TFSSPTLICHHRTNLAYQTPPPQALVHSVQPS

Query:  -PTSFSDTSSQAPTDYTHPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIGSFSF-STSKPITLQ--------------------
         P + +   +   +  T  DE+WF D+GATHH++  +  L +  PY G +++ VGNGK + I H G+  F S+SK   L+                    
Subjt:  -PTSFSDTSSQAPTDYTHPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIGSFSF-STSKPITLQ--------------------

Query:  ---------------TDLKSKKILLQGRLEDGLYKLSS---PQPRLSSDSFSHPAVCLASASVSSTWHLRLGHPAASTLKQVLSLCNVSSSI-ANEFCND
                        D  +KKILLQG LE GLY+  +   P P     S    +  L+  + ++ WH RLGHPA + LK +L+ CN+S     N  C  
Subjt:  ---------------TDLKSKKILLQGRLEDGLYKLSS---PQPRLSSDSFSHPAVCLASASVSSTWHLRLGHPAASTLKQVLSLCNVSSSI-ANEFCND

Query:  CQLAKNHRLPFAVVETKTGEPFQIVHSDV
        CQ AK+H+LPF V  ++   P  ++H+D+
Subjt:  CQLAKNHRLPFAVVETKTGEPFQIVHSDV

A0A438FWJ3 Retrovirus-related Pol polyprotein from transposon TNT 1-946.1e-7233.84Show/hide
Query:  IPNTYPAPNPYPTLPQPLAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASWERYNRFIMCWIYSSLSEEKMGEIVNLTS
        +P++  A      L   L +KL  NN++LW+ Q+ N V ANG   +++G    PPQ      T  NPD+  W R++R I+ WIYSSL+ E MG+IV   S
Subjt:  IPNTYPAPNPYPTLPQPLAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASWERYNRFIMCWIYSSLSEEKMGEIVNLTS

Query:  AFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDVRSLLLA
        +   W +L R + + + AR+M L+ + Q  RK  LT+ +Y+ ++K + D  +AIGEP++ RD +  +L GLG++YN+ V ++  R D  SL  V S+LL 
Subjt:  AFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDVRSLLLA

Query:  YEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPFTK------PSVSTQNTNQQQ---TFSSPTLICHHRTNLAYQTPPPQALVHSVQPS
        +E RL  Q  V + N+  ANL+    QH N +  +  Q   S   T+       S S+Q+  Q Q    F    + C+HR ++ +Q   P   + +VQ +
Subjt:  YEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPFTK------PSVSTQNTNQQQ---TFSSPTLICHHRTNLAYQTPPPQALVHSVQPS

Query:  -PTSFSDTSSQAPTDYTHPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIGSFSF-STSKPITLQ--------------------
         P + +   +   +  T  DE+WF D+GATHH++  +  L +  PY G +++ VGNGK + I H G+  F S+SK   L+                    
Subjt:  -PTSFSDTSSQAPTDYTHPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIGSFSF-STSKPITLQ--------------------

Query:  ---------------TDLKSKKILLQGRLEDGLYKLSS---PQPRLSSDSFSHPAVCLASASVSSTWHLRLGHPAASTLKQVLSLCNVSSSI-ANEFCND
                        D  +KKILLQG LE GLY+  +   P P     S    +  L+  + ++ WH RLGHPA + LK +L+ CN+S     N  C  
Subjt:  ---------------TDLKSKKILLQGRLEDGLYKLSS---PQPRLSSDSFSHPAVCLASASVSSTWHLRLGHPAASTLKQVLSLCNVSSSI-ANEFCND

Query:  CQLAKNHRLPFAVVETKTGEPFQIVHSDV
        CQ AK+H+LPF V  ++   P  ++H+D+
Subjt:  CQLAKNHRLPFAVVETKTGEPFQIVHSDV

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE11.8e-8738.4Show/hide
Query:  QQPYPQIPNTYPAPNPYPTLPQPLAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASWERYNRFIMCWIYSSLSEEKMGE
        Q P PQI        P P+L Q L++KL + N LL K+QLLN +IANGL  ++D    SPP++LD    Q NP++  W+R N+ +M WIYSSL+   +G+
Subjt:  QQPYPQIPNTYPAPNPYPTLPQPLAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASWERYNRFIMCWIYSSLSEEKMGE

Query:  IVNLTSAFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDV
        IV  ++A DIW SL+  Y+S + A +M L +QLQ+I+K  + +S+YL+++K + D+F+ IGEP+SYRD L  IL+GL  EY+ FVT+I NR+D PSL++V
Subjt:  IVNLTSAFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDV

Query:  RSLLLAYEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPFTKPSVSTQNTNQQQTFSSPTLICHHRTNLAYQTP--PPQALVHSVQP--
         SLL  YE RL ++++   LN  QA               N  Q   +N   +  +  ++           L  +HRTNL Y  P  P  A  +   P  
Subjt:  RSLLLAYEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPFTKPSVSTQNTNQQQTFSSPTLICHHRTNLAYQTP--PPQALVHSVQP--

Query:  --SPTSFSDTSSQAPTDYT--HPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIG-SFSFSTSKPITLQ----------------
          SP S   T+S APT  +    D SW++DSGATHH TP+   + + + Y+ G+   VGN K+I ISHIG +   S+ KPI L                 
Subjt:  --SPTSFSDTSSQAPTDYT--HPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIG-SFSFSTSKPITLQ----------------

Query:  -------------------TDLKSKKILLQGRLEDGLYKLSSPQPRLSSDSFSHPA----------VCLASASVSSTWHLRLGHPAASTLKQVLSLCNVS
                            D ++K++LLQG LE GLYKL+ P    SS   S P+            L+  +    WH RLGHPA   + QVL  CN+ 
Subjt:  -------------------TDLKSKKILLQGRLEDGLYKLSSPQPRLSSDSFSHPA----------VCLASASVSSTWHLRLGHPAASTLKQVLSLCNVS

Query:  SSIANEFCNDCQLAKNHRLPFAVVETKTGEPFQIVHSDV
         S +   C+ CQLAK+HRLPF + E++  +PF +V+SD+
Subjt:  SSIANEFCNDCQLAKNHRLPFAVVETKTGEPFQIVHSDV

A0A438JAU4 Retrovirus-related Pol polyprotein from transposon TNT 1-946.1e-7233.84Show/hide
Query:  IPNTYPAPNPYPTLPQPLAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASWERYNRFIMCWIYSSLSEEKMGEIVNLTS
        +P++  A      L   L +KL  NN++LW+ Q+ N V ANG   +++G    PPQ      T  NPD+  W R++R I+ WIYSSL+ E MG+IV   S
Subjt:  IPNTYPAPNPYPTLPQPLAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASWERYNRFIMCWIYSSLSEEKMGEIVNLTS

Query:  AFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDVRSLLLA
        +   W +L R + + + AR+M L+ + Q  RK  LT+ +Y+ ++K + D  +AIGEP++ RD +  +L GLG++YN+ V ++  R D  SL  V S+LL 
Subjt:  AFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDVRSLLLA

Query:  YEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPFTK------PSVSTQNTNQQQ---TFSSPTLICHHRTNLAYQTPPPQALVHSVQPS
        +E RL  Q  V + N+  ANL+    QH N +  +  Q   S   T+       S S+Q+  Q Q    F    + C+HR ++ +Q   P   + +VQ +
Subjt:  YEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPFTK------PSVSTQNTNQQQ---TFSSPTLICHHRTNLAYQTPPPQALVHSVQPS

Query:  -PTSFSDTSSQAPTDYTHPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIGSFSF-STSKPITLQ--------------------
         P + +   +   +  T  DE+WF D+GATHH++  +  L +  PY G +++ VGNGK + I H G+  F S+SK   L+                    
Subjt:  -PTSFSDTSSQAPTDYTHPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIGSFSF-STSKPITLQ--------------------

Query:  ---------------TDLKSKKILLQGRLEDGLYKLSS---PQPRLSSDSFSHPAVCLASASVSSTWHLRLGHPAASTLKQVLSLCNVSSSI-ANEFCND
                        D  +KKILLQG LE GLY+  +   P P     S    +  L+  + ++ WH RLGHPA + LK +L+ CN+S     N  C  
Subjt:  ---------------TDLKSKKILLQGRLEDGLYKLSS---PQPRLSSDSFSHPAVCLASASVSSTWHLRLGHPAASTLKQVLSLCNVSSSI-ANEFCND

Query:  CQLAKNHRLPFAVVETKTGEPFQIVHSDV
        CQ AK+H+LPF V  ++   P  ++H+D+
Subjt:  CQLAKNHRLPFAVVETKTGEPFQIVHSDV

A0A6J1DQX7 uncharacterized protein LOC1110223151.0e-12760.59Show/hide
Query:  QFSPHQPNFFAQPFYLRPLFPAQQPYPQIPNTYPAPNPYPTLPQPLAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASW
        QF P  PNF AQP                PN + A NP+PTLPQPL VKL DNNFLLWKNQLLNAVIANGL GYLDG+I  PPQFLD    QPNP Y +W
Subjt:  QFSPHQPNFFAQPFYLRPLFPAQQPYPQIPNTYPAPNPYPTLPQPLAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASW

Query:  ERYNRFIMCWIYSSLSEEKMGEIVNLTSAFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLG
        ERYNR +MCWIYSSLSEEKMGE+V+L +  DIW+SL+R YDSKTTARIMGLKT+LQ +RKDG +VSQYLA+IK+I DKF+A+GEP+SYRDHLAH+LDGLG
Subjt:  ERYNRFIMCWIYSSLSEEKMGEIVNLTSAFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLG

Query:  SEYNAFVTTIQNRTDNPSLEDVRSLLLAYEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPF---------------------------
        SEYNAFVT+I NR D+PSLEDVRSLLLAYEARL+KQ  VDQLNIAQANL  L+LQH ++R    P+F+  N +                           
Subjt:  SEYNAFVTTIQNRTDNPSLEDVRSLLLAYEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPF---------------------------

Query:  -TKPSVSTQNTNQQQTFSSPTLICHHRTNLAYQTPPPQALVHSVQPSPTSFSDTSSQAPTDYTHPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITV
          KPS S               +C+HRTN+AY    PQAL H VQPSPT  S     +  ++ HPDESWF+DSGATHHMTPD S LCNP PY+GGEQ+TV
Subjt:  -TKPSVSTQNTNQQQTFSSPTLICHHRTNLAYQTPPPQALVHSVQPSPTSFSDTSSQAPTDYTHPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITV

Query:  GNGKKI
        GNG  +
Subjt:  GNGKKI

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-4828.68Show/hide
Query:  KLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHT-QPNPDYASWERYNRFIMCWIYSSLSEEKMGEIVNLTSAFDIWTSLSRSYDSKTTAR
        KLT  N+L+W  Q+        L+G+LDGS   PP  +      + NPDY  W+R ++ I   +  ++S      +   T+A  IW +L + Y + +   
Subjt:  KLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHT-QPNPDYASWERYNRFIMCWIYSSLSEEKMGEIVNLTSAFDIWTSLSRSYDSKTTAR

Query:  IMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDVRSLLLAYEARLEKQTVVDQLNIAQA
        +  L+TQL++  K   T+  Y+  +    D+ + +G+PM + + +  +L+ L  EY   +  I  +   P+L ++   LL +E+++   +    + I   
Subjt:  IMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDVRSLLLAYEARLEKQTVVDQLNIAQA

Query:  NLSCLNLQHT--------NRRTFNKPQFALSNPFTKPSVSTQ-NTNQQQTFSSPTLICHHRTNLAYQTPPPQALVHSV--QPSPTSFSDTSSQAPTDYTH
         +S  N   T        N R  N+     S P+ + S +   N NQ + +     IC  + + A +    Q  + SV  Q  P+ F+    +A      
Subjt:  NLSCLNLQHT--------NRRTFNKPQFALSNPFTKPSVSTQ-NTNQQQTFSSPTLICHHRTNLAYQTPPPQALVHSV--QPSPTSFSDTSSQAPTDYTH

Query:  P--DESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIGSFSFST-SKPITLQT-----------------------------------
        P    +W LDSGATHH+T D ++L    PYTGG+ + V +G  IPISH GS S ST S+P+ L                                     
Subjt:  P--DESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIGSFSFST-SKPITLQT-----------------------------------

Query:  DLKSKKILLQGRLEDGLYK--LSSPQPRLSSDSFSHPAVCLASASVSSTWHLRLGHPAASTLKQVLSLCNVSSSIANE-----FCNDCQLAKNHRLPFAV
        DL +   LLQG+ +D LY+  ++S QP      F+ P    +S +  S+WH RLGHPA S L  V+S  N S S+ N       C+DC + K++++PF+ 
Subjt:  DLKSKKILLQGRLEDGLYK--LSSPQPRLSSDSFSHPAVCLASASVSSTWHLRLGHPAASTLKQVLSLCNVSSSIANE-----FCNDCQLAKNHRLPFAV

Query:  VETKTGEPFQIVHSDV
            +  P + ++SDV
Subjt:  VETKTGEPFQIVHSDV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.6e-3926.74Show/hide
Query:  KLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHT-QPNPDYASWERYNRFIMCWIYSSLSEEKMGEIVNLTSAFDIWTSLSRSYDSKTTAR
        KLT  N+L+W  Q+        L+G+LDGS P PP  +      + NPDY  W R ++ I   I  ++S      +   T+A  IW +L + Y + +   
Subjt:  KLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHT-QPNPDYASWERYNRFIMCWIYSSLSEEKMGEIVNLTSAFDIWTSLSRSYDSKTTAR

Query:  IMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDVRSLLLAYEARL----EKQTVVDQLN
        +    TQL+ I +                D+ + +G+PM + + +  +L+ L  +Y   +  I  +   PSL ++   L+  E++L      + V    N
Subjt:  IMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDVRSLLLAYEARL----EKQTVVDQLN

Query:  IAQANLSCLNLQHTNR---RTFNKPQFALSNPFTKPSVSTQNTNQQQTFSSPTLICHHRTNLAYQTPPPQALVHSVQPSPTSFSDTSSQAP--------T
        +     +  N    NR   R +N      ++     S S  +  Q + +     IC  + + A + P     +H  Q +      TS   P         
Subjt:  IAQANLSCLNLQHTNR---RTFNKPQFALSNPFTKPSVSTQNTNQQQTFSSPTLICHHRTNLAYQTPPPQALVHSVQPSPTSFSDTSSQAP--------T

Query:  DYTHPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIGSFSFSTSK-----------------------------------PITLQ
        +  +   +W LDSGATHH+T D ++L    PYTGG+ + + +G  IPI+H GS S  TS                                    P + Q
Subjt:  DYTHPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIGSFSFSTSK-----------------------------------PITLQ

Query:  T-DLKSKKILLQGRLEDGLYKLSSPQPRLSSDSFSHPAVCLASASVSSTWHLRLGHPAASTLKQVLSLCNVSSSIANE-----FCNDCQLAKNHRLPFAV
          DL +   LLQG+ +D LY+     P  SS + S  A   + A+ SS WH RLGHP+ + L  V+S  N S  + N       C+DC + K+H++PF+ 
Subjt:  T-DLKSKKILLQGRLEDGLYKLSSPQPRLSSDSFSHPAVCLASASVSSTWHLRLGHPAASTLKQVLSLCNVSSSIANE-----FCNDCQLAKNHRLPFAV

Query:  VETKTGEPFQIVHSDV
            + +P + ++SDV
Subjt:  VETKTGEPFQIVHSDV

Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.3e-1724Show/hide
Query:  LAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASWERYNRFIMCWIYSSLSEEKMGEIVNL-TSAFDIWTSLSRSYDSKT
        + + L   N+ +W+       ++ G+ G++DGS  S P  + ++          W+  +  +  WIY ++++  +  I+ +  +A D+W SL   +    
Subjt:  LAVKLTDNNFLLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASWERYNRFIMCWIYSSLSEEKMGEIVNL-TSAFDIWTSLSRSYDSKT

Query:  TARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDVRSLLLAYEARLEKQTVVDQLNI
         AR +  + +L+    D L+V +Y  ++K ++D  + +  P+S R  + H+L+GL  +Y+  +  I++++  PS  + RS+LL  E+RL           
Subjt:  TARIMGLKTQLQKIRKDGLTVSQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDVRSLLLAYEARLEKQTVVDQLNI

Query:  AQANLSCLNLQHTNRRTFNKPQFAL
          +N S  +L HTN  + +   F +
Subjt:  AQANLSCLNLQHTNRRTFNKPQFAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGATTGAAGAAGGGTCTTCATCCTCTTCCTCTGCAGCACCTCAAATTACACCAGTGATGACCCCTAGTTCTCCATTAACAACTCCAGTTGTTACTCCCATTCATAA
CCCTATACCTCCAAATGCAGCTCGTCCCTCTCTTCCGAATTATCCTCAACCTAATCAGTTCTCTCCACACCAACCAAATTTCTTTGCCCAACCATTTTATCTCAGGCCTC
TGTTTCCAGCACAGCAACCTTACCCCCAAATTCCAAATACCTACCCAGCCCCAAATCCATATCCAACCTTACCTCAACCATTAGCCGTCAAACTCACAGATAATAACTTC
CTCCTTTGGAAGAACCAGCTTCTCAACGCTGTTATTGCGAATGGATTGTCCGGTTATCTAGATGGGTCGATTCCCTCTCCTCCACAATTCCTTGATCAACAACACACTCA
ACCGAATCCGGACTATGCAAGCTGGGAACGCTACAACCGCTTCATCATGTGTTGGATTTATTCATCCTTATCAGAAGAGAAAATGGGTGAGATCGTTAACTTAACCTCTG
CCTTTGATATTTGGACTTCTCTTTCTCGTTCATATGATTCAAAGACTACTGCTCGTATAATGGGTTTAAAGACACAGCTGCAAAAGATTAGAAAAGATGGATTGACTGTT
AGCCAATATTTGGCTCAAATTAAAGATATTACTGATAAATTCTCTGCTATTGGCGAACCTATGTCATATCGTGATCATCTTGCACATATATTAGATGGATTAGGAAGTGA
ATATAATGCTTTTGTAACTACTATTCAGAATAGAACTGATAATCCGTCTTTAGAAGATGTTAGGAGCTTGTTATTGGCATATGAAGCTAGACTTGAGAAGCAAACTGTTG
TTGATCAACTCAATATTGCCCAAGCTAATTTAAGCTGCCTTAATCTTCAGCACACTAATCGTAGGACCTTTAACAAGCCTCAGTTCGCCTTGTCTAATCCCTTTACTAAG
CCTTCTGTTTCTACTCAAAATACCAATCAGCAACAGACCTTTTCCTCTCCTACCCTCATATGCCATCATCGAACAAACTTAGCTTACCAAACTCCACCCCCACAGGCCTT
AGTTCATTCAGTCCAGCCATCTCCCACTTCCTTTTCTGATACTTCATCCCAGGCACCAACTGATTATACACACCCTGACGAATCCTGGTTCCTTGATTCTGGTGCAACCC
ACCATATGACTCCAGATATGTCTTCCCTGTGCAACCCAATTCCTTACACTGGTGGTGAACAAATTACTGTTGGAAATGGTAAGAAAATCCCTATTTCTCATATTGGTTCA
TTTTCTTTTTCTACATCTAAACCCATTACTTTACAAACTGATCTCAAGTCCAAGAAAATCCTCCTTCAGGGCAGACTTGAAGATGGGCTCTATAAGCTGTCATCTCCTCA
GCCACGGTTGTCTTCCGATTCATTCAGTCACCCTGCTGTTTGCTTGGCGTCTGCTTCAGTTTCCTCTACTTGGCATCTGCGATTAGGCCACCCTGCTGCTTCAACCTTGA
AGCAAGTTTTGTCTCTTTGTAATGTTTCTTCTAGTATAGCCAATGAATTTTGTAATGATTGCCAATTGGCTAAAAATCATCGATTACCTTTTGCTGTAGTTGAAACCAAA
ACTGGTGAGCCTTTTCAAATAGTTCACTCGGATGTCTGTGTGTGCTCCGATGATCCACGTCTTCAGCTTGAAATGCCTCGGTCAGAATCTGGGTACAAAACGCCTCCACG
CCTTCGTTTCAGATCTAGGATGAAACACCTCCACGCCTCCTCCAGATTTAGGTACGAACCGGCGGTCTCTATGGGTTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGATTGAAGAAGGGTCTTCATCCTCTTCCTCTGCAGCACCTCAAATTACACCAGTGATGACCCCTAGTTCTCCATTAACAACTCCAGTTGTTACTCCCATTCATAA
CCCTATACCTCCAAATGCAGCTCGTCCCTCTCTTCCGAATTATCCTCAACCTAATCAGTTCTCTCCACACCAACCAAATTTCTTTGCCCAACCATTTTATCTCAGGCCTC
TGTTTCCAGCACAGCAACCTTACCCCCAAATTCCAAATACCTACCCAGCCCCAAATCCATATCCAACCTTACCTCAACCATTAGCCGTCAAACTCACAGATAATAACTTC
CTCCTTTGGAAGAACCAGCTTCTCAACGCTGTTATTGCGAATGGATTGTCCGGTTATCTAGATGGGTCGATTCCCTCTCCTCCACAATTCCTTGATCAACAACACACTCA
ACCGAATCCGGACTATGCAAGCTGGGAACGCTACAACCGCTTCATCATGTGTTGGATTTATTCATCCTTATCAGAAGAGAAAATGGGTGAGATCGTTAACTTAACCTCTG
CCTTTGATATTTGGACTTCTCTTTCTCGTTCATATGATTCAAAGACTACTGCTCGTATAATGGGTTTAAAGACACAGCTGCAAAAGATTAGAAAAGATGGATTGACTGTT
AGCCAATATTTGGCTCAAATTAAAGATATTACTGATAAATTCTCTGCTATTGGCGAACCTATGTCATATCGTGATCATCTTGCACATATATTAGATGGATTAGGAAGTGA
ATATAATGCTTTTGTAACTACTATTCAGAATAGAACTGATAATCCGTCTTTAGAAGATGTTAGGAGCTTGTTATTGGCATATGAAGCTAGACTTGAGAAGCAAACTGTTG
TTGATCAACTCAATATTGCCCAAGCTAATTTAAGCTGCCTTAATCTTCAGCACACTAATCGTAGGACCTTTAACAAGCCTCAGTTCGCCTTGTCTAATCCCTTTACTAAG
CCTTCTGTTTCTACTCAAAATACCAATCAGCAACAGACCTTTTCCTCTCCTACCCTCATATGCCATCATCGAACAAACTTAGCTTACCAAACTCCACCCCCACAGGCCTT
AGTTCATTCAGTCCAGCCATCTCCCACTTCCTTTTCTGATACTTCATCCCAGGCACCAACTGATTATACACACCCTGACGAATCCTGGTTCCTTGATTCTGGTGCAACCC
ACCATATGACTCCAGATATGTCTTCCCTGTGCAACCCAATTCCTTACACTGGTGGTGAACAAATTACTGTTGGAAATGGTAAGAAAATCCCTATTTCTCATATTGGTTCA
TTTTCTTTTTCTACATCTAAACCCATTACTTTACAAACTGATCTCAAGTCCAAGAAAATCCTCCTTCAGGGCAGACTTGAAGATGGGCTCTATAAGCTGTCATCTCCTCA
GCCACGGTTGTCTTCCGATTCATTCAGTCACCCTGCTGTTTGCTTGGCGTCTGCTTCAGTTTCCTCTACTTGGCATCTGCGATTAGGCCACCCTGCTGCTTCAACCTTGA
AGCAAGTTTTGTCTCTTTGTAATGTTTCTTCTAGTATAGCCAATGAATTTTGTAATGATTGCCAATTGGCTAAAAATCATCGATTACCTTTTGCTGTAGTTGAAACCAAA
ACTGGTGAGCCTTTTCAAATAGTTCACTCGGATGTCTGTGTGTGCTCCGATGATCCACGTCTTCAGCTTGAAATGCCTCGGTCAGAATCTGGGTACAAAACGCCTCCACG
CCTTCGTTTCAGATCTAGGATGAAACACCTCCACGCCTCCTCCAGATTTAGGTACGAACCGGCGGTCTCTATGGGTTTTTAA
Protein sequenceShow/hide protein sequence
MMIEEGSSSSSSAAPQITPVMTPSSPLTTPVVTPIHNPIPPNAARPSLPNYPQPNQFSPHQPNFFAQPFYLRPLFPAQQPYPQIPNTYPAPNPYPTLPQPLAVKLTDNNF
LLWKNQLLNAVIANGLSGYLDGSIPSPPQFLDQQHTQPNPDYASWERYNRFIMCWIYSSLSEEKMGEIVNLTSAFDIWTSLSRSYDSKTTARIMGLKTQLQKIRKDGLTV
SQYLAQIKDITDKFSAIGEPMSYRDHLAHILDGLGSEYNAFVTTIQNRTDNPSLEDVRSLLLAYEARLEKQTVVDQLNIAQANLSCLNLQHTNRRTFNKPQFALSNPFTK
PSVSTQNTNQQQTFSSPTLICHHRTNLAYQTPPPQALVHSVQPSPTSFSDTSSQAPTDYTHPDESWFLDSGATHHMTPDMSSLCNPIPYTGGEQITVGNGKKIPISHIGS
FSFSTSKPITLQTDLKSKKILLQGRLEDGLYKLSSPQPRLSSDSFSHPAVCLASASVSSTWHLRLGHPAASTLKQVLSLCNVSSSIANEFCNDCQLAKNHRLPFAVVETK
TGEPFQIVHSDVCVCSDDPRLQLEMPRSESGYKTPPRLRFRSRMKHLHASSRFRYEPAVSMGF