; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017723 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017723
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase
Genome locationchr5:7749836..7760973
RNA-Seq ExpressionLag0017723
SyntenyLag0017723
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR021109 - Aspartic peptidase domain superfamily
IPR025724 - GAG-pre-integrase domain
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051601.1 integrase [Cucumis melo var. makuwa]1.3e-14363.85Show/hide
Query:  EDAETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSG
        ++ ETIEEFFNR+L+IVN L SNGE +GDQRVVEKILRSM RK+EHI+VAIEESKDL TLSINSLM SLQSHELRLK+FD    EEAF MQTS RG S G
Subjt:  EDAETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSG

Query:  RRGGRGGRGNGRSKDSKN-FESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKKTNQAKTKL-MH-EQQDNDQG
        RRGG G RG GR+ D+++   SE S   +S +RGRG  S RGR    GR++G G G+FS I+CFNCG+ GHFQA CW+ K     T + MH EQ+  D+G
Subjt:  RRGGRGGRGNGRSKDSKN-FESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKKTNQAKTKL-MH-EQQDNDQG

Query:  LLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIF
        +LFL  +VQ+   E  WYLDSGCSNHMTG + IFVTLDES   EVKTGDN +L+V+G+GDILVKTK G KR+T+V++VPGLK NLLS+GQLL +G  V F
Subjt:  LLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIF

Query:  KDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGN
        +  IC IK + G LI KV MT NKMFP+  +Y ++ CF +++KD SWLWH+ YGHL+F +LS++C+ HMVRG+ NI+ E  +CE CIL KHHR  FPTG 
Subjt:  KDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGN

Query:  AWRASKPLELVHTDLCGPMRTTTLGG
        AWRASKPLEL+HTDLCGPMRTTT GG
Subjt:  AWRASKPLELVHTDLCGPMRTTTLGG

KAA0053129.1 integrase [Cucumis melo var. makuwa]6.0e-14463.38Show/hide
Query:  EDAETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSG
        ++ ETIEEFFNR+L+IVN L SNGE +GDQRVVEKILRSM RK+EHI+VAIEESKDL TLSINSLM SLQSHELRLK+FD    EEAF MQTS RG S G
Subjt:  EDAETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSG

Query:  RRGGRGGRGNGRSKDSKN-FESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKK--TNQAKTKLMHEQQDNDQG
        RRGG G RG GR+ D+++   SE S    S +RGRG  S RG     GR++G G G+FS I+CFNCG+ GHFQA+CW+ K     A   +  EQ+ ND+G
Subjt:  RRGGRGGRGNGRSKDSKN-FESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKK--TNQAKTKLMHEQQDNDQG

Query:  LLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIF
        +LFL  +VQ+   E  WYLDSGCSNHMTG + IFVTLDES   EVKTGDN +L+V+G+GDILVKTK G KR+T+V++VPGLK NLLS+GQLL +G  V F
Subjt:  LLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIF

Query:  KDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGN
        +  IC IK + G LI KV MT NKMFP+  +Y ++ CF +++KD SWLWH+ YGHL+F +LS++C+ HMVRG+ NI+ E  +CE CIL KHHR  FPTG 
Subjt:  KDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGN

Query:  AWRASKPLELVHTDLCGPMRTTTLGG
        AWRASKPLEL+HTDLCGPMRTTT GG
Subjt:  AWRASKPLELVHTDLCGPMRTTTLGG

KAA0060377.1 integrase [Cucumis melo var. makuwa]2.3e-14363.85Show/hide
Query:  EDAETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSG
        ++ ETIEEFFNR+L+IVN L SNGE +GDQRVVEKILRSM RK+EHI+VAIEESKDL TLSINSLM SLQSHELRLK+FD    EEAF MQTS RG S G
Subjt:  EDAETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSG

Query:  RRGGRGGRGNGRSKDSKN-FESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKKTNQAKTKL-MH-EQQDNDQG
        RRGG G RG GR+ D+++   SE S   +S +RGRG  S RGR    GR++G G G+FS I+CFNCG+ GHFQA CW+ K     T + MH EQ+  D+G
Subjt:  RRGGRGGRGNGRSKDSKN-FESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKKTNQAKTKL-MH-EQQDNDQG

Query:  LLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIF
        +LFL  +VQ+   E  WYLDSGCSNHMTG + IFVTLDES   EVKTGDN +L+V+G+GDILVKTK G KR+T+V++VPGLK NLLS+GQLL +G  V F
Subjt:  LLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIF

Query:  KDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGN
        +  IC IK + G LI KV MT NKMFP+  +Y ++ CF +++KD SWLWH+ YGHL+F +LS++C+ HMVRG+ NI+ E  +CE CIL KHHR  FPTG 
Subjt:  KDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGN

Query:  AWRASKPLELVHTDLCGPMRTTTLGG
        AWRASKPLEL+HTDLCGPMRTTT GG
Subjt:  AWRASKPLELVHTDLCGPMRTTTLGG

KAA0060690.1 integrase [Cucumis melo var. makuwa]1.3e-14363.85Show/hide
Query:  EDAETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSG
        ++ ETIEEFFNR+L+IVN L SNGE +GDQRVVEKILRSM RK+EHI+VAIEESKDL TLSINSLM SLQSHELRLK+FD    EEAF MQTS RG S G
Subjt:  EDAETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSG

Query:  RRGGRGGRGNGRSKDSKN-FESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKKTNQAKTKL-MH-EQQDNDQG
        RRGG G RG GR+ D+++   SE S   +S +RGRG  S RGR    GR++G G G+FS I+CFNCG+ GHFQA CW+ K     T + MH EQ+  D+G
Subjt:  RRGGRGGRGNGRSKDSKN-FESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKKTNQAKTKL-MH-EQQDNDQG

Query:  LLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIF
        +LFL  +VQ+   E  WYLDSGCSNHMTG + IFVTLDES   EVKTGDN +L+V+G+GDILVKTK G KR+T+V++VPGLK NLLS+GQLL +G  V F
Subjt:  LLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIF

Query:  KDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGN
        +  IC IK + G LI KV MT NKMFP+  +Y ++ CF +++KD SWLWH+ YGHL+F +LS++C+ HMVRG+ NI+ E  +CE CIL KHHR  FPTG 
Subjt:  KDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGN

Query:  AWRASKPLELVHTDLCGPMRTTTLGG
        AWRASKPLEL+HTDLCGPMRTTT GG
Subjt:  AWRASKPLELVHTDLCGPMRTTTLGG

TYJ95504.1 integrase [Cucumis melo var. makuwa]2.3e-14363.85Show/hide
Query:  EDAETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSG
        ++ ETIEEFFNR+L+IVN L SNGE +GDQRVVEKILRSM RK+EHI+VAIEESKDL TLSINSLM SLQSHELRLK+FD    EEAF MQTS RG S G
Subjt:  EDAETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSG

Query:  RRGGRGGRGNGRSKDSKN-FESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKKTNQAKTKL-MH-EQQDNDQG
        RRGG G RG GR+ D+++   SE S   +S +RGRG  S RGR    GR++G G G+FS I+CFNCG+ GHFQA CW+ K     T + MH EQ+  D+G
Subjt:  RRGGRGGRGNGRSKDSKN-FESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKKTNQAKTKL-MH-EQQDNDQG

Query:  LLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIF
        +LFL  +VQ+   E  WYLDSGCSNHMTG + IFVTLDES   EVKTGDN +L+V+G+GDILVKTK G KR+T+V++VPGLK NLLS+GQLL +G  V F
Subjt:  LLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIF

Query:  KDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGN
        +  IC IK + G LI KV MT NKMFP+  +Y ++ CF +++KD SWLWH+ YGHL+F +LS++C+ HMVRG+ NI+ E  +CE CIL KHHR  FPTG 
Subjt:  KDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGN

Query:  AWRASKPLELVHTDLCGPMRTTTLGG
        AWRASKPLEL+HTDLCGPMRTTT GG
Subjt:  AWRASKPLELVHTDLCGPMRTTTLGG

TrEMBL top hitse value%identityAlignment
A0A5A7UDJ2 Integrase6.5e-14463.85Show/hide
Query:  EDAETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSG
        ++ ETIEEFFNR+L+IVN L SNGE +GDQRVVEKILRSM RK+EHI+VAIEESKDL TLSINSLM SLQSHELRLK+FD    EEAF MQTS RG S G
Subjt:  EDAETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSG

Query:  RRGGRGGRGNGRSKDSKN-FESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKKTNQAKTKL-MH-EQQDNDQG
        RRGG G RG GR+ D+++   SE S   +S +RGRG  S RGR    GR++G G G+FS I+CFNCG+ GHFQA CW+ K     T + MH EQ+  D+G
Subjt:  RRGGRGGRGNGRSKDSKN-FESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKKTNQAKTKL-MH-EQQDNDQG

Query:  LLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIF
        +LFL  +VQ+   E  WYLDSGCSNHMTG + IFVTLDES   EVKTGDN +L+V+G+GDILVKTK G KR+T+V++VPGLK NLLS+GQLL +G  V F
Subjt:  LLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIF

Query:  KDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGN
        +  IC IK + G LI KV MT NKMFP+  +Y ++ CF +++KD SWLWH+ YGHL+F +LS++C+ HMVRG+ NI+ E  +CE CIL KHHR  FPTG 
Subjt:  KDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGN

Query:  AWRASKPLELVHTDLCGPMRTTTLGG
        AWRASKPLEL+HTDLCGPMRTTT GG
Subjt:  AWRASKPLELVHTDLCGPMRTTTLGG

A0A5A7UI36 Integrase2.9e-14463.38Show/hide
Query:  EDAETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSG
        ++ ETIEEFFNR+L+IVN L SNGE +GDQRVVEKILRSM RK+EHI+VAIEESKDL TLSINSLM SLQSHELRLK+FD    EEAF MQTS RG S G
Subjt:  EDAETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSG

Query:  RRGGRGGRGNGRSKDSKN-FESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKK--TNQAKTKLMHEQQDNDQG
        RRGG G RG GR+ D+++   SE S    S +RGRG  S RG     GR++G G G+FS I+CFNCG+ GHFQA+CW+ K     A   +  EQ+ ND+G
Subjt:  RRGGRGGRGNGRSKDSKN-FESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKK--TNQAKTKLMHEQQDNDQG

Query:  LLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIF
        +LFL  +VQ+   E  WYLDSGCSNHMTG + IFVTLDES   EVKTGDN +L+V+G+GDILVKTK G KR+T+V++VPGLK NLLS+GQLL +G  V F
Subjt:  LLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIF

Query:  KDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGN
        +  IC IK + G LI KV MT NKMFP+  +Y ++ CF +++KD SWLWH+ YGHL+F +LS++C+ HMVRG+ NI+ E  +CE CIL KHHR  FPTG 
Subjt:  KDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGN

Query:  AWRASKPLELVHTDLCGPMRTTTLGG
        AWRASKPLEL+HTDLCGPMRTTT GG
Subjt:  AWRASKPLELVHTDLCGPMRTTTLGG

A0A5A7V047 Integrase6.5e-14463.85Show/hide
Query:  EDAETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSG
        ++ ETIEEFFNR+L+IVN L SNGE +GDQRVVEKILRSM RK+EHI+VAIEESKDL TLSINSLM SLQSHELRLK+FD    EEAF MQTS RG S G
Subjt:  EDAETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSG

Query:  RRGGRGGRGNGRSKDSKN-FESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKKTNQAKTKL-MH-EQQDNDQG
        RRGG G RG GR+ D+++   SE S   +S +RGRG  S RGR    GR++G G G+FS I+CFNCG+ GHFQA CW+ K     T + MH EQ+  D+G
Subjt:  RRGGRGGRGNGRSKDSKN-FESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKKTNQAKTKL-MH-EQQDNDQG

Query:  LLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIF
        +LFL  +VQ+   E  WYLDSGCSNHMTG + IFVTLDES   EVKTGDN +L+V+G+GDILVKTK G KR+T+V++VPGLK NLLS+GQLL +G  V F
Subjt:  LLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIF

Query:  KDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGN
        +  IC IK + G LI KV MT NKMFP+  +Y ++ CF +++KD SWLWH+ YGHL+F +LS++C+ HMVRG+ NI+ E  +CE CIL KHHR  FPTG 
Subjt:  KDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGN

Query:  AWRASKPLELVHTDLCGPMRTTTLGG
        AWRASKPLEL+HTDLCGPMRTTT GG
Subjt:  AWRASKPLELVHTDLCGPMRTTTLGG

A0A5D3BQ81 Integrase1.1e-14363.85Show/hide
Query:  EDAETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSG
        ++ ETIEEFFNR+L+IVN L SNGE +GDQRVVEKILRSM RK+EHI+VAIEESKDL TLSINSLM SLQSHELRLK+FD    EEAF MQTS RG S G
Subjt:  EDAETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSG

Query:  RRGGRGGRGNGRSKDSKN-FESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKKTNQAKTKL-MH-EQQDNDQG
        RRGG G RG GR+ D+++   SE S   +S +RGRG  S RGR    GR++G G G+FS I+CFNCG+ GHFQA CW+ K     T + MH EQ+  D+G
Subjt:  RRGGRGGRGNGRSKDSKN-FESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKKTNQAKTKL-MH-EQQDNDQG

Query:  LLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIF
        +LFL  +VQ+   E  WYLDSGCSNHMTG + IFVTLDES   EVKTGDN +L+V+G+GDILVKTK G KR+T+V++VPGLK NLLS+GQLL +G  V F
Subjt:  LLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIF

Query:  KDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGN
        +  IC IK + G LI KV MT NKMFP+  +Y ++ CF +++KD SWLWH+ YGHL+F +LS++C+ HMVRG+ NI+ E  +CE CIL KHHR  FPTG 
Subjt:  KDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGN

Query:  AWRASKPLELVHTDLCGPMRTTTLGG
        AWRASKPLEL+HTDLCGPMRTTT GG
Subjt:  AWRASKPLELVHTDLCGPMRTTTLGG

A0A5D3CLV1 Integrase1.1e-14363.85Show/hide
Query:  EDAETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSG
        ++ ETIEEFFNR+L+IVN L SNGE +GDQRVVEKILRSM RK+EHI+VAIEESKDL TLSINSLM SLQSHELRLK+FD    EEAF MQTS RG S G
Subjt:  EDAETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSG

Query:  RRGGRGGRGNGRSKDSKN-FESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKKTNQAKTKL-MH-EQQDNDQG
        RRGG G RG GR+ D+++   SE S   +S +RGRG  S RGR    GR++G G G+FS I+CFNCG+ GHFQA CW+ K     T + MH EQ+  D+G
Subjt:  RRGGRGGRGNGRSKDSKN-FESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKKTNQAKTKL-MH-EQQDNDQG

Query:  LLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIF
        +LFL  +VQ+   E  WYLDSGCSNHMTG + IFVTLDES   EVKTGDN +L+V+G+GDILVKTK G KR+T+V++VPGLK NLLS+GQLL +G  V F
Subjt:  LLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIF

Query:  KDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGN
        +  IC IK + G LI KV MT NKMFP+  +Y ++ CF +++KD SWLWH+ YGHL+F +LS++C+ HMVRG+ NI+ E  +CE CIL KHHR  FPTG 
Subjt:  KDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGN

Query:  AWRASKPLELVHTDLCGPMRTTTLGG
        AWRASKPLEL+HTDLCGPMRTTT GG
Subjt:  AWRASKPLELVHTDLCGPMRTTTLGG

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.2e-1624.88Show/hide
Query:  IVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEE-SKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSGRRGGRGGRGNGRSK
        ++++L++ G  I +   +  +L ++   Y+ II AIE  S++ LTL+   +   L   E+++K                                N  SK
Subjt:  IVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEE-SKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSGRRGGRGGRGNGRSK

Query:  DSKNFESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKK---TNQAKTKLMHEQQDNDQGLLFLTLNVQETSTE
           N     ++N   +N  + R  ++ +   +G S+ +       +KC +CGR GH + +C+  K    N+ K      Q     G+ F+   V  TS  
Subjt:  DSKNFESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKK---TNQAKTKLMHEQQDNDQGLLFLTLNVQETSTE

Query:  DI--WYLDSGCSNHMTGRKDIFVTLDESLHKE-VKTGDNKKLEVQGRGDILVKTKNGAKR--------ITDVYFVPGLKQNLLSVGQLLLKGHDVIFKDS
        D   + LDSG S+H+          DESL+ + V+     K+ V  +G+ +  TK G  R        + DV F      NL+SV +L   G  + F  S
Subjt:  DI--WYLDSGCSNHMTGRKDIFVTLDESLHKE-VKTGDNKKLEVQGRGDILVKTKNGAKR--------ITDVYFVPGLKQNLLSVGQLLLKGHDVIFKDS

Query:  ICEIK------TKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRG---MPNIHKEDQLCETCILGKHHRS
           I        KN G++  V +   + + I   +          K+   LWH  +GH+S   L  + +++M      + N+    ++CE C+ GK  R 
Subjt:  ICEIK------TKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRG---MPNIHKEDQLCETCILGKHHRS

Query:  PF-PTGNAWRASKPLELVHTDLCGPMRTTTLGGK
        PF    +     +PL +VH+D+CGP+   TL  K
Subjt:  PF-PTGNAWRASKPLELVHTDLCGPMRTTTLGGK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-1926.36Show/hide
Query:  GRGNGRSKDSKNFESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKKTNQAKTKLMHEQ-------QDNDQGLL
        GRG    + S N+               GRS +RG+S +R +SR R         C+NC + GHF+ +C + +  + +T             Q+ND  +L
Subjt:  GRGNGRSKDSKNFESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKKTNQAKTKLMHEQ-------QDNDQGLL

Query:  FLTLNVQE-----TSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRI-TDVYFVPGLKQNLLSVGQLLLKGH
        F  +N +E     +  E  W +D+  S+H T  +D+F          VK G+    ++ G GDI +KT  G   +  DV  VP L+ NL+S   L   G+
Subjt:  FLTLNVQE-----TSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRI-TDVYFVPGLKQNLLSVGQLLLKGH

Query:  DVIFKDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPF
        +  F +   + +   G L+    +    ++       +        + +  LWH   GH+S   L  + ++ ++        +   C+ C+ GK HR  F
Subjt:  DVIFKDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLLCFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPF

Query:  PTGNAWRASKPLELVHTDLCGPMRTTTLGG
         T +  R    L+LV++D+CGPM   ++GG
Subjt:  PTGNAWRASKPLELVHTDLCGPMRTTTLGG

Arabidopsis top hitse value%identityAlignment
AT3G20980.1 Gag-Pol-related retrotransposon family protein5.5e-1035.61Show/hide
Query:  TSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVK--TGDNKKLE---VQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIFKDSIC
        T  E+IW + S  SNHMT     F TLD S   +VK  +GD  +     V+G GD+   T  G K I +V +VPG++ N LSV QL   G +V       
Subjt:  TSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVK--TGDNKKLE---VQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIFKDSIC

Query:  EIKTKNGGLITKVHMTTNKMFPIKMSYEKLLC
         ++ + G  +     TT KMF   M  ++  C
Subjt:  EIKTKNGGLITKVHMTTNKMFPIKMSYEKLLC

AT3G21000.1 Gag-Pol-related retrotransposon family protein1.6e-0931.11Show/hide
Query:  CFNCGRNGHFQAECWSKKTNQAKTKLMHEQQDNDQGLLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVK
        C  C +N H Q +C  +     + K   ++   D  L  +     +T  +DIW +      +MT     F TLD +    V T D   L V+G+GD+ ++
Subjt:  CFNCGRNGHFQAECWSKKTNQAKTKLMHEQQDNDQGLLFLTLNVQETSTEDIWYLDSGCSNHMTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVK

Query:  TKNGAKR-ITDVYFVPGLKQNLLSVGQLLLKGHDV
         K G K+ I +V FVPGL +N+LS G+++ K + +
Subjt:  TKNGAKR-ITDVYFVPGLKQNLLSVGQLLLKGHDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACGAGAGGGTTTTCTGTTTGATGGTTGGACCACAAACAGGTTGTTCATTAGAGGAGAACTGATCCGTGGACACAGAAAATATGTCTGCAGTGAGAAGAGTGCAAC
TACTCTCAAGACAAGACCTAGAATTCTCTCTAGCCTCCCTCTTAGAGAAAGACTCCCACAAGTCTTTTGCCTCCTAGACTCAGAGTCATACCGGTGTAACCTCTGTGGTT
ATTGTGTCAATCAAGAAGTAATTTCCAGCGATAAAAGCAAAGAGGCTACTGCGTTTTCGTTCGTTGGAGCGTCGTTGGCGAAGAACGGTCAAGTCTACAACGAAGATGCT
GAAACTATTGAAGAATTTTTTAATCGTGTTCTCTTAATTGTTAACCAATTGATATCAAATGGAGAAACAATTGGAGATCAAAGAGTGGTCGAAAAGATCCTCAGAAGCAT
GACTAGAAAATATGAACACATTATCGTAGCAATTGAAGAATCTAAAGATTTGTTGACTCTCTCCATTAATAGCTTAATGAGATCTCTCCAATCTCATGAGCTTAGATTGA
AGCGGTTCGATTCTACTTTTTCAGAAGAAGCTTTTCATATGCAAACTTCTTCTAGAGGGAGATCTAGTGGAAGAAGAGGTGGACGTGGTGGTCGAGGTAATGGCAGATCC
AAGGACTCCAAAAATTTTGAGTCCGAAAGAAGTGATAATCCAAATTCTTCAAATAGAGGAAGAGGAAGAAGTTCAAGCAGAGGAAGAAGTTCAAGCAGAGGAAGAAGCAG
AGGTAGAGGTCATGGAGATTTTTCTCACATTAAATGTTTCAATTGTGGACGCAATGGACATTTTCAAGCAGAGTGTTGGTCCAAAAAGACTAACCAGGCAAAGACAAAAC
TAATGCATGAACAACAGGATAATGACCAAGGTCTTCTTTTCCTCACCTTAAATGTCCAAGAAACAAGCACTGAAGATATATGGTATCTTGATAGTGGTTGCAGTAATCAC
ATGACGGGAAGGAAGGATATTTTCGTAACTTTGGATGAATCTCTTCATAAAGAGGTGAAGACTGGTGACAATAAGAAGCTCGAAGTTCAAGGAAGAGGAGATATCCTTGT
TAAGACAAAGAATGGAGCAAAAAGAATCACTGATGTATATTTTGTTCCAGGTCTTAAACAAAATCTCTTAAGTGTTGGACAACTGTTATTGAAAGGGCATGATGTAATCT
TCAAAGACAGCATTTGCGAGATCAAAACCAAGAATGGAGGTCTCATAACAAAGGTACATATGACTACAAACAAGATGTTTCCTATCAAAATGTCTTATGAGAAGCTTTTA
TGTTTTGAGACGTTGGTTAAAGATACTTCATGGCTTTGGCATTATCTATATGGACACTTGAGTTTTGACACTTTATCGCACATGTGTCAACAACATATGGTGAGAGGAAT
GCCAAACATTCACAAGGAAGATCAACTTTGTGAAACATGTATTTTGGGGAAGCATCATCGAAGTCCATTTCCAACAGGAAATGCTTGGAGAGCATCAAAACCTCTTGAGC
TTGTCCATACAGATTTATGTGGACCCATGCGAACTACTACACTTGGAGGGAAATGGAATCAAACACCAAAAGAAAGTTCGAAGAACTCCCCAACAAAATGGAGTTGCAGA
GAGAATGAACAGGATAATAATGGAGCTTGCAAGAAATGCCGCAATTCAAAGAAATCAAGCTTCAATGAGAGCCCTGGAATTGCAATGGGTCAGCTTGCTAATGAGCTGAA
GGCACGTCCTCAAGGTAAGCTTCCTTTGGAAACTGAACACCTTACGAGGGAAGGTAAGAAGCAGGTGCAGGCAGTGACTTTAAAGAGTGGTAAGCCAGTAGAAGAGAGGA
AAAAGCCTAGTAAACCCCAGGAAGTAGAAAAGAATTGTGATAAAAATATTGTTGTTGAGAAAGAATTGGAGACTGGTCAGGGTGCTGGAGGCAGCAATAGTGATGCTGGA
GCATTTGGTTCTGTTCCAGATTTGGAACCACCTTATGTACTGCCCCCACCCTATGATCCACCCTTACCTTTTCCACAAAGGCAAAAGCCTAAGAACCAGGATGGTCAATT
TAAGAAGTTCTTAGAGATTCTTAAGAAATTGCATATAAATATCCCTTTAGTAGAAGCTATAGACCAAATGCCTAATTATGTTAAATTTCTTAAGGACATTTTGACTAAAA
AGAAGAGATTAGGAGAGTTTGAGACTGTGGCTCTTACTGAGGAATGTAGTTCTATTCTTAAGAATGGGCTACCTTCCAAGGTTGAGGATCCAATATCATTAACTATTCCT
GTCTCCATAGGTGGAAAAGAGTTGGGGAGAGCACTTTGTGATTCAGGCGCAAGAATTAACCTTATGCCTCTTTCGGTTTATCGAAAGCTAGGAATATTTGAAGCTAGGCC
TACGACAGTCACACTCCAATTAGCAGATAGGTCTATCGCATATCCTGAAGGTAAGATTGAGGATGTTCTGGTCCAAGTAGATAAATTTATTTTTCCTGTCGATTTCATTA
TTCTAGATTATGAGGCAGATAAGAATGTCCCAATTATTCTTGGCCGTCCATTTTTGGCAACTGGTAGATCATTGATAGATGTCCAACAGGGGGAGCTTACAATGAGGATG
CATGACCAAGAGGTGAAGTTTTATATGTTTGATGCAATGAAATATCCTAATGATATTGAGGATTGCTCGTGCATTCAGGTGTTGGATGAGTTTGTTGAGGATCATTTTGA
GAAGGATTTGATGGAGTACCATACCAAAAAATTTGGAGAAATCCAAATTGAGGATTTGGAAATAGGTGGATTGGAGCATGAGCATAAAGATGTAGGTGAGATTTCTAGTC
TTAAGAGGAGTTTTGAATCCTTAGAGCCAATAGATAAGAAATCCAAGCCTATTGAACCTTATAATTCATTGACATTGTTCCAGCAACCTGAGAGTAGGAAATCCTTCATT
GATGAGAGGTTACTTACTGTAGCTCATATTAAGGTAGTGAAAACACCTTGGTATGGTGACTTTTCCAATTACCTTGATTTTGAAACATTGCCTCCTGGTTTATCAAGAGA
ACAGATGAAAGAATTTTTCCATGGGGTGAAGTTTTATTTATCAAATGATGCATATGTGGTTAGACAATGTGTTGATGATGGTTGTGAGTTTAAACCCCCCCAGCTGATCT
CGCAGCCGCCGCCGCCGACCAGACCCTTGCCGCCGCCGCTTAGTTTGCGCCGCTGCCCAGGTCCTGCAGCTGCCGCTGTCGACCTGCCAAGCCGCGCCGTCTTCGTCGCG
CGATTCTCTTTTTCCTCTGGTGGTCGTCGCCCAGCCCATTGTCTCCCCCATTCTCCGCGAGCTCTTTCTCTCTCCCGTCGGTCTCCCTCCCTCTCTTCGGTCTCTCTCTC
TCATGAGGTCGTGAGTCCAGCCGCAACCGCTGTCTTGTCGAGCCGCTGCCGCCGTGTGGGTTTTCTCGCCGGGTCTCTCGTTTCCTTGCGTTTTTGGCTAAGAAAACCCG
TGGATCTCACGTGTCTAGCGATTCAGAGTCCCTTCGTCCTCGTTTCAGTCAATTTCGCCTCTGTCCAGCAGCCTTTTTGGGTGTTTTCGGCACCGCTTGACTATTCCGAA
TTAAATACCCATTCACTTAAGTGCTGGAGTTTAAGCTTTGATGTTGAAATTCTGATCTGCAGCGCCGTCTTTGCTCAGTTCGTGAGTGGTTCGGCGTCAATTATCGCTTC
CGCGCCATTGAAGTGTTCGATTGAGTTCGATACACTCCAACTCGAATACCCATTGCCCACGGAGCGTTCTAACACGTTGTTAGAGCATATGCCCATGGCCCGTAGTGCCT
TAGACATGGTTGCCCATGGTCTAAAGCTTTTGGGTTCTCATTTTGGGGAATGTTGGAGACTGTTTAGCAAGGAAATAGGCCTAGTAGGAGTTCATCTTGGCTGTTTTAGG
CTTGTTGAGGTTGCCTTGGAAACGCTATATAGCATGTTGCTTGTGTTGGTTGTTTGTTTGTGGTTTGGTAAGTGTGAGCTGCTTACCAGTACCACGATTGTACTGATACC
CCCTTCCCCACCTTCCCCCAATATTTTAGATGTTGCAGGTTTCGTTCATGAGCTGGATCCTGGTGAAGGAAAAAAATTGGTTCAGGCCATTTTACGTCGTTATGCTACCG
AAATTTTCGTTGCTGGCAGATTAAGGGAGCGTAGAGAAAAGCCAGTAGAGGAAAAAGAAGACAAGGGGAATGAAGTTGTGACTGAGATGAAACTGAGTGAAGAAGTTGAG
CTTGATGAAAGATTCTGGTACGAGAGATTTATCCATGAAGAGGCAAGAAAAAAATATCAAGAAGTCCTGAAGCGAGATGTTCTAATGGAGCGCGACTTCGATGGTGGCAA
AGAGCTTCCACATTTTCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCACGAGAGGGTTTTCTGTTTGATGGTTGGACCACAAACAGGTTGTTCATTAGAGGAGAACTGATCCGTGGACACAGAAAATATGTCTGCAGTGAGAAGAGTGCAAC
TACTCTCAAGACAAGACCTAGAATTCTCTCTAGCCTCCCTCTTAGAGAAAGACTCCCACAAGTCTTTTGCCTCCTAGACTCAGAGTCATACCGGTGTAACCTCTGTGGTT
ATTGTGTCAATCAAGAAGTAATTTCCAGCGATAAAAGCAAAGAGGCTACTGCGTTTTCGTTCGTTGGAGCGTCGTTGGCGAAGAACGGTCAAGTCTACAACGAAGATGCT
GAAACTATTGAAGAATTTTTTAATCGTGTTCTCTTAATTGTTAACCAATTGATATCAAATGGAGAAACAATTGGAGATCAAAGAGTGGTCGAAAAGATCCTCAGAAGCAT
GACTAGAAAATATGAACACATTATCGTAGCAATTGAAGAATCTAAAGATTTGTTGACTCTCTCCATTAATAGCTTAATGAGATCTCTCCAATCTCATGAGCTTAGATTGA
AGCGGTTCGATTCTACTTTTTCAGAAGAAGCTTTTCATATGCAAACTTCTTCTAGAGGGAGATCTAGTGGAAGAAGAGGTGGACGTGGTGGTCGAGGTAATGGCAGATCC
AAGGACTCCAAAAATTTTGAGTCCGAAAGAAGTGATAATCCAAATTCTTCAAATAGAGGAAGAGGAAGAAGTTCAAGCAGAGGAAGAAGTTCAAGCAGAGGAAGAAGCAG
AGGTAGAGGTCATGGAGATTTTTCTCACATTAAATGTTTCAATTGTGGACGCAATGGACATTTTCAAGCAGAGTGTTGGTCCAAAAAGACTAACCAGGCAAAGACAAAAC
TAATGCATGAACAACAGGATAATGACCAAGGTCTTCTTTTCCTCACCTTAAATGTCCAAGAAACAAGCACTGAAGATATATGGTATCTTGATAGTGGTTGCAGTAATCAC
ATGACGGGAAGGAAGGATATTTTCGTAACTTTGGATGAATCTCTTCATAAAGAGGTGAAGACTGGTGACAATAAGAAGCTCGAAGTTCAAGGAAGAGGAGATATCCTTGT
TAAGACAAAGAATGGAGCAAAAAGAATCACTGATGTATATTTTGTTCCAGGTCTTAAACAAAATCTCTTAAGTGTTGGACAACTGTTATTGAAAGGGCATGATGTAATCT
TCAAAGACAGCATTTGCGAGATCAAAACCAAGAATGGAGGTCTCATAACAAAGGTACATATGACTACAAACAAGATGTTTCCTATCAAAATGTCTTATGAGAAGCTTTTA
TGTTTTGAGACGTTGGTTAAAGATACTTCATGGCTTTGGCATTATCTATATGGACACTTGAGTTTTGACACTTTATCGCACATGTGTCAACAACATATGGTGAGAGGAAT
GCCAAACATTCACAAGGAAGATCAACTTTGTGAAACATGTATTTTGGGGAAGCATCATCGAAGTCCATTTCCAACAGGAAATGCTTGGAGAGCATCAAAACCTCTTGAGC
TTGTCCATACAGATTTATGTGGACCCATGCGAACTACTACACTTGGAGGGAAATGGAATCAAACACCAAAAGAAAGTTCGAAGAACTCCCCAACAAAATGGAGTTGCAGA
GAGAATGAACAGGATAATAATGGAGCTTGCAAGAAATGCCGCAATTCAAAGAAATCAAGCTTCAATGAGAGCCCTGGAATTGCAATGGGTCAGCTTGCTAATGAGCTGAA
GGCACGTCCTCAAGGTAAGCTTCCTTTGGAAACTGAACACCTTACGAGGGAAGGTAAGAAGCAGGTGCAGGCAGTGACTTTAAAGAGTGGTAAGCCAGTAGAAGAGAGGA
AAAAGCCTAGTAAACCCCAGGAAGTAGAAAAGAATTGTGATAAAAATATTGTTGTTGAGAAAGAATTGGAGACTGGTCAGGGTGCTGGAGGCAGCAATAGTGATGCTGGA
GCATTTGGTTCTGTTCCAGATTTGGAACCACCTTATGTACTGCCCCCACCCTATGATCCACCCTTACCTTTTCCACAAAGGCAAAAGCCTAAGAACCAGGATGGTCAATT
TAAGAAGTTCTTAGAGATTCTTAAGAAATTGCATATAAATATCCCTTTAGTAGAAGCTATAGACCAAATGCCTAATTATGTTAAATTTCTTAAGGACATTTTGACTAAAA
AGAAGAGATTAGGAGAGTTTGAGACTGTGGCTCTTACTGAGGAATGTAGTTCTATTCTTAAGAATGGGCTACCTTCCAAGGTTGAGGATCCAATATCATTAACTATTCCT
GTCTCCATAGGTGGAAAAGAGTTGGGGAGAGCACTTTGTGATTCAGGCGCAAGAATTAACCTTATGCCTCTTTCGGTTTATCGAAAGCTAGGAATATTTGAAGCTAGGCC
TACGACAGTCACACTCCAATTAGCAGATAGGTCTATCGCATATCCTGAAGGTAAGATTGAGGATGTTCTGGTCCAAGTAGATAAATTTATTTTTCCTGTCGATTTCATTA
TTCTAGATTATGAGGCAGATAAGAATGTCCCAATTATTCTTGGCCGTCCATTTTTGGCAACTGGTAGATCATTGATAGATGTCCAACAGGGGGAGCTTACAATGAGGATG
CATGACCAAGAGGTGAAGTTTTATATGTTTGATGCAATGAAATATCCTAATGATATTGAGGATTGCTCGTGCATTCAGGTGTTGGATGAGTTTGTTGAGGATCATTTTGA
GAAGGATTTGATGGAGTACCATACCAAAAAATTTGGAGAAATCCAAATTGAGGATTTGGAAATAGGTGGATTGGAGCATGAGCATAAAGATGTAGGTGAGATTTCTAGTC
TTAAGAGGAGTTTTGAATCCTTAGAGCCAATAGATAAGAAATCCAAGCCTATTGAACCTTATAATTCATTGACATTGTTCCAGCAACCTGAGAGTAGGAAATCCTTCATT
GATGAGAGGTTACTTACTGTAGCTCATATTAAGGTAGTGAAAACACCTTGGTATGGTGACTTTTCCAATTACCTTGATTTTGAAACATTGCCTCCTGGTTTATCAAGAGA
ACAGATGAAAGAATTTTTCCATGGGGTGAAGTTTTATTTATCAAATGATGCATATGTGGTTAGACAATGTGTTGATGATGGTTGTGAGTTTAAACCCCCCCAGCTGATCT
CGCAGCCGCCGCCGCCGACCAGACCCTTGCCGCCGCCGCTTAGTTTGCGCCGCTGCCCAGGTCCTGCAGCTGCCGCTGTCGACCTGCCAAGCCGCGCCGTCTTCGTCGCG
CGATTCTCTTTTTCCTCTGGTGGTCGTCGCCCAGCCCATTGTCTCCCCCATTCTCCGCGAGCTCTTTCTCTCTCCCGTCGGTCTCCCTCCCTCTCTTCGGTCTCTCTCTC
TCATGAGGTCGTGAGTCCAGCCGCAACCGCTGTCTTGTCGAGCCGCTGCCGCCGTGTGGGTTTTCTCGCCGGGTCTCTCGTTTCCTTGCGTTTTTGGCTAAGAAAACCCG
TGGATCTCACGTGTCTAGCGATTCAGAGTCCCTTCGTCCTCGTTTCAGTCAATTTCGCCTCTGTCCAGCAGCCTTTTTGGGTGTTTTCGGCACCGCTTGACTATTCCGAA
TTAAATACCCATTCACTTAAGTGCTGGAGTTTAAGCTTTGATGTTGAAATTCTGATCTGCAGCGCCGTCTTTGCTCAGTTCGTGAGTGGTTCGGCGTCAATTATCGCTTC
CGCGCCATTGAAGTGTTCGATTGAGTTCGATACACTCCAACTCGAATACCCATTGCCCACGGAGCGTTCTAACACGTTGTTAGAGCATATGCCCATGGCCCGTAGTGCCT
TAGACATGGTTGCCCATGGTCTAAAGCTTTTGGGTTCTCATTTTGGGGAATGTTGGAGACTGTTTAGCAAGGAAATAGGCCTAGTAGGAGTTCATCTTGGCTGTTTTAGG
CTTGTTGAGGTTGCCTTGGAAACGCTATATAGCATGTTGCTTGTGTTGGTTGTTTGTTTGTGGTTTGGTAAGTGTGAGCTGCTTACCAGTACCACGATTGTACTGATACC
CCCTTCCCCACCTTCCCCCAATATTTTAGATGTTGCAGGTTTCGTTCATGAGCTGGATCCTGGTGAAGGAAAAAAATTGGTTCAGGCCATTTTACGTCGTTATGCTACCG
AAATTTTCGTTGCTGGCAGATTAAGGGAGCGTAGAGAAAAGCCAGTAGAGGAAAAAGAAGACAAGGGGAATGAAGTTGTGACTGAGATGAAACTGAGTGAAGAAGTTGAG
CTTGATGAAAGATTCTGGTACGAGAGATTTATCCATGAAGAGGCAAGAAAAAAATATCAAGAAGTCCTGAAGCGAGATGTTCTAATGGAGCGCGACTTCGATGGTGGCAA
AGAGCTTCCACATTTTCTTTAA
Protein sequenceShow/hide protein sequence
MAREGFLFDGWTTNRLFIRGELIRGHRKYVCSEKSATTLKTRPRILSSLPLRERLPQVFCLLDSESYRCNLCGYCVNQEVISSDKSKEATAFSFVGASLAKNGQVYNEDA
ETIEEFFNRVLLIVNQLISNGETIGDQRVVEKILRSMTRKYEHIIVAIEESKDLLTLSINSLMRSLQSHELRLKRFDSTFSEEAFHMQTSSRGRSSGRRGGRGGRGNGRS
KDSKNFESERSDNPNSSNRGRGRSSSRGRSSSRGRSRGRGHGDFSHIKCFNCGRNGHFQAECWSKKTNQAKTKLMHEQQDNDQGLLFLTLNVQETSTEDIWYLDSGCSNH
MTGRKDIFVTLDESLHKEVKTGDNKKLEVQGRGDILVKTKNGAKRITDVYFVPGLKQNLLSVGQLLLKGHDVIFKDSICEIKTKNGGLITKVHMTTNKMFPIKMSYEKLL
CFETLVKDTSWLWHYLYGHLSFDTLSHMCQQHMVRGMPNIHKEDQLCETCILGKHHRSPFPTGNAWRASKPLELVHTDLCGPMRTTTLGGKWNQTPKESSKNSPTKWSCR
ENEQDNNGACKKCRNSKKSSFNESPGIAMGQLANELKARPQGKLPLETEHLTREGKKQVQAVTLKSGKPVEERKKPSKPQEVEKNCDKNIVVEKELETGQGAGGSNSDAG
AFGSVPDLEPPYVLPPPYDPPLPFPQRQKPKNQDGQFKKFLEILKKLHINIPLVEAIDQMPNYVKFLKDILTKKKRLGEFETVALTEECSSILKNGLPSKVEDPISLTIP
VSIGGKELGRALCDSGARINLMPLSVYRKLGIFEARPTTVTLQLADRSIAYPEGKIEDVLVQVDKFIFPVDFIILDYEADKNVPIILGRPFLATGRSLIDVQQGELTMRM
HDQEVKFYMFDAMKYPNDIEDCSCIQVLDEFVEDHFEKDLMEYHTKKFGEIQIEDLEIGGLEHEHKDVGEISSLKRSFESLEPIDKKSKPIEPYNSLTLFQQPESRKSFI
DERLLTVAHIKVVKTPWYGDFSNYLDFETLPPGLSREQMKEFFHGVKFYLSNDAYVVRQCVDDGCEFKPPQLISQPPPPTRPLPPPLSLRRCPGPAAAAVDLPSRAVFVA
RFSFSSGGRRPAHCLPHSPRALSLSRRSPSLSSVSLSHEVVSPAATAVLSSRCRRVGFLAGSLVSLRFWLRKPVDLTCLAIQSPFVLVSVNFASVQQPFWVFSAPLDYSE
LNTHSLKCWSLSFDVEILICSAVFAQFVSGSASIIASAPLKCSIEFDTLQLEYPLPTERSNTLLEHMPMARSALDMVAHGLKLLGSHFGECWRLFSKEIGLVGVHLGCFR
LVEVALETLYSMLLVLVVCLWFGKCELLTSTTIVLIPPSPPSPNILDVAGFVHELDPGEGKKLVQAILRRYATEIFVAGRLRERREKPVEEKEDKGNEVVTEMKLSEEVE
LDERFWYERFIHEEARKKYQEVLKRDVLMERDFDGGKELPHFL