; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G09000 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G09000
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTy3/gypsy retrotransposon protein
Genome locationClcChr09:7743440..7755529
RNA-Seq ExpressionClc09G09000
SyntenyClc09G09000
Gene Ontology termsGO:0030026 - cellular manganese ion homeostasis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0005384 - manganese ion transmembrane transporter activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005162 - Retrotransposon gag domain
IPR008217 - Ccc1 family
IPR016197 - Chromo-like domain superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025132.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-28746.49Show/hide
Query:  RKFEERMEAMDQEVSEIRVEISKLPVIEETLVSLSKSIERLSVQTEKQQIMLATIAIDLARGQKGIAGDESGNKSGTSETLRTSETPGTSETPGMLTAKA
        R  EER++ ++Q        IS+LP++EE +  LSK+I  ++ Q +KQ      I   L    +GI  D+S                     PG    + 
Subjt:  RKFEERMEAMDQEVSEIRVEISKLPVIEETLVSLSKSIERLSVQTEKQQIMLATIAIDLARGQKGIAGDESGNKSGTSETLRTSETPGTSETPGMLTAKA

Query:  TSETPGTSETPVTLAAK-ATMKAKSETNDTSLDQNKFKKVEMPVFNGVDPDGWFFRADRYFQIHRLSDSEKMTMAVISFNGVALDWYRPQGEQDSFSDWK
        +S      ET +T  +    ++ K+E    +LD++KFKKVEMP+F G+DPD W FRADRYF+IH LSDSEK+T+AVISF+G ALDWYR Q E+++F  W 
Subjt:  TSETPGTSETPVTLAAK-ATMKAKSETNDTSLDQNKFKKVEMPVFNGVDPDGWFFRADRYFQIHRLSDSEKMTMAVISFNGVALDWYRPQGEQDSFSDWK

Query:  NLKNHLLMRFRLERDGSLCGRFFAIKQESTVDEYRNLFDKLVAPLRLI--CQTRWTFMNGLPPWIKAEVECWEPVGLAHEVGSASGKSGIEAF----GCG
        +LK  +L+RFR  RDG+L GRF  IKQE+TV+EYRN FDK +AP+  +       TFMNGL PW+K+EVE WEPVGLA  +  A      E      G  
Subjt:  NLKNHLLMRFRLERDGSLCGRFFAIKQESTVDEYRNLFDKLVAPLRLI--CQTRWTFMNGLPPWIKAEVECWEPVGLAHEVGSASGKSGIEAF----GCG

Query:  FGYEVKWSAAVVRKERFRT-----HGSAKGW---------------RKNG--REPIDADANNHIERGV----DDGKSKGRTCETIDGCRVSGERELRVLV
          Y+ K +          +     + +A GW               R+ G  +   DA+     E+G+     +    G  C+T D       +ELR+L+
Subjt:  FGYEVKWSAAVVRKERFRT-----HGSAKGW---------------RKNG--REPIDADANNHIERGV----DDGKSKGRTCETIDGCRVSGERELRVLV

Query:  VTDSINDLEVVENEDNREEQNELQMIKADNEIQTISELSINSVGGLTNLGMMKLKGNIGKEAVVVLIDCGLTHNFIAERLVLSLKLPIVETSNYGVILGS
        V ++  +LE++E E+  + + E++  +  +      ELSINSV GLTN G MK+KG +GKE VVVLIDCG THNFIAE+LV  L+LP+ ET NYGVILGS
Subjt:  VTDSINDLEVVENEDNREEQNELQMIKADNEIQTISELSINSVGGLTNLGMMKLKGNIGKEAVVVLIDCGLTHNFIAERLVLSLKLPIVETSNYGVILGS

Query:  GLVIKGKGICNVVELVIGELIVRGSFLPLELGGVDVIL-----------------------------------------------------VDQGFLIEY
        G  IKGKG+C+ VEL +G   V  SFLPLELGGVD IL                                                      DQG+L+E 
Subjt:  GLVIKGKGICNVVELVIGELIVRGSFLPLELGGVDVIL-----------------------------------------------------VDQGFLIEY

Query:  RAMEGGVSLAEWYEVDDVYTISESN--PAVIDSFADVFEWPAAL-PQRSIEHHIHFKMGTDPVNVRPYR-------------------------------
        R +E      E  EVD   T +E     A+++ F DVFE P  L PQR IEHHI+ K G DPVNVRPYR                               
Subjt:  RAMEGGVSLAEWYEVDDVYTISESN--PAVIDSFADVFEWPAAL-PQRSIEHHIHFKMGTDPVNVRPYR-------------------------------

Query:  -----------WRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMN
                   WRFCVDY+ALNNVT+ D FPIPV+EELFDEL+GA +FSKIDLK+GYHQIRMHP DIEKT FR HEGHYEF+VMPFGLTNAPSTFQALMN
Subjt:  -----------WRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMN

Query:  NIFKPYLRRFVLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTD
         +FKP+LRRFVLVFFDDI + S+ +E+H QHLE V  ILR++ELYAN +KC FAK R+ YLGH IS KG+E DPEKIRA+ EWP P N+RE+ GFLGLT 
Subjt:  NIFKPYLRRFVLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTD

Query:  YYRRFVHHYGSLAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYEREL
        YYRRFV +YGS++APL+QLLK GA++W E+T+ AFE+LK AM+TL VL +PDF+L FE+E+DASG+G+G+VL Q K+P+AY++  LS RDR + VYEREL
Subjt:  YYRRFVHHYGSLAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYEREL

Query:  MAVVMAVQ-----------------------------------------------------QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDP
        MAVV AVQ                                                     +NK  DALSRI PT HL  L+AP ++DVEVI+ EV  DP
Subjt:  MAVVMAVQ-----------------------------------------------------QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDP

Query:  KLKEVVEKLVQHADSVPGFVWQQGTLKYKGRLVLSKSSTLIPTVLHTYHDSVFGGY--------------------------------------SALSPA
         L+E++  + +    +P +   QG LK+KGRLVLSK+STLIPT++HTYHDSVFGG+                                      SALSPA
Subjt:  KLKEVVEKLVQHADSVPGFVWQQGTLKYKGRLVLSKSSTLIPTVLHTYHDSVFGGY--------------------------------------SALSPA

Query:  GLLMPLEIPDVIWSDISMDFIEALPKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVL
        GLL+PLEIPD IWSDISMDFIE LPK+ G+ VILVVVDRLSKY HF+ LKHP+T KT     + EV+
Subjt:  GLLMPLEIPDVIWSDISMDFIEALPKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVL

KAA0025132.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.5e-1460.27Show/hide
Query:  FTTKTNPTTNIWEVLVGWKGLPAHEATWEIFDDFKQQFPDFHLEDKVVLEGESSVRPPIILTYS-RKGNKGKR
        +  + N  T  WE LV WKGLP HEATWE + D K QFP+FHLEDKV LE ES  RPPI+ TY+ RK +KG R
Subjt:  FTTKTNPTTNIWEVLVGWKGLPAHEATWEIFDDFKQQFPDFHLEDKVVLEGESSVRPPIILTYS-RKGNKGKR

KAA0025132.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.6e-28746.49Show/hide
Query:  RKFEERMEAMDQEVSEIRVEISKLPVIEETLVSLSKSIERLSVQTEKQQIMLATIAIDLARGQKGIAGDESGNKSGTSETLRTSETPGTSETPGMLTAKA
        R  EER++ ++Q        IS+LP++EE +  LSK+I  ++ Q +KQ      I   L    +GI  D+S                     PG    + 
Subjt:  RKFEERMEAMDQEVSEIRVEISKLPVIEETLVSLSKSIERLSVQTEKQQIMLATIAIDLARGQKGIAGDESGNKSGTSETLRTSETPGTSETPGMLTAKA

Query:  TSETPGTSETPVTLAAK-ATMKAKSETNDTSLDQNKFKKVEMPVFNGVDPDGWFFRADRYFQIHRLSDSEKMTMAVISFNGVALDWYRPQGEQDSFSDWK
        +S      ET +T  +    ++ K+E    +LD++KFKKVEMP+F G+DPD W FRADRYF+IH LSDSEK+T+AVISF+G ALDWYR Q E+++F  W 
Subjt:  TSETPGTSETPVTLAAK-ATMKAKSETNDTSLDQNKFKKVEMPVFNGVDPDGWFFRADRYFQIHRLSDSEKMTMAVISFNGVALDWYRPQGEQDSFSDWK

Query:  NLKNHLLMRFRLERDGSLCGRFFAIKQESTVDEYRNLFDKLVAPLRLI--CQTRWTFMNGLPPWIKAEVECWEPVGLAHEVGSASGKSGIEAF----GCG
        +LK  +L+RFR  RDG+L GRF  IKQE+TV+EYRN FDK +AP+  +       TFMNGL PW+K+EVE WEPVGLA  +  A      E      G  
Subjt:  NLKNHLLMRFRLERDGSLCGRFFAIKQESTVDEYRNLFDKLVAPLRLI--CQTRWTFMNGLPPWIKAEVECWEPVGLAHEVGSASGKSGIEAF----GCG

Query:  FGYEVKWSAAVVRKERFRT-----HGSAKGW---------------RKNG--REPIDADANNHIERGV----DDGKSKGRTCETIDGCRVSGERELRVLV
          Y+ K +          +     + +A GW               R+ G  +   DA+     E+G+     +    G  C+T D       +ELR+L+
Subjt:  FGYEVKWSAAVVRKERFRT-----HGSAKGW---------------RKNG--REPIDADANNHIERGV----DDGKSKGRTCETIDGCRVSGERELRVLV

Query:  VTDSINDLEVVENEDNREEQNELQMIKADNEIQTISELSINSVGGLTNLGMMKLKGNIGKEAVVVLIDCGLTHNFIAERLVLSLKLPIVETSNYGVILGS
        V ++  +LE++E E+  + + E++  +  +      ELSINSV GLTN G MK+KG +GKE VVVLIDCG THNFIAE+LV  L+LP+ ET NYGVILGS
Subjt:  VTDSINDLEVVENEDNREEQNELQMIKADNEIQTISELSINSVGGLTNLGMMKLKGNIGKEAVVVLIDCGLTHNFIAERLVLSLKLPIVETSNYGVILGS

Query:  GLVIKGKGICNVVELVIGELIVRGSFLPLELGGVDVIL-----------------------------------------------------VDQGFLIEY
        G  IKGKG+C+ VEL +G   V  SFLPLELGGVD IL                                                      DQG+L+E 
Subjt:  GLVIKGKGICNVVELVIGELIVRGSFLPLELGGVDVIL-----------------------------------------------------VDQGFLIEY

Query:  RAMEGGVSLAEWYEVDDVYTISESN--PAVIDSFADVFEWPAAL-PQRSIEHHIHFKMGTDPVNVRPYR-------------------------------
        R +E      E  EVD   T +E     A+++ F DVFE P  L PQR IEHHI+ K G DPVNVRPYR                               
Subjt:  RAMEGGVSLAEWYEVDDVYTISESN--PAVIDSFADVFEWPAAL-PQRSIEHHIHFKMGTDPVNVRPYR-------------------------------

Query:  -----------WRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMN
                   WRFCVDY+ALNNVT+ D FPIPV+EELFDEL+GA +FSKIDLK+GYHQIRMHP DIEKT FR HEGHYEF+VMPFGLTNAPSTFQALMN
Subjt:  -----------WRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMN

Query:  NIFKPYLRRFVLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTD
         +FKP+LRRFVLVFFDDI + S+ +E+H QHLE V  ILR++ELYAN +KC FAK R+ YLGH IS KG+E DPEKIRA+ EWP P N+RE+ GFLGLT 
Subjt:  NIFKPYLRRFVLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTD

Query:  YYRRFVHHYGSLAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYEREL
        YYRRFV +YGS++APL+QLLK GA++W E+T+ AFE+LK AM+TL VL +PDF+L FE+E+DASG+G+G+VL Q K+P+AY++  LS RDR + VYEREL
Subjt:  YYRRFVHHYGSLAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYEREL

Query:  MAVVMAVQ-----------------------------------------------------QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDP
        MAVV AVQ                                                     +NK  DALSRI PT HL  L+AP ++DVEVI+ EV  DP
Subjt:  MAVVMAVQ-----------------------------------------------------QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDP

Query:  KLKEVVEKLVQHADSVPGFVWQQGTLKYKGRLVLSKSSTLIPTVLHTYHDSVFGGY--------------------------------------SALSPA
         L+E++  + +    +P +   QG LK+KGRLVLSK+STLIPT++HTYHDSVFGG+                                      SALSPA
Subjt:  KLKEVVEKLVQHADSVPGFVWQQGTLKYKGRLVLSKSSTLIPTVLHTYHDSVFGGY--------------------------------------SALSPA

Query:  GLLMPLEIPDVIWSDISMDFIEALPKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVL
        GLL+PLEIPD IWSDISMDFIE LPK+ G+ VILVVVDRLSKY HF+ LKHP+T KT     + EV+
Subjt:  GLLMPLEIPDVIWSDISMDFIEALPKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVL

KAA0049776.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.6e-28746.49Show/hide
Query:  RKFEERMEAMDQEVSEIRVEISKLPVIEETLVSLSKSIERLSVQTEKQQIMLATIAIDLARGQKGIAGDESGNKSGTSETLRTSETPGTSETPGMLTAKA
        R  EER++ ++Q        IS+LP++EE +  LSK+I  ++ Q +KQ      I   L    +GI  D+S                     PG    + 
Subjt:  RKFEERMEAMDQEVSEIRVEISKLPVIEETLVSLSKSIERLSVQTEKQQIMLATIAIDLARGQKGIAGDESGNKSGTSETLRTSETPGTSETPGMLTAKA

Query:  TSETPGTSETPVTLAAK-ATMKAKSETNDTSLDQNKFKKVEMPVFNGVDPDGWFFRADRYFQIHRLSDSEKMTMAVISFNGVALDWYRPQGEQDSFSDWK
        +S      ET +T  +    ++ K+E    +LD++KFKKVEMP+F G+DPD W FRADRYF+IH LSDSEK+T+AVISF+G ALDWYR Q E+++F  W 
Subjt:  TSETPGTSETPVTLAAK-ATMKAKSETNDTSLDQNKFKKVEMPVFNGVDPDGWFFRADRYFQIHRLSDSEKMTMAVISFNGVALDWYRPQGEQDSFSDWK

Query:  NLKNHLLMRFRLERDGSLCGRFFAIKQESTVDEYRNLFDKLVAPLRLI--CQTRWTFMNGLPPWIKAEVECWEPVGLAHEVGSASGKSGIEAF----GCG
        +LK  +L+RFR  RDG+L GRF  IKQE+TV+EYRN FDK +AP+  +       TFMNGL PW+K+EVE WEPVGLA  +  A      E      G  
Subjt:  NLKNHLLMRFRLERDGSLCGRFFAIKQESTVDEYRNLFDKLVAPLRLI--CQTRWTFMNGLPPWIKAEVECWEPVGLAHEVGSASGKSGIEAF----GCG

Query:  FGYEVKWSAAVVRKERFRT-----HGSAKGW---------------RKNG--REPIDADANNHIERGV----DDGKSKGRTCETIDGCRVSGERELRVLV
          Y+ K +          +     + +A GW               R+ G  +   DA+     E+G+     +    G  C+T D       +ELR+L+
Subjt:  FGYEVKWSAAVVRKERFRT-----HGSAKGW---------------RKNG--REPIDADANNHIERGV----DDGKSKGRTCETIDGCRVSGERELRVLV

Query:  VTDSINDLEVVENEDNREEQNELQMIKADNEIQTISELSINSVGGLTNLGMMKLKGNIGKEAVVVLIDCGLTHNFIAERLVLSLKLPIVETSNYGVILGS
        V ++  +LE++E E+  + + E++  +  +      ELSINSV GLTN G MK+KG +GKE VVVLIDCG THNFIAE+LV  L+LP+ ET NYGVILGS
Subjt:  VTDSINDLEVVENEDNREEQNELQMIKADNEIQTISELSINSVGGLTNLGMMKLKGNIGKEAVVVLIDCGLTHNFIAERLVLSLKLPIVETSNYGVILGS

Query:  GLVIKGKGICNVVELVIGELIVRGSFLPLELGGVDVIL-----------------------------------------------------VDQGFLIEY
        G  IKGKG+C+ VEL +G   V  SFLPLELGGVD IL                                                      DQG+L+E 
Subjt:  GLVIKGKGICNVVELVIGELIVRGSFLPLELGGVDVIL-----------------------------------------------------VDQGFLIEY

Query:  RAMEGGVSLAEWYEVDDVYTISESN--PAVIDSFADVFEWPAAL-PQRSIEHHIHFKMGTDPVNVRPYR-------------------------------
        R +E      E  EVD   T +E     A+++ F DVFE P  L PQR IEHHI+ K G DPVNVRPYR                               
Subjt:  RAMEGGVSLAEWYEVDDVYTISESN--PAVIDSFADVFEWPAAL-PQRSIEHHIHFKMGTDPVNVRPYR-------------------------------

Query:  -----------WRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMN
                   WRFCVDY+ALNNVT+ D FPIPV+EELFDEL+GA +FSKIDLK+GYHQIRMHP DIEKT FR HEGHYEF+VMPFGLTNAPSTFQALMN
Subjt:  -----------WRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMN

Query:  NIFKPYLRRFVLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTD
         +FKP+LRRFVLVFFDDI + S+ +E+H QHLE V  ILR++ELYAN +KC FAK R+ YLGH IS KG+E DPEKIRA+ EWP P N+RE+ GFLGLT 
Subjt:  NIFKPYLRRFVLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTD

Query:  YYRRFVHHYGSLAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYEREL
        YYRRFV +YGS++APL+QLLK GA++W E+T+ AFE+LK AM+TL VL +PDF+L FE+E+DASG+G+G+VL Q K+P+AY++  LS RDR + VYEREL
Subjt:  YYRRFVHHYGSLAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYEREL

Query:  MAVVMAVQ-----------------------------------------------------QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDP
        MAVV AVQ                                                     +NK  DALSRI PT HL  L+AP ++DVEVI+ EV  DP
Subjt:  MAVVMAVQ-----------------------------------------------------QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDP

Query:  KLKEVVEKLVQHADSVPGFVWQQGTLKYKGRLVLSKSSTLIPTVLHTYHDSVFGGY--------------------------------------SALSPA
         L+E++  + +    +P +   QG LK+KGRLVLSK+STLIPT++HTYHDSVFGG+                                      SALSPA
Subjt:  KLKEVVEKLVQHADSVPGFVWQQGTLKYKGRLVLSKSSTLIPTVLHTYHDSVFGGY--------------------------------------SALSPA

Query:  GLLMPLEIPDVIWSDISMDFIEALPKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVL
        GLL+PLEIPD IWSDISMDFIE LPK+ G+ VILVVVDRLSKY HF+ LKHP+T KT     + EV+
Subjt:  GLLMPLEIPDVIWSDISMDFIEALPKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVL

KAA0049776.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.5e-1460.27Show/hide
Query:  FTTKTNPTTNIWEVLVGWKGLPAHEATWEIFDDFKQQFPDFHLEDKVVLEGESSVRPPIILTYS-RKGNKGKR
        +  + N  T  WE LV WKGLP HEATWE + D K QFP+FHLEDKV LE ES  RPPI+ TY+ RK +KG R
Subjt:  FTTKTNPTTNIWEVLVGWKGLPAHEATWEIFDDFKQQFPDFHLEDKVVLEGESSVRPPIILTYS-RKGNKGKR

XP_038898254.1 uncharacterized protein LOC120085971 isoform X1 [Benincasa hispida]2.3e-29185.23Show/hide
Query:  GNNVESLGIRAEIQVDRAPVPAQSVDYAVTDKNEDDHVAAQAHPSLKATSEEGMIVQEEALASK--KGNNVESIEFGEREIFQVQETTDPQENQVDKAPV
        GN+VE+LGIR E QVDRAPVPAQSVDYAVTDKNEDDH AAQAHPSLKATSEEG+IV+EE LAS   KGNNVE+IEF ER+I QVQET D Q++QV++APV
Subjt:  GNNVESLGIRAEIQVDRAPVPAQSVDYAVTDKNEDDHVAAQAHPSLKATSEEGMIVQEEALASK--KGNNVESIEFGEREIFQVQETTDPQENQVDKAPV

Query:  PNQSVDYAVSDKNEVVPDPSVGNGVADQIRGIHAVSNSKPTHPSLNTTVAEQEVVPVNGVESKQGNTIESIIVGRPEPLLESAETVYDQGIQADSINKTQ
        P+QSV+YAV+DKNEVVPDPSVGN VADQ R IHAVSNSKPTHPSLN  VAE+  V   GV+S QGN IESIIVGRP+PLLESAET+YDQ IQADSINKT 
Subjt:  PNQSVDYAVSDKNEVVPDPSVGNGVADQIRGIHAVSNSKPTHPSLNTTVAEQEVVPVNGVESKQGNTIESIIVGRPEPLLESAETVYDQGIQADSINKTQ

Query:  VPDQLVEFDMWTNDNSLETKVDSTIDSSILDGAK-DTKKGMTVENIVVGIPYTSHASNGSVQD-DNQTSTANKVLVQNQSNGFAVLSESEIDTNVNSTPG
        VPDQL EFDMWTNDNSLE KVDST  SSILDGAK DTKKG+TVENIVVGIPYTSH  NGS+QD DNQTST NKV VQNQSNGF+VLSESE D  VN T G
Subjt:  VPDQLVEFDMWTNDNSLETKVDSTIDSSILDGAK-DTKKGMTVENIVVGIPYTSHASNGSVQD-DNQTSTANKVLVQNQSNGFAVLSESEIDTNVNSTPG

Query:  LTSLEAMMAETVTDTVDEKKGIDVENVVVGIPY-SLESKGGLLDRFRLPALFNKAPVPDQSAAVAETEIPKTPEPVGATVPDSSPVSPSLGAPTAAERVT
         TSLEAMMAETVTDT DEK+GI+VENVVVGIPY SLESKG LLDRF LPAL NKAPVPDQSAAVAET+I KTPE V ATVPDSSPVS SL APTAAERVT
Subjt:  LTSLEAMMAETVTDTVDEKKGIDVENVVVGIPY-SLESKGGLLDRFRLPALFNKAPVPDQSAAVAETEIPKTPEPVGATVPDSSPVSPSLGAPTAAERVT

Query:  DSAVGSREVEGGPVAISIDDSRDEQVEPEPSRYNRWEIVKSIVYGGLAESITSLGIVASAASANTATGNIVILALANLISGLFILGHNLTGLKSEQFRRT
        DSAVGSREVE GPV ISI +S DEQV+PEPSRYNRWEIVKSIVYGGLAESITSLGIVASAASANT TGNIV+LALANLISG FILGHNLTGLKSEQF RT
Subjt:  DSAVGSREVEGGPVAISIDDSRDEQVEPEPSRYNRWEIVKSIVYGGLAESITSLGIVASAASANTATGNIVILALANLISGLFILGHNLTGLKSEQFRRT

Query:  SNETDDVEHVDRYEVVLGNRENYILHFTLAIFSFVLFGLVPPLVYGFSFTKSNDKDLKLAAVAGASLLCIILLALGKAYIQRPNRWDVYFKMVASYVVIA
        SNET D E +DRYEVVLGNRENYILHFTLAIFSFVLFGLVPPLVYGFSFTKSNDKDLKLAAVAGASLLC+ILLA GKAYIQRPNRWDVY K+VASY+VIA
Subjt:  SNETDDVEHVDRYEVVLGNRENYILHFTLAIFSFVLFGLVPPLVYGFSFTKSNDKDLKLAAVAGASLLCIILLALGKAYIQRPNRWDVYFKMVASYVVIA

Query:  AGAGGFSYLAGDLLDKVIKKYGWFEENPAFNLISLPLPEMSLAKPAWGSS
        AGAGGFSYLAGDL+DK+IKKYGWFEENPAFNL SLPLPEMSLAKPAWGSS
Subjt:  AGAGGFSYLAGDLLDKVIKKYGWFEENPAFNLISLPLPEMSLAKPAWGSS

XP_038898255.1 uncharacterized protein LOC120085971 isoform X2 [Benincasa hispida]2.3e-29185.23Show/hide
Query:  GNNVESLGIRAEIQVDRAPVPAQSVDYAVTDKNEDDHVAAQAHPSLKATSEEGMIVQEEALASK--KGNNVESIEFGEREIFQVQETTDPQENQVDKAPV
        GN+VE+LGIR E QVDRAPVPAQSVDYAVTDKNEDDH AAQAHPSLKATSEEG+IV+EE LAS   KGNNVE+IEF ER+I QVQET D Q++QV++APV
Subjt:  GNNVESLGIRAEIQVDRAPVPAQSVDYAVTDKNEDDHVAAQAHPSLKATSEEGMIVQEEALASK--KGNNVESIEFGEREIFQVQETTDPQENQVDKAPV

Query:  PNQSVDYAVSDKNEVVPDPSVGNGVADQIRGIHAVSNSKPTHPSLNTTVAEQEVVPVNGVESKQGNTIESIIVGRPEPLLESAETVYDQGIQADSINKTQ
        P+QSV+YAV+DKNEVVPDPSVGN VADQ R IHAVSNSKPTHPSLN  VAE+  V   GV+S QGN IESIIVGRP+PLLESAET+YDQ IQADSINKT 
Subjt:  PNQSVDYAVSDKNEVVPDPSVGNGVADQIRGIHAVSNSKPTHPSLNTTVAEQEVVPVNGVESKQGNTIESIIVGRPEPLLESAETVYDQGIQADSINKTQ

Query:  VPDQLVEFDMWTNDNSLETKVDSTIDSSILDGAK-DTKKGMTVENIVVGIPYTSHASNGSVQD-DNQTSTANKVLVQNQSNGFAVLSESEIDTNVNSTPG
        VPDQL EFDMWTNDNSLE KVDST  SSILDGAK DTKKG+TVENIVVGIPYTSH  NGS+QD DNQTST NKV VQNQSNGF+VLSESE D  VN T G
Subjt:  VPDQLVEFDMWTNDNSLETKVDSTIDSSILDGAK-DTKKGMTVENIVVGIPYTSHASNGSVQD-DNQTSTANKVLVQNQSNGFAVLSESEIDTNVNSTPG

Query:  LTSLEAMMAETVTDTVDEKKGIDVENVVVGIPY-SLESKGGLLDRFRLPALFNKAPVPDQSAAVAETEIPKTPEPVGATVPDSSPVSPSLGAPTAAERVT
         TSLEAMMAETVTDT DEK+GI+VENVVVGIPY SLESKG LLDRF LPAL NKAPVPDQSAAVAET+I KTPE V ATVPDSSPVS SL APTAAERVT
Subjt:  LTSLEAMMAETVTDTVDEKKGIDVENVVVGIPY-SLESKGGLLDRFRLPALFNKAPVPDQSAAVAETEIPKTPEPVGATVPDSSPVSPSLGAPTAAERVT

Query:  DSAVGSREVEGGPVAISIDDSRDEQVEPEPSRYNRWEIVKSIVYGGLAESITSLGIVASAASANTATGNIVILALANLISGLFILGHNLTGLKSEQFRRT
        DSAVGSREVE GPV ISI +S DEQV+PEPSRYNRWEIVKSIVYGGLAESITSLGIVASAASANT TGNIV+LALANLISG FILGHNLTGLKSEQF RT
Subjt:  DSAVGSREVEGGPVAISIDDSRDEQVEPEPSRYNRWEIVKSIVYGGLAESITSLGIVASAASANTATGNIVILALANLISGLFILGHNLTGLKSEQFRRT

Query:  SNETDDVEHVDRYEVVLGNRENYILHFTLAIFSFVLFGLVPPLVYGFSFTKSNDKDLKLAAVAGASLLCIILLALGKAYIQRPNRWDVYFKMVASYVVIA
        SNET D E +DRYEVVLGNRENYILHFTLAIFSFVLFGLVPPLVYGFSFTKSNDKDLKLAAVAGASLLC+ILLA GKAYIQRPNRWDVY K+VASY+VIA
Subjt:  SNETDDVEHVDRYEVVLGNRENYILHFTLAIFSFVLFGLVPPLVYGFSFTKSNDKDLKLAAVAGASLLCIILLALGKAYIQRPNRWDVYFKMVASYVVIA

Query:  AGAGGFSYLAGDLLDKVIKKYGWFEENPAFNLISLPLPEMSLAKPAWGSS
        AGAGGFSYLAGDL+DK+IKKYGWFEENPAFNL SLPLPEMSLAKPAWGSS
Subjt:  AGAGGFSYLAGDLLDKVIKKYGWFEENPAFNLISLPLPEMSLAKPAWGSS

TrEMBL top hitse value%identityAlignment
A0A5A7SIV7 Ty3/gypsy retrotransposon protein5.8e-28846.49Show/hide
Query:  RKFEERMEAMDQEVSEIRVEISKLPVIEETLVSLSKSIERLSVQTEKQQIMLATIAIDLARGQKGIAGDESGNKSGTSETLRTSETPGTSETPGMLTAKA
        R  EER++ ++Q        IS+LP++EE +  LSK+I  ++ Q +KQ      I   L    +GI  D+S                     PG    + 
Subjt:  RKFEERMEAMDQEVSEIRVEISKLPVIEETLVSLSKSIERLSVQTEKQQIMLATIAIDLARGQKGIAGDESGNKSGTSETLRTSETPGTSETPGMLTAKA

Query:  TSETPGTSETPVTLAAK-ATMKAKSETNDTSLDQNKFKKVEMPVFNGVDPDGWFFRADRYFQIHRLSDSEKMTMAVISFNGVALDWYRPQGEQDSFSDWK
        +S      ET +T  +    ++ K+E    +LD++KFKKVEMP+F G+DPD W FRADRYF+IH LSDSEK+T+AVISF+G ALDWYR Q E+++F  W 
Subjt:  TSETPGTSETPVTLAAK-ATMKAKSETNDTSLDQNKFKKVEMPVFNGVDPDGWFFRADRYFQIHRLSDSEKMTMAVISFNGVALDWYRPQGEQDSFSDWK

Query:  NLKNHLLMRFRLERDGSLCGRFFAIKQESTVDEYRNLFDKLVAPLRLI--CQTRWTFMNGLPPWIKAEVECWEPVGLAHEVGSASGKSGIEAF----GCG
        +LK  +L+RFR  RDG+L GRF  IKQE+TV+EYRN FDK +AP+  +       TFMNGL PW+K+EVE WEPVGLA  +  A      E      G  
Subjt:  NLKNHLLMRFRLERDGSLCGRFFAIKQESTVDEYRNLFDKLVAPLRLI--CQTRWTFMNGLPPWIKAEVECWEPVGLAHEVGSASGKSGIEAF----GCG

Query:  FGYEVKWSAAVVRKERFRT-----HGSAKGW---------------RKNG--REPIDADANNHIERGV----DDGKSKGRTCETIDGCRVSGERELRVLV
          Y+ K +          +     + +A GW               R+ G  +   DA+     E+G+     +    G  C+T D       +ELR+L+
Subjt:  FGYEVKWSAAVVRKERFRT-----HGSAKGW---------------RKNG--REPIDADANNHIERGV----DDGKSKGRTCETIDGCRVSGERELRVLV

Query:  VTDSINDLEVVENEDNREEQNELQMIKADNEIQTISELSINSVGGLTNLGMMKLKGNIGKEAVVVLIDCGLTHNFIAERLVLSLKLPIVETSNYGVILGS
        V ++  +LE++E E+  + + E++  +  +      ELSINSV GLTN G MK+KG +GKE VVVLIDCG THNFIAE+LV  L+LP+ ET NYGVILGS
Subjt:  VTDSINDLEVVENEDNREEQNELQMIKADNEIQTISELSINSVGGLTNLGMMKLKGNIGKEAVVVLIDCGLTHNFIAERLVLSLKLPIVETSNYGVILGS

Query:  GLVIKGKGICNVVELVIGELIVRGSFLPLELGGVDVIL-----------------------------------------------------VDQGFLIEY
        G  IKGKG+C+ VEL +G   V  SFLPLELGGVD IL                                                      DQG+L+E 
Subjt:  GLVIKGKGICNVVELVIGELIVRGSFLPLELGGVDVIL-----------------------------------------------------VDQGFLIEY

Query:  RAMEGGVSLAEWYEVDDVYTISESN--PAVIDSFADVFEWPAAL-PQRSIEHHIHFKMGTDPVNVRPYR-------------------------------
        R +E      E  EVD   T +E     A+++ F DVFE P  L PQR IEHHI+ K G DPVNVRPYR                               
Subjt:  RAMEGGVSLAEWYEVDDVYTISESN--PAVIDSFADVFEWPAAL-PQRSIEHHIHFKMGTDPVNVRPYR-------------------------------

Query:  -----------WRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMN
                   WRFCVDY+ALNNVT+ D FPIPV+EELFDEL+GA +FSKIDLK+GYHQIRMHP DIEKT FR HEGHYEF+VMPFGLTNAPSTFQALMN
Subjt:  -----------WRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMN

Query:  NIFKPYLRRFVLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTD
         +FKP+LRRFVLVFFDDI + S+ +E+H QHLE V  ILR++ELYAN +KC FAK R+ YLGH IS KG+E DPEKIRA+ EWP P N+RE+ GFLGLT 
Subjt:  NIFKPYLRRFVLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTD

Query:  YYRRFVHHYGSLAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYEREL
        YYRRFV +YGS++APL+QLLK GA++W E+T+ AFE+LK AM+TL VL +PDF+L FE+E+DASG+G+G+VL Q K+P+AY++  LS RDR + VYEREL
Subjt:  YYRRFVHHYGSLAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYEREL

Query:  MAVVMAVQ-----------------------------------------------------QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDP
        MAVV AVQ                                                     +NK  DALSRI PT HL  L+AP ++DVEVI+ EV  DP
Subjt:  MAVVMAVQ-----------------------------------------------------QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDP

Query:  KLKEVVEKLVQHADSVPGFVWQQGTLKYKGRLVLSKSSTLIPTVLHTYHDSVFGGY--------------------------------------SALSPA
         L+E++  + +    +P +   QG LK+KGRLVLSK+STLIPT++HTYHDSVFGG+                                      SALSPA
Subjt:  KLKEVVEKLVQHADSVPGFVWQQGTLKYKGRLVLSKSSTLIPTVLHTYHDSVFGGY--------------------------------------SALSPA

Query:  GLLMPLEIPDVIWSDISMDFIEALPKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVL
        GLL+PLEIPD IWSDISMDFIE LPK+ G+ VILVVVDRLSKY HF+ LKHP+T KT     + EV+
Subjt:  GLLMPLEIPDVIWSDISMDFIEALPKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVL

A0A5A7SIV7 Ty3/gypsy retrotransposon protein1.2e-1460.27Show/hide
Query:  FTTKTNPTTNIWEVLVGWKGLPAHEATWEIFDDFKQQFPDFHLEDKVVLEGESSVRPPIILTYS-RKGNKGKR
        +  + N  T  WE LV WKGLP HEATWE + D K QFP+FHLEDKV LE ES  RPPI+ TY+ RK +KG R
Subjt:  FTTKTNPTTNIWEVLVGWKGLPAHEATWEIFDDFKQQFPDFHLEDKVVLEGESSVRPPIILTYS-RKGNKGKR

A0A5A7SIV7 Ty3/gypsy retrotransposon protein7.6e-28846.49Show/hide
Query:  RKFEERMEAMDQEVSEIRVEISKLPVIEETLVSLSKSIERLSVQTEKQQIMLATIAIDLARGQKGIAGDESGNKSGTSETLRTSETPGTSETPGMLTAKA
        R  EER++ ++Q        IS+LP++EE +  LSK+I  ++ Q +KQ      I   L    +GI  D+S                     PG    + 
Subjt:  RKFEERMEAMDQEVSEIRVEISKLPVIEETLVSLSKSIERLSVQTEKQQIMLATIAIDLARGQKGIAGDESGNKSGTSETLRTSETPGTSETPGMLTAKA

Query:  TSETPGTSETPVTLAAK-ATMKAKSETNDTSLDQNKFKKVEMPVFNGVDPDGWFFRADRYFQIHRLSDSEKMTMAVISFNGVALDWYRPQGEQDSFSDWK
        +S      ET +T  +    ++ K+E    +LD++KFKKVEMP+F G+DPD W FRADRYF+IH LSDSEK+T+AVISF+G ALDWYR Q E+++F  W 
Subjt:  TSETPGTSETPVTLAAK-ATMKAKSETNDTSLDQNKFKKVEMPVFNGVDPDGWFFRADRYFQIHRLSDSEKMTMAVISFNGVALDWYRPQGEQDSFSDWK

Query:  NLKNHLLMRFRLERDGSLCGRFFAIKQESTVDEYRNLFDKLVAPLRLI--CQTRWTFMNGLPPWIKAEVECWEPVGLAHEVGSASGKSGIEAF----GCG
        +LK  +L+RFR  RDG+L GRF  IKQE+TV+EYRN FDK +AP+  +       TFMNGL PW+K+EVE WEPVGLA  +  A      E      G  
Subjt:  NLKNHLLMRFRLERDGSLCGRFFAIKQESTVDEYRNLFDKLVAPLRLI--CQTRWTFMNGLPPWIKAEVECWEPVGLAHEVGSASGKSGIEAF----GCG

Query:  FGYEVKWSAAVVRKERFRT-----HGSAKGW---------------RKNG--REPIDADANNHIERGV----DDGKSKGRTCETIDGCRVSGERELRVLV
          Y+ K +          +     + +A GW               R+ G  +   DA+     E+G+     +    G  C+T D       +ELR+L+
Subjt:  FGYEVKWSAAVVRKERFRT-----HGSAKGW---------------RKNG--REPIDADANNHIERGV----DDGKSKGRTCETIDGCRVSGERELRVLV

Query:  VTDSINDLEVVENEDNREEQNELQMIKADNEIQTISELSINSVGGLTNLGMMKLKGNIGKEAVVVLIDCGLTHNFIAERLVLSLKLPIVETSNYGVILGS
        V ++  +LE++E E+  + + E++  +  +      ELSINSV GLTN G MK+KG +GKE VVVLIDCG THNFIAE+LV  L+LP+ ET NYGVILGS
Subjt:  VTDSINDLEVVENEDNREEQNELQMIKADNEIQTISELSINSVGGLTNLGMMKLKGNIGKEAVVVLIDCGLTHNFIAERLVLSLKLPIVETSNYGVILGS

Query:  GLVIKGKGICNVVELVIGELIVRGSFLPLELGGVDVIL-----------------------------------------------------VDQGFLIEY
        G  IKGKG+C+ VEL +G   V  SFLPLELGGVD IL                                                      DQG+L+E 
Subjt:  GLVIKGKGICNVVELVIGELIVRGSFLPLELGGVDVIL-----------------------------------------------------VDQGFLIEY

Query:  RAMEGGVSLAEWYEVDDVYTISESN--PAVIDSFADVFEWPAAL-PQRSIEHHIHFKMGTDPVNVRPYR-------------------------------
        R +E      E  EVD   T +E     A+++ F DVFE P  L PQR IEHHI+ K G DPVNVRPYR                               
Subjt:  RAMEGGVSLAEWYEVDDVYTISESN--PAVIDSFADVFEWPAAL-PQRSIEHHIHFKMGTDPVNVRPYR-------------------------------

Query:  -----------WRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMN
                   WRFCVDY+ALNNVT+ D FPIPV+EELFDEL+GA +FSKIDLK+GYHQIRMHP DIEKT FR HEGHYEF+VMPFGLTNAPSTFQALMN
Subjt:  -----------WRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMN

Query:  NIFKPYLRRFVLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTD
         +FKP+LRRFVLVFFDDI + S+ +E+H QHLE V  ILR++ELYAN +KC FAK R+ YLGH IS KG+E DPEKIRA+ EWP P N+RE+ GFLGLT 
Subjt:  NIFKPYLRRFVLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTD

Query:  YYRRFVHHYGSLAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYEREL
        YYRRFV +YGS++APL+QLLK GA++W E+T+ AFE+LK AM+TL VL +PDF+L FE+E+DASG+G+G+VL Q K+P+AY++  LS RDR + VYEREL
Subjt:  YYRRFVHHYGSLAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYEREL

Query:  MAVVMAVQ-----------------------------------------------------QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDP
        MAVV AVQ                                                     +NK  DALSRI PT HL  L+AP ++DVEVI+ EV  DP
Subjt:  MAVVMAVQ-----------------------------------------------------QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDP

Query:  KLKEVVEKLVQHADSVPGFVWQQGTLKYKGRLVLSKSSTLIPTVLHTYHDSVFGGY--------------------------------------SALSPA
         L+E++  + +    +P +   QG LK+KGRLVLSK+STLIPT++HTYHDSVFGG+                                      SALSPA
Subjt:  KLKEVVEKLVQHADSVPGFVWQQGTLKYKGRLVLSKSSTLIPTVLHTYHDSVFGGY--------------------------------------SALSPA

Query:  GLLMPLEIPDVIWSDISMDFIEALPKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVL
        GLL+PLEIPD IWSDISMDFIE LPK+ G+ VILVVVDRLSKY HF+ LKHP+T KT     + EV+
Subjt:  GLLMPLEIPDVIWSDISMDFIEALPKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVL

A0A5A7U6J3 Ty3/gypsy retrotransposon protein7.6e-28846.49Show/hide
Query:  RKFEERMEAMDQEVSEIRVEISKLPVIEETLVSLSKSIERLSVQTEKQQIMLATIAIDLARGQKGIAGDESGNKSGTSETLRTSETPGTSETPGMLTAKA
        R  EER++ ++Q        IS+LP++EE +  LSK+I  ++ Q +KQ      I   L    +GI  D+S                     PG    + 
Subjt:  RKFEERMEAMDQEVSEIRVEISKLPVIEETLVSLSKSIERLSVQTEKQQIMLATIAIDLARGQKGIAGDESGNKSGTSETLRTSETPGTSETPGMLTAKA

Query:  TSETPGTSETPVTLAAK-ATMKAKSETNDTSLDQNKFKKVEMPVFNGVDPDGWFFRADRYFQIHRLSDSEKMTMAVISFNGVALDWYRPQGEQDSFSDWK
        +S      ET +T  +    ++ K+E    +LD++KFKKVEMP+F G+DPD W FRADRYF+IH LSDSEK+T+AVISF+G ALDWYR Q E+++F  W 
Subjt:  TSETPGTSETPVTLAAK-ATMKAKSETNDTSLDQNKFKKVEMPVFNGVDPDGWFFRADRYFQIHRLSDSEKMTMAVISFNGVALDWYRPQGEQDSFSDWK

Query:  NLKNHLLMRFRLERDGSLCGRFFAIKQESTVDEYRNLFDKLVAPLRLI--CQTRWTFMNGLPPWIKAEVECWEPVGLAHEVGSASGKSGIEAF----GCG
        +LK  +L+RFR  RDG+L GRF  IKQE+TV+EYRN FDK +AP+  +       TFMNGL PW+K+EVE WEPVGLA  +  A      E      G  
Subjt:  NLKNHLLMRFRLERDGSLCGRFFAIKQESTVDEYRNLFDKLVAPLRLI--CQTRWTFMNGLPPWIKAEVECWEPVGLAHEVGSASGKSGIEAF----GCG

Query:  FGYEVKWSAAVVRKERFRT-----HGSAKGW---------------RKNG--REPIDADANNHIERGV----DDGKSKGRTCETIDGCRVSGERELRVLV
          Y+ K +          +     + +A GW               R+ G  +   DA+     E+G+     +    G  C+T D       +ELR+L+
Subjt:  FGYEVKWSAAVVRKERFRT-----HGSAKGW---------------RKNG--REPIDADANNHIERGV----DDGKSKGRTCETIDGCRVSGERELRVLV

Query:  VTDSINDLEVVENEDNREEQNELQMIKADNEIQTISELSINSVGGLTNLGMMKLKGNIGKEAVVVLIDCGLTHNFIAERLVLSLKLPIVETSNYGVILGS
        V ++  +LE++E E+  + + E++  +  +      ELSINSV GLTN G MK+KG +GKE VVVLIDCG THNFIAE+LV  L+LP+ ET NYGVILGS
Subjt:  VTDSINDLEVVENEDNREEQNELQMIKADNEIQTISELSINSVGGLTNLGMMKLKGNIGKEAVVVLIDCGLTHNFIAERLVLSLKLPIVETSNYGVILGS

Query:  GLVIKGKGICNVVELVIGELIVRGSFLPLELGGVDVIL-----------------------------------------------------VDQGFLIEY
        G  IKGKG+C+ VEL +G   V  SFLPLELGGVD IL                                                      DQG+L+E 
Subjt:  GLVIKGKGICNVVELVIGELIVRGSFLPLELGGVDVIL-----------------------------------------------------VDQGFLIEY

Query:  RAMEGGVSLAEWYEVDDVYTISESN--PAVIDSFADVFEWPAAL-PQRSIEHHIHFKMGTDPVNVRPYR-------------------------------
        R +E      E  EVD   T +E     A+++ F DVFE P  L PQR IEHHI+ K G DPVNVRPYR                               
Subjt:  RAMEGGVSLAEWYEVDDVYTISESN--PAVIDSFADVFEWPAAL-PQRSIEHHIHFKMGTDPVNVRPYR-------------------------------

Query:  -----------WRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMN
                   WRFCVDY+ALNNVT+ D FPIPV+EELFDEL+GA +FSKIDLK+GYHQIRMHP DIEKT FR HEGHYEF+VMPFGLTNAPSTFQALMN
Subjt:  -----------WRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMN

Query:  NIFKPYLRRFVLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTD
         +FKP+LRRFVLVFFDDI + S+ +E+H QHLE V  ILR++ELYAN +KC FAK R+ YLGH IS KG+E DPEKIRA+ EWP P N+RE+ GFLGLT 
Subjt:  NIFKPYLRRFVLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTD

Query:  YYRRFVHHYGSLAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYEREL
        YYRRFV +YGS++APL+QLLK GA++W E+T+ AFE+LK AM+TL VL +PDF+L FE+E+DASG+G+G+VL Q K+P+AY++  LS RDR + VYEREL
Subjt:  YYRRFVHHYGSLAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYEREL

Query:  MAVVMAVQ-----------------------------------------------------QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDP
        MAVV AVQ                                                     +NK  DALSRI PT HL  L+AP ++DVEVI+ EV  DP
Subjt:  MAVVMAVQ-----------------------------------------------------QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDP

Query:  KLKEVVEKLVQHADSVPGFVWQQGTLKYKGRLVLSKSSTLIPTVLHTYHDSVFGGY--------------------------------------SALSPA
         L+E++  + +    +P +   QG LK+KGRLVLSK+STLIPT++HTYHDSVFGG+                                      SALSPA
Subjt:  KLKEVVEKLVQHADSVPGFVWQQGTLKYKGRLVLSKSSTLIPTVLHTYHDSVFGGY--------------------------------------SALSPA

Query:  GLLMPLEIPDVIWSDISMDFIEALPKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVL
        GLL+PLEIPD IWSDISMDFIE LPK+ G+ VILVVVDRLSKY HF+ LKHP+T KT     + EV+
Subjt:  GLLMPLEIPDVIWSDISMDFIEALPKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVL

A0A5A7U6J3 Ty3/gypsy retrotransposon protein1.2e-1460.27Show/hide
Query:  FTTKTNPTTNIWEVLVGWKGLPAHEATWEIFDDFKQQFPDFHLEDKVVLEGESSVRPPIILTYS-RKGNKGKR
        +  + N  T  WE LV WKGLP HEATWE + D K QFP+FHLEDKV LE ES  RPPI+ TY+ RK +KG R
Subjt:  FTTKTNPTTNIWEVLVGWKGLPAHEATWEIFDDFKQQFPDFHLEDKVVLEGESSVRPPIILTYS-RKGNKGKR

A0A5A7U6J3 Ty3/gypsy retrotransposon protein9.9e-28846.41Show/hide
Query:  RKFEERMEAMDQEVSEIRVEISKLPVIEETLVSLSKSIERLSVQTEKQQIMLATIAIDLARGQKGIAGDESGNKSGTSETLRTSETPGTSETPGMLTAKA
        R  EER++ ++Q        IS+LP++EE +  LSK+I  ++ Q +KQ      I   L    +GI  D+S                     PG    + 
Subjt:  RKFEERMEAMDQEVSEIRVEISKLPVIEETLVSLSKSIERLSVQTEKQQIMLATIAIDLARGQKGIAGDESGNKSGTSETLRTSETPGTSETPGMLTAKA

Query:  TSETPGTSETPVTLAAK-ATMKAKSETNDTSLDQNKFKKVEMPVFNGVDPDGWFFRADRYFQIHRLSDSEKMTMAVISFNGVALDWYRPQGEQDSFSDWK
        +S      ET +T  +    ++ K+E    +LD++KFKKVEMP+F G+DPD W FRADRYF+IH LSDSEK+T+AVISF+G ALDWYR Q E+++F  W 
Subjt:  TSETPGTSETPVTLAAK-ATMKAKSETNDTSLDQNKFKKVEMPVFNGVDPDGWFFRADRYFQIHRLSDSEKMTMAVISFNGVALDWYRPQGEQDSFSDWK

Query:  NLKNHLLMRFRLERDGSLCGRFFAIKQESTVDEYRNLFDKLVAPLRLI--CQTRWTFMNGLPPWIKAEVECWEPVGLAHEVGSASGKSGIEAF----GCG
        +LK  +L+RFR  RDG+L GRF  IKQE+TV+EYRN FDK +AP+  +       TFMNGL PW+K+EVE WEPVGLA  +  A      E      G  
Subjt:  NLKNHLLMRFRLERDGSLCGRFFAIKQESTVDEYRNLFDKLVAPLRLI--CQTRWTFMNGLPPWIKAEVECWEPVGLAHEVGSASGKSGIEAF----GCG

Query:  FGYEVKWSAAVVRKERFRT-----HGSAKGW---------------RKNG--REPIDADANNHIERGV----DDGKSKGRTCETIDGCRVSGERELRVLV
          Y+ K +          +     + +A GW               R+ G  +   DA+     E+G+     +    G  C+T D       +ELR+L+
Subjt:  FGYEVKWSAAVVRKERFRT-----HGSAKGW---------------RKNG--REPIDADANNHIERGV----DDGKSKGRTCETIDGCRVSGERELRVLV

Query:  VTDSINDLEVVENEDNREEQNELQMIKADNEIQTISELSINSVGGLTNLGMMKLKGNIGKEAVVVLIDCGLTHNFIAERLVLSLKLPIVETSNYGVILGS
        V ++  +LE++E E+  + + E++  +  +      ELSINSV GLTN G MK+KG +GKE VVVLIDCG THNFIAE+LV  L+LP+ ET NYGVILGS
Subjt:  VTDSINDLEVVENEDNREEQNELQMIKADNEIQTISELSINSVGGLTNLGMMKLKGNIGKEAVVVLIDCGLTHNFIAERLVLSLKLPIVETSNYGVILGS

Query:  GLVIKGKGICNVVELVIGELIVRGSFLPLELGGVDVIL-----------------------------------------------------VDQGFLIEY
        G  IKGKG+C+ VEL +G   V  SFLPLELGGVD IL                                                      DQG+L+E 
Subjt:  GLVIKGKGICNVVELVIGELIVRGSFLPLELGGVDVIL-----------------------------------------------------VDQGFLIEY

Query:  RAMEGGVSLAEWYEVDDVYTISES--NPAVIDSFADVFEWPAAL-PQRSIEHHIHFKMGTDPVNVRPYR-------------------------------
        R +E      E  EVD   T +E     A+++ F DVFE P  L PQR IEHHI+ K G DP+NVRPYR                               
Subjt:  RAMEGGVSLAEWYEVDDVYTISES--NPAVIDSFADVFEWPAAL-PQRSIEHHIHFKMGTDPVNVRPYR-------------------------------

Query:  -----------WRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMN
                   WRFCVDY+ALNNVT+ D FPIPV+EELFDEL+GA +FSKIDLK+GYHQIRMHP DIEKT FR HEGHYEF+VMPFGLTNAPSTFQALMN
Subjt:  -----------WRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMN

Query:  NIFKPYLRRFVLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTD
         +FKP+LRRFVLVFFDDI + S+ +E+H QHLE V  ILR++ELYAN +KC FAK R+ YLGH IS KG+E DPEKIRA+ EWP P N+RE+ GFLGLT 
Subjt:  NIFKPYLRRFVLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTD

Query:  YYRRFVHHYGSLAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYEREL
        YYRRFV +YGS++APL+QLLK GA++W E+T+ AFE+LK AM+TL VL +PDF+L FE+E+DASG+G+G+VL Q K+P+AY++  LS RDR + VYEREL
Subjt:  YYRRFVHHYGSLAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYEREL

Query:  MAVVMAVQ-----------------------------------------------------QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDP
        MAVV AVQ                                                     +NK  DALSRI PT HL  L+AP ++DVEVI+ EV  DP
Subjt:  MAVVMAVQ-----------------------------------------------------QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDP

Query:  KLKEVVEKLVQHADSVPGFVWQQGTLKYKGRLVLSKSSTLIPTVLHTYHDSVFGGY--------------------------------------SALSPA
         L+E++  + +    +P +   QG LK+KGRLVLSK+STLIPT++HTYHDSVFGG+                                      SALSPA
Subjt:  KLKEVVEKLVQHADSVPGFVWQQGTLKYKGRLVLSKSSTLIPTVLHTYHDSVFGGY--------------------------------------SALSPA

Query:  GLLMPLEIPDVIWSDISMDFIEALPKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVL
        GLL+PLEIPD IWSDISMDFIE LPK+ G+ VILVVVDRLSKY HF+ LKHP+T KT     + EV+
Subjt:  GLLMPLEIPDVIWSDISMDFIEALPKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVL

A0A5D3DI73 Ty3/gypsy retrotransposon protein9.9e-28846.49Show/hide
Query:  RKFEERMEAMDQEVSEIRVEISKLPVIEETLVSLSKSIERLSVQTEKQQIMLATIAIDLARGQKGIAGDESGNKSGTSETLRTSETPGTSETPGMLTAKA
        R  EER++ ++Q        IS+LP++EE +  LSK+I  ++ Q +KQ      I   L    +GI  D+S                     PG    + 
Subjt:  RKFEERMEAMDQEVSEIRVEISKLPVIEETLVSLSKSIERLSVQTEKQQIMLATIAIDLARGQKGIAGDESGNKSGTSETLRTSETPGTSETPGMLTAKA

Query:  TSETPGTSETPVTLAAK-ATMKAKSETNDTSLDQNKFKKVEMPVFNGVDPDGWFFRADRYFQIHRLSDSEKMTMAVISFNGVALDWYRPQGEQDSFSDWK
        +S      ET +T  +    ++ K+E    +LD++KFKKVEMP+F G+DPD W FRADRYF+IH LSDSEK+T+AVISF+G ALDWYR Q E+++F  W 
Subjt:  TSETPGTSETPVTLAAK-ATMKAKSETNDTSLDQNKFKKVEMPVFNGVDPDGWFFRADRYFQIHRLSDSEKMTMAVISFNGVALDWYRPQGEQDSFSDWK

Query:  NLKNHLLMRFRLERDGSLCGRFFAIKQESTVDEYRNLFDKLVAPLRLI--CQTRWTFMNGLPPWIKAEVECWEPVGLAHEVGSASGKSGIEAF----GCG
        +LK  +L+RFR  RDG+L GRF  IKQE+TV+EYRN FDK +AP+  +       TFMNGL PW+K+EVE WEPVGLA  +  A      E      G  
Subjt:  NLKNHLLMRFRLERDGSLCGRFFAIKQESTVDEYRNLFDKLVAPLRLI--CQTRWTFMNGLPPWIKAEVECWEPVGLAHEVGSASGKSGIEAF----GCG

Query:  FGYEVKWSAAVVRKERFRT-----HGSAKGW---------------RKNG--REPIDADANNHIERGV----DDGKSKGRTCETIDGCRVSGERELRVLV
          Y+ K +          +     + +A GW               R+ G  +   DA+     E+G+     +    G  C+T D       +ELR+L+
Subjt:  FGYEVKWSAAVVRKERFRT-----HGSAKGW---------------RKNG--REPIDADANNHIERGV----DDGKSKGRTCETIDGCRVSGERELRVLV

Query:  VTDSINDLEVVENEDNREEQNELQMIKADNEIQTISELSINSVGGLTNLGMMKLKGNIGKEAVVVLIDCGLTHNFIAERLVLSLKLPIVETSNYGVILGS
        V ++  +LE++E E+  + + E++  +  +      ELSINSV GLTN G MK+KG +GKE VVVLIDCG THNFIAE+LV  L+LP+ ET NYGVILGS
Subjt:  VTDSINDLEVVENEDNREEQNELQMIKADNEIQTISELSINSVGGLTNLGMMKLKGNIGKEAVVVLIDCGLTHNFIAERLVLSLKLPIVETSNYGVILGS

Query:  GLVIKGKGICNVVELVIGELIVRGSFLPLELGGVDVIL-----------------------------------------------------VDQGFLIEY
        G  IKGKG+C+ VEL +G   V  SFLPLELGGVD IL                                                      DQG+L+E 
Subjt:  GLVIKGKGICNVVELVIGELIVRGSFLPLELGGVDVIL-----------------------------------------------------VDQGFLIEY

Query:  RAMEGGVSLAEWYEVDDVYTISESN--PAVIDSFADVFEWPAAL-PQRSIEHHIHFKMGTDPVNVRPYR-------------------------------
        R +E      E  EVD   T +E     A+++ F DVFE P  L PQR IEHHI+ K G DPVNVRPYR                               
Subjt:  RAMEGGVSLAEWYEVDDVYTISESN--PAVIDSFADVFEWPAAL-PQRSIEHHIHFKMGTDPVNVRPYR-------------------------------

Query:  -----------WRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMN
                   WRFCVDY+ALNNVT+ D FPIPV+EELFDEL+GA +FSKIDLK+GYHQIRMHP DIEKT FR HEGHYEF+VMPFGLTNAPSTFQALMN
Subjt:  -----------WRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMN

Query:  NIFKPYLRRFVLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTD
         +FKP+LRRFVLVFFDDI + S+ +E+H QHLE V  ILR++ELYAN +KC FAK R+ YLGH IS KG+E DPEKIRA+ EWP P N+RE+ GFLGLT 
Subjt:  NIFKPYLRRFVLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTD

Query:  YYRRFVHHYGSLAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYEREL
        YYRRFV +YGS++APL+QLLK GA++W E+T+ AFE+LK AM+TL VL +PDF+L FE+E+DASG+G+G+VL Q K+P+AY++  LS RDR + VYEREL
Subjt:  YYRRFVHHYGSLAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYEREL

Query:  MAVVMAVQ-----------------------------------------------------QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDP
        MAVV AVQ                                                     +NK  DALSRI PT HL  L+AP ++DVEVI+ EV  DP
Subjt:  MAVVMAVQ-----------------------------------------------------QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDP

Query:  KLKEVVEKLVQHADSVPGFVWQQGTLKYKGRLVLSKSSTLIPTVLHTYHDSVFGGY--------------------------------------SALSPA
         L+E++  + +    +P +   QG LK+KGRLVLSK+STLIPT++HTYHDSVFGG+                                      SALSPA
Subjt:  KLKEVVEKLVQHADSVPGFVWQQGTLKYKGRLVLSKSSTLIPTVLHTYHDSVFGGY--------------------------------------SALSPA

Query:  GLLMPLEIPDVIWSDISMDFIEALPKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVL
        GLL+PLEIPD IWSDISMDFIE LPK+ G+ VILVVVDRLSKY HF+ LKHP+T KT     + EV+
Subjt:  GLLMPLEIPDVIWSDISMDFIEALPKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVL

A0A5D3DI73 Ty3/gypsy retrotransposon protein1.2e-1460.27Show/hide
Query:  FTTKTNPTTNIWEVLVGWKGLPAHEATWEIFDDFKQQFPDFHLEDKVVLEGESSVRPPIILTYS-RKGNKGKR
        +  + N  T  WE LV WKGLP HEATWE + D K QFP+FHLEDKV LE ES  RPPI+ TY+ RK +KG R
Subjt:  FTTKTNPTTNIWEVLVGWKGLPAHEATWEIFDDFKQQFPDFHLEDKVVLEGESSVRPPIILTYS-RKGNKGKR

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.3e-6239.74Show/hide
Query:  RWRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMNNIFKPYLRRF
        ++R  +DY+ LN +T+ D  PIP ++E+  +L     F+ IDL  G+HQI M P  + KT F    GHYE+L MPFGL NAP+TFQ  MN+I +P L + 
Subjt:  RWRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMNNIFKPYLRRF

Query:  VLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTDYYRRFVHHYG
         LV+ DDI V S ++++H+Q L  VF  L    L    +KC+F K+   +LGH+++  G++ +PEKI A+ ++PIPT  +E+  FLGLT YYR+F+ ++ 
Subjt:  VLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTDYYRRFVHHYG

Query:  SLAAPLSQLLKGGAFEWIEDT----QVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYERELMAVVMA
         +A P+++ LK      I+ T      AF++LK  +    +L +PDF   F + TDAS   +G+VL+Q   P++Y + TL+  +      E+EL+A+V A
Subjt:  SLAAPLSQLLKGGAFEWIEDT----QVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYERELMAVVMA

Query:  VQ
         +
Subjt:  VQ

P10401 Retrovirus-related Pol polyprotein from transposon gypsy9.9e-5938.2Show/hide
Query:  KMGTDPVNVRPYRWRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQAL
        K GTD     P + R  +D++ LN  TI D +P+P I  +   L  A  F+ +DLKSGYHQI +   D EKT+F ++ G YEF  +PFGL NA S FQ  
Subjt:  KMGTDPVNVRPYRWRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQAL

Query:  MNNIFKPYLRRFVLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGL
        ++++ +  + +   V+ DD+ + S    DHV+H++ V   L D  +  ++ K +F KE VEYLG ++S  G ++DPEK++A+ E+P P  + +V  FLGL
Subjt:  MNNIFKPYLRRFVLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGL

Query:  TDYYRRFVHHYGSLAAPLSQLLKGG------------AFEWIEDTQVAFERLKNAMITL-LVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHT
          YYR F+  + ++A P++ +LKG               E+ E  + AF+RL+N + +  ++L  PDF   F++ TDAS  GIG+VL+Q  RPI   + T
Subjt:  TDYYRRFVHHYGSLAAPLSQLLKGG------------AFEWIEDTQVAFERLKNAMITL-LVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHT

Query:  LSLRDRGKQVYERELMAVVMAV
        L   ++     EREL+A+V A+
Subjt:  LSLRDRGKQVYERELMAVVMAV

P20825 Retrovirus-related Pol polyprotein from transposon 2975.4e-6540.46Show/hide
Query:  RWRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMNNIFKPYLRRF
        ++R  +DY+ LN +TI D +PIP ++E+  +L     F+ IDL  G+HQI M    I KT F    GHYE+L MPFGL NAP+TFQ  MNNI +P L + 
Subjt:  RWRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMNNIFKPYLRRF

Query:  VLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTDYYRRFVHHYG
         LV+ DDI + S ++ +H+  ++ VF  L D  L    +KC+F K+   +LGH+++  G++ +P K++A++ +PIPT  +E+  FLGLT YYR+F+ +Y 
Subjt:  VLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTDYYRRFVHHYG

Query:  SLAAPLSQLLKGGA------FEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYERELMAVV
         +A P++  LK          E+IE    AFE+LK  +I   +L LPDF+  F + TDAS   +G+VL+Q   PI++ + TL+  +      E+EL+A+V
Subjt:  SLAAPLSQLLKGGA------FEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYERELMAVV

Query:  MAVQ
         A +
Subjt:  MAVQ

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein8.1e-6130.94Show/hide
Query:  WRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMNNIFKPYLRRFV
        +R CVDY+ LN  TI D FP+P I+ L   +  A +F+ +DL SGYHQI M P+D  KT F    G YE+ VMPFGL NAPSTF   M + F+    RFV
Subjt:  WRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMNNIFKPYLRRFV

Query:  LVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTDYYRRFVHHYGS
         V+ DDI + S + E+H +HL+ V   L++  L   K KC+FA E  E+LG+ I  + +     K  A+ ++P P  +++   FLG+ +YYRRF+ +   
Subjt:  LVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTDYYRRFVHHYGS

Query:  LAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRP------IAYYNHTLSLRDRGKQVYERELMAVVM
        +A P+ QL      +W E    A E+LK A+    VLV  +    + + TDAS  GIG+VL ++         + Y++ +L    +     E EL+ ++ 
Subjt:  LAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRP------IAYYNHTLSLRDRGKQVYERELMAVVM

Query:  AVQ-----------------------------------------------------QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDPKLKEV
        A+                                                      +N V DA+SR   TI     S P  ID E  K+   +DP    V
Subjt:  AVQ-----------------------------------------------------QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDPKLKEV

Query:  ---VEKLVQH---ADSVPGFVWQQGTLK----------YKGRLVLSKSSTLIP-----TVLHTYHD-SVFGGY-------SALSPA--------------
           +++L QH    + +  F   Q  L+           +  ++  +   ++P      V+  YHD ++FGG+       + +SP               
Subjt:  ---VEKLVQH---ADSVPGFVWQQGTLK----------YKGRLVLSKSSTLIP-----TVLHTYHD-SVFGGY-------SALSPA--------------

Query:  -----------------GLLMPLEIPDVIWSDISMDFIEAL-PKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVL
                         GLL PL I +  W DISMDF+  L P +   N+ILVVVDR SK  HFIA     T KT   T + ++L
Subjt:  -----------------GLLMPLEIPDVIWSDISMDFIEAL-PKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVL

Q99315 Transposon Ty3-G Gag-Pol polyprotein6.2e-6130.77Show/hide
Query:  WRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMNNIFKPYLRRFV
        +R CVDY+ LN  TI D FP+P I+ L   +  A +F+ +DL SGYHQI M P+D  KT F    G YE+ VMPFGL NAPSTF   M + F+    RFV
Subjt:  WRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGLTNAPSTFQALMNNIFKPYLRRFV

Query:  LVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTDYYRRFVHHYGS
         V+ DDI + S + E+H +HL+ V   L++  L   K KC+FA E  E+LG+ I  + +     K  A+ ++P P  +++   FLG+ +YYRRF+ +   
Subjt:  LVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGLTDYYRRFVHHYGS

Query:  LAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRP------IAYYNHTLSLRDRGKQVYERELMAVVM
        +A P+ QL      +W E    A ++LK+A+    VLV  +    + + TDAS  GIG+VL ++         + Y++ +L    +     E EL+ ++ 
Subjt:  LAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRP------IAYYNHTLSLRDRGKQVYERELMAVVM

Query:  AVQ-----------------------------------------------------QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDPKLKEV
        A+                                                      +N V DA+SR   TI     S P  ID E  K+   +DP    V
Subjt:  AVQ-----------------------------------------------------QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDPKLKEV

Query:  ---VEKLVQH---ADSVPGFVWQQGTLK----------YKGRLVLSKSSTLIP-----TVLHTYHD-SVFGGY-------SALSPA--------------
           +++L QH    + +  F   Q  L+           +  ++  +   ++P      V+  YHD ++FGG+       + +SP               
Subjt:  ---VEKLVQH---ADSVPGFVWQQGTLK----------YKGRLVLSKSSTLIP-----TVLHTYHD-SVFGGY-------SALSPA--------------

Query:  -----------------GLLMPLEIPDVIWSDISMDFIEAL-PKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVL
                         GLL PL I +  W DISMDF+  L P +   N+ILVVVDR SK  HFIA     T KT   T + ++L
Subjt:  -----------------GLLMPLEIPDVIWSDISMDFIEAL-PKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVL

Arabidopsis top hitse value%identityAlignment
AT4G27860.1 vacuolar iron transporter (VIT) family protein1.3e-3745.93Show/hide
Query:  VEPEPSRYNRWEIVKSIVYGGLAESITSLGIVASAASANTATGNIVILALANLISGLFILGHNLTGLKSEQFRRTSNETDDVE----HVDRYEVVLGNRE
        VEP     N+ EI+KSIVYGGL ESITSL  V SAA++  +T N++ L +ANL SGL +  H+L  L +E+ R+ +N  D  E      DRYE VLG RE
Subjt:  VEPEPSRYNRWEIVKSIVYGGLAESITSLGIVASAASANTATGNIVILALANLISGLFILGHNLTGLKSEQFRRTSNETDDVE----HVDRYEVVLGNRE

Query:  NYILHFTLAIFSFVLFGLVPPLVYGFSFTKSNDK--DLKLAAVAGASLLCIILLALGKAYIQRPNRWDVYFKMVASYVVIAAGAGGFSYLAGDLLDKVIK
           +H  +AI SFV+FGL+PPLVYGFSF K  +K  + K+ AV   SLLCI+LL++ KAY+ +      Y K +  Y   A  A GFS   G L+ + ++
Subjt:  NYILHFTLAIFSFVLFGLVPPLVYGFSFTKSNDK--DLKLAAVAGASLLCIILLALGKAYIQRPNRWDVYFKMVASYVVIAAGAGGFSYLAGDLLDKVIK

Query:  KYGWFEENP
        K G+++++P
Subjt:  KYGWFEENP

AT4G27860.2 vacuolar iron transporter (VIT) family protein1.3e-3745.93Show/hide
Query:  VEPEPSRYNRWEIVKSIVYGGLAESITSLGIVASAASANTATGNIVILALANLISGLFILGHNLTGLKSEQFRRTSNETDDVE----HVDRYEVVLGNRE
        VEP     N+ EI+KSIVYGGL ESITSL  V SAA++  +T N++ L +ANL SGL +  H+L  L +E+ R+ +N  D  E      DRYE VLG RE
Subjt:  VEPEPSRYNRWEIVKSIVYGGLAESITSLGIVASAASANTATGNIVILALANLISGLFILGHNLTGLKSEQFRRTSNETDDVE----HVDRYEVVLGNRE

Query:  NYILHFTLAIFSFVLFGLVPPLVYGFSFTKSNDK--DLKLAAVAGASLLCIILLALGKAYIQRPNRWDVYFKMVASYVVIAAGAGGFSYLAGDLLDKVIK
           +H  +AI SFV+FGL+PPLVYGFSF K  +K  + K+ AV   SLLCI+LL++ KAY+ +      Y K +  Y   A  A GFS   G L+ + ++
Subjt:  NYILHFTLAIFSFVLFGLVPPLVYGFSFTKSNDK--DLKLAAVAGASLLCIILLALGKAYIQRPNRWDVYFKMVASYVVIAAGAGGFSYLAGDLLDKVIK

Query:  KYGWFEENP
        K G+++++P
Subjt:  KYGWFEENP

AT4G27870.1 Vacuolar iron transporter (VIT) family protein2.8e-4032.98Show/hide
Query:  SNGSVQDDNQTSTANKVLVQNQSNGFAVLSESEIDTNVNSTPGLTSLEAMMAETVTDTVDEKKGIDVENVVVGIPYSLESKGGLLDRFRLPALFNKAPVP
        S G++    + +T + V+  + +    V   ++    VN+   +   +  +A  V +   +K   D  +   G   S + +   +      ++ N     
Subjt:  SNGSVQDDNQTSTANKVLVQNQSNGFAVLSESEIDTNVNSTPGLTSLEAMMAETVTDTVDEKKGIDVENVVVGIPYSLESKGGLLDRFRLPALFNKAPVP

Query:  DQSAAVAETEIPKTPEPV---GATVPDSSPVSPSLGAPTAAERVTDSAVGSREVEGGPVAISIDDSRDEQVEPEPSRY----------NRWEIVKSIVYG
             V ET I +   P+   G   PD+S     L   T    V D+  G R V  GP    ID S    +E    R            + EI+KSIVYG
Subjt:  DQSAAVAETEIPKTPEPV---GATVPDSSPVSPSLGAPTAAERVTDSAVGSREVEGGPVAISIDDSRDEQVEPEPSRY----------NRWEIVKSIVYG

Query:  GLAESITSLGIVASAASANTATGNIVILALANLISGLFILGHNLTGLKSEQFRRTSNETDDV----EHVDRYEVVLGNRENYILHFTLAIFSFVLFGLVP
        GL E+ITSLG+++SAA +  +  NI++L LANL+ GL ++ HNL  L+ E+  RT+ E +      E   RY+ +LG REN+ LH T+AI SF++ G++P
Subjt:  GLAESITSLGIVASAASANTATGNIVILALANLISGLFILGHNLTGLKSEQFRRTSNETDDV----EHVDRYEVVLGNRENYILHFTLAIFSFVLFGLVP

Query:  PLVYGFSFTKSNDKDLKLAAVAGASLLCIILLALGKAYIQRPNRWDVYFKMVASYVVIAAGAGGFSYLAGDLLDKVIKKYGW
        P+VY FSF++ ++KD K+A+V GASL CI+LLA+ KA+++ P     Y K +  Y  IA    G SY+ G+ L+++++K+GW
Subjt:  PLVYGFSFTKSNDKDLKLAAVAGASLLCIILLALGKAYIQRPNRWDVYFKMVASYVVIAAGAGGFSYLAGDLLDKVIKKYGW

AT5G24290.1 Vacuolar iron transporter (VIT) family protein1.9e-3335.69Show/hide
Query:  VEGGPVAISIDDSRD-----------EQVEPEPSRYNRWEIVKSIVYGGLAESITSLGIVASAASANTATGNIVILALANLISGLFILGHNLTGLKSEQF
        + G PV  S+++  D            Q++P+  R+   E++KS VYGGL E+ITSLG+V+SA+++ ++T NI+ LA+ANL  GL +L  N   L+    
Subjt:  VEGGPVAISIDDSRD-----------EQVEPEPSRYNRWEIVKSIVYGGLAESITSLGIVASAASANTATGNIVILALANLISGLFILGHNLTGLKSEQF

Query:  RRTSNETDDVEHVDRYEVVLGNRENYILHFTLAIFSFVLFGLVPPLVYGFSFTKSNDKDLKLAAVAGASLLCIILLALGKAYIQRPNR----WDVYFKMV
            N +D  +  DRYE +LG R    +H  +A+ S++ FGL+PPLVY FSF ++  K+ KL +V   SL+C+ILL   K Y+++P         Y K  
Subjt:  RRTSNETDDVEHVDRYEVVLGNRENYILHFTLAIFSFVLFGLVPPLVYGFSFTKSNDKDLKLAAVAGASLLCIILLALGKAYIQRPNR----WDVYFKMV

Query:  ASYVVIAAGAGGFSYLAGDLLDKVIKKYGWFEENPAFNLISLPLPEMSLAKPAWG
        A Y  I   + G SY+ GD++ + I+K           L  + L ++S+  P +G
Subjt:  ASYVVIAAGAGGFSYLAGDLLDKVIKKYGWFEENPAFNLISLPLPEMSLAKPAWG

AT5G24290.2 Vacuolar iron transporter (VIT) family protein1.9e-3335.69Show/hide
Query:  VEGGPVAISIDDSRD-----------EQVEPEPSRYNRWEIVKSIVYGGLAESITSLGIVASAASANTATGNIVILALANLISGLFILGHNLTGLKSEQF
        + G PV  S+++  D            Q++P+  R+   E++KS VYGGL E+ITSLG+V+SA+++ ++T NI+ LA+ANL  GL +L  N   L+    
Subjt:  VEGGPVAISIDDSRD-----------EQVEPEPSRYNRWEIVKSIVYGGLAESITSLGIVASAASANTATGNIVILALANLISGLFILGHNLTGLKSEQF

Query:  RRTSNETDDVEHVDRYEVVLGNRENYILHFTLAIFSFVLFGLVPPLVYGFSFTKSNDKDLKLAAVAGASLLCIILLALGKAYIQRPNR----WDVYFKMV
            N +D  +  DRYE +LG R    +H  +A+ S++ FGL+PPLVY FSF ++  K+ KL +V   SL+C+ILL   K Y+++P         Y K  
Subjt:  RRTSNETDDVEHVDRYEVVLGNRENYILHFTLAIFSFVLFGLVPPLVYGFSFTKSNDKDLKLAAVAGASLLCIILLALGKAYIQRPNR----WDVYFKMV

Query:  ASYVVIAAGAGGFSYLAGDLLDKVIKKYGWFEENPAFNLISLPLPEMSLAKPAWG
        A Y  I   + G SY+ GD++ + I+K           L  + L ++S+  P +G
Subjt:  ASYVVIAAGAGGFSYLAGDLLDKVIKKYGWFEENPAFNLISLPLPEMSLAKPAWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCAACGGAAGTTCGAAGAACGGATGGAAGCAATGGATCAAGAGGTATCGGAAATTCGGGTAGAAATTAGCAAATTGCCTGTAATAGAAGAGACCTTGGTTTCTTT
ATCCAAAAGTATCGAGAGATTGAGTGTTCAAACGGAAAAACAACAGATAATGTTAGCGACAATTGCTATCGACTTAGCGAGAGGACAGAAAGGTATCGCTGGAGATGAAT
CGGGCAATAAAAGTGGGACGAGTGAGACACTTAGGACGAGCGAGACCCCTGGGACTAGCGAGACACCTGGTATGTTAACAGCGAAGGCGACGAGCGAGACACCTGGGACT
AGCGAGACACCTGTTACGTTAGCAGCAAAGGCGACGATGAAAGCAAAATCGGAAACTAACGACACATCATTGGATCAGAATAAATTCAAGAAAGTCGAGATGCCGGTGTT
TAATGGAGTTGATCCGGATGGGTGGTTCTTCAGAGCTGATCGTTATTTCCAAATTCATAGATTATCGGATTCAGAGAAGATGACGATGGCAGTTATTAGTTTCAATGGTG
TCGCTCTTGATTGGTATCGGCCCCAAGGGGAGCAAGATTCATTCAGCGATTGGAAAAACTTGAAGAACCATTTATTGATGCGCTTCAGATTGGAGCGAGATGGATCACTT
TGTGGCCGATTTTTTGCAATTAAGCAGGAGTCTACCGTGGATGAATATCGGAACCTCTTTGACAAGTTGGTTGCGCCATTGCGCCTCATCTGTCAAACGAGGTGGACTTT
CATGAATGGGTTGCCTCCTTGGATTAAAGCAGAGGTAGAATGTTGGGAACCCGTGGGGTTAGCGCATGAAGTTGGCTCGGCGAGTGGAAAATCGGGAATTGAGGCGTTTG
GATGCGGGTTTGGGTACGAAGTCAAGTGGTCGGCTGCAGTTGTTCGAAAAGAAAGATTTCGTACCCACGGCAGTGCAAAAGGATGGAGGAAGAATGGCAGAGAACCCATC
GATGCCGATGCGAACAATCACATTGAGAGGGGTGTTGACGATGGGAAATCAAAGGGAAGGACCTGCGAGACGATTGACGGATGCCGAGTTTCAGGCGAAAGGGAACTTCG
AGTGTTGGTTGTTACTGACAGCATTAATGATTTGGAGGTTGTGGAAAATGAGGATAATAGGGAAGAGCAAAATGAGTTGCAAATGATCAAGGCAGATAATGAAATTCAAA
CAATATCGGAATTGTCAATTAATTCCGTGGGGGGCTTGACGAATCTGGGAATGATGAAGCTTAAAGGCAACATTGGGAAAGAGGCAGTCGTGGTGTTAATTGATTGTGGA
CTGACACACAATTTTATTGCAGAAAGGTTAGTTTTGTCGTTGAAATTGCCGATTGTTGAAACCTCCAATTATGGAGTAATTTTGGGGTCGGGATTGGTGATAAAAGGGAA
GGGCATTTGTAATGTTGTGGAGTTAGTAATCGGAGAGTTGATTGTGCGTGGTAGTTTCTTGCCCTTAGAATTGGGAGGAGTAGATGTGATATTAGTAGACCAAGGATTTC
TCATTGAGTACCGTGCTATGGAAGGAGGAGTGTCGCTGGCTGAATGGTATGAGGTAGATGATGTCTATACCATCTCTGAATCAAATCCAGCAGTGATTGATAGCTTTGCT
GATGTGTTTGAGTGGCCAGCAGCCTTACCGCAGAGGAGTATAGAGCATCACATCCATTTCAAGATGGGAACGGATCCGGTTAATGTGAGACCATATAGGTGGAGATTCTG
TGTTGATTATCAAGCTTTGAATAATGTCACTATACTTGATAATTTTCCTATTCCGGTAATTGAAGAACTGTTTGATGAATTGCATGGAGCGACATTATTTTCCAAGATTG
ATTTGAAATCAGGTTATCACCAAATTAGGATGCATCCAAGGGACATTGAGAAAACAACTTTCCGCATGCATGAGGGCCATTATGAGTTTTTGGTGATGCCTTTTGGGCTG
ACTAATGCTCCATCGACGTTTCAAGCACTCATGAATAACATCTTTAAACCATACTTGAGGAGATTCGTACTTGTATTCTTTGATGATATTTTTGTGTGTAGTAGAACTGT
CGAGGATCATGTTCAGCATTTGGAGGAGGTATTTTTGATATTGAGGGATAATGAGTTGTATGCAAATAAAAACAAATGTCAATTTGCCAAGGAGAGAGTGGAGTATTTGG
GGCATCTAATCTCAGGTAAAGGTATGGAAGCGGATCCCGAGAAGATTCGAGCAATGCTGGAATGGCCTATACCTACGAATATCAGGGAGGTAAGTGGATTTTTGGGTTTG
ACCGATTATTATAGAAGATTTGTTCATCACTATGGTAGTTTGGCTGCTCCTTTGAGTCAATTATTGAAAGGGGGAGCTTTTGAATGGATTGAGGATACTCAAGTTGCTTT
TGAACGTCTGAAGAATGCCATGATAACATTACTAGTATTGGTGTTACCTGATTTTGATTTACTGTTTGAAGTGGAGACGGATGCATCGGGATATGGGATTGGATCGGTAT
TGACACAACTAAAGAGACCTATAGCTTATTACAACCACACCTTGTCTTTACGAGATAGAGGAAAGCAAGTGTATGAACGAGAGTTGATGGCTGTAGTGATGGCAGTGCAG
CAGAACAAGGTTGTTGACGCTTTATCTCGCATTCCACCAACGATACATTTAGCCTACTTATCAGCTCCTACTATCATAGATGTGGAAGTTATCAAAACGGAGGTGGCAAA
TGATCCGAAGTTGAAAGAAGTGGTTGAGAAATTGGTGCAGCATGCTGACAGTGTACCTGGGTTTGTGTGGCAACAAGGTACTTTGAAATACAAGGGTCGTCTGGTCTTAT
CTAAGTCCTCTACTCTCATTCCTACAGTTTTGCATACTTATCATGATTCTGTATTTGGGGGGTATTCAGCTTTGTCTCCCGCGGGGTTGTTGATGCCCTTGGAGATTCCT
GATGTTATTTGGAGTGATATTTCGATGGATTTCATTGAGGCGTTGCCAAAGGCAAAGGGATACAATGTTATTTTGGTAGTGGTAGATAGATTAAGCAAGTACGACCATTT
CATTGCATTGAAACATCCATTCACCACTAAAACGAATCCGACAACCAATATATGGGAAGTGTTGGTGGGATGGAAAGGGTTACCAGCTCATGAAGCTACATGGGAAATCT
TTGATGACTTCAAACAACAATTTCCAGACTTTCACCTTGAGGACAAGGTGGTTCTTGAAGGGGAGAGTAGTGTTAGACCACCTATAATATTAACTTATAGTAGAAAAGGG
AATAAAGGAAAGAGAATTGTTTGTGGAAGTGTAGCTTCTACTAGCCAAAGGAGGGAAGGAAACTTGAAGTTTCCTTCTGTTCTAGTCTTAGGGCTGCAATCTTGTGATTC
AAATAAATGCACAGTGAAGTTTCTTATCAGTTTCCTACTAAATATGCATGCAGGAAATAACGTTGAAAGCTTAGGGATCAGAGCGGAAATTCAAGTTGATAGAGCTCCAG
TTCCTGCTCAGTCAGTTGATTATGCAGTAACCGATAAAAATGAAGATGATCATGTTGCAGCCCAGGCTCATCCTTCACTTAAAGCAACAAGTGAAGAGGGAATGATTGTA
CAAGAAGAAGCATTGGCTAGCAAGAAAGGAAATAACGTTGAAAGCATAGAGTTCGGAGAAAGAGAGATATTTCAAGTTCAGGAAACAACAGATCCTCAAGAAAATCAAGT
TGATAAAGCTCCAGTTCCCAACCAGTCAGTTGATTATGCAGTATCTGATAAAAATGAAGTCGTACCTGATCCATCAGTTGGTAATGGAGTAGCTGATCAAATACGAGGCA
TCCATGCTGTATCTAATTCAAAGCCAACTCATCCTTCACTTAACACAACAGTTGCTGAGCAAGAGGTTGTGCCAGTTAATGGAGTGGAAAGCAAACAAGGGAACACGATT
GAGAGCATCATAGTTGGAAGACCAGAGCCACTGCTGGAATCCGCTGAAACAGTATATGATCAAGGAATTCAAGCTGATTCCATAAATAAAACCCAGGTTCCTGATCAGCT
GGTTGAATTTGATATGTGGACCAATGACAACTCCCTAGAAACTAAAGTAGATTCTACCATAGATTCGAGCATACTAGATGGTGCTAAGGACACCAAGAAAGGAATGACTG
TTGAGAATATTGTAGTTGGAATACCATACACATCACATGCGTCGAATGGATCAGTGCAGGATGATAATCAAACTTCAACAGCTAACAAAGTGCTAGTTCAGAATCAGTCA
AATGGATTTGCAGTATTATCAGAGTCTGAAATTGATACTAATGTTAATTCGACACCGGGTCTTACTTCTCTTGAAGCAATGATGGCTGAGACTGTAACAGATACAGTCGA
TGAAAAAAAAGGGATTGATGTTGAGAATGTTGTGGTTGGAATTCCATACTCCCTAGAGTCAAAGGGAGGACTGCTCGATCGATTTAGGCTACCTGCATTGTTTAACAAAG
CTCCTGTTCCTGATCAGTCTGCTGCAGTTGCTGAAACAGAGATCCCAAAAACACCAGAGCCTGTAGGAGCTACCGTGCCTGATTCTTCTCCAGTTTCTCCTTCCCTTGGA
GCACCAACTGCTGCCGAGAGAGTAACAGACAGTGCAGTGGGCAGTCGTGAAGTTGAAGGAGGACCTGTAGCTATATCCATTGATGATTCTCGGGATGAACAAGTAGAACC
TGAACCAAGTAGGTATAATAGGTGGGAAATTGTCAAAAGTATAGTTTATGGTGGTTTGGCTGAGTCAATTACAAGCCTTGGCATTGTGGCGTCTGCAGCAAGTGCCAATA
CTGCAACAGGGAACATTGTAATTTTGGCGTTGGCAAACCTGATCAGTGGGCTTTTCATCCTTGGACATAATTTGACAGGACTCAAGAGCGAGCAGTTCAGAAGGACATCT
AATGAAACAGACGATGTTGAACACGTAGACCGATACGAAGTAGTATTGGGAAACCGAGAGAATTACATTCTACATTTCACGCTTGCTATCTTCTCGTTTGTACTCTTCGG
TCTAGTCCCTCCTCTCGTTTACGGTTTCTCATTCACCAAGAGCAATGACAAGGATCTCAAGCTTGCAGCAGTAGCAGGAGCTTCTCTTTTATGCATCATATTGCTTGCTC
TCGGGAAGGCTTACATTCAGAGACCAAATAGGTGGGATGTGTACTTCAAAATGGTAGCATCATATGTCGTTATAGCGGCTGGGGCCGGGGGTTTCTCATATTTGGCCGGT
GATCTGCTTGACAAGGTCATCAAGAAATATGGTTGGTTTGAGGAAAACCCTGCATTCAATCTTATTAGTCTGCCCCTTCCAGAGATGAGTTTGGCGAAACCTGCATGGGG
ATCCTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCAACGGAAGTTCGAAGAACGGATGGAAGCAATGGATCAAGAGGTATCGGAAATTCGGGTAGAAATTAGCAAATTGCCTGTAATAGAAGAGACCTTGGTTTCTTT
ATCCAAAAGTATCGAGAGATTGAGTGTTCAAACGGAAAAACAACAGATAATGTTAGCGACAATTGCTATCGACTTAGCGAGAGGACAGAAAGGTATCGCTGGAGATGAAT
CGGGCAATAAAAGTGGGACGAGTGAGACACTTAGGACGAGCGAGACCCCTGGGACTAGCGAGACACCTGGTATGTTAACAGCGAAGGCGACGAGCGAGACACCTGGGACT
AGCGAGACACCTGTTACGTTAGCAGCAAAGGCGACGATGAAAGCAAAATCGGAAACTAACGACACATCATTGGATCAGAATAAATTCAAGAAAGTCGAGATGCCGGTGTT
TAATGGAGTTGATCCGGATGGGTGGTTCTTCAGAGCTGATCGTTATTTCCAAATTCATAGATTATCGGATTCAGAGAAGATGACGATGGCAGTTATTAGTTTCAATGGTG
TCGCTCTTGATTGGTATCGGCCCCAAGGGGAGCAAGATTCATTCAGCGATTGGAAAAACTTGAAGAACCATTTATTGATGCGCTTCAGATTGGAGCGAGATGGATCACTT
TGTGGCCGATTTTTTGCAATTAAGCAGGAGTCTACCGTGGATGAATATCGGAACCTCTTTGACAAGTTGGTTGCGCCATTGCGCCTCATCTGTCAAACGAGGTGGACTTT
CATGAATGGGTTGCCTCCTTGGATTAAAGCAGAGGTAGAATGTTGGGAACCCGTGGGGTTAGCGCATGAAGTTGGCTCGGCGAGTGGAAAATCGGGAATTGAGGCGTTTG
GATGCGGGTTTGGGTACGAAGTCAAGTGGTCGGCTGCAGTTGTTCGAAAAGAAAGATTTCGTACCCACGGCAGTGCAAAAGGATGGAGGAAGAATGGCAGAGAACCCATC
GATGCCGATGCGAACAATCACATTGAGAGGGGTGTTGACGATGGGAAATCAAAGGGAAGGACCTGCGAGACGATTGACGGATGCCGAGTTTCAGGCGAAAGGGAACTTCG
AGTGTTGGTTGTTACTGACAGCATTAATGATTTGGAGGTTGTGGAAAATGAGGATAATAGGGAAGAGCAAAATGAGTTGCAAATGATCAAGGCAGATAATGAAATTCAAA
CAATATCGGAATTGTCAATTAATTCCGTGGGGGGCTTGACGAATCTGGGAATGATGAAGCTTAAAGGCAACATTGGGAAAGAGGCAGTCGTGGTGTTAATTGATTGTGGA
CTGACACACAATTTTATTGCAGAAAGGTTAGTTTTGTCGTTGAAATTGCCGATTGTTGAAACCTCCAATTATGGAGTAATTTTGGGGTCGGGATTGGTGATAAAAGGGAA
GGGCATTTGTAATGTTGTGGAGTTAGTAATCGGAGAGTTGATTGTGCGTGGTAGTTTCTTGCCCTTAGAATTGGGAGGAGTAGATGTGATATTAGTAGACCAAGGATTTC
TCATTGAGTACCGTGCTATGGAAGGAGGAGTGTCGCTGGCTGAATGGTATGAGGTAGATGATGTCTATACCATCTCTGAATCAAATCCAGCAGTGATTGATAGCTTTGCT
GATGTGTTTGAGTGGCCAGCAGCCTTACCGCAGAGGAGTATAGAGCATCACATCCATTTCAAGATGGGAACGGATCCGGTTAATGTGAGACCATATAGGTGGAGATTCTG
TGTTGATTATCAAGCTTTGAATAATGTCACTATACTTGATAATTTTCCTATTCCGGTAATTGAAGAACTGTTTGATGAATTGCATGGAGCGACATTATTTTCCAAGATTG
ATTTGAAATCAGGTTATCACCAAATTAGGATGCATCCAAGGGACATTGAGAAAACAACTTTCCGCATGCATGAGGGCCATTATGAGTTTTTGGTGATGCCTTTTGGGCTG
ACTAATGCTCCATCGACGTTTCAAGCACTCATGAATAACATCTTTAAACCATACTTGAGGAGATTCGTACTTGTATTCTTTGATGATATTTTTGTGTGTAGTAGAACTGT
CGAGGATCATGTTCAGCATTTGGAGGAGGTATTTTTGATATTGAGGGATAATGAGTTGTATGCAAATAAAAACAAATGTCAATTTGCCAAGGAGAGAGTGGAGTATTTGG
GGCATCTAATCTCAGGTAAAGGTATGGAAGCGGATCCCGAGAAGATTCGAGCAATGCTGGAATGGCCTATACCTACGAATATCAGGGAGGTAAGTGGATTTTTGGGTTTG
ACCGATTATTATAGAAGATTTGTTCATCACTATGGTAGTTTGGCTGCTCCTTTGAGTCAATTATTGAAAGGGGGAGCTTTTGAATGGATTGAGGATACTCAAGTTGCTTT
TGAACGTCTGAAGAATGCCATGATAACATTACTAGTATTGGTGTTACCTGATTTTGATTTACTGTTTGAAGTGGAGACGGATGCATCGGGATATGGGATTGGATCGGTAT
TGACACAACTAAAGAGACCTATAGCTTATTACAACCACACCTTGTCTTTACGAGATAGAGGAAAGCAAGTGTATGAACGAGAGTTGATGGCTGTAGTGATGGCAGTGCAG
CAGAACAAGGTTGTTGACGCTTTATCTCGCATTCCACCAACGATACATTTAGCCTACTTATCAGCTCCTACTATCATAGATGTGGAAGTTATCAAAACGGAGGTGGCAAA
TGATCCGAAGTTGAAAGAAGTGGTTGAGAAATTGGTGCAGCATGCTGACAGTGTACCTGGGTTTGTGTGGCAACAAGGTACTTTGAAATACAAGGGTCGTCTGGTCTTAT
CTAAGTCCTCTACTCTCATTCCTACAGTTTTGCATACTTATCATGATTCTGTATTTGGGGGGTATTCAGCTTTGTCTCCCGCGGGGTTGTTGATGCCCTTGGAGATTCCT
GATGTTATTTGGAGTGATATTTCGATGGATTTCATTGAGGCGTTGCCAAAGGCAAAGGGATACAATGTTATTTTGGTAGTGGTAGATAGATTAAGCAAGTACGACCATTT
CATTGCATTGAAACATCCATTCACCACTAAAACGAATCCGACAACCAATATATGGGAAGTGTTGGTGGGATGGAAAGGGTTACCAGCTCATGAAGCTACATGGGAAATCT
TTGATGACTTCAAACAACAATTTCCAGACTTTCACCTTGAGGACAAGGTGGTTCTTGAAGGGGAGAGTAGTGTTAGACCACCTATAATATTAACTTATAGTAGAAAAGGG
AATAAAGGAAAGAGAATTGTTTGTGGAAGTGTAGCTTCTACTAGCCAAAGGAGGGAAGGAAACTTGAAGTTTCCTTCTGTTCTAGTCTTAGGGCTGCAATCTTGTGATTC
AAATAAATGCACAGTGAAGTTTCTTATCAGTTTCCTACTAAATATGCATGCAGGAAATAACGTTGAAAGCTTAGGGATCAGAGCGGAAATTCAAGTTGATAGAGCTCCAG
TTCCTGCTCAGTCAGTTGATTATGCAGTAACCGATAAAAATGAAGATGATCATGTTGCAGCCCAGGCTCATCCTTCACTTAAAGCAACAAGTGAAGAGGGAATGATTGTA
CAAGAAGAAGCATTGGCTAGCAAGAAAGGAAATAACGTTGAAAGCATAGAGTTCGGAGAAAGAGAGATATTTCAAGTTCAGGAAACAACAGATCCTCAAGAAAATCAAGT
TGATAAAGCTCCAGTTCCCAACCAGTCAGTTGATTATGCAGTATCTGATAAAAATGAAGTCGTACCTGATCCATCAGTTGGTAATGGAGTAGCTGATCAAATACGAGGCA
TCCATGCTGTATCTAATTCAAAGCCAACTCATCCTTCACTTAACACAACAGTTGCTGAGCAAGAGGTTGTGCCAGTTAATGGAGTGGAAAGCAAACAAGGGAACACGATT
GAGAGCATCATAGTTGGAAGACCAGAGCCACTGCTGGAATCCGCTGAAACAGTATATGATCAAGGAATTCAAGCTGATTCCATAAATAAAACCCAGGTTCCTGATCAGCT
GGTTGAATTTGATATGTGGACCAATGACAACTCCCTAGAAACTAAAGTAGATTCTACCATAGATTCGAGCATACTAGATGGTGCTAAGGACACCAAGAAAGGAATGACTG
TTGAGAATATTGTAGTTGGAATACCATACACATCACATGCGTCGAATGGATCAGTGCAGGATGATAATCAAACTTCAACAGCTAACAAAGTGCTAGTTCAGAATCAGTCA
AATGGATTTGCAGTATTATCAGAGTCTGAAATTGATACTAATGTTAATTCGACACCGGGTCTTACTTCTCTTGAAGCAATGATGGCTGAGACTGTAACAGATACAGTCGA
TGAAAAAAAAGGGATTGATGTTGAGAATGTTGTGGTTGGAATTCCATACTCCCTAGAGTCAAAGGGAGGACTGCTCGATCGATTTAGGCTACCTGCATTGTTTAACAAAG
CTCCTGTTCCTGATCAGTCTGCTGCAGTTGCTGAAACAGAGATCCCAAAAACACCAGAGCCTGTAGGAGCTACCGTGCCTGATTCTTCTCCAGTTTCTCCTTCCCTTGGA
GCACCAACTGCTGCCGAGAGAGTAACAGACAGTGCAGTGGGCAGTCGTGAAGTTGAAGGAGGACCTGTAGCTATATCCATTGATGATTCTCGGGATGAACAAGTAGAACC
TGAACCAAGTAGGTATAATAGGTGGGAAATTGTCAAAAGTATAGTTTATGGTGGTTTGGCTGAGTCAATTACAAGCCTTGGCATTGTGGCGTCTGCAGCAAGTGCCAATA
CTGCAACAGGGAACATTGTAATTTTGGCGTTGGCAAACCTGATCAGTGGGCTTTTCATCCTTGGACATAATTTGACAGGACTCAAGAGCGAGCAGTTCAGAAGGACATCT
AATGAAACAGACGATGTTGAACACGTAGACCGATACGAAGTAGTATTGGGAAACCGAGAGAATTACATTCTACATTTCACGCTTGCTATCTTCTCGTTTGTACTCTTCGG
TCTAGTCCCTCCTCTCGTTTACGGTTTCTCATTCACCAAGAGCAATGACAAGGATCTCAAGCTTGCAGCAGTAGCAGGAGCTTCTCTTTTATGCATCATATTGCTTGCTC
TCGGGAAGGCTTACATTCAGAGACCAAATAGGTGGGATGTGTACTTCAAAATGGTAGCATCATATGTCGTTATAGCGGCTGGGGCCGGGGGTTTCTCATATTTGGCCGGT
GATCTGCTTGACAAGGTCATCAAGAAATATGGTTGGTTTGAGGAAAACCCTGCATTCAATCTTATTAGTCTGCCCCTTCCAGAGATGAGTTTGGCGAAACCTGCATGGGG
ATCCTCTTGA
Protein sequenceShow/hide protein sequence
MAQRKFEERMEAMDQEVSEIRVEISKLPVIEETLVSLSKSIERLSVQTEKQQIMLATIAIDLARGQKGIAGDESGNKSGTSETLRTSETPGTSETPGMLTAKATSETPGT
SETPVTLAAKATMKAKSETNDTSLDQNKFKKVEMPVFNGVDPDGWFFRADRYFQIHRLSDSEKMTMAVISFNGVALDWYRPQGEQDSFSDWKNLKNHLLMRFRLERDGSL
CGRFFAIKQESTVDEYRNLFDKLVAPLRLICQTRWTFMNGLPPWIKAEVECWEPVGLAHEVGSASGKSGIEAFGCGFGYEVKWSAAVVRKERFRTHGSAKGWRKNGREPI
DADANNHIERGVDDGKSKGRTCETIDGCRVSGERELRVLVVTDSINDLEVVENEDNREEQNELQMIKADNEIQTISELSINSVGGLTNLGMMKLKGNIGKEAVVVLIDCG
LTHNFIAERLVLSLKLPIVETSNYGVILGSGLVIKGKGICNVVELVIGELIVRGSFLPLELGGVDVILVDQGFLIEYRAMEGGVSLAEWYEVDDVYTISESNPAVIDSFA
DVFEWPAALPQRSIEHHIHFKMGTDPVNVRPYRWRFCVDYQALNNVTILDNFPIPVIEELFDELHGATLFSKIDLKSGYHQIRMHPRDIEKTTFRMHEGHYEFLVMPFGL
TNAPSTFQALMNNIFKPYLRRFVLVFFDDIFVCSRTVEDHVQHLEEVFLILRDNELYANKNKCQFAKERVEYLGHLISGKGMEADPEKIRAMLEWPIPTNIREVSGFLGL
TDYYRRFVHHYGSLAAPLSQLLKGGAFEWIEDTQVAFERLKNAMITLLVLVLPDFDLLFEVETDASGYGIGSVLTQLKRPIAYYNHTLSLRDRGKQVYERELMAVVMAVQ
QNKVVDALSRIPPTIHLAYLSAPTIIDVEVIKTEVANDPKLKEVVEKLVQHADSVPGFVWQQGTLKYKGRLVLSKSSTLIPTVLHTYHDSVFGGYSALSPAGLLMPLEIP
DVIWSDISMDFIEALPKAKGYNVILVVVDRLSKYDHFIALKHPFTTKTNPTTNIWEVLVGWKGLPAHEATWEIFDDFKQQFPDFHLEDKVVLEGESSVRPPIILTYSRKG
NKGKRIVCGSVASTSQRREGNLKFPSVLVLGLQSCDSNKCTVKFLISFLLNMHAGNNVESLGIRAEIQVDRAPVPAQSVDYAVTDKNEDDHVAAQAHPSLKATSEEGMIV
QEEALASKKGNNVESIEFGEREIFQVQETTDPQENQVDKAPVPNQSVDYAVSDKNEVVPDPSVGNGVADQIRGIHAVSNSKPTHPSLNTTVAEQEVVPVNGVESKQGNTI
ESIIVGRPEPLLESAETVYDQGIQADSINKTQVPDQLVEFDMWTNDNSLETKVDSTIDSSILDGAKDTKKGMTVENIVVGIPYTSHASNGSVQDDNQTSTANKVLVQNQS
NGFAVLSESEIDTNVNSTPGLTSLEAMMAETVTDTVDEKKGIDVENVVVGIPYSLESKGGLLDRFRLPALFNKAPVPDQSAAVAETEIPKTPEPVGATVPDSSPVSPSLG
APTAAERVTDSAVGSREVEGGPVAISIDDSRDEQVEPEPSRYNRWEIVKSIVYGGLAESITSLGIVASAASANTATGNIVILALANLISGLFILGHNLTGLKSEQFRRTS
NETDDVEHVDRYEVVLGNRENYILHFTLAIFSFVLFGLVPPLVYGFSFTKSNDKDLKLAAVAGASLLCIILLALGKAYIQRPNRWDVYFKMVASYVVIAAGAGGFSYLAG
DLLDKVIKKYGWFEENPAFNLISLPLPEMSLAKPAWGSS