; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017199 (gene) of Snake gourd v1 genome

Gene IDTan0017199
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
Genome locationLG07:5492008..5494510
RNA-Seq ExpressionTan0017199
SyntenyTan0017199
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain
IPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN33754.1 retrotransposon protein [Cucumis melo subsp. melo]1.3e-12942.77Show/hide
Query:  LIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMFLHIISHDVKNRIIRRHFARSGETMSRHFNAVLFAILQLM-------VYITSV
        +I+ESDL C +STRMDRR F +LC +LR    L +TE VDVEEM+AMFLH+++HDVKNR+I++ F RSGET+SRHFN VL A+L+L        V +TS 
Subjt:  LIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMFLHIISHDVKNRIIRRHFARSGETMSRHFNAVLFAILQLM-------VYITSV

Query:  VDA---FAIQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKVPKGYYYLCDVGYPNAEG
         +       +NCLGALDGTYIKVNV   +RP +RTRKGEI T+VL VC+  G+F++V+ GWEGSAADSR+LRDAIS+ NGL+VPKGYYYLCD GYPNAEG
Subjt:  VDA---FAIQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKVPKGYYYLCDVGYPNAEG

Query:  FLVPYRGERYHLSEWRGMGSS------------------------IAMENW------------------------------------------KEEETSS
        FL PY+G+RYHL EWRG  ++                        +    W                                          + + T +
Subjt:  FLVPYRGERYHLSEWRGMGSS------------------------IAMENW------------------------------------------KEEETSS

Query:  SSSNGDHINFIETSSEWNQQRDEMAERMFTETMTGVERMPKHTWTRFEDAKLVECLVAMVHEGCWRSDNGTFRPGYLSHLLRLLREKIPNCAIQATTTID
        +++  + I +IET++EW+Q RD++A  MFT+            W +F       C + +V  G W+SDNGTFRPGYL+ L+R++ EK+  C ++ATT ID
Subjt:  SSSNGDHINFIETSSEWNQQRDEMAERMFTETMTGVERMPKHTWTRFEDAKLVECLVAMVHEGCWRSDNGTFRPGYLSHLLRLLREKIPNCAIQATTTID

Query:  YRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVKAHPGAKGLRCNSFPHFDELSIVFGKDRATGADAETPDDMASNDAMPLGADDEMNI
         R+K LK+ + AI+EMLGP CSGFGWN+E K +  EK++FD WV++ P AKGL  N FP++DEL+ VFG+DRATG  AET  D+ SN+  P G  D  ++
Subjt:  YRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVKAHPGAKGLRCNSFPHFDELSIVFGKDRATGADAETPDDMASNDAMPLGADDEMNI

Query:  SQEPYEQNPETPTSGV---------------QRSATTSRGSKRKRTSYQSEMLDVVRTAMDMQNSQLERIASWLVSNYALEDSRRKEVAYSLCQLTPALS
          +  E  P   + GV                   T S GSKRKR S +   ++ +  A+D  N QL +IA W   N A ++  R E  + + +  P L+
Subjt:  SQEPYEQNPETPTSGV---------------QRSATTSRGSKRKRTSYQSEMLDVVRTAMDMQNSQLERIASWLVSNYALEDSRRKEVAYSLCQLTPALS

Query:  RQERVRLMDILFADAFKPSSFLAVPVDDR
          +R  L   L +       F+ +P D+R
Subjt:  RQERVRLMDILFADAFKPSSFLAVPVDDR

ADN34114.1 retrotransposon protein [Cucumis melo subsp. melo]3.6e-14044.58Show/hide
Query:  MENRELITILTVITASQRQTLQLLDILINNHRRIEHQSPYLRHQIRQLTVFRLIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMF
        M+  EL +I+    ASQRQ L +L++L N+ +RI H     RH+IRQL  FR+I+                         T   L +TE VDVEEM+AMF
Subjt:  MENRELITILTVITASQRQTLQLLDILINNHRRIEHQSPYLRHQIRQLTVFRLIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMF

Query:  LHIISHDVKNRIIRRHFARSGETMSRHFNAVLFAILQLMVYITSVVDAFA----------IQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVC
        LHI++HDVK+R+I+R F RSGET+SRHFN VL A+++L   +                   +NCLGALDGTYIKVNV  ++R RYRTRKGE+ T+VL VC
Subjt:  LHIISHDVKNRIIRRHFARSGETMSRHFNAVLFAILQLMVYITSVVDAFA----------IQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVC

Query:  NPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKVPKGYYYLCDVGYPNAEGFLVPYRGERYHLSEWRG-------------------------------
        +  G+F++V+ GWEGSAADSR+LRDA+SRPN LKVPKGYYYL DVGYPNAEGFL PYRG+RYHL EWRG                               
Subjt:  NPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKVPKGYYYLCDVGYPNAEGFLVPYRGERYHLSEWRG-------------------------------

Query:  ----------------------------------MGSSIAMENWKEEETSSSSSNGDHINFIETSSEWNQQRDEMAERMFTETMTGVERMPKHTWTRFED
                                          M +    +N  E +++ +++  D I++IETS+EW+Q RD +AE    E MT   R+PKHTWT+ E+
Subjt:  ----------------------------------MGSSIAMENWKEEETSSSSSNGDHINFIETSSEWNQQRDEMAERMFTETMTGVERMPKHTWTRFED

Query:  AKLVECLVAMVHEGCWRSDNGTFRPGYLSHLLRLLREKIPNCAIQATTTIDYRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVKAHPG
        A LVECLV +V+ G WRSDNGTFRPGYL+ L R++  KIP   I A +TID R+K +K+ + A++EM GP CSGFGWN+E K +  EK+VFD W  +HP 
Subjt:  AKLVECLVAMVHEGCWRSDNGTFRPGYLSHLLRLLREKIPNCAIQATTTIDYRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVKAHPG

Query:  AKGLRCNSFPHFDELSIVFGKDRATGADAETPDDMASND--AMPLGADDEMNISQEPYEQNP----------ETPTSGVQRSATTSRGSKRKRTSYQSEM
        AKGL   SF H+DELS VFGKDRATG  AE+  D+ SND      GA D M  +  P   +P          ET T+ V      S GSKRKR  + ++ 
Subjt:  AKGLRCNSFPHFDELSIVFGKDRATGADAETPDDMASND--AMPLGADDEMNISQEPYEQNP----------ETPTSGVQRSATTSRGSKRKRTSYQSEM

Query:  LDVVRTAMDMQNSQLERIASWLVSNYALEDSRRKEVAYSLCQLTPALSRQERVRLMDILFADAFKPSSFLAVP
         D+VRTA++  N QL RIA W +         R+E+   L +  P L+  +R RLM IL  +     +FL VP
Subjt:  LDVVRTAMDMQNSQLERIASWLVSNYALEDSRRKEVAYSLCQLTPALSRQERVRLMDILFADAFKPSSFLAVP

KAA0034843.1 retrotransposon protein [Cucumis melo var. makuwa]1.1e-14447.41Show/hide
Query:  MENRELITILTVITASQRQTLQLLDILINNHRRIEHQSPYLRHQIRQLTVFRLIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMF
        M+  EL +I+    ASQRQ L +L++L N+ +RI H     RH+IRQL  FR+I+ SDL C +STRMDRRCF +LC +LRT   L +TE VDVEEM+AMF
Subjt:  MENRELITILTVITASQRQTLQLLDILINNHRRIEHQSPYLRHQIRQLTVFRLIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMF

Query:  LHIISHDVKNRIIRRHFARSGETMSRHFNAVLFAILQLMVYITSVVDAFA----------IQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVC
        LHI++HDVKNR+I+R F RSGET+SRHFN VL A+++L   +                   +NCLGALDGTYIKVNV  ++R RYRTRKGE+ T+VL V 
Subjt:  LHIISHDVKNRIIRRHFARSGETMSRHFNAVLFAILQLMVYITSVVDAFA----------IQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVC

Query:  NPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKVPKGYYYLCDVGYPNAEGFLVPYRGERYHLSEWRGMGSSIAMENWKEEETSSSSSNGDHINFIETS
        +  G+F++V+ GWEGSAADSR+LRDA+SRPN LKVPKGYYYL D GYPNAEGFL PYRG+RYHL EWRG  ++ +          SS+ N     F    
Subjt:  NPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKVPKGYYYLCDVGYPNAEGFLVPYRGERYHLSEWRGMGSSIAMENWKEEETSSSSSNGDHINFIETS

Query:  SEWNQQRDE--------------------MAERMFTE--------TMTGVERMPKHTWTRFEDAKLVECLVAMVHEGCWRSDNGTFRPGYLSHLLRLLRE
          W   R +                    +  R  T         +MT   R+PKHTWT+ E+A LVE    +V+ G WRSDNGTFRPGYL+ L R++  
Subjt:  SEWNQQRDE--------------------MAERMFTE--------TMTGVERMPKHTWTRFEDAKLVECLVAMVHEGCWRSDNGTFRPGYLSHLLRLLRE

Query:  KIPNCAIQATTTIDYRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVKAHPGAKGLRCNSFPHFDELSIVFGKDRATGADAETPDDMAS
        KIP C I A +TID R+K +K+ + A++EM GP CSGFGWN+E K +  EK+VFD W  +HP AKGL   SF H+DELS VFGKDRATG  AE+  D+ S
Subjt:  KIPNCAIQATTTIDYRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVKAHPGAKGLRCNSFPHFDELSIVFGKDRATGADAETPDDMAS

Query:  N----------DAMP---LGADDEMNISQEPYEQNPETPTSGVQRSATTSRGSKRKRTSYQSEMLDVVRTAMDMQNSQLERIASWLVSNYALEDSRRKEV
        N          DA+P         + ++  P +   ET T+ V      S GSKRKR  + ++  D+VRTA++  N QL RIA W +         R+E+
Subjt:  N----------DAMP---LGADDEMNISQEPYEQNPETPTSGVQRSATTSRGSKRKRTSYQSEMLDVVRTAMDMQNSQLERIASWLVSNYALEDSRRKEV

Query:  AYSLCQLTPALSRQERVRLMDILFADAFKPSSFLAVP
           L +  P L+  +R RLM IL  +     +FL VP
Subjt:  AYSLCQLTPALSRQERVRLMDILFADAFKPSSFLAVP

KAA0036474.1 retrotransposon protein [Cucumis melo var. makuwa]1.2e-12451.12Show/hide
Query:  INNHRRIEHQSPYLRHQIRQLTVFRLIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMFLHIISHDVKNRIIRRHFARSGETMSRH
        +NN++R+ H  P  RH+IR+L  FR+I+ESDL C +STRMDRR F +LC +LR    L +TE VDVEEM+AMFLHI +HDVKNR+I+R F RSGET+SRH
Subjt:  INNHRRIEHQSPYLRHQIRQLTVFRLIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMFLHIISHDVKNRIIRRHFARSGETMSRH

Query:  FNAVLFAILQLM-------VYITSVVDA---FAIQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAI
        FN VL A+L+L        V +TS  +       +NCLGALDGTYIKVNV   +RP +RTRKGEI T+VL VC+  G+F++V+ GW+GSAADSR+LRDAI
Subjt:  FNAVLFAILQLM-------VYITSVVDA---FAIQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAI

Query:  SRPNGLKVPKGYYYLCDVGYPNAEGFLVPYRGERYHLSEWRGMGSS------------------------IAMENW-------------------KEEET
        SR NGL+VPKGYYYLCD GYPNAEGFL PYRG+RYHL EWRG  ++                        +    W                   + + T
Subjt:  SRPNGLKVPKGYYYLCDVGYPNAEGFLVPYRGERYHLSEWRGMGSS------------------------IAMENW-------------------KEEET

Query:  SSSSSNGDHINFIETSSEWNQQRDEMAERMFTE-TMTGVERMPKHTWTRFEDAKLVECLVAMVHEGCWRSDNGTFRPGYLSHLLRLLREKIPNCAIQATT
         ++++  + I +IET++EW+Q RD++A  MF +  M+   R P+H WTR E+  LVECL+ +V  G W+SDNGTFR GYL+ L+R++ EK+  C ++ATT
Subjt:  SSSSSNGDHINFIETSSEWNQQRDEMAERMFTE-TMTGVERMPKHTWTRFEDAKLVECLVAMVHEGCWRSDNGTFRPGYLSHLLRLLREKIPNCAIQATT

Query:  TIDYRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVK
         ID R+K LK+ + AI+EM GP CSGFGWN+E K +  EK++FD WV+
Subjt:  TIDYRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVK

KAA0045638.1 retrotransposon protein [Cucumis melo var. makuwa]6.4e-10540.55Show/hide
Query:  INNHRRIEHQSPYLRHQIRQLTVFRLIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMFLHIISHDVKNRIIRRHFARSGETMSRH
        +NN++R+ H  P  RH+IR+L  FR+I+ESDL C +STRMDRR F +LC +L     L +TE VDVEEM+AMFLH+++HDVKN +I+R F          
Subjt:  INNHRRIEHQSPYLRHQIRQLTVFRLIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMFLHIISHDVKNRIIRRHFARSGETMSRH

Query:  FNAVLFAILQLMVYITSVVDAFAIQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKVPK
                                 NCLGALDGTYIKVNV   +RP +RTRKGEI T+VL VC+  G+F++V+ GWEGSAADSR+LR             
Subjt:  FNAVLFAILQLMVYITSVVDAFAIQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKVPK

Query:  GYYYLCDVGYPNAEGFLVPYRGERYHLSEWRGMGSSIAMENWKEEETSSSSSNGDHINFIETSSEWNQQRDEMAERMFTETMTGVERMPKHTWTRFEDAK
         YYYLCD GYPNAEGFLV  RG+RYHL EWRG  +  A  N KE              +  T                           KH+ TR     
Subjt:  GYYYLCDVGYPNAEGFLVPYRGERYHLSEWRGMGSSIAMENWKEEETSSSSSNGDHINFIETSSEWNQQRDEMAERMFTETMTGVERMPKHTWTRFEDAK

Query:  LVECLVAMVHEGCWRSDNGTFRPGYLSHLLRLLREKIPNCAIQATTTIDYRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVKAHPGAK
        ++EC   ++       DN TFRPGYL+ L+R++ EK+P C ++ATT ID R+K LK+ + AI+EM GP CSG GWN+E K +  +K++FD WV++HP AK
Subjt:  LVECLVAMVHEGCWRSDNGTFRPGYLSHLLRLLREKIPNCAIQATTTIDYRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVKAHPGAK

Query:  GLRCNSFPHFDELSIVFGKDRATGADAETPDDMASNDAMPLGADDEMNISQ------EPYEQNPETPTSGVQRS--------ATTSRGSKRKRTSYQSEM
        GL   SF ++DEL+ VFG++R     AET  D+ SN+  P G  D  ++          Y Q  +     V+ S         T S GSKRKR S Q   
Subjt:  GLRCNSFPHFDELSIVFGKDRATGADAETPDDMASNDAMPLGADDEMNISQ------EPYEQNPETPTSGVQRS--------ATTSRGSKRKRTSYQSEM

Query:  LDVVRTAMDMQNSQLERIASWLVSNYALEDSRRKEVAYSLCQLTPALSRQERVRLMDILFADAFKPSSFLAVPVDDR
        ++ +  A+D  N QL +IA W   N A ++  R E  + + +  P L+  +R  L   L +       F+ +P D+R
Subjt:  LDVVRTAMDMQNSQLERIASWLVSNYALEDSRRKEVAYSLCQLTPALSRQERVRLMDILFADAFKPSSFLAVPVDDR

TrEMBL top hitse value%identityAlignment
A0A5A7SWD8 Retrotransposon protein5.3e-14547.41Show/hide
Query:  MENRELITILTVITASQRQTLQLLDILINNHRRIEHQSPYLRHQIRQLTVFRLIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMF
        M+  EL +I+    ASQRQ L +L++L N+ +RI H     RH+IRQL  FR+I+ SDL C +STRMDRRCF +LC +LRT   L +TE VDVEEM+AMF
Subjt:  MENRELITILTVITASQRQTLQLLDILINNHRRIEHQSPYLRHQIRQLTVFRLIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMF

Query:  LHIISHDVKNRIIRRHFARSGETMSRHFNAVLFAILQLMVYITSVVDAFA----------IQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVC
        LHI++HDVKNR+I+R F RSGET+SRHFN VL A+++L   +                   +NCLGALDGTYIKVNV  ++R RYRTRKGE+ T+VL V 
Subjt:  LHIISHDVKNRIIRRHFARSGETMSRHFNAVLFAILQLMVYITSVVDAFA----------IQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVC

Query:  NPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKVPKGYYYLCDVGYPNAEGFLVPYRGERYHLSEWRGMGSSIAMENWKEEETSSSSSNGDHINFIETS
        +  G+F++V+ GWEGSAADSR+LRDA+SRPN LKVPKGYYYL D GYPNAEGFL PYRG+RYHL EWRG  ++ +          SS+ N     F    
Subjt:  NPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKVPKGYYYLCDVGYPNAEGFLVPYRGERYHLSEWRGMGSSIAMENWKEEETSSSSSNGDHINFIETS

Query:  SEWNQQRDE--------------------MAERMFTE--------TMTGVERMPKHTWTRFEDAKLVECLVAMVHEGCWRSDNGTFRPGYLSHLLRLLRE
          W   R +                    +  R  T         +MT   R+PKHTWT+ E+A LVE    +V+ G WRSDNGTFRPGYL+ L R++  
Subjt:  SEWNQQRDE--------------------MAERMFTE--------TMTGVERMPKHTWTRFEDAKLVECLVAMVHEGCWRSDNGTFRPGYLSHLLRLLRE

Query:  KIPNCAIQATTTIDYRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVKAHPGAKGLRCNSFPHFDELSIVFGKDRATGADAETPDDMAS
        KIP C I A +TID R+K +K+ + A++EM GP CSGFGWN+E K +  EK+VFD W  +HP AKGL   SF H+DELS VFGKDRATG  AE+  D+ S
Subjt:  KIPNCAIQATTTIDYRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVKAHPGAKGLRCNSFPHFDELSIVFGKDRATGADAETPDDMAS

Query:  N----------DAMP---LGADDEMNISQEPYEQNPETPTSGVQRSATTSRGSKRKRTSYQSEMLDVVRTAMDMQNSQLERIASWLVSNYALEDSRRKEV
        N          DA+P         + ++  P +   ET T+ V      S GSKRKR  + ++  D+VRTA++  N QL RIA W +         R+E+
Subjt:  N----------DAMP---LGADDEMNISQEPYEQNPETPTSGVQRSATTSRGSKRKRTSYQSEMLDVVRTAMDMQNSQLERIASWLVSNYALEDSRRKEV

Query:  AYSLCQLTPALSRQERVRLMDILFADAFKPSSFLAVP
           L +  P L+  +R RLM IL  +     +FL VP
Subjt:  AYSLCQLTPALSRQERVRLMDILFADAFKPSSFLAVP

A0A5A7SYW1 Retrotransposon protein6.0e-12551.12Show/hide
Query:  INNHRRIEHQSPYLRHQIRQLTVFRLIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMFLHIISHDVKNRIIRRHFARSGETMSRH
        +NN++R+ H  P  RH+IR+L  FR+I+ESDL C +STRMDRR F +LC +LR    L +TE VDVEEM+AMFLHI +HDVKNR+I+R F RSGET+SRH
Subjt:  INNHRRIEHQSPYLRHQIRQLTVFRLIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMFLHIISHDVKNRIIRRHFARSGETMSRH

Query:  FNAVLFAILQLM-------VYITSVVDA---FAIQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAI
        FN VL A+L+L        V +TS  +       +NCLGALDGTYIKVNV   +RP +RTRKGEI T+VL VC+  G+F++V+ GW+GSAADSR+LRDAI
Subjt:  FNAVLFAILQLM-------VYITSVVDA---FAIQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAI

Query:  SRPNGLKVPKGYYYLCDVGYPNAEGFLVPYRGERYHLSEWRGMGSS------------------------IAMENW-------------------KEEET
        SR NGL+VPKGYYYLCD GYPNAEGFL PYRG+RYHL EWRG  ++                        +    W                   + + T
Subjt:  SRPNGLKVPKGYYYLCDVGYPNAEGFLVPYRGERYHLSEWRGMGSS------------------------IAMENW-------------------KEEET

Query:  SSSSSNGDHINFIETSSEWNQQRDEMAERMFTE-TMTGVERMPKHTWTRFEDAKLVECLVAMVHEGCWRSDNGTFRPGYLSHLLRLLREKIPNCAIQATT
         ++++  + I +IET++EW+Q RD++A  MF +  M+   R P+H WTR E+  LVECL+ +V  G W+SDNGTFR GYL+ L+R++ EK+  C ++ATT
Subjt:  SSSSSNGDHINFIETSSEWNQQRDEMAERMFTE-TMTGVERMPKHTWTRFEDAKLVECLVAMVHEGCWRSDNGTFRPGYLSHLLRLLREKIPNCAIQATT

Query:  TIDYRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVK
         ID R+K LK+ + AI+EM GP CSGFGWN+E K +  EK++FD WV+
Subjt:  TIDYRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVK

A0A803QNC5 Uncharacterized protein3.7e-11445.2Show/hide
Query:  MDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMFLHIISHDVKNRIIRRHFARSGETMSRHFNAVLFAILQLMVYITSVVDAFA----------IQNCLG
        MDRR F +LC  L+TTG L+ ++ VDVEEM+A+FLHII+HDVKNRI+RR FARSGET+SRHFN VL A+L L   +     A             +NCLG
Subjt:  MDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMFLHIISHDVKNRIIRRHFARSGETMSRHFNAVLFAILQLMVYITSVVDAFA----------IQNCLG

Query:  ALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKVPKGYYYLCDVGYPNAEGFLVPYRGERYHLSE
        ALDGTYIKVNV  +NRPRYRTRK EI T+VL V +   +FI+V+PGWEGSAADSRVLRDAI R NG KVP+GYYYLCD GYPN EGFL PYRG+RYHL++
Subjt:  ALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKVPKGYYYLCDVGYPNAEGFLVPYRGERYHLSE

Query:  WRGMGSSIAMENWKEEETSSSSSNGDHINFIETSSEWN------------QQRDEMAERMFTETMTGVERMPKHTWTRFEDAKLVECLVAMVHEGCWRSD
        W    +S   E +      SS+ N     F      W             Q R  + + M   + +      KH WT  +D+KLVECLV M + G W++D
Subjt:  WRGMGSSIAMENWKEEETSSSSSNGDHINFIETSSEWN------------QQRDEMAERMFTETMTGVERMPKHTWTRFEDAKLVECLVAMVHEGCWRSD

Query:  NGTFRPGYLSHLLRLLREKIPNCAIQATTTIDYRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVKAHPGAKGLRCNSFPHFDELSIVF
        NGTF+PGYL  L +++ ++IPN  I+A   ID R+K LK++Y AIS+MLGP  SGFGWNE+ K V  +K VFD WVK+HP AKGL    FP++DEL+IV+
Subjt:  NGTFRPGYLSHLLRLLREKIPNCAIQATTTIDYRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVKAHPGAKGLRCNSFPHFDELSIVF

Query:  GKDRATGADA----ETPDDMA-------SNDAMPLGADDEMNISQEPYEQNPETPTSGVQRSATTSRGSKRKRTSYQSEMLDVVRTAMDMQNSQLERIAS
        GKDRATG  A    ET D++A       ++D  P    DEMN +      N   P+S   R A     +        S+ +    T     +  ++++A 
Subjt:  GKDRATGADA----ETPDDMA-------SNDAMPLGADDEMNISQEPYEQNPETPTSGVQRSATTSRGSKRKRTSYQSEMLDVVRTAMDMQNSQLERIAS

Query:  WLVSNYALEDSRRKEVAYSLCQLTPALSRQERVRLMDILFAD
             +  + + R+   Y   +    L+  +R+++  +L ++
Subjt:  WLVSNYALEDSRRKEVAYSLCQLTPALSRQERVRLMDILFAD

E5GBB2 Retrotransposon protein6.2e-13042.77Show/hide
Query:  LIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMFLHIISHDVKNRIIRRHFARSGETMSRHFNAVLFAILQLM-------VYITSV
        +I+ESDL C +STRMDRR F +LC +LR    L +TE VDVEEM+AMFLH+++HDVKNR+I++ F RSGET+SRHFN VL A+L+L        V +TS 
Subjt:  LIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMFLHIISHDVKNRIIRRHFARSGETMSRHFNAVLFAILQLM-------VYITSV

Query:  VDA---FAIQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKVPKGYYYLCDVGYPNAEG
         +       +NCLGALDGTYIKVNV   +RP +RTRKGEI T+VL VC+  G+F++V+ GWEGSAADSR+LRDAIS+ NGL+VPKGYYYLCD GYPNAEG
Subjt:  VDA---FAIQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKVPKGYYYLCDVGYPNAEG

Query:  FLVPYRGERYHLSEWRGMGSS------------------------IAMENW------------------------------------------KEEETSS
        FL PY+G+RYHL EWRG  ++                        +    W                                          + + T +
Subjt:  FLVPYRGERYHLSEWRGMGSS------------------------IAMENW------------------------------------------KEEETSS

Query:  SSSNGDHINFIETSSEWNQQRDEMAERMFTETMTGVERMPKHTWTRFEDAKLVECLVAMVHEGCWRSDNGTFRPGYLSHLLRLLREKIPNCAIQATTTID
        +++  + I +IET++EW+Q RD++A  MFT+            W +F       C + +V  G W+SDNGTFRPGYL+ L+R++ EK+  C ++ATT ID
Subjt:  SSSNGDHINFIETSSEWNQQRDEMAERMFTETMTGVERMPKHTWTRFEDAKLVECLVAMVHEGCWRSDNGTFRPGYLSHLLRLLREKIPNCAIQATTTID

Query:  YRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVKAHPGAKGLRCNSFPHFDELSIVFGKDRATGADAETPDDMASNDAMPLGADDEMNI
         R+K LK+ + AI+EMLGP CSGFGWN+E K +  EK++FD WV++ P AKGL  N FP++DEL+ VFG+DRATG  AET  D+ SN+  P G  D  ++
Subjt:  YRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVKAHPGAKGLRCNSFPHFDELSIVFGKDRATGADAETPDDMASNDAMPLGADDEMNI

Query:  SQEPYEQNPETPTSGV---------------QRSATTSRGSKRKRTSYQSEMLDVVRTAMDMQNSQLERIASWLVSNYALEDSRRKEVAYSLCQLTPALS
          +  E  P   + GV                   T S GSKRKR S +   ++ +  A+D  N QL +IA W   N A ++  R E  + + +  P L+
Subjt:  SQEPYEQNPETPTSGV---------------QRSATTSRGSKRKRTSYQSEMLDVVRTAMDMQNSQLERIASWLVSNYALEDSRRKEVAYSLCQLTPALS

Query:  RQERVRLMDILFADAFKPSSFLAVPVDDR
          +R  L   L +       F+ +P D+R
Subjt:  RQERVRLMDILFADAFKPSSFLAVPVDDR

E5GCB5 Retrotransposon protein1.7e-14044.58Show/hide
Query:  MENRELITILTVITASQRQTLQLLDILINNHRRIEHQSPYLRHQIRQLTVFRLIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMF
        M+  EL +I+    ASQRQ L +L++L N+ +RI H     RH+IRQL  FR+I+                         T   L +TE VDVEEM+AMF
Subjt:  MENRELITILTVITASQRQTLQLLDILINNHRRIEHQSPYLRHQIRQLTVFRLIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMF

Query:  LHIISHDVKNRIIRRHFARSGETMSRHFNAVLFAILQLMVYITSVVDAFA----------IQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVC
        LHI++HDVK+R+I+R F RSGET+SRHFN VL A+++L   +                   +NCLGALDGTYIKVNV  ++R RYRTRKGE+ T+VL VC
Subjt:  LHIISHDVKNRIIRRHFARSGETMSRHFNAVLFAILQLMVYITSVVDAFA----------IQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVC

Query:  NPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKVPKGYYYLCDVGYPNAEGFLVPYRGERYHLSEWRG-------------------------------
        +  G+F++V+ GWEGSAADSR+LRDA+SRPN LKVPKGYYYL DVGYPNAEGFL PYRG+RYHL EWRG                               
Subjt:  NPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKVPKGYYYLCDVGYPNAEGFLVPYRGERYHLSEWRG-------------------------------

Query:  ----------------------------------MGSSIAMENWKEEETSSSSSNGDHINFIETSSEWNQQRDEMAERMFTETMTGVERMPKHTWTRFED
                                          M +    +N  E +++ +++  D I++IETS+EW+Q RD +AE    E MT   R+PKHTWT+ E+
Subjt:  ----------------------------------MGSSIAMENWKEEETSSSSSNGDHINFIETSSEWNQQRDEMAERMFTETMTGVERMPKHTWTRFED

Query:  AKLVECLVAMVHEGCWRSDNGTFRPGYLSHLLRLLREKIPNCAIQATTTIDYRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVKAHPG
        A LVECLV +V+ G WRSDNGTFRPGYL+ L R++  KIP   I A +TID R+K +K+ + A++EM GP CSGFGWN+E K +  EK+VFD W  +HP 
Subjt:  AKLVECLVAMVHEGCWRSDNGTFRPGYLSHLLRLLREKIPNCAIQATTTIDYRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVKAHPG

Query:  AKGLRCNSFPHFDELSIVFGKDRATGADAETPDDMASND--AMPLGADDEMNISQEPYEQNP----------ETPTSGVQRSATTSRGSKRKRTSYQSEM
        AKGL   SF H+DELS VFGKDRATG  AE+  D+ SND      GA D M  +  P   +P          ET T+ V      S GSKRKR  + ++ 
Subjt:  AKGLRCNSFPHFDELSIVFGKDRATGADAETPDDMASND--AMPLGADDEMNISQEPYEQNP----------ETPTSGVQRSATTSRGSKRKRTSYQSEM

Query:  LDVVRTAMDMQNSQLERIASWLVSNYALEDSRRKEVAYSLCQLTPALSRQERVRLMDILFADAFKPSSFLAVP
         D+VRTA++  N QL RIA W +         R+E+   L +  P L+  +R RLM IL  +     +FL VP
Subjt:  LDVVRTAMDMQNSQLERIASWLVSNYALEDSRRKEVAYSLCQLTPALSRQERVRLMDILFADAFKPSSFLAVP

SwissProt top hitse value%identityAlignment
O82368 Uncharacterized protein At2g298803.1e-0926.56Show/hide
Query:  ETMTGVERMPKHTWTRFEDAKLVECLVAMVHEGCWRSDNGTFRPGYLSHLLRLLREKIPNCAIQATTTIDYRVKNLKKKYAAISEMLGPGCSGFGWNEEF
        ET    ++ P  +W+  E  +L   LV  +  G WR  NGT     +   +  L  K   C    T  +  R+K++KK+Y+  + +     SGFGW+   
Subjt:  ETMTGVERMPKHTWTRFEDAKLVECLVAMVHEGCWRSDNGTFRPGYLSHLLRLLREKIPNCAIQATTTIDYRVKNLKKKYAAISEMLGPGCSGFGWNEEF

Query:  KYVEVEKDVFDAWVKAHPGAKGLRCNSFPHFDELSIVFGKDRATGADA------------ETPDDMASND---AMPLGADDEMNISQEPYEQNPETPTSG
        K      DV+ A++  HP    +R ++F  F++L ++F    A G +A            E  DD+ + D    M +  DDE+N    P E+ P    S 
Subjt:  KYVEVEKDVFDAWVKAHPGAKGLRCNSFPHFDELSIVFGKDRATGADA------------ETPDDMASND---AMPLGADDEMNISQEPYEQNPETPTSG

Query:  VQRSATTSRGSKRKRTSYQ--SEMLDVVRTAMDMQNSQLER
          R+   S       +S +  SEM+ V    +++   + ER
Subjt:  VQRSATTSRGSKRKRTSYQ--SEMLDVVRTAMDMQNSQLER

Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein2.9e-2331.17Show/hide
Query:  VFRLIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMFLHIISHDVKNRIIRRHFARSGETMSRHFNAVLFAI-LQLMVYITSVV--
        ++R + +    C++  RM   CF  LC+ML+T   L+ T  + +EE +AMFL I  H+   R +   F R+ ET+ R F  VL A  L    YI +    
Subjt:  VFRLIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMFLHIISHDVKNRIIRRHFARSGETMSRHFNAVLFAI-LQLMVYITSVV--

Query:  DAFAIQNCL--------------GALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKVPKG-YYY
        + + I   L              GA+DGT++ V V    +  Y  R      +++ +C+    F ++  G  GS  D+ VL+ A    +   +P    YY
Subjt:  DAFAIQNCL--------------GALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKVPKG-YYY

Query:  LCDVGYPNAEGFLVPYRGE-----RYHLSEW
        L D GYPN +G L PYR       RYH+S++
Subjt:  LCDVGYPNAEGFLVPYRGE-----RYHLSEW

AT5G28730.1 unknown protein2.3e-1532.34Show/hide
Query:  IYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMFLHIISHDVKNRIIRRHFARSGETMSRHFNAVLFAILQLMV-YI--TSVVDAFA
        IY +++ C    RM    F  LC +L     L+++  + ++E +A+FL I + +   R I   F  + ET+ R F+ VL A+ +L V YI    V +  A
Subjt:  IYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMFLHIISHDVKNRIIRRHFARSGETMSRHFNAVLFAILQLMV-YI--TSVVDAFA

Query:  IQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKV-PKGYYYLCDVGYPNAEGFLVPYRG
        I N          ++   T   P      G    +VL +C+    F +   G  GS  D+RVL  AIS      V P   YYL D GY N  G+L PYR 
Subjt:  IQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKV-PKGYYYLCDVGYPNAEGFLVPYRG

Query:  E
        E
Subjt:  E

AT5G28950.1 unknown protein5.0e-1535.66Show/hide
Query:  QNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAISR-PNGLKVP---KGYYYLCDVGYPNAEGFLVPY
        ++C+GA+D T+I   VS    P +R RKG+I  ++L  CN   EF++V+ GWEGSA DS+VL DA++R  N L VP   +    + +    N +  L   
Subjt:  QNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAISR-PNGLKVP---KGYYYLCDVGYPNAEGFLVPY

Query:  RGERYHLSEWRGMGSSIAMENWKEEETSS
          +R + ++WR    +IA   W     +S
Subjt:  RGERYHLSEWRGMGSSIAMENWKEEETSS

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)3.4e-1150Show/hide
Query:  FIFVMPGWEGSAADSRVLRDAISRPNGLKVPKGYYYLCDVGYPNAEGFLVPYRGERYHLSEWRG
        FI+V+ GWEGSA DSRVL DA+ +          +YL D G+ N   FL P+RG RYHL E+ G
Subjt:  FIFVMPGWEGSAADSRVLRDAISRPNGLKVPKGYYYLCDVGYPNAEGFLVPYRGERYHLSEWRG

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)8.3e-3436.54Show/hide
Query:  VFRLIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMFLHIISHDVKNRIIRRHFARSGETMSRHFNAVLFAILQLMVYI-------
        V++++   +  C E+ RMD+  F  LC +L+T G L  T  + +E  +A+FL II H+++ R ++  F  SGET+SRHFN VL A++ +           
Subjt:  VFRLIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMFLHIISHDVKNRIIRRHFARSGETMSRHFNAVLFAILQLMVYI-------

Query:  -TSVVDAFAIQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKVPKGYYYLCDVGYPNAE
         T   D    ++C+G +D  +I V V    +  +R   G +  +VL   +    F +V+ GWEGSA+D +VL  A++R N L+VP+G YY+ D  YPN  
Subjt:  -TSVVDAFAIQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAISRPNGLKVPKGYYYLCDVGYPNAE

Query:  GFLVPYRG
        GF+ PY G
Subjt:  GFLVPYRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAATCGTGAGTTGATCACAATATTGACAGTAATTACGGCCTCCCAACGTCAAACATTACAACTGCTAGATATATTAATTAACAACCATCGTAGAATAGAGCACCA
GTCGCCATACCTTAGACATCAAATTAGGCAACTAACAGTATTCCGCTTGATCTATGAAAGTGACTTATGTTGCATTGAGAGCACGAGGATGGATAGAAGGTGTTTTGTTG
TCCTTTGTAGTATGTTGAGGACAACAGGTCGTTTGGAGGCGACGGAATATGTAGATGTCGAGGAGATGATTGCCATGTTCCTACATATCATATCACATGATGTTAAGAAT
CGAATCATTCGTAGACATTTTGCAAGGTCCGGTGAGACAATGTCAAGACACTTTAATGCTGTACTTTTTGCAATATTGCAACTAATGGTGTACATAACTTCCGTAGTGGA
TGCATTTGCAATACAGAATTGTTTGGGTGCGTTGGATGGCACGTACATCAAGGTGAATGTCAGTACCGCAAATCGACCGAGATATAGGACGCGTAAGGGTGAGATCGTGA
CCGATGTCCTTGTTGTTTGCAATCCGAGTGGTGAATTCATATTCGTAATGCCAGGATGGGAAGGGTCTGCAGCTGATTCTCGAGTTCTTCGAGATGCAATATCAAGACCT
AATGGCTTGAAGGTTCCCAAGGGATACTATTATCTTTGTGATGTTGGGTACCCAAATGCAGAAGGATTCTTGGTACCATATAGAGGGGAGCGATACCACCTTTCTGAATG
GCGTGGGATGGGCTCGAGTATAGCAATGGAAAACTGGAAAGAGGAAGAAACCTCCTCAAGTTCCTCTAATGGAGATCATATAAATTTCATTGAAACTTCTAGTGAATGGA
ACCAACAACGAGATGAAATGGCTGAACGAATGTTCACTGAGACAATGACAGGTGTGGAGCGAATGCCAAAACACACATGGACTAGATTCGAGGATGCCAAGTTGGTAGAA
TGTCTCGTTGCAATGGTCCACGAGGGATGCTGGAGATCAGACAATGGAACATTCCGACCCGGCTACCTATCGCATCTGTTACGGTTGCTTAGAGAAAAAATTCCAAATTG
TGCAATTCAAGCAACAACCACTATTGACTACAGAGTGAAGAACTTGAAGAAGAAGTACGCAGCCATCTCGGAGATGCTAGGTCCTGGGTGCAGTGGCTTTGGTTGGAATG
AAGAGTTCAAATATGTGGAGGTTGAGAAGGACGTGTTTGACGCATGGGTCAAGGCTCATCCTGGAGCAAAGGGGCTACGATGTAACTCATTCCCCCATTTTGACGAGTTG
TCAATCGTCTTTGGAAAAGATCGCGCGACTGGGGCTGATGCAGAGACCCCAGATGACATGGCATCTAACGATGCCATGCCACTAGGGGCAGATGATGAGATGAACATATC
ACAAGAACCTTATGAACAAAATCCAGAGACGCCAACTAGTGGAGTTCAAAGATCAGCCACGACATCGCGTGGAAGCAAGCGAAAACGTACATCTTATCAATCTGAAATGC
TAGATGTTGTACGCACTGCGATGGACATGCAAAATTCCCAGCTGGAGCGGATTGCATCATGGCTTGTATCAAACTATGCTCTCGAGGACAGTCGACGCAAAGAAGTAGCA
TACAGTCTTTGCCAGTTGACACCAGCACTATCTAGACAAGAAAGAGTAAGATTGATGGACATATTATTCGCTGATGCATTTAAGCCCAGCAGCTTCTTGGCAGTCCCGGT
GGACGATAGATGGGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAATCGTGAGTTGATCACAATATTGACAGTAATTACGGCCTCCCAACGTCAAACATTACAACTGCTAGATATATTAATTAACAACCATCGTAGAATAGAGCACCA
GTCGCCATACCTTAGACATCAAATTAGGCAACTAACAGTATTCCGCTTGATCTATGAAAGTGACTTATGTTGCATTGAGAGCACGAGGATGGATAGAAGGTGTTTTGTTG
TCCTTTGTAGTATGTTGAGGACAACAGGTCGTTTGGAGGCGACGGAATATGTAGATGTCGAGGAGATGATTGCCATGTTCCTACATATCATATCACATGATGTTAAGAAT
CGAATCATTCGTAGACATTTTGCAAGGTCCGGTGAGACAATGTCAAGACACTTTAATGCTGTACTTTTTGCAATATTGCAACTAATGGTGTACATAACTTCCGTAGTGGA
TGCATTTGCAATACAGAATTGTTTGGGTGCGTTGGATGGCACGTACATCAAGGTGAATGTCAGTACCGCAAATCGACCGAGATATAGGACGCGTAAGGGTGAGATCGTGA
CCGATGTCCTTGTTGTTTGCAATCCGAGTGGTGAATTCATATTCGTAATGCCAGGATGGGAAGGGTCTGCAGCTGATTCTCGAGTTCTTCGAGATGCAATATCAAGACCT
AATGGCTTGAAGGTTCCCAAGGGATACTATTATCTTTGTGATGTTGGGTACCCAAATGCAGAAGGATTCTTGGTACCATATAGAGGGGAGCGATACCACCTTTCTGAATG
GCGTGGGATGGGCTCGAGTATAGCAATGGAAAACTGGAAAGAGGAAGAAACCTCCTCAAGTTCCTCTAATGGAGATCATATAAATTTCATTGAAACTTCTAGTGAATGGA
ACCAACAACGAGATGAAATGGCTGAACGAATGTTCACTGAGACAATGACAGGTGTGGAGCGAATGCCAAAACACACATGGACTAGATTCGAGGATGCCAAGTTGGTAGAA
TGTCTCGTTGCAATGGTCCACGAGGGATGCTGGAGATCAGACAATGGAACATTCCGACCCGGCTACCTATCGCATCTGTTACGGTTGCTTAGAGAAAAAATTCCAAATTG
TGCAATTCAAGCAACAACCACTATTGACTACAGAGTGAAGAACTTGAAGAAGAAGTACGCAGCCATCTCGGAGATGCTAGGTCCTGGGTGCAGTGGCTTTGGTTGGAATG
AAGAGTTCAAATATGTGGAGGTTGAGAAGGACGTGTTTGACGCATGGGTCAAGGCTCATCCTGGAGCAAAGGGGCTACGATGTAACTCATTCCCCCATTTTGACGAGTTG
TCAATCGTCTTTGGAAAAGATCGCGCGACTGGGGCTGATGCAGAGACCCCAGATGACATGGCATCTAACGATGCCATGCCACTAGGGGCAGATGATGAGATGAACATATC
ACAAGAACCTTATGAACAAAATCCAGAGACGCCAACTAGTGGAGTTCAAAGATCAGCCACGACATCGCGTGGAAGCAAGCGAAAACGTACATCTTATCAATCTGAAATGC
TAGATGTTGTACGCACTGCGATGGACATGCAAAATTCCCAGCTGGAGCGGATTGCATCATGGCTTGTATCAAACTATGCTCTCGAGGACAGTCGACGCAAAGAAGTAGCA
TACAGTCTTTGCCAGTTGACACCAGCACTATCTAGACAAGAAAGAGTAAGATTGATGGACATATTATTCGCTGATGCATTTAAGCCCAGCAGCTTCTTGGCAGTCCCGGT
GGACGATAGATGGGAGTAA
Protein sequenceShow/hide protein sequence
MENRELITILTVITASQRQTLQLLDILINNHRRIEHQSPYLRHQIRQLTVFRLIYESDLCCIESTRMDRRCFVVLCSMLRTTGRLEATEYVDVEEMIAMFLHIISHDVKN
RIIRRHFARSGETMSRHFNAVLFAILQLMVYITSVVDAFAIQNCLGALDGTYIKVNVSTANRPRYRTRKGEIVTDVLVVCNPSGEFIFVMPGWEGSAADSRVLRDAISRP
NGLKVPKGYYYLCDVGYPNAEGFLVPYRGERYHLSEWRGMGSSIAMENWKEEETSSSSSNGDHINFIETSSEWNQQRDEMAERMFTETMTGVERMPKHTWTRFEDAKLVE
CLVAMVHEGCWRSDNGTFRPGYLSHLLRLLREKIPNCAIQATTTIDYRVKNLKKKYAAISEMLGPGCSGFGWNEEFKYVEVEKDVFDAWVKAHPGAKGLRCNSFPHFDEL
SIVFGKDRATGADAETPDDMASNDAMPLGADDEMNISQEPYEQNPETPTSGVQRSATTSRGSKRKRTSYQSEMLDVVRTAMDMQNSQLERIASWLVSNYALEDSRRKEVA
YSLCQLTPALSRQERVRLMDILFADAFKPSSFLAVPVDDRWE