; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g10510 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g10510
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr9:8909070..8913314
RNA-Seq ExpressionMoc09g10510
SyntenyMoc09g10510
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056838.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.3e-4723.16Show/hide
Query:  ITESTKDRAFFISLHWSSIAWPKTIFSTLLSIPQNHKFFKETHIDDQTLWVEKTNNHKGT--LAEIAKLDINGGLTKLLIPIGEDRKGWKAFYRLLHEPI
        +TE  + ++F + +   ++AW +  F  LL       FF E  ++D  +WV KT N   T   AEI ++D  G    +L+P G D  GWK F  L    I
Subjt:  ITESTKDRAFFISLHWSSIAWPKTIFSTLLSIPQNHKFFKETHIDDQTLWVEKTNNHKGT--LAEIAKLDINGGLTKLLIPIGEDRKGWKAFYRLLHEPI

Query:  QQHPHAPT-HTRQNIPDQPIPSVIVGTQTYRDAITKGKQKLNPLPAPLPTQHRRKNLHPTNITEYEEADLQLG------------SSIIVVRRFFHDDWL
             APT   R  I  +P+ +      +  D+  K   K   + +       +K    T+          +G             ++I+ RR FHDDW 
Subjt:  QQHPHAPT-HTRQNIPDQPIPSVIVGTQTYRDAITKGKQKLNPLPAPLPTQHRRKNLHPTNITEYEEADLQLG------------SSIIVVRRFFHDDWL

Query:  KITRALQTHISTLCTLNPFTADKAL--LRCEDKKQGDALSQYKDWHQVGDTQLKFLAWSDLDPSHSSVVPSY--------------------------GG
        +I  +L+       +  PF ADKA+  L  +  K   +      W  VG+ Q+KF +W     S  SV+PSY                          GG
Subjt:  KITRALQTHISTLCTLNPFTADKAL--LRCEDKKQGDALSQYKDWHQVGDTQLKFLAWSDLDPSHSSVVPSY--------------------------GG

Query:  FIEIAKKSLSKLDLLEAKIRVKSNPFGFIPGEVGINTAGFGHFKVQIDPFHRTEHYLGFSVGIHGAF----------------------LAGIPSDP---
        F+++AK+++    L++AKI+V+ N  GF+P  + I      +F V           +  +V +HG+F                         IP +P   
Subjt:  FIEIAKKSLSKLDLLEAKIRVKSNPFGFIPGEVGINTAGFGHFKVQIDPFHRTEHYLGFSVGIHGAF----------------------LAGIPSDP---

Query:  --------------------PLGTTSEGLLGPCP-----------------------------------KLSFL----------EKDCRLLDERLSVST-
                                +SE    P                                     K+SFL            +     + L +ST 
Subjt:  --------------------PLGTTSEGLLGPCP-----------------------------------KLSFL----------EKDCRLLDERLSVST-

Query:  --------SPR----TCLDIHATKNPSASSISHSPSSPVPVTDPSATSAFPADNKP--PVTS-VPLHRSDPYQCISSPYPLSTEPPSPSASTYLI-----
                SPR    T L     K+P  S+  H+ S      + S       D  P  P+ S +    +     +++  P      + SA    +     
Subjt:  --------SPR----TCLDIHATKNPSASSISHSPSSPVPVTDPSATSAFPADNKP--PVTS-VPLHRSDPYQCISSPYPLSTEPPSPSASTYLI-----

Query:  -----DSYASKPFGEGDSQSALHMFDPSIHPSFYLQVVAPWLHTIGMGIFP-----IP--TKLPKKAASLQKEYRGIRELSG------LTTTVNYEKRQP
             +  AS+   EG+S+ A    +  I  +F  ++V  WL    + + P     +P  +  P   +    +  G   L        L     ++    
Subjt:  -----DSYASKPFGEGDSQSALHMFDPSIHPSFYLQVVAPWLHTIGMGIFP-----IP--TKLPKKAASLQKEYRGIRELSG------LTTTVNYEKRQP

Query:  KKG-------------------VYGPSNSEEKPAFWRELSDLSSLCNDSWLLGGDFN-------------------------------------------
        K G                   VYGP    ++   W EL  L SLC  +WL+ GDFN                                           
Subjt:  KKG-------------------VYGPSNSEEKPAFWRELSDLSSLCNDSWLLGGDFN-------------------------------------------

Query:  --------------------PNA---HVKR-LNRPTSDHYPIQLSFGMTKWGPSPFRFENAWMLHHSLQPLIEYWWKNTPLRGWPGHGFIMKLKGLKIML
                             NA   H  R L R  SDH+PI L     KWGP PFR  N+ +     Q     WW N+   G+PG+ FI  L  L   +
Subjt:  --------------------PNA---HVKR-LNRPTSDHYPIQLSFGMTKWGPSPFRFENAWMLHHSLQPLIEYWWKNTPLRGWPGHGFIMKLKGLKIML

Query:  KTWSKTTFGSIAQQKNQLTSELAILDELEEEGLLNTVQHAQRKTIKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNTSFFHRIVAAKRRRNTITEILSTTG
        K W           K  L  E+ I+D+LE +G ++T  H +R ++K+ LL +   +   W QR + +W L GD N S+FHRI    +R+N I  I    G
Subjt:  KTWSKTTFGSIAQQKNQLTSELAILDELEEEGLLNTVQHAQRKTIKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNTSFFHRIVAAKRRRNTITEILSTTG

Query:  ISLFDEKDIETEFLSFFDS
         SL    DI   F+S F +
Subjt:  ISLFDEKDIETEFLSFFDS

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.7e-4823.35Show/hide
Query:  ITESTKDRAFFISLHWSSIAWPKTIFSTLLSIPQNHKFFKETHIDDQTLWVEKTNNHKGT--LAEIAKLDINGGLTKLLIPIGEDRKGWKAFYRLLHEPI
        +TE  + ++F + +   ++AW +  F  LL       FF E  ++D  +WV KT N   T   AEI ++D  G    +L+P G D  GWK+F  L    I
Subjt:  ITESTKDRAFFISLHWSSIAWPKTIFSTLLSIPQNHKFFKETHIDDQTLWVEKTNNHKGT--LAEIAKLDINGGLTKLLIPIGEDRKGWKAFYRLLHEPI

Query:  QQHPHAPT-HTRQNIPDQPIPSVIVGTQTYRDAITKGKQKLNPLPAPLPTQHRRKNLHPTNITEYEEADLQLG------------SSIIVVRRFFHDDWL
             APT   R  I  +P+ +      +  D+  K   K   + +       +K    T+          +G             ++I+ RR FHDDW 
Subjt:  QQHPHAPT-HTRQNIPDQPIPSVIVGTQTYRDAITKGKQKLNPLPAPLPTQHRRKNLHPTNITEYEEADLQLG------------SSIIVVRRFFHDDWL

Query:  KITRALQTHISTLCTLNPFTADKAL--LRCEDKKQGDALSQYKDWHQVGDTQLKFLAWSDLDPSHSSVVPSY--------------------------GG
        +I  +L+       +  PF ADKA+  L  +  K   +      W  VG+ Q+KF +W     S  SV+PSY                          GG
Subjt:  KITRALQTHISTLCTLNPFTADKAL--LRCEDKKQGDALSQYKDWHQVGDTQLKFLAWSDLDPSHSSVVPSY--------------------------GG

Query:  FIEIAKKSLSKLDLLEAKIRVKSNPFGFIPGEVGINTAGFGHFKVQIDPFHRTEHYLGFSVGIHGAF----------------------LAGIPSDP---
        F+++AK+++    L++AKI+V+ N  GF+P  + I      +F V           +  +V +HG+F                         IP +P   
Subjt:  FIEIAKKSLSKLDLLEAKIRVKSNPFGFIPGEVGINTAGFGHFKVQIDPFHRTEHYLGFSVGIHGAF----------------------LAGIPSDP---

Query:  --------------------PLGTTSEGLLGPCP-----------------------------------KLSFL----------EKDCRLLDERLSVST-
                                +SE    P                                     K+SFL            +     + L +ST 
Subjt:  --------------------PLGTTSEGLLGPCP-----------------------------------KLSFL----------EKDCRLLDERLSVST-

Query:  --------SPR----TCLDIHATKNPSASSISHSPSSPVPVTDPSATSAFPADNKP------PVTSVPLHRSDPYQCISSPYPLSTEPPSPSASTYLI--
                SPR    T L     K+P  S+  H  S      + S       D  P       + S   H  D +    +P   S    S  A    +  
Subjt:  --------SPR----TCLDIHATKNPSASSISHSPSSPVPVTDPSATSAFPADNKP------PVTSVPLHRSDPYQCISSPYPLSTEPPSPSASTYLI--

Query:  ------DSYASKPFGEGDSQSALHMFDPSIHPSFYLQVVAPWLHTIGMGIFP-----IPTK--LPKKAASLQKEYRGIRELSG------LTTTVNYEKRQ
              +  AS+   EG+S+ A    +  I  +F  ++V  WL    + + P     +P+    P   +    +  G   L        L    N++   
Subjt:  ------DSYASKPFGEGDSQSALHMFDPSIHPSFYLQVVAPWLHTIGMGIFP-----IPTK--LPKKAASLQKEYRGIRELSG------LTTTVNYEKRQ

Query:  PKKG-------------------VYGPSNSEEKPAFWRELSDLSSLCNDSWLLGGDFN------------------------------------------
         K G                   VYGP    ++   W EL  L SLC  +WL+ GDFN                                          
Subjt:  PKKG-------------------VYGPSNSEEKPAFWRELSDLSSLCNDSWLLGGDFN------------------------------------------

Query:  ---------------------PNA---HVKR-LNRPTSDHYPIQLSFGMTKWGPSPFRFENAWMLHHSLQPLIEYWWKNTPLRGWPGHGFIMKLKGLKIM
                              NA   H  R L R  SDH+PI L     KWGP PFR  N+ +     Q     WW ++   G+PG+ FI  L  L   
Subjt:  ---------------------PNA---HVKR-LNRPTSDHYPIQLSFGMTKWGPSPFRFENAWMLHHSLQPLIEYWWKNTPLRGWPGHGFIMKLKGLKIM

Query:  LKTWSKTTFGSIAQQKNQLTSELAILDELEEEGLLNTVQHAQRKTIKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNTSFFHRIVAAKRRRNTITEILSTT
        +K W           K  L  E+ I+D+LE +G ++T  H +R ++K+ LL +   +   W QR + +W L GD N S+FHRI    +R+N I  I    
Subjt:  LKTWSKTTFGSIAQQKNQLTSELAILDELEEEGLLNTVQHAQRKTIKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNTSFFHRIVAAKRRRNTITEILSTT

Query:  GISLFDEKDIETEFLSFFDSKVFEEASGLKINHQKSEILGINLEDSELTNLASKYNCK
        G SL    DI   F+S F +   +E+          EIL  NL  + ++ L     CK
Subjt:  GISLFDEKDIETEFLSFFDSKVFEEASGLKINHQKSEILGINLEDSELTNLASKYNCK

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]8.0e-5322.09Show/hide
Query:  PQHITIERKRFTIAIDHRYRGSGVRITESTKDRAFFISLHWSSIAWPKTIFSTLLSIPQNHKFFKETHIDDQTLWVEKTNNHKGTLAEIAKLDINGGLTK
        P+   +ERK F + +D   + +   +TE+   +AF I +    + W +    +L++ P  ++FF ET   +Q +W+ KT N KG  AEI ++D     + 
Subjt:  PQHITIERKRFTIAIDHRYRGSGVRITESTKDRAFFISLHWSSIAWPKTIFSTLLSIPQNHKFFKETHIDDQTLWVEKTNNHKGTLAEIAKLDINGGLTK

Query:  LLIPIGEDRKGWKAFYRLLHEPIQ-QHPHAPTHTRQNIPDQPI-PSVIVGTQTYRDAITKGKQKLNPLPAPLPTQHRRKNLHPTNITEYEEADLQLGSSI
        +L+P G D+ GW +F  ++   ++ +    PT   +  PD  + P +    ++Y  A+T+G+       +         +    +  +   +DL L +++
Subjt:  LLIPIGEDRKGWKAFYRLLHEPIQ-QHPHAPTHTRQNIPDQPI-PSVIVGTQTYRDAITKGKQKLNPLPAPLPTQHRRKNLHPTNITEYEEADLQLGSSI

Query:  IVVRRFFHDDWLKITRALQTHISTLCTLNPFTADKALLRCEDKKQGDALSQYKDWHQVGDTQLKFLAWSDLDPSHSSVVPSYGGF---------------
        ++VRRFFHDDW KI + L+       T N F A+KAL+        + L Q K W  VG   ++F  WS +  +   ++PSYGG+               
Subjt:  IVVRRFFHDDWLKITRALQTHISTLCTLNPFTADKALLRCEDKKQGDALSQYKDWHQVGDTQLKFLAWSDLDPSHSSVVPSYGGF---------------

Query:  -----------IEIAKKSLSKLDLLEAKIRVKSNPFGFIPGEVGINTAGFGHFKVQIDPFHRTEHYLGFSVGIHGA-------------------FLAGI
                   I++A+++ S  +L+EA+I+V+ N  GF+P  V I       F VQ+      +  +  +V +HG                    F  G 
Subjt:  -----------IEIAKKSLSKLDLLEAKIRVKSNPFGFIPGEVGINTAGFGHFKVQIDPFHRTEHYLGFSVGIHGA-------------------FLAGI

Query:  PSDPP--LGTTSEGLLGPCPKL------------------SFLEKDC------------------------RLLD---ERLSVSTSPRTCLDIHATK---
         +  P  L T+S+G     P                    SFL ++                          +LD   +++ +   P + L++  +K   
Subjt:  PSDPP--LGTTSEGLLGPCPKL------------------SFLEKDC------------------------RLLD---ERLSVSTSPRTCLDIHATK---

Query:  ------------NPSASSISHSPSSPVP--------------------------------VTDPSATSAFPADNKPPVTSV-----PLHRSDPYQCISSP
                    NP ++  +HSPS   P                                +T P    A   D      S+      L   DP + +   
Subjt:  ------------NPSASSISHSPSSPVP--------------------------------VTDPSATSAFPADNKPPVTSV-----PLHRSDPYQCISSP

Query:  YPLSTEPPSPSASTYLIDS--YASKPFGEGDSQSALHMFDPSIH--------------------PSFYLQVVAPWLHTIGMGI-----------------
        +           +T ++        P  E  + S+   +    H                     +F  Q+V+ WL   G+ +                 
Subjt:  YPLSTEPPSPSASTYLIDS--YASKPFGEGDSQSALHMFDPSIH--------------------PSFYLQVVAPWLHTIGMGI-----------------

Query:  ------FPIPTK--------------LPKKAA---------------SLQKEYRGIRELSGLTTTVNYEKRQPKKGVYGPSNSEEKPAFWRELSDLSSLC
                I  K              + K A+               SL  +  G+  LS     +N        G+YGP    E+  FW EL +L  L 
Subjt:  ------FPIPTK--------------LPKKAA---------------SLQKEYRGIRELSGLTTTVNYEKRQPKKGVYGPSNSEEKPAFWRELSDLSSLC

Query:  NDSWLLGGDFN------------PNAHVKR-------------------------------------------------------LNRPTSDHYPIQLSF
        +  W+LGGD N             ++H  R                                                       L R TSDH+P+    
Subjt:  NDSWLLGGDFN------------PNAHVKR-------------------------------------------------------LNRPTSDHYPIQLSF

Query:  GMTK--WGPSPFRFENAWMLHHSLQPLIEYWWKNTPLRGWPGHGFIMKLKGLKIMLKTWSKTTFGSIAQQKNQLTSELAILDELEEEGLLNTVQHAQRKT
           K  WGP PFR  +  +     +  +  WW+N+   G+PG  FI +LK L   +K W K    S+   K  +  E+  +D+ E +  L   +  +R  
Subjt:  GMTK--WGPSPFRFENAWMLHHSLQPLIEYWWKNTPLRGWPGHGFIMKLKGLKIMLKTWSKTTFGSIAQQKNQLTSELAILDELEEEGLLNTVQHAQRKT

Query:  IKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNTSFFHRIVAAKRRRNTITEILSTTGISLFDEKDIETEFLSFFDSKVFEEAS
        +K  L E+  KE   W QR K  WL EGD N+SFFHRI +++++R+ I EI    G        I T F+ FF S+++  ++
Subjt:  IKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNTSFFHRIVAAKRRRNTITEILSTTGISLFDEKDIETEFLSFFDSKVFEEAS

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.7e-4823.35Show/hide
Query:  ITESTKDRAFFISLHWSSIAWPKTIFSTLLSIPQNHKFFKETHIDDQTLWVEKTNNHKGT--LAEIAKLDINGGLTKLLIPIGEDRKGWKAFYRLLHEPI
        +TE  + ++F + +   ++AW +  F  LL       FF E  ++D  +WV KT N   T   AEI ++D  G    +L+P G D  GWK+F  L    I
Subjt:  ITESTKDRAFFISLHWSSIAWPKTIFSTLLSIPQNHKFFKETHIDDQTLWVEKTNNHKGT--LAEIAKLDINGGLTKLLIPIGEDRKGWKAFYRLLHEPI

Query:  QQHPHAPT-HTRQNIPDQPIPSVIVGTQTYRDAITKGKQKLNPLPAPLPTQHRRKNLHPTNITEYEEADLQLG------------SSIIVVRRFFHDDWL
             APT   R  I  +P+ +      +  D+  K   K   + +       +K    T+          +G             ++I+ RR FHDDW 
Subjt:  QQHPHAPT-HTRQNIPDQPIPSVIVGTQTYRDAITKGKQKLNPLPAPLPTQHRRKNLHPTNITEYEEADLQLG------------SSIIVVRRFFHDDWL

Query:  KITRALQTHISTLCTLNPFTADKAL--LRCEDKKQGDALSQYKDWHQVGDTQLKFLAWSDLDPSHSSVVPSY--------------------------GG
        +I  +L+       +  PF ADKA+  L  +  K   +      W  VG+ Q+KF +W     S  SV+PSY                          GG
Subjt:  KITRALQTHISTLCTLNPFTADKAL--LRCEDKKQGDALSQYKDWHQVGDTQLKFLAWSDLDPSHSSVVPSY--------------------------GG

Query:  FIEIAKKSLSKLDLLEAKIRVKSNPFGFIPGEVGINTAGFGHFKVQIDPFHRTEHYLGFSVGIHGAF----------------------LAGIPSDP---
        F+++AK+++    L++AKI+V+ N  GF+P  + I      +F V           +  +V +HG+F                         IP +P   
Subjt:  FIEIAKKSLSKLDLLEAKIRVKSNPFGFIPGEVGINTAGFGHFKVQIDPFHRTEHYLGFSVGIHGAF----------------------LAGIPSDP---

Query:  --------------------PLGTTSEGLLGPCP-----------------------------------KLSFL----------EKDCRLLDERLSVST-
                                +SE    P                                     K+SFL            +     + L +ST 
Subjt:  --------------------PLGTTSEGLLGPCP-----------------------------------KLSFL----------EKDCRLLDERLSVST-

Query:  --------SPR----TCLDIHATKNPSASSISHSPSSPVPVTDPSATSAFPADNKP------PVTSVPLHRSDPYQCISSPYPLSTEPPSPSASTYLI--
                SPR    T L     K+P  S+  H  S      + S       D  P       + S   H  D +    +P   S    S  A    +  
Subjt:  --------SPR----TCLDIHATKNPSASSISHSPSSPVPVTDPSATSAFPADNKP------PVTSVPLHRSDPYQCISSPYPLSTEPPSPSASTYLI--

Query:  ------DSYASKPFGEGDSQSALHMFDPSIHPSFYLQVVAPWLHTIGMGIFP-----IPTK--LPKKAASLQKEYRGIRELSG------LTTTVNYEKRQ
              +  AS+   EG+S+ A    +  I  +F  ++V  WL    + + P     +P+    P   +    +  G   L        L    N++   
Subjt:  ------DSYASKPFGEGDSQSALHMFDPSIHPSFYLQVVAPWLHTIGMGIFP-----IPTK--LPKKAASLQKEYRGIRELSG------LTTTVNYEKRQ

Query:  PKKG-------------------VYGPSNSEEKPAFWRELSDLSSLCNDSWLLGGDFN------------------------------------------
         K G                   VYGP    ++   W EL  L SLC  +WL+ GDFN                                          
Subjt:  PKKG-------------------VYGPSNSEEKPAFWRELSDLSSLCNDSWLLGGDFN------------------------------------------

Query:  ---------------------PNA---HVKR-LNRPTSDHYPIQLSFGMTKWGPSPFRFENAWMLHHSLQPLIEYWWKNTPLRGWPGHGFIMKLKGLKIM
                              NA   H  R L R  SDH+PI L     KWGP PFR  N+ +     Q     WW ++   G+PG+ FI  L  L   
Subjt:  ---------------------PNA---HVKR-LNRPTSDHYPIQLSFGMTKWGPSPFRFENAWMLHHSLQPLIEYWWKNTPLRGWPGHGFIMKLKGLKIM

Query:  LKTWSKTTFGSIAQQKNQLTSELAILDELEEEGLLNTVQHAQRKTIKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNTSFFHRIVAAKRRRNTITEILSTT
        +K W           K  L  E+ I+D+LE +G ++T  H +R ++K+ LL +   +   W QR + +W L GD N S+FHRI    +R+N I  I    
Subjt:  LKTWSKTTFGSIAQQKNQLTSELAILDELEEEGLLNTVQHAQRKTIKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNTSFFHRIVAAKRRRNTITEILSTT

Query:  GISLFDEKDIETEFLSFFDSKVFEEASGLKINHQKSEILGINLEDSELTNLASKYNCK
        G SL    DI   F+S F +   +E+          EIL  NL  + ++ L     CK
Subjt:  GISLFDEKDIETEFLSFFDSKVFEEASGLKINHQKSEILGINLEDSELTNLASKYNCK

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]4.6e-5642.81Show/hide
Query:  GVYGPSNSEEKPAFWRELSDLSSLCNDSWLLGGDFNP--------------------NAHV---------------------------------------
        G+YGPS +E    FW+EL DLS LC + W+L GDFN                     N+ +                                       
Subjt:  GVYGPSNSEEKPAFWRELSDLSSLCNDSWLLGGDFNP--------------------NAHV---------------------------------------

Query:  -----KRLNRPTSDHYPIQLSFGMTKWGPSPFRFENAWMLHHSLQPLIEYWWKNTPLRGWPGHGFIMKLKGLKIMLKTWSKTTFGSIAQQKNQLTSELAI
             KR+ R TSDH+PI L FG   WG +PFRFEN W+ H + +P +E WW N PL GWPGHG +MKLK LK  +K W    F  I  QK  LT+ +  
Subjt:  -----KRLNRPTSDHYPIQLSFGMTKWGPSPFRFENAWMLHHSLQPLIEYWWKNTPLRGWPGHGFIMKLKGLKIMLKTWSKTTFGSIAQQKNQLTSELAI

Query:  LDELEEEGLLNTVQHAQRKTIKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNTSFFHRIVAAKRRRNTITEILSTTGISLFDEKDIETEFLSF
        LD+LE    +   Q   R   K  LL VVAKEE  WRQRCK KWL EGD NT FFHR +A KRRR+ ITEILS  GI L   KDIE EF+ F
Subjt:  LDELEEEGLLNTVQHAQRKTIKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNTSFFHRIVAAKRRRNTITEILSTTGISLFDEKDIETEFLSF

TrEMBL top hitse value%identityAlignment
A0A438ET41 Uncharacterized protein9.6e-4436.4Show/hide
Query:  YGPSNSEEKPAFWRELSDLSSLCNDSWLLGGDFNPNAHVKR------------------------LNRPTSDHYPIQLSFGMTKWGPSPFRFENAWMLHH
        YGP+    +  FW EL DL  L    W +GGDF+    + +                        L R TSDH PI L     KWGP+PFRFEN W+LHH
Subjt:  YGPSNSEEKPAFWRELSDLSSLCNDSWLLGGDFNPNAHVKR------------------------LNRPTSDHYPIQLSFGMTKWGPSPFRFENAWMLHH

Query:  SLQPLIEYWWKNTPLRGWPGHGFIMKLKGLKIMLKTWSKTTFGSIAQQKNQLTSELAILDELEEEGLLNTVQHAQRKTIKTQLLEVVAKEEIHWRQRCKS
          +     WW+   + GW GH F+ KLK +K  LK W+K  FG + ++K  + S+LA +D +E+EG LN    ++R   + +L +++ KEE+HWRQ+ + 
Subjt:  SLQPLIEYWWKNTPLRGWPGHGFIMKLKGLKIMLKTWSKTTFGSIAQQKNQLTSELAILDELEEEGLLNTVQHAQRKTIKTQLLEVVAKEEIHWRQRCKS

Query:  KWLLEGDNNTSFFHRIVAAKRRRNTITEILSTTGISLFDEKDIETEFLSFFDSKVFEEASG
        KW+ EGD N+ FFHR+   ++ R  I  ++S   ++L + K I  E ++FF  K++ +  G
Subjt:  KWLLEGDNNTSFFHRIVAAKRRRNTITEILSTTGISLFDEKDIETEFLSFFDSKVFEEASG

A0A438I181 Transposon TX1 uncharacterized 149 kDa protein2.1e-4339.04Show/hide
Query:  VYGPSNSEEKPAFWRELSDL--------SSLCN--DSWLLGGDFN---PNAHVKRLNRPTSDHYPIQLSFGMTKWGPSPFRFENAWMLHHSLQPLIEYWW
        VYGP+NS  +  FW ELSD+        + +C   D +L   ++    P +    L R TSDH+PI L     KWGP+PFRFEN W+ H S +     WW
Subjt:  VYGPSNSEEKPAFWRELSDL--------SSLCN--DSWLLGGDFN---PNAHVKRLNRPTSDHYPIQLSFGMTKWGPSPFRFENAWMLHHSLQPLIEYWW

Query:  KNTPLRGWPGHGFIMKLKGLKIMLKTWSKTTFGSIAQQKNQLTSELAILDELEEEGLLNTVQHAQRKTIKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNT
              GW GH F+ KL+ +K  LK W+KT+FG ++++K  + + LA  D LE+EG L+  +  QR   K +L E++ +EEIHWRQ+ + KW+ EGD N+
Subjt:  KNTPLRGWPGHGFIMKLKGLKIMLKTWSKTTFGSIAQQKNQLTSELAILDELEEEGLLNTVQHAQRKTIKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNT

Query:  SFFHRIVAAKRRRNTITEILSTTGISLFDEKDIETEFLSFFDSKVFEEASG
         FFH++   +R R  I E+ + +G+ L + + I+ E L +F+ K++   SG
Subjt:  SFFHRIVAAKRRRNTITEILSTTGISLFDEKDIETEFLSFFDSKVFEEASG

A0A5D3BKT8 LINE-1 retrotransposable element ORF2 protein6.4e-4823.16Show/hide
Query:  ITESTKDRAFFISLHWSSIAWPKTIFSTLLSIPQNHKFFKETHIDDQTLWVEKTNNHKGT--LAEIAKLDINGGLTKLLIPIGEDRKGWKAFYRLLHEPI
        +TE  + ++F + +   ++AW +  F  LL       FF E  ++D  +WV KT N   T   AEI ++D  G    +L+P G D  GWK F  L    I
Subjt:  ITESTKDRAFFISLHWSSIAWPKTIFSTLLSIPQNHKFFKETHIDDQTLWVEKTNNHKGT--LAEIAKLDINGGLTKLLIPIGEDRKGWKAFYRLLHEPI

Query:  QQHPHAPT-HTRQNIPDQPIPSVIVGTQTYRDAITKGKQKLNPLPAPLPTQHRRKNLHPTNITEYEEADLQLG------------SSIIVVRRFFHDDWL
             APT   R  I  +P+ +      +  D+  K   K   + +       +K    T+          +G             ++I+ RR FHDDW 
Subjt:  QQHPHAPT-HTRQNIPDQPIPSVIVGTQTYRDAITKGKQKLNPLPAPLPTQHRRKNLHPTNITEYEEADLQLG------------SSIIVVRRFFHDDWL

Query:  KITRALQTHISTLCTLNPFTADKAL--LRCEDKKQGDALSQYKDWHQVGDTQLKFLAWSDLDPSHSSVVPSY--------------------------GG
        +I  +L+       +  PF ADKA+  L  +  K   +      W  VG+ Q+KF +W     S  SV+PSY                          GG
Subjt:  KITRALQTHISTLCTLNPFTADKAL--LRCEDKKQGDALSQYKDWHQVGDTQLKFLAWSDLDPSHSSVVPSY--------------------------GG

Query:  FIEIAKKSLSKLDLLEAKIRVKSNPFGFIPGEVGINTAGFGHFKVQIDPFHRTEHYLGFSVGIHGAF----------------------LAGIPSDP---
        F+++AK+++    L++AKI+V+ N  GF+P  + I      +F V           +  +V +HG+F                         IP +P   
Subjt:  FIEIAKKSLSKLDLLEAKIRVKSNPFGFIPGEVGINTAGFGHFKVQIDPFHRTEHYLGFSVGIHGAF----------------------LAGIPSDP---

Query:  --------------------PLGTTSEGLLGPCP-----------------------------------KLSFL----------EKDCRLLDERLSVST-
                                +SE    P                                     K+SFL            +     + L +ST 
Subjt:  --------------------PLGTTSEGLLGPCP-----------------------------------KLSFL----------EKDCRLLDERLSVST-

Query:  --------SPR----TCLDIHATKNPSASSISHSPSSPVPVTDPSATSAFPADNKP--PVTS-VPLHRSDPYQCISSPYPLSTEPPSPSASTYLI-----
                SPR    T L     K+P  S+  H+ S      + S       D  P  P+ S +    +     +++  P      + SA    +     
Subjt:  --------SPR----TCLDIHATKNPSASSISHSPSSPVPVTDPSATSAFPADNKP--PVTS-VPLHRSDPYQCISSPYPLSTEPPSPSASTYLI-----

Query:  -----DSYASKPFGEGDSQSALHMFDPSIHPSFYLQVVAPWLHTIGMGIFP-----IP--TKLPKKAASLQKEYRGIRELSG------LTTTVNYEKRQP
             +  AS+   EG+S+ A    +  I  +F  ++V  WL    + + P     +P  +  P   +    +  G   L        L     ++    
Subjt:  -----DSYASKPFGEGDSQSALHMFDPSIHPSFYLQVVAPWLHTIGMGIFP-----IP--TKLPKKAASLQKEYRGIRELSG------LTTTVNYEKRQP

Query:  KKG-------------------VYGPSNSEEKPAFWRELSDLSSLCNDSWLLGGDFN-------------------------------------------
        K G                   VYGP    ++   W EL  L SLC  +WL+ GDFN                                           
Subjt:  KKG-------------------VYGPSNSEEKPAFWRELSDLSSLCNDSWLLGGDFN-------------------------------------------

Query:  --------------------PNA---HVKR-LNRPTSDHYPIQLSFGMTKWGPSPFRFENAWMLHHSLQPLIEYWWKNTPLRGWPGHGFIMKLKGLKIML
                             NA   H  R L R  SDH+PI L     KWGP PFR  N+ +     Q     WW N+   G+PG+ FI  L  L   +
Subjt:  --------------------PNA---HVKR-LNRPTSDHYPIQLSFGMTKWGPSPFRFENAWMLHHSLQPLIEYWWKNTPLRGWPGHGFIMKLKGLKIML

Query:  KTWSKTTFGSIAQQKNQLTSELAILDELEEEGLLNTVQHAQRKTIKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNTSFFHRIVAAKRRRNTITEILSTTG
        K W           K  L  E+ I+D+LE +G ++T  H +R ++K+ LL +   +   W QR + +W L GD N S+FHRI    +R+N I  I    G
Subjt:  KTWSKTTFGSIAQQKNQLTSELAILDELEEEGLLNTVQHAQRKTIKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNTSFFHRIVAAKRRRNTITEILSTTG

Query:  ISLFDEKDIETEFLSFFDS
         SL    DI   F+S F +
Subjt:  ISLFDEKDIETEFLSFFDS

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein3.9e-5322.09Show/hide
Query:  PQHITIERKRFTIAIDHRYRGSGVRITESTKDRAFFISLHWSSIAWPKTIFSTLLSIPQNHKFFKETHIDDQTLWVEKTNNHKGTLAEIAKLDINGGLTK
        P+   +ERK F + +D   + +   +TE+   +AF I +    + W +    +L++ P  ++FF ET   +Q +W+ KT N KG  AEI ++D     + 
Subjt:  PQHITIERKRFTIAIDHRYRGSGVRITESTKDRAFFISLHWSSIAWPKTIFSTLLSIPQNHKFFKETHIDDQTLWVEKTNNHKGTLAEIAKLDINGGLTK

Query:  LLIPIGEDRKGWKAFYRLLHEPIQ-QHPHAPTHTRQNIPDQPI-PSVIVGTQTYRDAITKGKQKLNPLPAPLPTQHRRKNLHPTNITEYEEADLQLGSSI
        +L+P G D+ GW +F  ++   ++ +    PT   +  PD  + P +    ++Y  A+T+G+       +         +    +  +   +DL L +++
Subjt:  LLIPIGEDRKGWKAFYRLLHEPIQ-QHPHAPTHTRQNIPDQPI-PSVIVGTQTYRDAITKGKQKLNPLPAPLPTQHRRKNLHPTNITEYEEADLQLGSSI

Query:  IVVRRFFHDDWLKITRALQTHISTLCTLNPFTADKALLRCEDKKQGDALSQYKDWHQVGDTQLKFLAWSDLDPSHSSVVPSYGGF---------------
        ++VRRFFHDDW KI + L+       T N F A+KAL+        + L Q K W  VG   ++F  WS +  +   ++PSYGG+               
Subjt:  IVVRRFFHDDWLKITRALQTHISTLCTLNPFTADKALLRCEDKKQGDALSQYKDWHQVGDTQLKFLAWSDLDPSHSSVVPSYGGF---------------

Query:  -----------IEIAKKSLSKLDLLEAKIRVKSNPFGFIPGEVGINTAGFGHFKVQIDPFHRTEHYLGFSVGIHGA-------------------FLAGI
                   I++A+++ S  +L+EA+I+V+ N  GF+P  V I       F VQ+      +  +  +V +HG                    F  G 
Subjt:  -----------IEIAKKSLSKLDLLEAKIRVKSNPFGFIPGEVGINTAGFGHFKVQIDPFHRTEHYLGFSVGIHGA-------------------FLAGI

Query:  PSDPP--LGTTSEGLLGPCPKL------------------SFLEKDC------------------------RLLD---ERLSVSTSPRTCLDIHATK---
         +  P  L T+S+G     P                    SFL ++                          +LD   +++ +   P + L++  +K   
Subjt:  PSDPP--LGTTSEGLLGPCPKL------------------SFLEKDC------------------------RLLD---ERLSVSTSPRTCLDIHATK---

Query:  ------------NPSASSISHSPSSPVP--------------------------------VTDPSATSAFPADNKPPVTSV-----PLHRSDPYQCISSP
                    NP ++  +HSPS   P                                +T P    A   D      S+      L   DP + +   
Subjt:  ------------NPSASSISHSPSSPVP--------------------------------VTDPSATSAFPADNKPPVTSV-----PLHRSDPYQCISSP

Query:  YPLSTEPPSPSASTYLIDS--YASKPFGEGDSQSALHMFDPSIH--------------------PSFYLQVVAPWLHTIGMGI-----------------
        +           +T ++        P  E  + S+   +    H                     +F  Q+V+ WL   G+ +                 
Subjt:  YPLSTEPPSPSASTYLIDS--YASKPFGEGDSQSALHMFDPSIH--------------------PSFYLQVVAPWLHTIGMGI-----------------

Query:  ------FPIPTK--------------LPKKAA---------------SLQKEYRGIRELSGLTTTVNYEKRQPKKGVYGPSNSEEKPAFWRELSDLSSLC
                I  K              + K A+               SL  +  G+  LS     +N        G+YGP    E+  FW EL +L  L 
Subjt:  ------FPIPTK--------------LPKKAA---------------SLQKEYRGIRELSGLTTTVNYEKRQPKKGVYGPSNSEEKPAFWRELSDLSSLC

Query:  NDSWLLGGDFN------------PNAHVKR-------------------------------------------------------LNRPTSDHYPIQLSF
        +  W+LGGD N             ++H  R                                                       L R TSDH+P+    
Subjt:  NDSWLLGGDFN------------PNAHVKR-------------------------------------------------------LNRPTSDHYPIQLSF

Query:  GMTK--WGPSPFRFENAWMLHHSLQPLIEYWWKNTPLRGWPGHGFIMKLKGLKIMLKTWSKTTFGSIAQQKNQLTSELAILDELEEEGLLNTVQHAQRKT
           K  WGP PFR  +  +     +  +  WW+N+   G+PG  FI +LK L   +K W K    S+   K  +  E+  +D+ E +  L   +  +R  
Subjt:  GMTK--WGPSPFRFENAWMLHHSLQPLIEYWWKNTPLRGWPGHGFIMKLKGLKIMLKTWSKTTFGSIAQQKNQLTSELAILDELEEEGLLNTVQHAQRKT

Query:  IKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNTSFFHRIVAAKRRRNTITEILSTTGISLFDEKDIETEFLSFFDSKVFEEAS
        +K  L E+  KE   W QR K  WL EGD N+SFFHRI +++++R+ I EI    G        I T F+ FF S+++  ++
Subjt:  IKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNTSFFHRIVAAKRRRNTITEILSTTGISLFDEKDIETEFLSFFDSKVFEEAS

A0A6J1E2G6 uncharacterized protein LOC1110254052.2e-5642.81Show/hide
Query:  GVYGPSNSEEKPAFWRELSDLSSLCNDSWLLGGDFNP--------------------NAHV---------------------------------------
        G+YGPS +E    FW+EL DLS LC + W+L GDFN                     N+ +                                       
Subjt:  GVYGPSNSEEKPAFWRELSDLSSLCNDSWLLGGDFNP--------------------NAHV---------------------------------------

Query:  -----KRLNRPTSDHYPIQLSFGMTKWGPSPFRFENAWMLHHSLQPLIEYWWKNTPLRGWPGHGFIMKLKGLKIMLKTWSKTTFGSIAQQKNQLTSELAI
             KR+ R TSDH+PI L FG   WG +PFRFEN W+ H + +P +E WW N PL GWPGHG +MKLK LK  +K W    F  I  QK  LT+ +  
Subjt:  -----KRLNRPTSDHYPIQLSFGMTKWGPSPFRFENAWMLHHSLQPLIEYWWKNTPLRGWPGHGFIMKLKGLKIMLKTWSKTTFGSIAQQKNQLTSELAI

Query:  LDELEEEGLLNTVQHAQRKTIKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNTSFFHRIVAAKRRRNTITEILSTTGISLFDEKDIETEFLSF
        LD+LE    +   Q   R   K  LL VVAKEE  WRQRCK KWL EGD NT FFHR +A KRRR+ ITEILS  GI L   KDIE EF+ F
Subjt:  LDELEEEGLLNTVQHAQRKTIKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNTSFFHRIVAAKRRRNTITEILSTTGISLFDEKDIETEFLSF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.6e-0628.84Show/hide
Query:  PKKGV-YGPSNSEEKPAFWRELSDLSSLCNDSWLLGGDFNPNAHVKRLNRPTSDHYP-IQLSFGMTKWGPSPFRFENAWMLHHS-LQPLIEYWWKNTPLR
        P +GV Y  SN ++     R+L    ++ N  W     F     V  L+   SDH P I +   + K     FR+ +    H + L  L   W +  P+ 
Subjt:  PKKGV-YGPSNSEEKPAFWRELSDLSSLCNDSWLLGGDFNPNAHVKRLNRPTSDHYP-IQLSFGMTKWGPSPFRFENAWMLHHS-LQPLIEYWWKNTPLR

Query:  GWPGHGFIM--KLKGLKIMLKTWSKTTFGSIAQQ-KNQLTSELAILDEL--EEEGLLNTVQHAQRKTIKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNTS
            H F +   LK  K   K  ++  FG+I  + K  L S  +I  +L       L  V+H  RK          A  E  +RQ+ + KWL +GD NT 
Subjt:  GWPGHGFIM--KLKGLKIMLKTWSKTTFGSIAQQ-KNQLTSELAILDEL--EEEGLLNTVQHAQRKTIKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNTS

Query:  FFHRIVAAKRRRNTI
        FFH+++ A + +N I
Subjt:  FFHRIVAAKRRRNTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCTCTCCCCACCCACAACACATCACCATCGAAAGAAAACGCTTCACCATTGCCATTGACCACAGATACCGAGGTAGCGGAGTTCGCATCACTGAATCCACCAA
AGACAGAGCTTTTTTCATCTCCCTCCATTGGTCTTCCATTGCCTGGCCGAAGACAATTTTCTCCACCCTCTTATCTATCCCTCAAAACCATAAATTCTTCAAGGAGACGC
ATATCGATGACCAGACACTCTGGGTCGAGAAGACAAACAACCATAAAGGCACACTTGCTGAGATTGCTAAACTGGACATTAATGGAGGACTCACTAAACTTCTGATCCCC
ATCGGCGAAGATAGAAAGGGTTGGAAAGCCTTCTATAGACTCCTACATGAACCCATTCAACAACACCCCCATGCTCCCACCCACACTAGACAAAACATCCCTGATCAGCC
CATCCCCTCTGTCATCGTGGGTACACAGACATACCGAGATGCCATAACTAAAGGAAAACAGAAACTTAATCCTTTACCCGCTCCACTCCCGACACAGCACAGAAGAAAGA
ACCTGCATCCTACAAACATCACCGAATACGAAGAAGCAGATCTCCAATTGGGTTCATCCATTATTGTCGTTAGAAGATTCTTCCACGACGATTGGCTCAAGATAACCAGA
GCTCTTCAAACCCACATCAGTACACTCTGTACTCTCAACCCCTTTACTGCTGATAAAGCGCTCCTCCGCTGTGAAGACAAAAAACAAGGAGACGCACTCAGCCAATATAA
GGATTGGCATCAAGTAGGTGATACACAGCTGAAGTTTCTGGCATGGAGCGACCTTGACCCATCACACTCTTCTGTAGTACCTTCATATGGTGGTTTTATTGAAATTGCTA
AGAAATCTCTGTCCAAACTTGATCTTTTGGAGGCGAAAATTAGAGTAAAGTCAAACCCTTTCGGCTTTATCCCGGGAGAAGTTGGGATTAACACCGCTGGCTTTGGCCAT
TTCAAAGTACAAATTGATCCTTTCCACAGGACGGAACATTACCTGGGTTTTTCAGTCGGAATCCATGGAGCTTTTCTGGCCGGAATCCCTTCCGATCCGCCATTGGGAAC
AACCTCCGAGGGCCTTTTGGGTCCGTGCCCGAAGCTTTCCTTTTTGGAGAAAGACTGCAGACTTCTTGATGAAAGACTCTCTGTCTCTACCTCACCTCGTACCTGCCTTG
ACATACACGCCACAAAAAACCCGTCTGCTTCATCTATCAGCCACAGTCCCTCGAGCCCTGTGCCTGTGACTGACCCCTCTGCTACTTCTGCCTTCCCTGCTGACAATAAA
CCCCCTGTCACATCGGTACCCTTACACCGATCTGACCCCTACCAATGTATCTCTAGCCCGTATCCCCTCTCCACTGAGCCTCCCTCCCCATCTGCCTCCACATACTTGAT
CGACAGTTATGCCTCAAAACCTTTTGGAGAAGGCGACTCTCAATCTGCCTTACACATGTTCGATCCCTCTATCCACCCCTCATTTTATCTCCAGGTAGTTGCTCCTTGGC
TTCATACTATTGGGATGGGCATCTTCCCCATTCCTACAAAGCTCCCAAAGAAAGCTGCCTCTTTGCAGAAGGAATACAGGGGCATTAGGGAACTTTCTGGCCTTACTACT
ACAGTCAACTATGAAAAACGCCAACCAAAAAAGGGTGTATACGGTCCATCAAATTCAGAGGAAAAGCCAGCTTTCTGGAGGGAGCTAAGCGACCTTTCATCCCTCTGCAA
TGACTCATGGCTTTTAGGGGGAGACTTCAATCCTAATGCACACGTCAAAAGATTGAATCGGCCTACCTCTGATCATTATCCCATACAACTCTCCTTCGGCATGACTAAAT
GGGGTCCATCTCCCTTCCGGTTTGAAAATGCATGGATGCTGCATCACTCCTTACAACCCCTCATCGAATATTGGTGGAAGAACACCCCCCTCAGAGGCTGGCCAGGCCAC
GGTTTCATCATGAAACTTAAAGGGCTGAAAATCATGCTCAAAACCTGGAGCAAAACAACTTTCGGGAGCATTGCACAACAAAAAAATCAGCTCACCTCAGAGCTCGCCAT
TCTAGATGAATTAGAGGAAGAGGGACTTCTAAACACAGTCCAACACGCCCAACGGAAAACTATCAAAACACAACTTCTTGAGGTTGTGGCCAAAGAGGAGATCCATTGGC
GCCAACGCTGCAAATCCAAATGGTTACTGGAAGGAGACAACAATACATCCTTCTTCCACCGGATTGTTGCAGCAAAGAGACGTCGGAACACCATTACAGAAATCCTCTCC
ACAACTGGTATCAGTCTTTTTGATGAGAAGGACATAGAAACAGAATTTCTTTCTTTTTTTGACAGCAAGGTTTTTGAGGAAGCTTCAGGGCTCAAAATCAACCACCAGAA
ATCTGAAATTCTGGGCATAAACCTAGAAGACAGTGAGCTAACAAATCTTGCATCCAAGTATAATTGCAAAAAGGGATCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCCTCTCCCCACCCACAACACATCACCATCGAAAGAAAACGCTTCACCATTGCCATTGACCACAGATACCGAGGTAGCGGAGTTCGCATCACTGAATCCACCAA
AGACAGAGCTTTTTTCATCTCCCTCCATTGGTCTTCCATTGCCTGGCCGAAGACAATTTTCTCCACCCTCTTATCTATCCCTCAAAACCATAAATTCTTCAAGGAGACGC
ATATCGATGACCAGACACTCTGGGTCGAGAAGACAAACAACCATAAAGGCACACTTGCTGAGATTGCTAAACTGGACATTAATGGAGGACTCACTAAACTTCTGATCCCC
ATCGGCGAAGATAGAAAGGGTTGGAAAGCCTTCTATAGACTCCTACATGAACCCATTCAACAACACCCCCATGCTCCCACCCACACTAGACAAAACATCCCTGATCAGCC
CATCCCCTCTGTCATCGTGGGTACACAGACATACCGAGATGCCATAACTAAAGGAAAACAGAAACTTAATCCTTTACCCGCTCCACTCCCGACACAGCACAGAAGAAAGA
ACCTGCATCCTACAAACATCACCGAATACGAAGAAGCAGATCTCCAATTGGGTTCATCCATTATTGTCGTTAGAAGATTCTTCCACGACGATTGGCTCAAGATAACCAGA
GCTCTTCAAACCCACATCAGTACACTCTGTACTCTCAACCCCTTTACTGCTGATAAAGCGCTCCTCCGCTGTGAAGACAAAAAACAAGGAGACGCACTCAGCCAATATAA
GGATTGGCATCAAGTAGGTGATACACAGCTGAAGTTTCTGGCATGGAGCGACCTTGACCCATCACACTCTTCTGTAGTACCTTCATATGGTGGTTTTATTGAAATTGCTA
AGAAATCTCTGTCCAAACTTGATCTTTTGGAGGCGAAAATTAGAGTAAAGTCAAACCCTTTCGGCTTTATCCCGGGAGAAGTTGGGATTAACACCGCTGGCTTTGGCCAT
TTCAAAGTACAAATTGATCCTTTCCACAGGACGGAACATTACCTGGGTTTTTCAGTCGGAATCCATGGAGCTTTTCTGGCCGGAATCCCTTCCGATCCGCCATTGGGAAC
AACCTCCGAGGGCCTTTTGGGTCCGTGCCCGAAGCTTTCCTTTTTGGAGAAAGACTGCAGACTTCTTGATGAAAGACTCTCTGTCTCTACCTCACCTCGTACCTGCCTTG
ACATACACGCCACAAAAAACCCGTCTGCTTCATCTATCAGCCACAGTCCCTCGAGCCCTGTGCCTGTGACTGACCCCTCTGCTACTTCTGCCTTCCCTGCTGACAATAAA
CCCCCTGTCACATCGGTACCCTTACACCGATCTGACCCCTACCAATGTATCTCTAGCCCGTATCCCCTCTCCACTGAGCCTCCCTCCCCATCTGCCTCCACATACTTGAT
CGACAGTTATGCCTCAAAACCTTTTGGAGAAGGCGACTCTCAATCTGCCTTACACATGTTCGATCCCTCTATCCACCCCTCATTTTATCTCCAGGTAGTTGCTCCTTGGC
TTCATACTATTGGGATGGGCATCTTCCCCATTCCTACAAAGCTCCCAAAGAAAGCTGCCTCTTTGCAGAAGGAATACAGGGGCATTAGGGAACTTTCTGGCCTTACTACT
ACAGTCAACTATGAAAAACGCCAACCAAAAAAGGGTGTATACGGTCCATCAAATTCAGAGGAAAAGCCAGCTTTCTGGAGGGAGCTAAGCGACCTTTCATCCCTCTGCAA
TGACTCATGGCTTTTAGGGGGAGACTTCAATCCTAATGCACACGTCAAAAGATTGAATCGGCCTACCTCTGATCATTATCCCATACAACTCTCCTTCGGCATGACTAAAT
GGGGTCCATCTCCCTTCCGGTTTGAAAATGCATGGATGCTGCATCACTCCTTACAACCCCTCATCGAATATTGGTGGAAGAACACCCCCCTCAGAGGCTGGCCAGGCCAC
GGTTTCATCATGAAACTTAAAGGGCTGAAAATCATGCTCAAAACCTGGAGCAAAACAACTTTCGGGAGCATTGCACAACAAAAAAATCAGCTCACCTCAGAGCTCGCCAT
TCTAGATGAATTAGAGGAAGAGGGACTTCTAAACACAGTCCAACACGCCCAACGGAAAACTATCAAAACACAACTTCTTGAGGTTGTGGCCAAAGAGGAGATCCATTGGC
GCCAACGCTGCAAATCCAAATGGTTACTGGAAGGAGACAACAATACATCCTTCTTCCACCGGATTGTTGCAGCAAAGAGACGTCGGAACACCATTACAGAAATCCTCTCC
ACAACTGGTATCAGTCTTTTTGATGAGAAGGACATAGAAACAGAATTTCTTTCTTTTTTTGACAGCAAGGTTTTTGAGGAAGCTTCAGGGCTCAAAATCAACCACCAGAA
ATCTGAAATTCTGGGCATAAACCTAGAAGACAGTGAGCTAACAAATCTTGCATCCAAGTATAATTGCAAAAAGGGATCTTAG
Protein sequenceShow/hide protein sequence
MAASPHPQHITIERKRFTIAIDHRYRGSGVRITESTKDRAFFISLHWSSIAWPKTIFSTLLSIPQNHKFFKETHIDDQTLWVEKTNNHKGTLAEIAKLDINGGLTKLLIP
IGEDRKGWKAFYRLLHEPIQQHPHAPTHTRQNIPDQPIPSVIVGTQTYRDAITKGKQKLNPLPAPLPTQHRRKNLHPTNITEYEEADLQLGSSIIVVRRFFHDDWLKITR
ALQTHISTLCTLNPFTADKALLRCEDKKQGDALSQYKDWHQVGDTQLKFLAWSDLDPSHSSVVPSYGGFIEIAKKSLSKLDLLEAKIRVKSNPFGFIPGEVGINTAGFGH
FKVQIDPFHRTEHYLGFSVGIHGAFLAGIPSDPPLGTTSEGLLGPCPKLSFLEKDCRLLDERLSVSTSPRTCLDIHATKNPSASSISHSPSSPVPVTDPSATSAFPADNK
PPVTSVPLHRSDPYQCISSPYPLSTEPPSPSASTYLIDSYASKPFGEGDSQSALHMFDPSIHPSFYLQVVAPWLHTIGMGIFPIPTKLPKKAASLQKEYRGIRELSGLTT
TVNYEKRQPKKGVYGPSNSEEKPAFWRELSDLSSLCNDSWLLGGDFNPNAHVKRLNRPTSDHYPIQLSFGMTKWGPSPFRFENAWMLHHSLQPLIEYWWKNTPLRGWPGH
GFIMKLKGLKIMLKTWSKTTFGSIAQQKNQLTSELAILDELEEEGLLNTVQHAQRKTIKTQLLEVVAKEEIHWRQRCKSKWLLEGDNNTSFFHRIVAAKRRRNTITEILS
TTGISLFDEKDIETEFLSFFDSKVFEEASGLKINHQKSEILGINLEDSELTNLASKYNCKKGS