; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026386 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026386
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon protein
Genome locationchr10:35889116..35891587
RNA-Seq ExpressionLag0026386
SyntenyLag0026386
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR024752 - Myb/SANT-like domain
IPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN33754.1 retrotransposon protein [Cucumis melo subsp. melo]1.6e-19456.96Show/hide
Query:  LIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMFLHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTAS
        +IHESDL CR+STRMDRR FAILC LL+ VAGLS+TE+VDVEEMVAMFLH++AHDVKNRV+  +F RSGETVSRHFN VL AVLRL++ L+K+P PVT++
Subjt:  LIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMFLHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTAS

Query:  CTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVCDTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEG
        C D RWK F+NCLGALDGTYIKVNV   DRP +RTRKGEIATNVL VCD KGDF +VL GWEGSAADSR+LRDAIS+ NGL VPKGYYYLCDAGYPNAEG
Subjt:  CTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVCDTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEG

Query:  FLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLKGRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPDELDEVDSA-S
        FLAPY+G+RYHL +WRGA NAPT  KE+FNM+HSS RNVIERAFG+LKGRW ILRGKSYYP+++Q  TI AC LLHNLINREM      ++ DE DS  +
Subjt:  FLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLKGRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPDELDEVDSA-S

Query:  ITTDGENINFIETSDEWSRWRDELAIQMFSNWELR-------------GWRSDNGTFKAGYLGQLENLMREKLPGHDVPAQSNIDSRVRTLKKQYQAIAE
         TT  E+I +IET++EWS+WRD+LA  MF++W+ R             GW+SDNGTF+ GYL QL  +M EKL G  V A + ID R++TLK+ +QAIAE
Subjt:  ITTDGENINFIETSDEWSRWRDELAIQMFSNWELR-------------GWRSDNGTFKAGYLGQLENLMREKLPGHDVPAQSNIDSRVRTLKKQYQAIAE

Query:  MLGPGCSGFGWNDEFKCIEAEKETFDLW--GHPIAKGMRNKPFPHFEDLAYVFGKDRATGKVSEVMGEMGSNMPDE------------------------
        MLGP CSGFGWNDE KCI AEKE FD W    P AKG+ N PFP++++L YVFG+DRATG+ +E   ++GSN P                          
Subjt:  MLGPGCSGFGWNDEFKCIEAEKETFDLW--GHPIAKGMRNKPFPHFEDLAYVFGKDRATGKVSEVMGEMGSNMPDE------------------------

Query:  -EVDLSASRRSPCPL-RTCQRMNLGVRQAVKVSQLEGRHVGA-RGREQLKSIAEWPEKKRVTEADFRDKIVTHLMEVPELDNQKRVKLMDILFNNMKATE
         + D+ ASR S     +T    +   R + +   +E  H+   +  EQL+ IAEWP +    +   R +    L E+PEL +  R  L   L + M    
Subjt:  -EVDLSASRRSPCPL-RTCQRMNLGVRQAVKVSQLEGRHVGA-RGREQLKSIAEWPEKKRVTEADFRDKIVTHLMEVPELDNQKRVKLMDILFNNMKATE

Query:  SFFSIPAALKLEYCELLL
         F  +P   +  +C +LL
Subjt:  SFFSIPAALKLEYCELLL

ADN34114.1 retrotransposon protein [Cucumis melo subsp. melo]2.6e-20054.04Show/hide
Query:  LDPDELVAILTAFTAAQTTILLLLDLYMNDHRRIEHQSPYIRHHIRQLAFFRLIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMF
        +D  EL +I+ AF A+Q  +LL+L+L  ND +RI H     RH IRQLA+FR+IH                         T+AGL++TEVVDVEEMVAMF
Subjt:  LDPDELVAILTAFTAAQTTILLLLDLYMNDHRRIEHQSPYIRHHIRQLAFFRLIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMF

Query:  LHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTASCTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVC
        LHI+AHDVK+RV+  +F RSGET+SRHFN VL AV+RLH+ LLKKP+PV   CTD RW+WF+NCLGALDGTYIKVNV   DR RYRTRKGE+ATNVL VC
Subjt:  LHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTASCTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVC

Query:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEGFLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLK
        DTKGDF +VL GWEGSAADSR+LRDA+SRPN L VPKGYYYL D GYPNAEGFLAPYRG+RYHL +WRG  NAP+T KEFFNM+H S RNVIERAFG+LK
Subjt:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEGFLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLK

Query:  GRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPDELDEVDSASITTDGENINFIETSDEWSRWRDELAIQMFSN--------W---------
        GRWAILRGKSYYPV +Q  TI ACCLLHNLINREM + +I D +DEVDS   TT  ++I++IETS+EWS+WRD LA ++ ++        W         
Subjt:  GRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPDELDEVDSASITTDGENINFIETSDEWSRWRDELAIQMFSN--------W---------

Query:  -------ELRGWRSDNGTFKAGYLGQLENLMREKLPGHDVPAQSNIDSRVRTLKKQYQAIAEMLGPGCSGFGWNDEFKCIEAEKETFDLWGHPIAKGMRN
                  GWRSDNGTF+ GYL QL  +M  K+PG ++ A S IDSR++ +K+ + A+AEM GP CSGFGWNDE KCI AEKE FD W HP AKG+ N
Subjt:  -------ELRGWRSDNGTFKAGYLGQLENLMREKLPGHDVPAQSNIDSRVRTLKKQYQAIAEMLGPGCSGFGWNDEFKCIEAEKETFDLWGHPIAKGMRN

Query:  KPFPHFEDLAYVFGKDRATGKVSEVMGEMGSN------------MPDEEVDLSAS---RRSPCPLRTCQRMNLGVRQAVKVSQLEGRHVGAR--------
        K F H+++L+YVFGKDRATG  +E   ++GSN            MPD +     S     SP  L   +   +  R+ V       R   A         
Subjt:  KPFPHFEDLAYVFGKDRATGKVSEVMGEMGSN------------MPDEEVDLSAS---RRSPCPLRTCQRMNLGVRQAVKVSQLEGRHVGAR--------

Query:  ----GREQLKSIAEWPEKKRVTEADFRDKIVTHLMEVPELDNQKRVKLMDILFNNMKATESFFSIPAALKLEYCELLLHKH
            G EQL  IAEWP  +R      R +IV HL  +PEL    R +LM IL  N+   ++F  +P  +K  YC L+L ++
Subjt:  ----GREQLKSIAEWPEKKRVTEADFRDKIVTHLMEVPELDNQKRVKLMDILFNNMKATESFFSIPAALKLEYCELLLHKH

KAA0034843.1 retrotransposon protein [Cucumis melo var. makuwa]6.5e-20456.16Show/hide
Query:  LDPDELVAILTAFTAAQTTILLLLDLYMNDHRRIEHQSPYIRHHIRQLAFFRLIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMF
        +D  EL +I+ AF A+Q  +LL+L+L  ND +RI H     RH IRQLA+FR+IH SDL CR+STRMDRRCFAILC LL+T+AGL++TEVVDVEEMVAMF
Subjt:  LDPDELVAILTAFTAAQTTILLLLDLYMNDHRRIEHQSPYIRHHIRQLAFFRLIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMF

Query:  LHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTASCTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVC
        LHI+AHDVKNRV+  +F RSGET+SRHFN VL AV+RLHD LLKKP+PV   CTD RW+WF+NCLGALDGTYIKVNV   DR RYRTRKGE+ATNVL V 
Subjt:  LHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTASCTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVC

Query:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEGFLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLK
        DTKGDF +VL GWEGSAADSR+LRDA+SRPN L VPKGYYYL DAGYPNAEGFLAPYRG+RYHL +WRG  NAP+T KEFFNM+HSS RNVIERAFG+LK
Subjt:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEGFLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLK

Query:  GRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPDELDEVDSASITTDGENINFIETSDEWSRWRDELAIQMFSNWELRGWRSDNGTFKAGYL
        GRWAILRGKSY+PV +Q  TI ACCLLHNLINREM + +I D +  + S+S                W++  +   +++ +     GWRSDNGTF+ GYL
Subjt:  GRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPDELDEVDSASITTDGENINFIETSDEWSRWRDELAIQMFSNWELRGWRSDNGTFKAGYL

Query:  GQLENLMREKLPGHDVPAQSNIDSRVRTLKKQYQAIAEMLGPGCSGFGWNDEFKCIEAEKETFDLWGHPIAKGMRNKPFPHFEDLAYVFGKDRATGKVSE
         QL  +M  K+PG ++ A S IDSR++ +K+ + A+AEM GP CSGFGWNDE KCI AEKE FD W HP AKG+ NK F H+++L+YVFGKDRATG  +E
Subjt:  GQLENLMREKLPGHDVPAQSNIDSRVRTLKKQYQAIAEMLGPGCSGFGWNDEFKCIEAEKETFDLWGHPIAKGMRNKPFPHFEDLAYVFGKDRATGKVSE

Query:  VMGEMGSNMPD----------EEVDLS-----ASRRSPCPLRTCQRMNLGVRQAVKVSQLEGRHVGAR------------GREQLKSIAEWPEKKRVTEA
           ++GSN P            + D S         SP  L   +   +  R+ V       R   A             G EQL  IAEWP  +R    
Subjt:  VMGEMGSNMPD----------EEVDLS-----ASRRSPCPLRTCQRMNLGVRQAVKVSQLEGRHVGAR------------GREQLKSIAEWPEKKRVTEA

Query:  DFRDKIVTHLMEVPELDNQKRVKLMDILFNNMKATESFFSIPAALKLEYCELLLHKH
          R +IV  L  +PEL    R +LM IL  N+   ++F  +P  +K  YC ++L ++
Subjt:  DFRDKIVTHLMEVPELDNQKRVKLMDILFNNMKATESFFSIPAALKLEYCELLLHKH

KAA0046727.1 retrotransposon protein [Cucumis melo var. makuwa]4.6e-16572.8Show/hide
Query:  LDPDELVAILTAFTAAQTTILLLLDLYMNDHRRIEHQSPYIRHHIRQLAFFRLIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMF
        +D  EL +I+ AF A+Q  +LL+L+L  ND +RI H     RH IRQLA+FR+IH SDL CR+STRMDRRCFAILC LL+T+AGL++TEVVDVEEMVAMF
Subjt:  LDPDELVAILTAFTAAQTTILLLLDLYMNDHRRIEHQSPYIRHHIRQLAFFRLIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMF

Query:  LHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTASCTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVC
        LHI+AHDVKNRV+  +F RSGET+SRHFN VL AV+RLHD LLKKP+PV   CTD RW+WF+NCLGALDGTYIKVNV   DR RYRTRKGE+ATNVL VC
Subjt:  LHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTASCTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVC

Query:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEGFLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLK
        DTKGDF +VL GWEGSAADSR+LRDA+SRPN L VPKGYYYL DAGYPNAEGFLAPYRG+RYHL +WRG  NAP+T KEFFNM+HSS RNVIERAFG+LK
Subjt:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEGFLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLK

Query:  GRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPDELDEVDSASITTDGENINFIETSDEWSRWRDELAIQMFSNWELR
        GRWAILRGKSYYPV +Q  TI ACCLLHNLINREM + +I D +DEVDS   TT  ++I++IETS+EWS+WRD LA +MF+ WELR
Subjt:  GRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPDELDEVDSASITTDGENINFIETSDEWSRWRDELAIQMFSNWELR

KAA0062747.1 retrotransposon protein [Cucumis melo var. makuwa]1.6e-18959.55Show/hide
Query:  MDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMFLHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTASCTDPRWKWFQ---N
        MDRRCFAILC LL+T AGL  TEV+DVEEMVAMFLHI+AH VKNR++  +F RSGETVSRHFN VL A  RLHD LLKKP+PVT SCTDPRWKWF+   N
Subjt:  MDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMFLHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTASCTDPRWKWFQ---N

Query:  CLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVCDTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEGFLAPYRGERYH
        CL + +GTYIKVNV   DRPRYRTRKGE+ATNVL  CDTKGDF FVL GWEGSAADSR+LRDAISR NGL VPKGYYYLCDAGYPNAEGFLAPYRGERYH
Subjt:  CLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVCDTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEGFLAPYRGERYH

Query:  LSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLKGRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPDELDEVDSASITTDGENINFIE
        LS+WRG  NAPTT +EFFNM+HSS+RNVIERAFGLLKG WAILRGKSYYPV +Q  TI ACCLLHNLINREM + EI D+LDE DS   TT G+ IN+IE
Subjt:  LSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLKGRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPDELDEVDSASITTDGENINFIE

Query:  TSDEWSRWRDELAIQMFSNWELRGWRSDNGTFKAGYLGQLENLMREKLP----GHDVPAQSNIDSRVRTLKKQYQAIAEMLGPGCSGFGWNDEFKCIEAE
         S+EWS WRD+LA  MFS+WEL+           G   + +NL+         G   P   + +              EM GP CSGFGWN+EF+CI AE
Subjt:  TSDEWSRWRDELAIQMFSNWELRGWRSDNGTFKAGYLGQLENLMREKLP----GHDVPAQSNIDSRVRTLKKQYQAIAEMLGPGCSGFGWNDEFKCIEAE

Query:  KETFDLW--GHPIAKGMRNKPFPHFEDLAYVFGKDRATGKVSEVMGEMGSNMP---DEEVDLSASRRSPCPLRTCQRMNLGVRQAVKVSQLEGRHVGARG
        ++ FD W   HP  KG+ +K FP+++DL+YVFGKDRATG  SE   ++GSN+P   ++ + L  S     P    Q +++   +   +     R V   G
Subjt:  KETFDLW--GHPIAKGMRNKPFPHFEDLAYVFGKDRATGKVSEVMGEMGSNMP---DEEVDLSASRRSPCPLRTCQRMNLGVRQAVKVSQLEGRHVGARG

Query:  REQLKSIAEWPEKKRVTEADFRDKIVTHLMEVPELDNQKRVKLMDILFNNMKATESFFSIPAALKLEYCELLLHKH
         EQLK+IA+W ++KR  E + R ++V  L ++PEL +Q R KLM ILF +++A   F SIP  LKLEYC +LL  +
Subjt:  REQLKSIAEWPEKKRVTEADFRDKIVTHLMEVPELDNQKRVKLMDILFNNMKATESFFSIPAALKLEYCELLLHKH

TrEMBL top hitse value%identityAlignment
A0A5A7SWD8 Retrotransposon protein3.1e-20456.16Show/hide
Query:  LDPDELVAILTAFTAAQTTILLLLDLYMNDHRRIEHQSPYIRHHIRQLAFFRLIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMF
        +D  EL +I+ AF A+Q  +LL+L+L  ND +RI H     RH IRQLA+FR+IH SDL CR+STRMDRRCFAILC LL+T+AGL++TEVVDVEEMVAMF
Subjt:  LDPDELVAILTAFTAAQTTILLLLDLYMNDHRRIEHQSPYIRHHIRQLAFFRLIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMF

Query:  LHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTASCTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVC
        LHI+AHDVKNRV+  +F RSGET+SRHFN VL AV+RLHD LLKKP+PV   CTD RW+WF+NCLGALDGTYIKVNV   DR RYRTRKGE+ATNVL V 
Subjt:  LHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTASCTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVC

Query:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEGFLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLK
        DTKGDF +VL GWEGSAADSR+LRDA+SRPN L VPKGYYYL DAGYPNAEGFLAPYRG+RYHL +WRG  NAP+T KEFFNM+HSS RNVIERAFG+LK
Subjt:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEGFLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLK

Query:  GRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPDELDEVDSASITTDGENINFIETSDEWSRWRDELAIQMFSNWELRGWRSDNGTFKAGYL
        GRWAILRGKSY+PV +Q  TI ACCLLHNLINREM + +I D +  + S+S                W++  +   +++ +     GWRSDNGTF+ GYL
Subjt:  GRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPDELDEVDSASITTDGENINFIETSDEWSRWRDELAIQMFSNWELRGWRSDNGTFKAGYL

Query:  GQLENLMREKLPGHDVPAQSNIDSRVRTLKKQYQAIAEMLGPGCSGFGWNDEFKCIEAEKETFDLWGHPIAKGMRNKPFPHFEDLAYVFGKDRATGKVSE
         QL  +M  K+PG ++ A S IDSR++ +K+ + A+AEM GP CSGFGWNDE KCI AEKE FD W HP AKG+ NK F H+++L+YVFGKDRATG  +E
Subjt:  GQLENLMREKLPGHDVPAQSNIDSRVRTLKKQYQAIAEMLGPGCSGFGWNDEFKCIEAEKETFDLWGHPIAKGMRNKPFPHFEDLAYVFGKDRATGKVSE

Query:  VMGEMGSNMPD----------EEVDLS-----ASRRSPCPLRTCQRMNLGVRQAVKVSQLEGRHVGAR------------GREQLKSIAEWPEKKRVTEA
           ++GSN P            + D S         SP  L   +   +  R+ V       R   A             G EQL  IAEWP  +R    
Subjt:  VMGEMGSNMPD----------EEVDLS-----ASRRSPCPLRTCQRMNLGVRQAVKVSQLEGRHVGAR------------GREQLKSIAEWPEKKRVTEA

Query:  DFRDKIVTHLMEVPELDNQKRVKLMDILFNNMKATESFFSIPAALKLEYCELLLHKH
          R +IV  L  +PEL    R +LM IL  N+   ++F  +P  +K  YC ++L ++
Subjt:  DFRDKIVTHLMEVPELDNQKRVKLMDILFNNMKATESFFSIPAALKLEYCELLLHKH

A0A5A7TXW1 Retrotransposon protein2.2e-16572.8Show/hide
Query:  LDPDELVAILTAFTAAQTTILLLLDLYMNDHRRIEHQSPYIRHHIRQLAFFRLIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMF
        +D  EL +I+ AF A+Q  +LL+L+L  ND +RI H     RH IRQLA+FR+IH SDL CR+STRMDRRCFAILC LL+T+AGL++TEVVDVEEMVAMF
Subjt:  LDPDELVAILTAFTAAQTTILLLLDLYMNDHRRIEHQSPYIRHHIRQLAFFRLIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMF

Query:  LHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTASCTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVC
        LHI+AHDVKNRV+  +F RSGET+SRHFN VL AV+RLHD LLKKP+PV   CTD RW+WF+NCLGALDGTYIKVNV   DR RYRTRKGE+ATNVL VC
Subjt:  LHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTASCTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVC

Query:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEGFLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLK
        DTKGDF +VL GWEGSAADSR+LRDA+SRPN L VPKGYYYL DAGYPNAEGFLAPYRG+RYHL +WRG  NAP+T KEFFNM+HSS RNVIERAFG+LK
Subjt:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEGFLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLK

Query:  GRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPDELDEVDSASITTDGENINFIETSDEWSRWRDELAIQMFSNWELR
        GRWAILRGKSYYPV +Q  TI ACCLLHNLINREM + +I D +DEVDS   TT  ++I++IETS+EWS+WRD LA +MF+ WELR
Subjt:  GRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPDELDEVDSASITTDGENINFIETSDEWSRWRDELAIQMFSNWELR

A0A5D3DG22 Retrotransposon protein7.5e-19059.55Show/hide
Query:  MDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMFLHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTASCTDPRWKWFQ---N
        MDRRCFAILC LL+T AGL  TEV+DVEEMVAMFLHI+AH VKNR++  +F RSGETVSRHFN VL A  RLHD LLKKP+PVT SCTDPRWKWF+   N
Subjt:  MDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMFLHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTASCTDPRWKWFQ---N

Query:  CLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVCDTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEGFLAPYRGERYH
        CL + +GTYIKVNV   DRPRYRTRKGE+ATNVL  CDTKGDF FVL GWEGSAADSR+LRDAISR NGL VPKGYYYLCDAGYPNAEGFLAPYRGERYH
Subjt:  CLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVCDTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEGFLAPYRGERYH

Query:  LSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLKGRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPDELDEVDSASITTDGENINFIE
        LS+WRG  NAPTT +EFFNM+HSS+RNVIERAFGLLKG WAILRGKSYYPV +Q  TI ACCLLHNLINREM + EI D+LDE DS   TT G+ IN+IE
Subjt:  LSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLKGRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPDELDEVDSASITTDGENINFIE

Query:  TSDEWSRWRDELAIQMFSNWELRGWRSDNGTFKAGYLGQLENLMREKLP----GHDVPAQSNIDSRVRTLKKQYQAIAEMLGPGCSGFGWNDEFKCIEAE
         S+EWS WRD+LA  MFS+WEL+           G   + +NL+         G   P   + +              EM GP CSGFGWN+EF+CI AE
Subjt:  TSDEWSRWRDELAIQMFSNWELRGWRSDNGTFKAGYLGQLENLMREKLP----GHDVPAQSNIDSRVRTLKKQYQAIAEMLGPGCSGFGWNDEFKCIEAE

Query:  KETFDLW--GHPIAKGMRNKPFPHFEDLAYVFGKDRATGKVSEVMGEMGSNMP---DEEVDLSASRRSPCPLRTCQRMNLGVRQAVKVSQLEGRHVGARG
        ++ FD W   HP  KG+ +K FP+++DL+YVFGKDRATG  SE   ++GSN+P   ++ + L  S     P    Q +++   +   +     R V   G
Subjt:  KETFDLW--GHPIAKGMRNKPFPHFEDLAYVFGKDRATGKVSEVMGEMGSNMP---DEEVDLSASRRSPCPLRTCQRMNLGVRQAVKVSQLEGRHVGARG

Query:  REQLKSIAEWPEKKRVTEADFRDKIVTHLMEVPELDNQKRVKLMDILFNNMKATESFFSIPAALKLEYCELLLHKH
         EQLK+IA+W ++KR  E + R ++V  L ++PEL +Q R KLM ILF +++A   F SIP  LKLEYC +LL  +
Subjt:  REQLKSIAEWPEKKRVTEADFRDKIVTHLMEVPELDNQKRVKLMDILFNNMKATESFFSIPAALKLEYCELLLHKH

E5GBB2 Retrotransposon protein7.8e-19556.96Show/hide
Query:  LIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMFLHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTAS
        +IHESDL CR+STRMDRR FAILC LL+ VAGLS+TE+VDVEEMVAMFLH++AHDVKNRV+  +F RSGETVSRHFN VL AVLRL++ L+K+P PVT++
Subjt:  LIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMFLHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTAS

Query:  CTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVCDTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEG
        C D RWK F+NCLGALDGTYIKVNV   DRP +RTRKGEIATNVL VCD KGDF +VL GWEGSAADSR+LRDAIS+ NGL VPKGYYYLCDAGYPNAEG
Subjt:  CTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVCDTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEG

Query:  FLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLKGRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPDELDEVDSA-S
        FLAPY+G+RYHL +WRGA NAPT  KE+FNM+HSS RNVIERAFG+LKGRW ILRGKSYYP+++Q  TI AC LLHNLINREM      ++ DE DS  +
Subjt:  FLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLKGRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPDELDEVDSA-S

Query:  ITTDGENINFIETSDEWSRWRDELAIQMFSNWELR-------------GWRSDNGTFKAGYLGQLENLMREKLPGHDVPAQSNIDSRVRTLKKQYQAIAE
         TT  E+I +IET++EWS+WRD+LA  MF++W+ R             GW+SDNGTF+ GYL QL  +M EKL G  V A + ID R++TLK+ +QAIAE
Subjt:  ITTDGENINFIETSDEWSRWRDELAIQMFSNWELR-------------GWRSDNGTFKAGYLGQLENLMREKLPGHDVPAQSNIDSRVRTLKKQYQAIAE

Query:  MLGPGCSGFGWNDEFKCIEAEKETFDLW--GHPIAKGMRNKPFPHFEDLAYVFGKDRATGKVSEVMGEMGSNMPDE------------------------
        MLGP CSGFGWNDE KCI AEKE FD W    P AKG+ N PFP++++L YVFG+DRATG+ +E   ++GSN P                          
Subjt:  MLGPGCSGFGWNDEFKCIEAEKETFDLW--GHPIAKGMRNKPFPHFEDLAYVFGKDRATGKVSEVMGEMGSNMPDE------------------------

Query:  -EVDLSASRRSPCPL-RTCQRMNLGVRQAVKVSQLEGRHVGA-RGREQLKSIAEWPEKKRVTEADFRDKIVTHLMEVPELDNQKRVKLMDILFNNMKATE
         + D+ ASR S     +T    +   R + +   +E  H+   +  EQL+ IAEWP +    +   R +    L E+PEL +  R  L   L + M    
Subjt:  -EVDLSASRRSPCPL-RTCQRMNLGVRQAVKVSQLEGRHVGA-RGREQLKSIAEWPEKKRVTEADFRDKIVTHLMEVPELDNQKRVKLMDILFNNMKATE

Query:  SFFSIPAALKLEYCELLL
         F  +P   +  +C +LL
Subjt:  SFFSIPAALKLEYCELLL

E5GCB5 Retrotransposon protein1.2e-20054.04Show/hide
Query:  LDPDELVAILTAFTAAQTTILLLLDLYMNDHRRIEHQSPYIRHHIRQLAFFRLIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMF
        +D  EL +I+ AF A+Q  +LL+L+L  ND +RI H     RH IRQLA+FR+IH                         T+AGL++TEVVDVEEMVAMF
Subjt:  LDPDELVAILTAFTAAQTTILLLLDLYMNDHRRIEHQSPYIRHHIRQLAFFRLIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMF

Query:  LHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTASCTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVC
        LHI+AHDVK+RV+  +F RSGET+SRHFN VL AV+RLH+ LLKKP+PV   CTD RW+WF+NCLGALDGTYIKVNV   DR RYRTRKGE+ATNVL VC
Subjt:  LHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTASCTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVC

Query:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEGFLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLK
        DTKGDF +VL GWEGSAADSR+LRDA+SRPN L VPKGYYYL D GYPNAEGFLAPYRG+RYHL +WRG  NAP+T KEFFNM+H S RNVIERAFG+LK
Subjt:  DTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEGFLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLK

Query:  GRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPDELDEVDSASITTDGENINFIETSDEWSRWRDELAIQMFSN--------W---------
        GRWAILRGKSYYPV +Q  TI ACCLLHNLINREM + +I D +DEVDS   TT  ++I++IETS+EWS+WRD LA ++ ++        W         
Subjt:  GRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPDELDEVDSASITTDGENINFIETSDEWSRWRDELAIQMFSN--------W---------

Query:  -------ELRGWRSDNGTFKAGYLGQLENLMREKLPGHDVPAQSNIDSRVRTLKKQYQAIAEMLGPGCSGFGWNDEFKCIEAEKETFDLWGHPIAKGMRN
                  GWRSDNGTF+ GYL QL  +M  K+PG ++ A S IDSR++ +K+ + A+AEM GP CSGFGWNDE KCI AEKE FD W HP AKG+ N
Subjt:  -------ELRGWRSDNGTFKAGYLGQLENLMREKLPGHDVPAQSNIDSRVRTLKKQYQAIAEMLGPGCSGFGWNDEFKCIEAEKETFDLWGHPIAKGMRN

Query:  KPFPHFEDLAYVFGKDRATGKVSEVMGEMGSN------------MPDEEVDLSAS---RRSPCPLRTCQRMNLGVRQAVKVSQLEGRHVGAR--------
        K F H+++L+YVFGKDRATG  +E   ++GSN            MPD +     S     SP  L   +   +  R+ V       R   A         
Subjt:  KPFPHFEDLAYVFGKDRATGKVSEVMGEMGSN------------MPDEEVDLSAS---RRSPCPLRTCQRMNLGVRQAVKVSQLEGRHVGAR--------

Query:  ----GREQLKSIAEWPEKKRVTEADFRDKIVTHLMEVPELDNQKRVKLMDILFNNMKATESFFSIPAALKLEYCELLLHKH
            G EQL  IAEWP  +R      R +IV HL  +PEL    R +LM IL  N+   ++F  +P  +K  YC L+L ++
Subjt:  ----GREQLKSIAEWPEKKRVTEADFRDKIVTHLMEVPELDNQKRVKLMDILFNNMKATESFFSIPAALKLEYCELLLHKH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein1.1e-3635.47Show/hide
Query:  FRLIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMFLHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPK---
        +R + +    C +  RM   CF  LC +L+T   L  T  + +EE VAMFL I  H+   R V  +F R+ ETV R F  VL A   L    ++ P    
Subjt:  FRLIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMFLHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPK---

Query:  ----PVTASCTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVCDTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKG-YYYL
            P         W +F   +GA+DGT++ V V    +  Y  R    + N++A+CD K  FT++  G  GS  D+ VL+ A    +   +P    YYL
Subjt:  ----PVTASCTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVCDTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKG-YYYL

Query:  CDAGYPNAEGFLAPYRGE-----RYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLKGR
         D+GYPN +G LAPYR       RYH+S++   G  P    E FN  H+S R+VIER F + K +
Subjt:  CDAGYPNAEGFLAPYRGE-----RYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLKGR

AT5G12010.1 unknown protein3.6e-1926.49Show/hide
Query:  RESTRMDRRCFAILCTLLKTVAGLSNT---EVVDVEEMVAMFLHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKK--PKPVTASCTDP
        +++ RM +  F ++C  L +     +T     + V + VA+ +  +A     R+V  +F   G  +S     VL     + DVL+ K    P   S  + 
Subjt:  RESTRMDRRCFAILCTLLKTVAGLSNT---EVVDVEEMVAMFLHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKK--PKPVTASCTDP

Query:  RWKW-----FQNCLGALDGTYI-----KVNVVVVDRPRY--RTRKGEIATNVLAVCDTKGDFTFVLPGWEGSAADSRVLRDAI--SRPNGLTVPKGYYYL
        R ++       N +G++  T+I     K++V      R+  R +K   +  + AV + KG FT +  GW GS  D +VL  ++   R N   + KG +  
Subjt:  RWKW-----FQNCLGALDGTYI-----KVNVVVVDRPRY--RTRKGEIATNVLAVCDTKGDFTFVLPGWEGSAADSRVLRDAI--SRPNGLTVPKGYYYL

Query:  CDAGYPNAEGFLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLKGRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPD
           G+P  +  L PY  +              T  +  FN + S  + V + AFG LKGRWA L+ ++   ++     + ACC+LHN+   EM + ++  
Subjt:  CDAGYPNAEGFLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLKGRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGDREIPD

Query:  EL
        EL
Subjt:  EL

AT5G28730.1 unknown protein2.8e-1931.16Show/hide
Query:  IHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMFLHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTASC
        I+ +++ C+   RM    F  LC +L    GL ++  + ++E VA+FL I A +   R +  +F  + ET+ R F+ VL A+ RL    ++  K      
Subjt:  IHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMFLHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTASC

Query:  TDPR-------WKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVCDTKGDFTFVLPGWEGSAADSRVLRDAIS-RPNGLTVPKGYYYLCDA
           R       W +  + LG                          + NVLA+CD    FT+   G  GS  D+RVL  AIS  P     P   YYL D+
Subjt:  TDPR-------WKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVCDTKGDFTFVLPGWEGSAADSRVLRDAIS-RPNGLTVPKGYYYLCDA

Query:  GYPNAEGFLAPYRGE
        GY N  G+LAPYR E
Subjt:  GYPNAEGFLAPYRGE

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)6.6e-2936.84Show/hide
Query:  FTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEGFLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLKGRWAI
        F +VL GWEGSA DSRVL DA+ +          +YL D G+ N   FLAP+RG RYHL ++ G    P TP E FN+ H S RNVIER FG+ K R+AI
Subjt:  FTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEGFLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLKGRWAI

Query:  LRGKSYYPVRIQGWTIAACCLLHNLINRE--MGDREIPDEL-DEVDSASITTDGENINFIETS----------DEWSRWRDELAIQMFSN
         +    +  + Q   +  C  LHN + +E    + + PDE+ +E D  +   +  N N I+            +  + WR  +A  M+ +
Subjt:  LRGKSYYPVRIQGWTIAACCLLHNLINRE--MGDREIPDEL-DEVDSASITTDGENINFIETS----------DEWSRWRDELAIQMFSN

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)4.0e-5033.91Show/hide
Query:  FRLIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMFLHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRL-HDVLLKKPKPV
        +++++  + +C E+ RMD+  F  LC LL+T   L +T  + +E  +A+FL I+ H+++ R V   F  SGET+SRHFN VL+AV+ +  D         
Subjt:  FRLIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMFLHIVAHDVKNRVVCTQFARSGETVSRHFNTVLHAVLRL-HDVLLKKPKPV

Query:  TASCTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVCDTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPN
        T    DP   +F++C+G +D  +I V V V ++  +R   G +  NVLA       F +VL GWEGSA+D +VL  A++R N L VP+G YY+ D  YPN
Subjt:  TASCTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVCDTKGDFTFVLPGWEGSAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPN

Query:  AEGFLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLKGRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGD----REIPDE--
          GF+APY G          + N+    KE FN  H      I R FG LK R+ IL     YP++ Q   + A C LHN +  E  D    R   +E  
Subjt:  AEGFLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLKGRWAILRGKSYYPVRIQGWTIAACCLLHNLINREMGD----REIPDE--

Query:  LDEVDSASITTDGENINFI--------ETSDEWSRWRDELAIQMFSNW
         +  +   +  + E +  +        E  ++  R RDE+A ++++++
Subjt:  LDEVDSASITTDGENINFI--------ETSDEWSRWRDELAIQMFSNW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCTGATGAACTAGATCCCGATGAACTAGTGGCCATATTGACAGCCTTCACCGCAGCTCAAACTACAATTCTCCTGCTATTAGATCTATATATGAACGACCATAG
GAGAATAGAACACCAATCACCCTACATCCGACACCACATTAGGCAACTAGCATTCTTCAGGCTTATACACGAGTCTGACTTACGTTGTCGGGAGAGTACAAGGATGGACA
GAAGATGTTTTGCCATACTTTGCACCTTATTAAAAACAGTTGCCGGTTTATCCAATACGGAGGTCGTAGATGTAGAAGAGATGGTTGCCATGTTCTTGCACATTGTAGCC
CACGATGTTAAGAACCGAGTAGTTTGTACGCAGTTCGCTAGGTCTGGTGAGACAGTTTCTAGGCATTTCAACACCGTCCTTCATGCGGTGTTACGATTGCATGATGTTCT
CTTAAAAAAACCTAAGCCAGTCACAGCCTCTTGTACGGATCCAAGGTGGAAATGGTTTCAGAATTGCCTCGGTGCGTTAGATGGAACATACATCAAGGTGAATGTTGTTG
TTGTTGATCGCCCGAGGTATAGAACAAGGAAAGGTGAAATTGCAACGAACGTGCTTGCTGTCTGTGATACAAAAGGAGACTTCACATTCGTCTTACCAGGGTGGGAAGGG
TCTGCCGCTGATTCCCGGGTTCTTAGAGATGCAATATCAAGACCAAACGGACTCACAGTTCCGAAGGGCTATTATTATCTGTGCGATGCTGGGTACCCTAACGCAGAGGG
TTTCCTGGCACCGTATAGAGGGGAACGTTACCACCTCTCTAAATGGCGTGGTGCAGGGAATGCACCAACTACTCCAAAAGAATTCTTTAACATGGAGCATTCATCTACGA
GGAACGTGATTGAGAGGGCATTTGGTTTGTTGAAAGGAAGGTGGGCTATCCTCCGAGGGAAATCGTACTATCCAGTTCGAATTCAAGGGTGGACCATCGCAGCATGCTGC
TTACTTCACAATCTTATTAATAGAGAGATGGGTGACCGTGAAATTCCTGATGAGCTGGATGAGGTGGATTCTGCTTCTATTACAACTGATGGTGAGAATATCAATTTCAT
TGAGACTTCCGACGAATGGAGCCGGTGGAGGGATGAGTTGGCAATACAGATGTTTTCGAATTGGGAGTTACGCGGTTGGAGGTCCGATAATGGGACATTCAAAGCTGGGT
ATTTGGGGCAGTTGGAGAATTTGATGAGGGAGAAACTGCCTGGACATGACGTTCCAGCACAGAGCAACATCGACTCTAGGGTTCGCACCTTAAAGAAACAATACCAAGCA
ATTGCGGAGATGTTGGGCCCTGGATGTAGCGGCTTTGGCTGGAATGATGAATTTAAATGCATCGAGGCAGAGAAGGAAACATTTGACTTGTGGGGGCATCCTATCGCAAA
AGGCATGCGTAACAAGCCTTTCCCGCATTTTGAAGACTTGGCATATGTCTTTGGGAAGGATCGAGCCACGGGGAAGGTGTCAGAGGTGATGGGCGAGATGGGATCGAACA
TGCCAGATGAGGAGGTAGACCTGAGTGCATCCCGCCGTTCACCATGCCCACTGCGAACATGTCAGAGGATGAACCTCGGGGTACGCCAGGCAGTCAAAGTTTCCCAACTG
GAAGGTCGTCACGTGGGAGCAAGAGGAAGAGAGCAACTGAAGTCAATTGCAGAGTGGCCTGAAAAAAAACGTGTCACAGAGGCTGACTTCCGAGATAAAATTGTCACCCA
CTTGATGGAGGTACCGGAATTGGATAACCAAAAGAGGGTCAAGCTCATGGATATCCTTTTCAATAATATGAAAGCGACAGAGAGCTTCTTCTCCATTCCGGCTGCTCTAA
AGTTGGAGTACTGTGAACTCCTCCTGCACAAACACGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCCTGATGAACTAGATCCCGATGAACTAGTGGCCATATTGACAGCCTTCACCGCAGCTCAAACTACAATTCTCCTGCTATTAGATCTATATATGAACGACCATAG
GAGAATAGAACACCAATCACCCTACATCCGACACCACATTAGGCAACTAGCATTCTTCAGGCTTATACACGAGTCTGACTTACGTTGTCGGGAGAGTACAAGGATGGACA
GAAGATGTTTTGCCATACTTTGCACCTTATTAAAAACAGTTGCCGGTTTATCCAATACGGAGGTCGTAGATGTAGAAGAGATGGTTGCCATGTTCTTGCACATTGTAGCC
CACGATGTTAAGAACCGAGTAGTTTGTACGCAGTTCGCTAGGTCTGGTGAGACAGTTTCTAGGCATTTCAACACCGTCCTTCATGCGGTGTTACGATTGCATGATGTTCT
CTTAAAAAAACCTAAGCCAGTCACAGCCTCTTGTACGGATCCAAGGTGGAAATGGTTTCAGAATTGCCTCGGTGCGTTAGATGGAACATACATCAAGGTGAATGTTGTTG
TTGTTGATCGCCCGAGGTATAGAACAAGGAAAGGTGAAATTGCAACGAACGTGCTTGCTGTCTGTGATACAAAAGGAGACTTCACATTCGTCTTACCAGGGTGGGAAGGG
TCTGCCGCTGATTCCCGGGTTCTTAGAGATGCAATATCAAGACCAAACGGACTCACAGTTCCGAAGGGCTATTATTATCTGTGCGATGCTGGGTACCCTAACGCAGAGGG
TTTCCTGGCACCGTATAGAGGGGAACGTTACCACCTCTCTAAATGGCGTGGTGCAGGGAATGCACCAACTACTCCAAAAGAATTCTTTAACATGGAGCATTCATCTACGA
GGAACGTGATTGAGAGGGCATTTGGTTTGTTGAAAGGAAGGTGGGCTATCCTCCGAGGGAAATCGTACTATCCAGTTCGAATTCAAGGGTGGACCATCGCAGCATGCTGC
TTACTTCACAATCTTATTAATAGAGAGATGGGTGACCGTGAAATTCCTGATGAGCTGGATGAGGTGGATTCTGCTTCTATTACAACTGATGGTGAGAATATCAATTTCAT
TGAGACTTCCGACGAATGGAGCCGGTGGAGGGATGAGTTGGCAATACAGATGTTTTCGAATTGGGAGTTACGCGGTTGGAGGTCCGATAATGGGACATTCAAAGCTGGGT
ATTTGGGGCAGTTGGAGAATTTGATGAGGGAGAAACTGCCTGGACATGACGTTCCAGCACAGAGCAACATCGACTCTAGGGTTCGCACCTTAAAGAAACAATACCAAGCA
ATTGCGGAGATGTTGGGCCCTGGATGTAGCGGCTTTGGCTGGAATGATGAATTTAAATGCATCGAGGCAGAGAAGGAAACATTTGACTTGTGGGGGCATCCTATCGCAAA
AGGCATGCGTAACAAGCCTTTCCCGCATTTTGAAGACTTGGCATATGTCTTTGGGAAGGATCGAGCCACGGGGAAGGTGTCAGAGGTGATGGGCGAGATGGGATCGAACA
TGCCAGATGAGGAGGTAGACCTGAGTGCATCCCGCCGTTCACCATGCCCACTGCGAACATGTCAGAGGATGAACCTCGGGGTACGCCAGGCAGTCAAAGTTTCCCAACTG
GAAGGTCGTCACGTGGGAGCAAGAGGAAGAGAGCAACTGAAGTCAATTGCAGAGTGGCCTGAAAAAAAACGTGTCACAGAGGCTGACTTCCGAGATAAAATTGTCACCCA
CTTGATGGAGGTACCGGAATTGGATAACCAAAAGAGGGTCAAGCTCATGGATATCCTTTTCAATAATATGAAAGCGACAGAGAGCTTCTTCTCCATTCCGGCTGCTCTAA
AGTTGGAGTACTGTGAACTCCTCCTGCACAAACACGGCTGA
Protein sequenceShow/hide protein sequence
MDPDELDPDELVAILTAFTAAQTTILLLLDLYMNDHRRIEHQSPYIRHHIRQLAFFRLIHESDLRCRESTRMDRRCFAILCTLLKTVAGLSNTEVVDVEEMVAMFLHIVA
HDVKNRVVCTQFARSGETVSRHFNTVLHAVLRLHDVLLKKPKPVTASCTDPRWKWFQNCLGALDGTYIKVNVVVVDRPRYRTRKGEIATNVLAVCDTKGDFTFVLPGWEG
SAADSRVLRDAISRPNGLTVPKGYYYLCDAGYPNAEGFLAPYRGERYHLSKWRGAGNAPTTPKEFFNMEHSSTRNVIERAFGLLKGRWAILRGKSYYPVRIQGWTIAACC
LLHNLINREMGDREIPDELDEVDSASITTDGENINFIETSDEWSRWRDELAIQMFSNWELRGWRSDNGTFKAGYLGQLENLMREKLPGHDVPAQSNIDSRVRTLKKQYQA
IAEMLGPGCSGFGWNDEFKCIEAEKETFDLWGHPIAKGMRNKPFPHFEDLAYVFGKDRATGKVSEVMGEMGSNMPDEEVDLSASRRSPCPLRTCQRMNLGVRQAVKVSQL
EGRHVGARGREQLKSIAEWPEKKRVTEADFRDKIVTHLMEVPELDNQKRVKLMDILFNNMKATESFFSIPAALKLEYCELLLHKHG