; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0021655 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0021655
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr06:12170151..12172639
RNA-Seq ExpressionPI0021655
SyntenyPI0021655
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR025558 - Domain of unknown function DUF4283
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048521.1 uncharacterized protein E6C27_scaffold61G001420 [Cucumis melo var. makuwa]1.1e-16149.54Show/hide
Query:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKP
        MLLRKW PSIVPE+FVF SVPVWIKLGRIPMELWTE+G+ V+AS +GKPL+LDLATKER RLS+ARVC+E++  + +P E+T++L+GV+  V V YEWKP
Subjt:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKP

Query:  RRCNSCHSFGHSAGKCPQKETSQVQKEVDVSMVVPSTGVEQTIVACGEVVLESFKQLEEGEIQGSPSRQVSSTNNGG-------DK--------------
        R+CN C +F HS+GKCP+   + V +EV VS ++     E     CGEVVLESFKQLEEGEI+ SP+R  S     G       DK              
Subjt:  RRCNSCHSFGHSAGKCPQKETSQVQKEVDVSMVVPSTGVEQTIVACGEVVLESFKQLEEGEIQGSPSRQVSSTNNGG-------DK--------------

Query:  -------------KEDFTPVTRK----------------KRVLVSVRDKGKGKSMQALQNSF--DSLSDLSEGEN------------------WALALRG
                        F  V+R+                 ++ V  +      S + +   F   +L+DL  G                    W   +  
Subjt:  -------------KEDFTPVTRK----------------KRVLVSVRDKGKGKSMQALQNSF--DSLSDLSEGEN------------------WALALRG

Query:  LMAWPSQRVTVLPW-GISDHSSI---------------------LFYPDVE--------QQRRIISFRFFNHWVEDSSFSDVVSSVWVRRFGVSPLVSLM
           W S  + +  +  I  HS                       L  P V+        + RR++SFRFFNHW+ED SF +VVS +W R  GVSPLVSL+
Subjt:  LMAWPSQRVTVLPW-GISDHSSI---------------------LFYPDVE--------QQRRIISFRFFNHWVEDSSFSDVVSSVWVRRFGVSPLVSLM

Query:  RNLHDLKPVLRRHFGRHIRDLSEEVRLAKEAMDRAQREVERDPGSVERSRDAGVATDAFWSAIRQEEASLRQKSRV-----------------------V
        RNL  LK  +RRHFGRHI+ LSEEVR AKEAMDRAQREV+R+P S   SR AG+AT+AFW+ +R EEASL QK R+                       +
Subjt:  RNLHDLKPVLRRHFGRHIRDLSEEVRLAKEAMDRAQREVERDPGSVERSRDAGVATDAFWSAIRQEEASLRQKSRV-----------------------V

Query:  SYRELYPVLEEVVQFRWTEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLGVNATAITLIPKRSGVD
         YREL PV++++VQFRW+EECC ALQ PI  EE+RRV+F+MDS KA GHDGFS  FFKG W+ V EDFCDV++HFFETCYLPLGVNAT ITLIPKR G +
Subjt:  SYRELYPVLEEVVQFRWTEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLGVNATAITLIPKRSGVD

Query:  RMENYRSISCCNVVYKCISKILADRLRVWLPSFISGNQSAFVVGRSIVDNILLC
         +E +R IS CNV+YKCISKILADRL VWLPSFISGNQSAF+ GRSI+DNILLC
Subjt:  RMENYRSISCCNVVYKCISKILADRLRVWLPSFISGNQSAFVVGRSIVDNILLC

KAA0059841.1 reverse transcriptase [Cucumis melo var. makuwa]2.0e-13442.86Show/hide
Query:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKP
        MLLRKW P IVPE+FVFNSV VWI+LG+IPMELWTE+G+AV+ASA+GKP+SLDL TKERRRLS+ARVCVE+EGG+++P ++T++L GV+ +V + YEWKP
Subjt:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKP

Query:  RRCNSCHSFGHSAGKCPQKETSQVQKEVDVSMVVPSTGVEQTIVACGEVVLESFKQLEEGEIQGSPSRQVSSTNNGGDKKEDFTP----VTRKKRVLVSV
        R+CN C +FGHS  KC +   S+  +E  V  ++           CGEVVLESFKQLEE EI+ SP+R  S  +       + +P    V     VL  V
Subjt:  RRCNSCHSFGHSAGKCPQKETSQVQKEVDVSMVVPSTGVEQTIVACGEVVLESFKQLEEGEIQGSPSRQVSSTNNGGDKKEDFTP----VTRKKRVLVSV

Query:  RDKGKGKSMQALQNSFDSLSDLSEGENWA-----------------------------------------------------------------------
              K +  L NS D  S+   G+ W                                                                        
Subjt:  RDKGKGKSMQALQNSFDSLSDLSEGENWA-----------------------------------------------------------------------

Query:  ----------------------LALRG-------------------------------------LMAWPSQRVTVLPWGISDHSSILFYPDVEQQRRIIS
                              LA+R                                      L AWP+  V VLPWGISDHS ILFYP  +   +++S
Subjt:  ----------------------LALRG-------------------------------------LMAWPSQRVTVLPWGISDHSSILFYPDVEQQRRIIS

Query:  FRFFNHWVEDSSFSDVVSSVWVRRFGVSPLVSLMRNLHDLKPVLRRHFGRHIRDLSEEVRLAKEAMDRAQREVERDPGSVERSRDAGVATDAFWSAIRQE
        FRFFNHWVED SF +VV+ +W R  GVSPLVSLMRNLH LKP LRR FGRHI+ LSEEV +AKEAMDRAQR+VER+  S   SR A +AT+ FW+A+R E
Subjt:  FRFFNHWVEDSSFSDVVSSVWVRRFGVSPLVSLMRNLHDLKPVLRRHFGRHIRDLSEEVRLAKEAMDRAQREVERDPGSVERSRDAGVATDAFWSAIRQE

Query:  EASLRQKSRV-----------------------------------------------------------VSYRELYPVLEEVVQFRWTEECCHALQAPIR
        EASLRQKSR+                                                           + YREL PV+++++QF+W+EECC ALQ PI 
Subjt:  EASLRQKSRV-----------------------------------------------------------VSYRELYPVLEEVVQFRWTEECCHALQAPIR

Query:  CEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLGVNAT
         EE+RRV+F+MDS KA G DGFS G FKG W+ VGEDFCDVV+HFFETCYLPLGVNAT
Subjt:  CEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLGVNAT

KAA0062888.1 non-LTR retroelement reverse transcriptase-like protein [Cucumis melo var. makuwa]3.0e-15442.26Show/hide
Query:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKP
        MLLRKW   IVPE+FVFNSVPVWI+LGRIPMELWTE+ +A++AS +GKP++LDLATKE  RLS+ARVCV++EG  ++  E+T+NLRGV+ +V V YEWKP
Subjt:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKP

Query:  RRCNSCHSFGHSAGKCPQKETSQVQKEVDVSMVVPSTGVEQTIVACGEVVLESFKQLEEGEIQGSPSRQVSSTNNGGDKKEDFTPVTRKKRVLVSVRDKG
        ++CN C + GHS GKCP+   S++ +E  VS ++P  G       C +VVLESFKQLEEGEI+ SP+R  S       K+++FT VTRKK  LVSVRD+G
Subjt:  RRCNSCHSFGHSAGKCPQKETSQVQKEVDVSMVVPSTGVEQTIVACGEVVLESFKQLEEGEIQGSPSRQVSSTNNGGDKKEDFTPVTRKKRVLVSVRDKG

Query:  KGKSMQALQNSFDSLSDLSEGENWALAL------------------------------------------------------------------------
        K   + A+ NSF SL ++ + + WAL +                                                                        
Subjt:  KGKSMQALQNSFDSLSDLSEGENWALAL------------------------------------------------------------------------

Query:  RGLMAWPSQRVTV--------------------------------------LPW------------------------------------------GISD
        R  + W   R +                                       L W                                           ++ 
Subjt:  RGLMAWPSQRVTV--------------------------------------LPW------------------------------------------GISD

Query:  HSSILFYPDVE------------------QQRRIISFRFFNHW---------VEDSSFSDVVSSVWVRRFGVSPLVSLMRNLHDLKPVLRRHFGRHIRDL
          + L  P V+                    R +++  + + W         VED SF +VV+ +W R  GVSPLVSLMRNL +LKP LRR FGRHI+ L
Subjt:  HSSILFYPDVE------------------QQRRIISFRFFNHW---------VEDSSFSDVVSSVWVRRFGVSPLVSLMRNLHDLKPVLRRHFGRHIRDL

Query:  SEEVRLAKEAMDRAQREVERDPGSVERSRDAGVATDAFWSAIRQEEASLRQKSRV---------------------------------------------
        +EEV +AKE MDRAQREVE +P S   SR  G+AT+AFW+A+R EEASLRQKSR+                                             
Subjt:  SEEVRLAKEAMDRAQREVERDPGSVERSRDAGVATDAFWSAIRQEEASLRQKSRV---------------------------------------------

Query:  --------------VSYRELYPVLEEVVQFRWTEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLGV
                      + YREL+PV++++VQFRW+EECC ALQ PI  EE+RRV+F+MDS KA G DGFS GFFKGAW+ V EDFCDVV+HFFETCYLP+GV
Subjt:  --------------VSYRELYPVLEEVVQFRWTEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLGV

Query:  NATAITLIPKRSGVDRMENYRSISCCNVVYKCISKILADRLRVWLPSFISGNQSAFVVGRSIVDNILLC
        NAT ITLIPKR G ++ME +R ISCCNV+YKCISKILADRLRVWLPSFI  NQSAF+ GRSI+DNILLC
Subjt:  NATAITLIPKRSGVDRMENYRSISCCNVVYKCISKILADRLRVWLPSFISGNQSAFVVGRSIVDNILLC

TYK28312.1 uncharacterized protein E5676_scaffold600G001370 [Cucumis melo var. makuwa]8.8e-16249.54Show/hide
Query:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKP
        MLLRKW PSIVPE+FVF SVPVWIKLGRIPMELWTE+G+ V+AS +GKPL+LDLATKER RLS+ARVC+E++  + +P E+T++L+GV+  V V YEWKP
Subjt:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKP

Query:  RRCNSCHSFGHSAGKCPQKETSQVQKEVDVSMVVPSTGVEQTIVACGEVVLESFKQLEEGEIQGSPSRQVSSTNNGG-------DK--------------
        R+CN C +F HS+GKCP+   + V +EV VS ++     E     CGEVVLESFKQLEEGEI+ SP+R  S     G       DK              
Subjt:  RRCNSCHSFGHSAGKCPQKETSQVQKEVDVSMVVPSTGVEQTIVACGEVVLESFKQLEEGEIQGSPSRQVSSTNNGG-------DK--------------

Query:  -------------KEDFTPVTRK----------------KRVLVSVRDKGKGKSMQALQNSF--DSLSDLSEGEN------------------WALALRG
                        F  V+R+                 ++ V  +      S + +   F   +L+DL  G                    W   +  
Subjt:  -------------KEDFTPVTRK----------------KRVLVSVRDKGKGKSMQALQNSF--DSLSDLSEGEN------------------WALALRG

Query:  LMAWPSQRVTVLPW-GISDHSSI---------------------LFYPDVE--------QQRRIISFRFFNHWVEDSSFSDVVSSVWVRRFGVSPLVSLM
           W S  + +  +  I  HS                       L  P V+        + RR++SFRFFNHW+ED SF +VVS +W R  GVSPLVSL+
Subjt:  LMAWPSQRVTVLPW-GISDHSSI---------------------LFYPDVE--------QQRRIISFRFFNHWVEDSSFSDVVSSVWVRRFGVSPLVSLM

Query:  RNLHDLKPVLRRHFGRHIRDLSEEVRLAKEAMDRAQREVERDPGSVERSRDAGVATDAFWSAIRQEEASLRQKSRV-----------------------V
        RNL  LK  +RRHFGRHI+ LSEEVR AKEAMDRAQREV+R+P S   SR AG+AT+AFW+ +R EEASL QK R+                       +
Subjt:  RNLHDLKPVLRRHFGRHIRDLSEEVRLAKEAMDRAQREVERDPGSVERSRDAGVATDAFWSAIRQEEASLRQKSRV-----------------------V

Query:  SYRELYPVLEEVVQFRWTEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLGVNATAITLIPKRSGVD
         YREL PV++++VQFRW+EECC ALQ PI  EE+RRV+F+MDS KA GHDGFS  FFKG W+ V EDFCDV++HFFETCYLPLGVNAT ITLIPKR G +
Subjt:  SYRELYPVLEEVVQFRWTEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLGVNATAITLIPKRSGVD

Query:  RMENYRSISCCNVVYKCISKILADRLRVWLPSFISGNQSAFVVGRSIVDNILLC
         +E +R IS CNV+YKCISKILADRL VWLPSFISGNQSAF+ GRSI+DNILLC
Subjt:  RMENYRSISCCNVVYKCISKILADRLRVWLPSFISGNQSAFVVGRSIVDNILLC

XP_031745634.1 uncharacterized protein LOC116406053 [Cucumis sativus]1.6e-13944.93Show/hide
Query:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKP
        MLLRKW P IVPE+FVF+SV V IKLGRIP+ELWT++G+AV+ASAIGKPLS+DLATKERRRLS+AR+CVE+   + +P EVT+NLRG E  V VTYEWKP
Subjt:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKP

Query:  RRCNSCHSFGHSAGKCPQKETSQ------VQKEVDVSMVVPSTGVEQTIVACGE---VVLESFKQLEEGEIQGSPSRQVSSTNNGGDKKEDFTPVTRKKR
        ++CN C SFGHS   CP+K  ++        KEV V  VVP+   ++ +  CGE   VVLESF+ +EEGEI      +    N      E+FTPV RK R
Subjt:  RRCNSCHSFGHSAGKCPQKETSQ------VQKEVDVSMVVPSTGVEQTIVACGE---VVLESFKQLEEGEIQGSPSRQVSSTNNGGDKKEDFTPVTRKKR

Query:  VLVSVRDKGKGKSMQALQNSFDSLSDLSEGENWALAL-----------RGLMAWPSQRVTVLPWG----ISDHSSI-----------LFYPDVEQ-----
         ++S+ D+GK ++  ++ NSF++L ++ +G+ W L++              M   S    V+P G    I  H  I            +  ++E+     
Subjt:  VLVSVRDKGKGKSMQALQNSFDSLSDLSEGENWALAL-----------RGLMAWPSQRVTVLPWG----ISDHSSI-----------LFYPDVEQ-----

Query:  ---------------------------------------------QRRIISFRFFNHW--------------------VED----------------SSF
                                                     +RR +      +W                    V D                +SF
Subjt:  ---------------------------------------------QRRIISFRFFNHW--------------------VED----------------SSF

Query:  SDVVSSVWVRRFGVSPLVSLMRNLHDLKPVLRRHFGRHIRDLSEEVRLAKEAMDRAQREVERDPGSVERSRDAGVAT--DAFWSAIRQEEASLRQKSRV-
         DVVSS W +   VSP+V+++RNL +LK +LRRHFGRHIR +SE+VRLA + MDRA+RE+E +  S E S  A +AT  +A  S I  +   L    +V 
Subjt:  SDVVSSVWVRRFGVSPLVSLMRNLHDLKPVLRRHFGRHIRDLSEEVRLAKEAMDRAQREVERDPGSVERSRDAGVAT--DAFWSAIRQEEASLRQKSRV-

Query:  ---------------VSYRELYPVLEEVVQFRWTEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLG
                       +SY EL   +EE+VQFRWTEECC ALQ+PI   E+RRV+F+MD  KA G DG+S GFFKGAW  VGE FCDVV+HFFET Y P G
Subjt:  ---------------VSYRELYPVLEEVVQFRWTEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLG

Query:  VNATAITLIPKRSGVDRMENYRSISCCNVVYKCISKILADRLRVWLPSFISGNQSAFVVGRSIVDNILLC
        VN TAITLIPKR+G DR+E++  ISCC+V+YKCIS+ILADRLRVWLPSF+SGNQ AF+ GRSI+DNILLC
Subjt:  VNATAITLIPKRSGVDRMENYRSISCCNVVYKCISKILADRLRVWLPSFISGNQSAFVVGRSIVDNILLC

TrEMBL top hitse value%identityAlignment
A0A1S3CRZ6 uncharacterized protein LOC1035041009.2e-12539.64Show/hide
Query:  MELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKPRRCNSCHSFGHSAGKCPQKETSQVQKEVDV
        MELWTE+G+AV+ASA+GKP+SLDL TKERRRLS+ARVCVE+EGG+++P ++T++L GV+ +V + YEWKPR+CN C +FGHS  KC +   S+  +E  V
Subjt:  MELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKPRRCNSCHSFGHSAGKCPQKETSQVQKEVDV

Query:  SMVVPSTGVEQTIVACGEVVLESFKQLEEGEIQGSPSRQVSSTNNGGDKKEDFTPVTRKKRVLVSVRDKGKGKSMQALQNSFDSLSDLSEGENWALAL--
          ++           CGEVVLESFKQLEE EI+ SP+R  S    GG K ++FT VTRKK  LVSVRD+GK   +  + NSF SL ++ + + WAL++  
Subjt:  SMVVPSTGVEQTIVACGEVVLESFKQLEEGEIQGSPSRQVSSTNNGGDKKEDFTPVTRKKRVLVSVRDKGKGKSMQALQNSFDSLSDLSEGENWALAL--

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------RGLMAWPSQRVTVLPWGISD
                                                                                          L AWP+  V VLPWGISD
Subjt:  --------------------------------------------------------------------------------RGLMAWPSQRVTVLPWGISD

Query:  HSSILFYPDVEQQRRIISFRFFNHWVEDSSFSDVVSSVWVRRFGVSPLVSLMRNLHDLKPVLRRHFGRHIRDLSEEVRLAKEAMDRAQREVERDPGSVER
        HS ILFYP  +   +++SFRFFNHWVED SF +VV+ +W R  GVSPLVSLMRNLH LKP LRR FGRHI+ LSEEV +AKEAMDRAQR+VER+  S   
Subjt:  HSSILFYPDVEQQRRIISFRFFNHWVEDSSFSDVVSSVWVRRFGVSPLVSLMRNLHDLKPVLRRHFGRHIRDLSEEVRLAKEAMDRAQREVERDPGSVER

Query:  SRDAGVATDAFWSAIRQEEASLRQKSRV-----------------------------------------------------------VSYRELYPVLEEV
        SR A +AT+ FW+A+R EEASLRQKSR+                                                           + YREL PV++++
Subjt:  SRDAGVATDAFWSAIRQEEASLRQKSRV-----------------------------------------------------------VSYRELYPVLEEV

Query:  VQFRWTEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLGVNAT
        +QF+W+EECC ALQ PI  EE+RRV+F+MDS KA G DGFS G FKG W+ VGEDFCDVV+HFFETCYLPLGVNAT
Subjt:  VQFRWTEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLGVNAT

A0A5A7U4M4 Reverse transcriptase domain-containing protein5.5e-16249.54Show/hide
Query:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKP
        MLLRKW PSIVPE+FVF SVPVWIKLGRIPMELWTE+G+ V+AS +GKPL+LDLATKER RLS+ARVC+E++  + +P E+T++L+GV+  V V YEWKP
Subjt:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKP

Query:  RRCNSCHSFGHSAGKCPQKETSQVQKEVDVSMVVPSTGVEQTIVACGEVVLESFKQLEEGEIQGSPSRQVSSTNNGG-------DK--------------
        R+CN C +F HS+GKCP+   + V +EV VS ++     E     CGEVVLESFKQLEEGEI+ SP+R  S     G       DK              
Subjt:  RRCNSCHSFGHSAGKCPQKETSQVQKEVDVSMVVPSTGVEQTIVACGEVVLESFKQLEEGEIQGSPSRQVSSTNNGG-------DK--------------

Query:  -------------KEDFTPVTRK----------------KRVLVSVRDKGKGKSMQALQNSF--DSLSDLSEGEN------------------WALALRG
                        F  V+R+                 ++ V  +      S + +   F   +L+DL  G                    W   +  
Subjt:  -------------KEDFTPVTRK----------------KRVLVSVRDKGKGKSMQALQNSF--DSLSDLSEGEN------------------WALALRG

Query:  LMAWPSQRVTVLPW-GISDHSSI---------------------LFYPDVE--------QQRRIISFRFFNHWVEDSSFSDVVSSVWVRRFGVSPLVSLM
           W S  + +  +  I  HS                       L  P V+        + RR++SFRFFNHW+ED SF +VVS +W R  GVSPLVSL+
Subjt:  LMAWPSQRVTVLPW-GISDHSSI---------------------LFYPDVE--------QQRRIISFRFFNHWVEDSSFSDVVSSVWVRRFGVSPLVSLM

Query:  RNLHDLKPVLRRHFGRHIRDLSEEVRLAKEAMDRAQREVERDPGSVERSRDAGVATDAFWSAIRQEEASLRQKSRV-----------------------V
        RNL  LK  +RRHFGRHI+ LSEEVR AKEAMDRAQREV+R+P S   SR AG+AT+AFW+ +R EEASL QK R+                       +
Subjt:  RNLHDLKPVLRRHFGRHIRDLSEEVRLAKEAMDRAQREVERDPGSVERSRDAGVATDAFWSAIRQEEASLRQKSRV-----------------------V

Query:  SYRELYPVLEEVVQFRWTEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLGVNATAITLIPKRSGVD
         YREL PV++++VQFRW+EECC ALQ PI  EE+RRV+F+MDS KA GHDGFS  FFKG W+ V EDFCDV++HFFETCYLPLGVNAT ITLIPKR G +
Subjt:  SYRELYPVLEEVVQFRWTEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLGVNATAITLIPKRSGVD

Query:  RMENYRSISCCNVVYKCISKILADRLRVWLPSFISGNQSAFVVGRSIVDNILLC
         +E +R IS CNV+YKCISKILADRL VWLPSFISGNQSAF+ GRSI+DNILLC
Subjt:  RMENYRSISCCNVVYKCISKILADRLRVWLPSFISGNQSAFVVGRSIVDNILLC

A0A5A7V275 Reverse transcriptase9.8e-13542.86Show/hide
Query:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKP
        MLLRKW P IVPE+FVFNSV VWI+LG+IPMELWTE+G+AV+ASA+GKP+SLDL TKERRRLS+ARVCVE+EGG+++P ++T++L GV+ +V + YEWKP
Subjt:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKP

Query:  RRCNSCHSFGHSAGKCPQKETSQVQKEVDVSMVVPSTGVEQTIVACGEVVLESFKQLEEGEIQGSPSRQVSSTNNGGDKKEDFTP----VTRKKRVLVSV
        R+CN C +FGHS  KC +   S+  +E  V  ++           CGEVVLESFKQLEE EI+ SP+R  S  +       + +P    V     VL  V
Subjt:  RRCNSCHSFGHSAGKCPQKETSQVQKEVDVSMVVPSTGVEQTIVACGEVVLESFKQLEEGEIQGSPSRQVSSTNNGGDKKEDFTP----VTRKKRVLVSV

Query:  RDKGKGKSMQALQNSFDSLSDLSEGENWA-----------------------------------------------------------------------
              K +  L NS D  S+   G+ W                                                                        
Subjt:  RDKGKGKSMQALQNSFDSLSDLSEGENWA-----------------------------------------------------------------------

Query:  ----------------------LALRG-------------------------------------LMAWPSQRVTVLPWGISDHSSILFYPDVEQQRRIIS
                              LA+R                                      L AWP+  V VLPWGISDHS ILFYP  +   +++S
Subjt:  ----------------------LALRG-------------------------------------LMAWPSQRVTVLPWGISDHSSILFYPDVEQQRRIIS

Query:  FRFFNHWVEDSSFSDVVSSVWVRRFGVSPLVSLMRNLHDLKPVLRRHFGRHIRDLSEEVRLAKEAMDRAQREVERDPGSVERSRDAGVATDAFWSAIRQE
        FRFFNHWVED SF +VV+ +W R  GVSPLVSLMRNLH LKP LRR FGRHI+ LSEEV +AKEAMDRAQR+VER+  S   SR A +AT+ FW+A+R E
Subjt:  FRFFNHWVEDSSFSDVVSSVWVRRFGVSPLVSLMRNLHDLKPVLRRHFGRHIRDLSEEVRLAKEAMDRAQREVERDPGSVERSRDAGVATDAFWSAIRQE

Query:  EASLRQKSRV-----------------------------------------------------------VSYRELYPVLEEVVQFRWTEECCHALQAPIR
        EASLRQKSR+                                                           + YREL PV+++++QF+W+EECC ALQ PI 
Subjt:  EASLRQKSRV-----------------------------------------------------------VSYRELYPVLEEVVQFRWTEECCHALQAPIR

Query:  CEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLGVNAT
         EE+RRV+F+MDS KA G DGFS G FKG W+ VGEDFCDVV+HFFETCYLPLGVNAT
Subjt:  CEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLGVNAT

A0A5A7V5J2 Non-LTR retroelement reverse transcriptase-like protein1.5e-15442.26Show/hide
Query:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKP
        MLLRKW   IVPE+FVFNSVPVWI+LGRIPMELWTE+ +A++AS +GKP++LDLATKE  RLS+ARVCV++EG  ++  E+T+NLRGV+ +V V YEWKP
Subjt:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKP

Query:  RRCNSCHSFGHSAGKCPQKETSQVQKEVDVSMVVPSTGVEQTIVACGEVVLESFKQLEEGEIQGSPSRQVSSTNNGGDKKEDFTPVTRKKRVLVSVRDKG
        ++CN C + GHS GKCP+   S++ +E  VS ++P  G       C +VVLESFKQLEEGEI+ SP+R  S       K+++FT VTRKK  LVSVRD+G
Subjt:  RRCNSCHSFGHSAGKCPQKETSQVQKEVDVSMVVPSTGVEQTIVACGEVVLESFKQLEEGEIQGSPSRQVSSTNNGGDKKEDFTPVTRKKRVLVSVRDKG

Query:  KGKSMQALQNSFDSLSDLSEGENWALAL------------------------------------------------------------------------
        K   + A+ NSF SL ++ + + WAL +                                                                        
Subjt:  KGKSMQALQNSFDSLSDLSEGENWALAL------------------------------------------------------------------------

Query:  RGLMAWPSQRVTV--------------------------------------LPW------------------------------------------GISD
        R  + W   R +                                       L W                                           ++ 
Subjt:  RGLMAWPSQRVTV--------------------------------------LPW------------------------------------------GISD

Query:  HSSILFYPDVE------------------QQRRIISFRFFNHW---------VEDSSFSDVVSSVWVRRFGVSPLVSLMRNLHDLKPVLRRHFGRHIRDL
          + L  P V+                    R +++  + + W         VED SF +VV+ +W R  GVSPLVSLMRNL +LKP LRR FGRHI+ L
Subjt:  HSSILFYPDVE------------------QQRRIISFRFFNHW---------VEDSSFSDVVSSVWVRRFGVSPLVSLMRNLHDLKPVLRRHFGRHIRDL

Query:  SEEVRLAKEAMDRAQREVERDPGSVERSRDAGVATDAFWSAIRQEEASLRQKSRV---------------------------------------------
        +EEV +AKE MDRAQREVE +P S   SR  G+AT+AFW+A+R EEASLRQKSR+                                             
Subjt:  SEEVRLAKEAMDRAQREVERDPGSVERSRDAGVATDAFWSAIRQEEASLRQKSRV---------------------------------------------

Query:  --------------VSYRELYPVLEEVVQFRWTEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLGV
                      + YREL+PV++++VQFRW+EECC ALQ PI  EE+RRV+F+MDS KA G DGFS GFFKGAW+ V EDFCDVV+HFFETCYLP+GV
Subjt:  --------------VSYRELYPVLEEVVQFRWTEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLGV

Query:  NATAITLIPKRSGVDRMENYRSISCCNVVYKCISKILADRLRVWLPSFISGNQSAFVVGRSIVDNILLC
        NAT ITLIPKR G ++ME +R ISCCNV+YKCISKILADRLRVWLPSFI  NQSAF+ GRSI+DNILLC
Subjt:  NATAITLIPKRSGVDRMENYRSISCCNVVYKCISKILADRLRVWLPSFISGNQSAFVVGRSIVDNILLC

A0A5D3DXQ8 Reverse transcriptase domain-containing protein4.2e-16249.54Show/hide
Query:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKP
        MLLRKW PSIVPE+FVF SVPVWIKLGRIPMELWTE+G+ V+AS +GKPL+LDLATKER RLS+ARVC+E++  + +P E+T++L+GV+  V V YEWKP
Subjt:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKP

Query:  RRCNSCHSFGHSAGKCPQKETSQVQKEVDVSMVVPSTGVEQTIVACGEVVLESFKQLEEGEIQGSPSRQVSSTNNGG-------DK--------------
        R+CN C +F HS+GKCP+   + V +EV VS ++     E     CGEVVLESFKQLEEGEI+ SP+R  S     G       DK              
Subjt:  RRCNSCHSFGHSAGKCPQKETSQVQKEVDVSMVVPSTGVEQTIVACGEVVLESFKQLEEGEIQGSPSRQVSSTNNGG-------DK--------------

Query:  -------------KEDFTPVTRK----------------KRVLVSVRDKGKGKSMQALQNSF--DSLSDLSEGEN------------------WALALRG
                        F  V+R+                 ++ V  +      S + +   F   +L+DL  G                    W   +  
Subjt:  -------------KEDFTPVTRK----------------KRVLVSVRDKGKGKSMQALQNSF--DSLSDLSEGEN------------------WALALRG

Query:  LMAWPSQRVTVLPW-GISDHSSI---------------------LFYPDVE--------QQRRIISFRFFNHWVEDSSFSDVVSSVWVRRFGVSPLVSLM
           W S  + +  +  I  HS                       L  P V+        + RR++SFRFFNHW+ED SF +VVS +W R  GVSPLVSL+
Subjt:  LMAWPSQRVTVLPW-GISDHSSI---------------------LFYPDVE--------QQRRIISFRFFNHWVEDSSFSDVVSSVWVRRFGVSPLVSLM

Query:  RNLHDLKPVLRRHFGRHIRDLSEEVRLAKEAMDRAQREVERDPGSVERSRDAGVATDAFWSAIRQEEASLRQKSRV-----------------------V
        RNL  LK  +RRHFGRHI+ LSEEVR AKEAMDRAQREV+R+P S   SR AG+AT+AFW+ +R EEASL QK R+                       +
Subjt:  RNLHDLKPVLRRHFGRHIRDLSEEVRLAKEAMDRAQREVERDPGSVERSRDAGVATDAFWSAIRQEEASLRQKSRV-----------------------V

Query:  SYRELYPVLEEVVQFRWTEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLGVNATAITLIPKRSGVD
         YREL PV++++VQFRW+EECC ALQ PI  EE+RRV+F+MDS KA GHDGFS  FFKG W+ V EDFCDV++HFFETCYLPLGVNAT ITLIPKR G +
Subjt:  SYRELYPVLEEVVQFRWTEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLGVNATAITLIPKRSGVD

Query:  RMENYRSISCCNVVYKCISKILADRLRVWLPSFISGNQSAFVVGRSIVDNILLC
         +E +R IS CNV+YKCISKILADRL VWLPSFISGNQSAF+ GRSI+DNILLC
Subjt:  RMENYRSISCCNVVYKCISKILADRLRVWLPSFISGNQSAFVVGRSIVDNILLC

SwissProt top hitse value%identityAlignment
P14381 Transposon TX1 uncharacterized 149 kDa protein3.4e-1528.57Show/hide
Query:  QEEASLRQKSRVVSYRELY---PVLEEVVQFRW------TEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFET
        ++  ++R ++R   Y+ L+   P+  +  +  W      +E     L+ PI  +E+ + +  M  +K+ G DG +  FF+  W+T+G DF  V+   F+ 
Subjt:  QEEASLRQKSRVVSYRELY---PVLEEVVQFRW------TEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFET

Query:  CYLPLGVNATAITLIPKRSGVDRMENYRSISCCNVVYKCISKILADRLRVWLPSFISGNQSAFVVGRSIVDNILL
          LPL      ++L+PK+  +  ++N+R +S  +  YK ++K ++ RL+  L   I  +QS  V GR+I DN+ L
Subjt:  CYLPLGVNATAITLIPKRSGVDRMENYRSISCCNVVYKCISKILADRLRVWLPSFISGNQSAFVVGRSIVDNILL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.4e-1641.44Show/hide
Query:  LEEVVQFRWTEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLGVNATAITLIPKRSGVDRMENYRSI
        ++++  FR  +     L A    +EI   VFAM  +KA G D F+  FF  +W  V +     V  FF T +L    NATAITLIPK +GVD++  +R +
Subjt:  LEEVVQFRWTEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNTVGEDFCDVVMHFFETCYLPLGVNATAITLIPKRSGVDRMENYRSI

Query:  SCCNVVYKCIS
        SCC VVYK I+
Subjt:  SCCNVVYKCIS

AT2G01050.1 zinc ion binding;nucleic acid binding4.4e-1030.65Show/hide
Query:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKP
        +L++ W     P      + PVW++L  IP   +    +  IA  +G+PL +D+ T    +  FARVC+EV     L   V IN         V YE   
Subjt:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKP

Query:  RRCNSCHSFGHSAGKCPQKETSQV
        + C+SC  +GH    CP+    +V
Subjt:  RRCNSCHSFGHSAGKCPQKETSQV

AT2G07760.1 Zinc knuckle (CCHC-type) family protein2.3e-1130.33Show/hide
Query:  NSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEV-TINLRGVECSVPVTYEWKPRRCNSCHSFGHSAGKC
        +++PVW+ L  IP  L++  GI+ IAS +G P++      +   +S A + VEVE     P  +  ++ +G    V V Y W P +C  C   GH A +C
Subjt:  NSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEV-TINLRGVECSVPVTYEWKPRRCNSCHSFGHSAGKC

Query:  --PQKETSQVQKEVDVSMVVPS
          P     +V + V   ++ P+
Subjt:  --PQKETSQVQKEVDVSMVVPS

AT5G28823.1 FUNCTIONS IN: molecular_function unknown2.2e-0625.16Show/hide
Query:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLR-GVECSVPVTYEWK
        M +  W P    E      +P W+ L  IP +L++  GI  IAS IG+ +       +  ++  A++ VEV+     P  V +    G    V V Y W 
Subjt:  MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLR-GVECSVPVTYEWK

Query:  PRRCNSCHSFGHSAGKCPQKETSQVQKEVDVSMVVPSTGVEQTIVACGEVVLESF
        P +C             P  E S V           +  ++  IV+ G  +L+ F
Subjt:  PRRCNSCHSFGHSAGKCPQKETSQVQKEVDVSMVVPSTGVEQTIVACGEVVLESF

AT5G32613.1 Zinc knuckle (CCHC-type) family protein3.1e-0828.12Show/hide
Query:  WIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTI-NLRGVECSVPVTYEWKPRRCNSCHSFGHSAGKCP----
        W  L  +P +L++  GI+VIAS IG+PL  + +      +   +V V    G  LP  + + +++G    V VTY   P +C +C  +GH   +C     
Subjt:  WIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTI-NLRGVECSVPVTYEWKPRRCNSCHSFGHSAGKCP----

Query:  -----QKETSQVQKEVDVSMVVPSTGVE
             +K+     KEV + ++   T  E
Subjt:  -----QKETSQVQKEVDVSMVVPSTGVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCCTCCGCAAATGGGTTCCAAGTATTGTCCCTGAAACCTTTGTTTTTAATTCTGTTCCTGTGTGGATCAAACTGGGTCGTATTCCCATGGAGTTGTGGACTGAGTC
AGGTATTGCAGTCATTGCTAGTGCTATTGGTAAACCTCTTTCTTTAGATTTGGCCACTAAGGAGAGACGTAGACTGTCGTTTGCTAGGGTGTGTGTTGAAGTAGAAGGGG
GTGCTGATTTGCCTACTGAGGTCACAATTAATTTGAGGGGTGTGGAATGCAGTGTTCCGGTTACTTATGAGTGGAAACCCCGTAGGTGTAATTCATGTCATTCGTTTGGT
CATTCTGCTGGTAAGTGCCCTCAGAAGGAGACATCTCAGGTGCAAAAGGAGGTGGATGTGAGCATGGTTGTTCCTAGTACGGGTGTTGAGCAGACTATTGTGGCCTGTGG
GGAGGTGGTGTTAGAATCCTTTAAGCAGTTAGAGGAAGGTGAGATTCAGGGTTCTCCGAGTAGACAAGTGTCTTCGACCAATAATGGTGGGGATAAGAAAGAGGATTTTA
CTCCGGTGACTCGTAAAAAGAGAGTTTTGGTTTCAGTGAGAGACAAGGGGAAAGGGAAGAGTATGCAGGCTTTGCAGAACTCTTTCGATAGTCTTTCTGATTTGAGTGAG
GGGGAAAATTGGGCGTTGGCCTTACGGGGGTTAATGGCTTGGCCTAGTCAGCGTGTTACTGTTTTGCCTTGGGGAATTTCTGACCATTCCTCTATTTTATTCTATCCTGA
TGTTGAGCAGCAGAGGCGTATTATTTCGTTTCGTTTCTTTAATCATTGGGTGGAGGATTCGTCTTTCAGTGATGTGGTGTCTTCTGTTTGGGTGAGAAGGTTTGGTGTGT
CTCCGTTAGTGAGTCTTATGCGAAATTTGCATGATCTGAAACCTGTGCTTCGTAGACATTTTGGTAGACATATCAGGGACCTTAGTGAGGAGGTGCGCTTGGCTAAAGAG
GCTATGGATAGGGCCCAGCGAGAGGTTGAGCGGGATCCTGGGTCTGTGGAGAGGAGTCGTGATGCTGGTGTTGCGACTGATGCCTTTTGGTCTGCTATCCGTCAGGAAGA
AGCCTCTCTCCGTCAGAAATCACGGGTGGTTAGTTATAGAGAGCTCTATCCTGTATTGGAGGAGGTGGTTCAGTTTAGGTGGACGGAGGAGTGTTGTCATGCGTTACAGG
CTCCGATTAGGTGTGAGGAGATCAGGAGGGTGGTTTTTGCTATGGATAGCAGTAAGGCTCTTGGTCATGATGGTTTTTCGACGGGTTTCTTCAAAGGTGCTTGGAACACG
GTTGGTGAGGATTTTTGTGATGTTGTGATGCATTTCTTTGAGACGTGTTATCTGCCTCTAGGGGTTAATGCTACTGCAATCACCCTCATTCCCAAACGTAGTGGGGTTGA
TCGTATGGAGAATTATAGGTCTATTTCGTGTTGTAATGTGGTTTACAAGTGCATTTCTAAGATTTTGGCGGATAGGCTTCGTGTGTGGCTTCCTTCTTTTATCAGTGGTA
ACCAGTCTGCCTTTGTTGTTGGGAGGAGTATTGTTGATAACATTCTGCTTTGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCCTCCGCAAATGGGTTCCAAGTATTGTCCCTGAAACCTTTGTTTTTAATTCTGTTCCTGTGTGGATCAAACTGGGTCGTATTCCCATGGAGTTGTGGACTGAGTC
AGGTATTGCAGTCATTGCTAGTGCTATTGGTAAACCTCTTTCTTTAGATTTGGCCACTAAGGAGAGACGTAGACTGTCGTTTGCTAGGGTGTGTGTTGAAGTAGAAGGGG
GTGCTGATTTGCCTACTGAGGTCACAATTAATTTGAGGGGTGTGGAATGCAGTGTTCCGGTTACTTATGAGTGGAAACCCCGTAGGTGTAATTCATGTCATTCGTTTGGT
CATTCTGCTGGTAAGTGCCCTCAGAAGGAGACATCTCAGGTGCAAAAGGAGGTGGATGTGAGCATGGTTGTTCCTAGTACGGGTGTTGAGCAGACTATTGTGGCCTGTGG
GGAGGTGGTGTTAGAATCCTTTAAGCAGTTAGAGGAAGGTGAGATTCAGGGTTCTCCGAGTAGACAAGTGTCTTCGACCAATAATGGTGGGGATAAGAAAGAGGATTTTA
CTCCGGTGACTCGTAAAAAGAGAGTTTTGGTTTCAGTGAGAGACAAGGGGAAAGGGAAGAGTATGCAGGCTTTGCAGAACTCTTTCGATAGTCTTTCTGATTTGAGTGAG
GGGGAAAATTGGGCGTTGGCCTTACGGGGGTTAATGGCTTGGCCTAGTCAGCGTGTTACTGTTTTGCCTTGGGGAATTTCTGACCATTCCTCTATTTTATTCTATCCTGA
TGTTGAGCAGCAGAGGCGTATTATTTCGTTTCGTTTCTTTAATCATTGGGTGGAGGATTCGTCTTTCAGTGATGTGGTGTCTTCTGTTTGGGTGAGAAGGTTTGGTGTGT
CTCCGTTAGTGAGTCTTATGCGAAATTTGCATGATCTGAAACCTGTGCTTCGTAGACATTTTGGTAGACATATCAGGGACCTTAGTGAGGAGGTGCGCTTGGCTAAAGAG
GCTATGGATAGGGCCCAGCGAGAGGTTGAGCGGGATCCTGGGTCTGTGGAGAGGAGTCGTGATGCTGGTGTTGCGACTGATGCCTTTTGGTCTGCTATCCGTCAGGAAGA
AGCCTCTCTCCGTCAGAAATCACGGGTGGTTAGTTATAGAGAGCTCTATCCTGTATTGGAGGAGGTGGTTCAGTTTAGGTGGACGGAGGAGTGTTGTCATGCGTTACAGG
CTCCGATTAGGTGTGAGGAGATCAGGAGGGTGGTTTTTGCTATGGATAGCAGTAAGGCTCTTGGTCATGATGGTTTTTCGACGGGTTTCTTCAAAGGTGCTTGGAACACG
GTTGGTGAGGATTTTTGTGATGTTGTGATGCATTTCTTTGAGACGTGTTATCTGCCTCTAGGGGTTAATGCTACTGCAATCACCCTCATTCCCAAACGTAGTGGGGTTGA
TCGTATGGAGAATTATAGGTCTATTTCGTGTTGTAATGTGGTTTACAAGTGCATTTCTAAGATTTTGGCGGATAGGCTTCGTGTGTGGCTTCCTTCTTTTATCAGTGGTA
ACCAGTCTGCCTTTGTTGTTGGGAGGAGTATTGTTGATAACATTCTGCTTTGTTAG
Protein sequenceShow/hide protein sequence
MLLRKWVPSIVPETFVFNSVPVWIKLGRIPMELWTESGIAVIASAIGKPLSLDLATKERRRLSFARVCVEVEGGADLPTEVTINLRGVECSVPVTYEWKPRRCNSCHSFG
HSAGKCPQKETSQVQKEVDVSMVVPSTGVEQTIVACGEVVLESFKQLEEGEIQGSPSRQVSSTNNGGDKKEDFTPVTRKKRVLVSVRDKGKGKSMQALQNSFDSLSDLSE
GENWALALRGLMAWPSQRVTVLPWGISDHSSILFYPDVEQQRRIISFRFFNHWVEDSSFSDVVSSVWVRRFGVSPLVSLMRNLHDLKPVLRRHFGRHIRDLSEEVRLAKE
AMDRAQREVERDPGSVERSRDAGVATDAFWSAIRQEEASLRQKSRVVSYRELYPVLEEVVQFRWTEECCHALQAPIRCEEIRRVVFAMDSSKALGHDGFSTGFFKGAWNT
VGEDFCDVVMHFFETCYLPLGVNATAITLIPKRSGVDRMENYRSISCCNVVYKCISKILADRLRVWLPSFISGNQSAFVVGRSIVDNILLC