; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041493 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041493
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposable element protein
Genome locationchr13:18919252..18927792
RNA-Seq ExpressionLag0041493
SyntenyLag0041493
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7034152.1 unnamed protein product [Microthlaspi erraticum]3.9e-12239.06Show/hide
Query:  RGNLGPRDEMPLTYILEVELFDVWGIDFMGPF-PPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKL
        +G +   DEMPL  ILEVE+FDVWGIDFMGPF P SNG+ YIL+A DYVSKW+EAI C   DAK V +  ++ IF RFG PR ++SD G+HF+N +   L
Subjt:  RGNLGPRDEMPLTYILEVELFDVWGIDFMGPF-PPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKL

Query:  FAKHEIKHRIATPYHPQANGQAEISNREIKSILEKVVHPSRKDWSLRLDEALWAYRTAYKTPLGPFVVVEVFPHGAITLQDPKDGRVFKVNGQRVKHYWG
           H +KH++ATPYHPQ +GQ E+SNR+IK+ILEK V  +RKDW+ RLDEALWAYRTAYKTP+G                                    
Subjt:  FAKHEIKHRIATPYHPQANGQAEISNREIKSILEKVVHPSRKDWSLRLDEALWAYRTAYKTPLGPFVVVEVFPHGAITLQDPKDGRVFKVNGQRVKHYWG

Query:  EKFRSKYPSLKKTAAKNSSSLLKNDHEHHHHDCTGILLDVYVEWLVGTQISK-DFSLKRHSVFTEREQNEREGEREGERRETESDFPKRKFLPLPIDQEG
           RS +  L                  +  DC    L V VE+     +   +F +K                             +R+ + L   +E 
Subjt:  EKFRSKYPSLKKTAAKNSSSLLKNDHEHHHHDCTGILLDVYVEWLVGTQISK-DFSLKRHSVFTEREQNEREGEREGERRETESDFPKRKFLPLPIDQEG

Query:  NCTPYFCNKVFNVK-KDAKPRLIRWILLLQELDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDFFQMSSFLLFRCVSGAEAKEILEQCHSSPYGGH
            Y  ++++ ++ K A  +LIR           +KD K  ++V             LL  S +S        +  RC++  + + +LE CH S YGGH
Subjt:  NCTPYFCNKVFNVK-KDAKPRLIRWILLLQELDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDFFQMSSFLLFRCVSGAEAKEILEQCHSSPYGGH

Query:  FSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW---------------------VDYVSKWVEAIACHQSDAK
        F+  +TA ++L  G +WPTLFKDA  +  +CD CQR+G +  RDEMPL  ILEVE+FDVW                     VDYVSKW+EAI C   DAK
Subjt:  FSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW---------------------VDYVSKWVEAIACHQSDAK

Query:  TVARFLQSHIFARFGTPRALVSDE---------------------------------------------EKVVHPSRKDWSFRLDEALWAYRTAYKTPLG
         V +  ++ IF RFG PR ++SD                                              EK V  +RKDW+ RLDEALWAYRTAYKTP+G
Subjt:  TVARFLQSHIFARFGTPRALVSDE---------------------------------------------EKVVHPSRKDWSFRLDEALWAYRTAYKTPLG

Query:  MSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKG
         SP+ L+YGK CHLP+E+E+K  WA+K LNFD+  A   R++ L ELEE R  +YEN+++YK +TK  HDK I+ K+F  G
Subjt:  MSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKG

GEY48025.1 reverse transcriptase domain-containing protein [Tanacetum cinerariifolium]1.7e-11734.99Show/hide
Query:  MPLTYILEVELFDVWGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFAKHEIKHRI
        MP   I   E+FDVWGIDFMGPFP   GN YIL+A DY+SKWVEA A   NDA+ V + L+S +FAR+G+PRA++SD GTHF N+  TK+  K+ + HR+
Subjt:  MPLTYILEVELFDVWGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFAKHEIKHRI

Query:  ATPYHPQANGQAEISNREIKSILEKVVHPSRKDWSLRLDEALWAYRTAYKTPLGPFVVVEVFPHGAITLQDPKDGRV-----FKVNGQRVKH--------
        +T YHPQ +GQ E+SNR +K ILE+ +  +R  WS +LD+ALW +RT YKTP+G        P+  +  +D  D  V     F ++  R  H        
Subjt:  ATPYHPQANGQAEISNREIKSILEKVVHPSRKDWSLRLDEALWAYRTAYKTPLGPFVVVEVFPHGAITLQDPKDGRV-----FKVNGQRVKH--------

Query:  ------YW------------------GEK--FRSKYPSLKKTAAKNSSSLLK-----NDHEHHHHDCTGI-----LLDVYVEWLVGTQISKDFSLKRHSV
              Y+                   EK  F +    L   A  ++ ++L+          H     GI      +DV  +    T +   +S   H+ 
Subjt:  ------YW------------------GEK--FRSKYPSLKKTAAKNSSSLLK-----NDHEHHHHDCTGI-----LLDVYVEWLVGTQISKDFSLKRHSV

Query:  FTEREQNEREGEREGERRETESDFP--------------KRKFLPLPI---------------------------DQEGNCTP-YFCNKVFNVKK-----
        F  R   +            E + P              K+K    PI                             E +  P ++ +K  N  K     
Subjt:  FTEREQNEREGEREGERRETESDFP--------------KRKFLPLPI---------------------------DQEGNCTP-YFCNKVFNVKK-----

Query:  --------DAKPRLIRWILLLQELDLEIKDKKGSENVIADHLSRLD-PSSSLLEQSAISDFF--------------------------------------
                DAK RL+ W+LLLQE D ++ D KG+EN+ ADHLSRL+ P  ++L+   I++ F                                      
Subjt:  --------DAKPRLIRWILLLQELDLEIKDKKGSENVIADHLSRLD-PSSSLLEQSAISDFF--------------------------------------

Query:  QMSSFL-------LFRCVSGAEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--
        Q  SFL       + RCV G EA EILE CH+ P GGH     TA +I   GFFWPT++KDAH F K CD+CQR+     RDEMP   I   E+ DVW  
Subjt:  QMSSFL-------LFRCVSGAEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--

Query:  ------------------VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDE----------------------EKVVHPSRKD-------
                          VDY+SKWVEA A   +DA+ + +FL+S +FARFG+PRA++SD                           HP           
Subjt:  ------------------VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDE----------------------EKVVHPSRKD-------

Query:  ----------------WSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKE
                        WS +LD+ALWA+RTAYKTP+G +PY+LVYGKACHLP++LEHK +W LK+ NFDLS AG  + +QLNEL E    +YEN+ +YKE
Subjt:  ----------------WSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKE

Query:  KTKLWHDKKIKSKEFVKGQKDEKDGRVFKVNGQRVKHYWGEEF
        KTK  HD KIK++ F  G +        K+   ++K  W   F
Subjt:  KTKLWHDKKIKSKEFVKGQKDEKDGRVFKVNGQRVKHYWGEEF

XP_031101830.1 uncharacterized protein LOC116005731 [Ipomoea triloba]1.5e-14241.72Show/hide
Query:  GNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFA
        GN+  R EMPL+ IL V+LFDVWGIDFMGPFP S G  YIL+A DYVSKWVEA+A   ND+K V +FL+  IF RFGTPRA++SDEGTHF N +   L  
Subjt:  GNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFA

Query:  KHEIKHRIATPYHPQANGQAEISNREIKSILEKVVHPSRKDWSLRLDEALWAYRTAYKTPLG--PF-----------VVVEVFPHGAITLQD---PKDGR
        K+ I HR+ATPYHPQ +GQ E+SNR+IK ILEK V+PSRKDW+++L++ALWAYRTAYKTP+G  P+           V +E   + AI   +      G 
Subjt:  KHEIKHRIATPYHPQANGQAEISNREIKSILEKVVHPSRKDWSLRLDEALWAYRTAYKTPLG--PF-----------VVVEVFPHGAITLQD---PKDGR

Query:  VFKVNGQRVKHYWGEKFRSKYPSLKKTAAKNSSSLLKNDHEHHHHDCTGILLDVYVEWLVG--------------------TQISKDFSLKRHSVFTERE
        V K+N   +     E + +     ++T A +  S+L+ + E        +L +  +    G                     ++    + ++  V  +R 
Subjt:  VFKVNGQRVKHYWGEKFRSKYPSLKKTAAKNSSSLLKNDHEHHHHDCTGILLDVYVEWLVG--------------------TQISKDFSLKRHSVFTERE

Query:  QNEREGEREGERRE------TESD-----FPKRKFLPLPIDQEGNC-TPYFCNKVFNVKKDAKPRLIRWILLLQELDLEIKDKKGSENVIADHLSRLDPS
        +   EGE      E      TE +     F   KF    I  +    + +   K    KKDAKPRLIRWILLLQE DL I+DKKG ENV+ADHLSRL  +
Subjt:  QNEREGEREGERRE------TESD-----FPKRKFLPLPIDQEGNC-TPYFCNKVFNVKKDAKPRLIRWILLLQELDLEIKDKKGSENVIADHLSRLDPS

Query:  SSLLEQS--AISDFFQMSSFLLFR-----CVSGAEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLT
        ++    S   I+D F     L  +     C+   E + IL   H+   GGHFSG++TA+++L  GF+WPTLFKDA  F  +CD CQR GN+  R EMPL+
Subjt:  SSLLEQS--AISDFFQMSSFLLFR-----CVSGAEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLT

Query:  YILEVELFDVW--------------------VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDE--------------------------
         IL V+LFDVW                    VDYVSKWVEA+A   +D+K V +FL+  IF RFGTPRA++SDE                          
Subjt:  YILEVELFDVW--------------------VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDE--------------------------

Query:  -------------------EKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEF
                           EK V+PSRKDW+ +L++ALWAYRTAYKTP+GMSPYRLV+GKACHLP+ELEH+ +WA+K LNFDL  AG ++          
Subjt:  -------------------EKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEF

Query:  RQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQKDEKDGRVFKVNGQRVKHYWGEE
          FS+   ++    T                      G  FKVNGQR+K Y+  E
Subjt:  RQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQKDEKDGRVFKVNGQRVKHYWGEE

XP_034197290.1 LOW QUALITY PROTEIN: uncharacterized protein LOC117612734 [Prunus dulcis]2.1e-12028.89Show/hide
Query:  GNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFA
        GN+  R+E+PL  IL VELFDVWGIDFMGPFP S G  YIL+A DYVSKWVEAIA   ND K V +FL+ +IF RFGT RA++SD G+HF N +   L  
Subjt:  GNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFA

Query:  KHEIKHRIATPYHPQANGQAEISNREIKSILEKVVHPSRKDWSLRLDEALWAYRTAYKTP----------------------------------------
        K+ I HR++TPYHPQ +GQ EISNREIK I+EKVV+ +RKDW+ +L++ALWAYRTAYKTP                                        
Subjt:  KHEIKHRIATPYHPQANGQAEISNREIKSILEKVVHPSRKDWSLRLDEALWAYRTAYKTP----------------------------------------

Query:  ----------------------------------------------------------------LGPFVVVEVFPHGAITLQDPKDGRVFKVNGQRVKHY
                                                                        LGPF VV V P+GA+ +Q+PKDG  FKVNGQR+K +
Subjt:  ----------------------------------------------------------------LGPFVVVEVFPHGAITLQDPKDGRVFKVNGQRVKHY

Query:  -------------------------------------WGEKFR------------------SKY------------------------------------
                                             W  + R                   KY                                    
Subjt:  -------------------------------------WGEKFR------------------SKY------------------------------------

Query:  ----------------------------------------PSLKK-----TAAKNSSSLL----------------------KNDH-------------E
                                                P++K+     T  KN ++ L                        DH              
Subjt:  ----------------------------------------PSLKK-----TAAKNSSSLL----------------------KNDH-------------E

Query:  HHHHDC----------------------------------------------------------------------------------------------
         H H C                                                                                              
Subjt:  HHHHDC----------------------------------------------------------------------------------------------

Query:  -------------TGILL----------------DVYVEWLVGTQISKDFSLKRHSVFTER------------------------EQNEREGEREGERRE
                      GI+L                D+       T + K  S   H+ F  R                        +Q+        ++  
Subjt:  -------------TGILL----------------DVYVEWLVGTQISKDFSLKRHSVFTER------------------------EQNEREGEREGERRE

Query:  T-------------------ESDFPKRKFLPLPIDQEGNCTPYFCNKVFN---------------------------------------------VKKDA
        T                    SD+     L   +D++ +   Y+ ++  N                                              KKDA
Subjt:  T-------------------ESDFPKRKFLPLPIDQEGNCTPYFCNKVFN---------------------------------------------VKKDA

Query:  KPRLIRWILLLQELDLEIKDKKGSENVIADHLSRL----------------DPSSSLLEQSAISDF-------------------FQMSSF---------
        KPRLIRWILLLQE DLEIKDKKGSENV+ADHLSRL                 P   L    A++                      ++S+F         
Subjt:  KPRLIRWILLLQELDLEIKDKKGSENVIADHLSRL----------------DPSSSLLEQSAISDF-------------------FQMSSF---------

Query:  -----------------LLFRCVSGAEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELF
                         ++ RCV   + K ILE CHS   GGHF  ++TA ++L  GFFWPTLFKDA+ F   CD CQR GNL  R++MPLT IL +++F
Subjt:  -----------------LLFRCVSGAEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELF

Query:  DVW--------------------VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDE----------------------------------
        DVW                    VDYVSKWVEAIA   +DAK V  FL+ +IF RFGTPRA++SD                                   
Subjt:  DVW--------------------VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDE----------------------------------

Query:  -----------EKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENA
                   EK V+ +RKDWS RLD+ALWAYRTAYKTP+GMSPYRLV+GK CHLP+ELEH+ +WA+K  NFD+  AG  R LQLNELEE R  +YENA
Subjt:  -----------EKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENA

Query:  KMYKEKTKLWHDKKIKSKEFVKGQK-------------------------------------DEKDGRVFKVNGQRVKHYWGEEFQSKYPS
        K+YKEKTK +HDKKI  K F KGQK                                     + KDG  FKVNG R+K Y+   F +   S
Subjt:  KMYKEKTKLWHDKKIKSKEFVKGQK-------------------------------------DEKDGRVFKVNGQRVKHYWGEEFQSKYPS

XP_038974948.1 uncharacterized protein LOC120106130 [Phoenix dactylifera]2.2e-12532.47Show/hide
Query:  GNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFA
        G L  R+ MPL  IL +E+FD WGIDFMGPFPPS G +YI++A DYVSKWVEA  C  ND KTV +FL+ ++ +RFGTPR ++SD GTHF N     L  
Subjt:  GNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFA

Query:  KHEIKHRIATPYHPQANGQAEISNREIKSILEKVVHPSRKDWSLRLDEALWAYRTAYKTPLG--PFVVVEVFP-HGAITLQD-------------PKDGR
        K+ + H+I+T YHPQ +GQ E++NREIK ILEK V+P RKDWSLRL +ALWAYRTA+KTPLG  P+ +V   P H  + L+              P+ G 
Subjt:  KHEIKHRIATPYHPQANGQAEISNREIKSILEKVVHPSRKDWSLRLDEALWAYRTAYKTPLG--PFVVVEVFP-HGAITLQD-------------PKDGR

Query:  VFKVNGQRVKHYWGEKF-RSKYPSLKKTAAKNSSSLLKNDHEHHH---------------------------------------------------HDCT
        + K     ++    E +  SK    K  A  + + L K  H H                                                      +C 
Subjt:  VFKVNGQRVKHYWGEKF-RSKYPSLKKTAAKNSSSLLKNDHEHHH---------------------------------------------------HDCT

Query:  GILLD---------------------------VYVEW-----------LVGTQIS----------------------------------------KDFS-
         + +D                           + + W           ++G  +S                                        ++FS 
Subjt:  GILLD---------------------------VYVEW-----------LVGTQIS----------------------------------------KDFS-

Query:  --------LKRHSVFTEREQNERE----------------------------------GEREGERRE----------------------TESD-----FP
                L R ++F   ++ +                                    G   G+RRE                      TE +     F 
Subjt:  --------LKRHSVFTEREQNERE----------------------------------GEREGERRE----------------------TESD-----FP

Query:  KRKFLPLPIDQEGNC-TPYFCNKVFNVKKDAKPRLIRWILLLQELDLEIKDKKGSENVIADHLSRL--------------DPSSSLLEQSAISDFFQMSS
          KF    I       T +   K    KKDAK RLIRWILLLQE +L IKDKKG EN +ADHLSRL               P   L   +++  F  + +
Subjt:  KRKFLPLPIDQEGNC-TPYFCNKVFNVKKDAKPRLIRWILLLQELDLEIKDKKGSENVIADHLSRL--------------DPSSSLLEQSAISDFFQMSS

Query:  FL---------------------------------------LFRCVSGAEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDAC
        +L                                       + RCV   E   +LE CHS   GGHFS ++TA +IL CGF+WP+LF+D + + + C+ C
Subjt:  FL---------------------------------------LFRCVSGAEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDAC

Query:  QRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDE-----------
        Q+ G L  R+ MPL  IL +E+FD W                    VDYVSKWVEA  C  +D KTV +FL+ ++ +RFGTPR ++SD            
Subjt:  QRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDE-----------

Query:  ----------------------------------EKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSR
                                          EK V+P RKDWS RL +ALWAYRTA+KTPLGMSPYRLV+GK CHLP+ELEH+ +WA+K  NFDL  
Subjt:  ----------------------------------EKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSR

Query:  AGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQK-------------------------------------DEKDGRVFKVNGQRV
        AG +R  Q+ ELEE R  +YEN+K+YK K K +HD+ I  K F   Q+                                     D +DGR+ KVNGQR+
Subjt:  AGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQK-------------------------------------DEKDGRVFKVNGQRV

Query:  K
        K
Subjt:  K

TrEMBL top hitse value%identityAlignment
A0A2K3LHD8 Integrase catalytic domain-containing protein1.1e-9845.76Show/hide
Query:  KKDAKPRLIRWILLLQELDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDFF---------------------------------QMSSF-------
        K+++KPRL+RWILLLQE DLEI+DKKGSEN +ADHLSRL+      E+ AI D F                                 Q   F       
Subjt:  KKDAKPRLIRWILLLQELDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDFF---------------------------------QMSSF-------

Query:  --------------LLFRCVSGAEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW
                      LL RCV   E +++L  CH S YGGHFSG RTA ++L  G FWPTLFKDA  + K+CD CQR GN+  R+EMP   ILEVE+FDVW
Subjt:  --------------LLFRCVSGAEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW

Query:  --------------------VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDE-------------------------------------
                            VDYVSKWVEAIA H +DA+ V  FL+ +IF+RFG PRAL+SDE                                     
Subjt:  --------------------VDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDE-------------------------------------

Query:  --------EKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMY
                EK V+ SRKDWS +LD+ALWAYRTA+KTP+GMSP+++VYGKACHLPLELEHK  WA K LNFDLS+AG  R+LQL+EL+EFR ++YENAK++
Subjt:  --------EKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMY

Query:  KEKTKLWHDKKIKSKEFVKGQKDEKDGRVFKVNGQRVKHYWGEEFQSK
        KEKTK WHD+KI+ KEF +GQ         ++   ++K  W   F+ K
Subjt:  KEKTKLWHDKKIKSKEFVKGQKDEKDGRVFKVNGQRVKHYWGEEFQSK

A0A5B6VWJ0 Retroelement pol polyprotein-like2.6e-10049.63Show/hide
Query:  KKDAKPRLIRWILLLQELDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDFFQMSSFLLF------------------RCVSGAEAKEILEQCHSSP
        KKDAKPRLIRW+LLLQE DLEI+D++G EN +ADHLSRL+P        +I + F   + L +                  RCV+  +  +IL  CHS+P
Subjt:  KKDAKPRLIRWILLLQELDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDFFQMSSFLLF------------------RCVSGAEAKEILEQCHSSP

Query:  YGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEAIACHQS
         GGHF G RT  ++L  GFFWPTLFKDA+ + K CD CQR GN+  R+EMP T I+E ELFDVW                    V YVSKWVEA A    
Subjt:  YGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEAIACHQS

Query:  DAKTVARFLQSHIFARFGTPRALVSDE---------------------------------------------EKVVHPSRKDWSFRLDEALWAYRTAYKT
        DAK V +FLQ H+F RFGTPRA++SDE                                             EKVV  +++DWS RLD+ALWAY+  YKT
Subjt:  DAKTVARFLQSHIFARFGTPRALVSDE---------------------------------------------EKVVHPSRKDWSFRLDEALWAYRTAYKT

Query:  PLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQKDEKDGRVFKVNGQR
        PLGMSPYRLV+GKACHLPLELEH+ +WAL++LN DL  A   RMLQLNELEEFR FSYENAK+ KE+ K WHDK I+ +EF  G+     G  F+VN QR
Subjt:  PLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQKDEKDGRVFKVNGQR

Query:  VKHYWGE
        +KHY+GE
Subjt:  VKHYWGE

A0A699JX63 Reverse transcriptase domain-containing protein6.7e-10435.03Show/hide
Query:  RGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLF
        +G +  +DEMP  +I   E+FDVWGIDFM PFP S GN YIL+A DY+SKW EA A   NDA+ V RFL+S +F+RFGTP+A++SD GTHF N+  +++ 
Subjt:  RGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLF

Query:  AKHEIKHRIATPYHPQANGQAEISNREIKSILEKVVHPSRKDWSLRLDEALWAYRTAYKTPLGPFVVVEVFPHGAITLQDPKDGRVFKVNGQRVKHYWGE
        AK  + H ++  YHPQ +GQ E++NR  K ILE+ V  +R  WS +L++ALWA+RTAYKT +G                                     
Subjt:  AKHEIKHRIATPYHPQANGQAEISNREIKSILEKVVHPSRKDWSLRLDEALWAYRTAYKTPLGPFVVVEVFPHGAITLQDPKDGRVFKVNGQRVKHYWGE

Query:  KFRSKYPSLKKTAAKNSSSLLKNDHEHHHHDCTGILLDVYVEWLVGTQISKDFSLKRHSVFTEREQNEREGEREGERRETESDFPKRKFLPLPIDQEGNC
                                       CT   L                                              + K  +LPL ++ +   
Subjt:  KFRSKYPSLKKTAAKNSSSLLKNDHEHHHHDCTGILLDVYVEWLVGTQISKDFSLKRHSVFTEREQNEREGEREGERRETESDFPKRKFLPLPIDQEGNC

Query:  TPYFCNKVFNVKKDAKPRLIRWILLLQELDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDFFQMSSFLLFRCVSGAEAKEILEQCHSSPYGGHFSG
          Y+  K  N   D K       L L EL+ E++D+    ++I    ++                                  +IL+ CHS P GGH+  
Subjt:  TPYFCNKVFNVKKDAKPRLIRWILLLQELDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDFFQMSSFLLFRCVSGAEAKEILEQCHSSPYGGHFSG

Query:  QRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEAIACHQSDAKTVAR
          TA ++   GF+WPT++KDA    K CD+CQ +G +  RDEMP   I   E+FDVW                    VDY+SKWVEA A   +DA+ V +
Subjt:  QRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEAIACHQSDAKTVAR

Query:  FLQSHIFARFGTPRALVSDEEKVVH-----------PSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGA
        FL+  +F+RFGTP+A++SD E + H            +R  WS +L+++LWA+RT +KTP+G +PYRLVYGK+CHLPLEL+HK FWALK  NFDL  AG 
Subjt:  FLQSHIFARFGTPRALVSDEEKVVH-----------PSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGA

Query:  IRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQKDEKDGRVFKVNGQRVKHYWGEEF
         R LQLNE  E R  +Y+N+ +YKE+TK  H+ K K++ F  G +        K+   ++K  W   F
Subjt:  IRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQKDEKDGRVFKVNGQRVKHYWGEEF

A0A6D2JA68 Uncharacterized protein1.9e-12239.06Show/hide
Query:  RGNLGPRDEMPLTYILEVELFDVWGIDFMGPF-PPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKL
        +G +   DEMPL  ILEVE+FDVWGIDFMGPF P SNG+ YIL+A DYVSKW+EAI C   DAK V +  ++ IF RFG PR ++SD G+HF+N +   L
Subjt:  RGNLGPRDEMPLTYILEVELFDVWGIDFMGPF-PPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKL

Query:  FAKHEIKHRIATPYHPQANGQAEISNREIKSILEKVVHPSRKDWSLRLDEALWAYRTAYKTPLGPFVVVEVFPHGAITLQDPKDGRVFKVNGQRVKHYWG
           H +KH++ATPYHPQ +GQ E+SNR+IK+ILEK V  +RKDW+ RLDEALWAYRTAYKTP+G                                    
Subjt:  FAKHEIKHRIATPYHPQANGQAEISNREIKSILEKVVHPSRKDWSLRLDEALWAYRTAYKTPLGPFVVVEVFPHGAITLQDPKDGRVFKVNGQRVKHYWG

Query:  EKFRSKYPSLKKTAAKNSSSLLKNDHEHHHHDCTGILLDVYVEWLVGTQISK-DFSLKRHSVFTEREQNEREGEREGERRETESDFPKRKFLPLPIDQEG
           RS +  L                  +  DC    L V VE+     +   +F +K                             +R+ + L   +E 
Subjt:  EKFRSKYPSLKKTAAKNSSSLLKNDHEHHHHDCTGILLDVYVEWLVGTQISK-DFSLKRHSVFTEREQNEREGEREGERRETESDFPKRKFLPLPIDQEG

Query:  NCTPYFCNKVFNVK-KDAKPRLIRWILLLQELDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDFFQMSSFLLFRCVSGAEAKEILEQCHSSPYGGH
            Y  ++++ ++ K A  +LIR           +KD K  ++V             LL  S +S        +  RC++  + + +LE CH S YGGH
Subjt:  NCTPYFCNKVFNVK-KDAKPRLIRWILLLQELDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDFFQMSSFLLFRCVSGAEAKEILEQCHSSPYGGH

Query:  FSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW---------------------VDYVSKWVEAIACHQSDAK
        F+  +TA ++L  G +WPTLFKDA  +  +CD CQR+G +  RDEMPL  ILEVE+FDVW                     VDYVSKW+EAI C   DAK
Subjt:  FSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW---------------------VDYVSKWVEAIACHQSDAK

Query:  TVARFLQSHIFARFGTPRALVSDE---------------------------------------------EKVVHPSRKDWSFRLDEALWAYRTAYKTPLG
         V +  ++ IF RFG PR ++SD                                              EK V  +RKDW+ RLDEALWAYRTAYKTP+G
Subjt:  TVARFLQSHIFARFGTPRALVSDE---------------------------------------------EKVVHPSRKDWSFRLDEALWAYRTAYKTPLG

Query:  MSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKG
         SP+ L+YGK CHLP+E+E+K  WA+K LNFD+  A   R++ L ELEE R  +YEN+++YK +TK  HDK I+ K+F  G
Subjt:  MSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKG

A0A6P8CB75 LOW QUALITY PROTEIN: uncharacterized protein LOC1161937468.0e-10527.35Show/hide
Query:  GNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFA
        GN+  R E+P   IL +ELFDVWGIDFMGPFP S  N YIL+A DYVSKWVEA+A   NDA+ V RFL+ +IF+RFG PRA++SD G+HF N    KL +
Subjt:  GNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFA

Query:  KHEIKHRIATPYHPQANGQAEISNREIKSILEKVVHPSRKDWSLRLDEALWAYRTAYKTPLG--PFVVV-----------EVFPHGAI-----TLQDPKD
        K+ + H+IATPYHPQ  GQ E+SNR+IK ILEK V+ SRKDWSL+LD+ALWAYRTA+KTP+G  P+ +V           E   + AI      LQ   +
Subjt:  KHEIKHRIATPYHPQANGQAEISNREIKSILEKVVHPSRKDWSLRLDEALWAYRTAYKTPLG--PFVVV-----------EVFPHGAI-----TLQDPKD

Query:  GRVFKVN---------------GQRVKHY-----------------------------------------------------------------------
         R+ ++N                +R K +                                                                       
Subjt:  GRVFKVN---------------GQRVKHY-----------------------------------------------------------------------

Query:  ----------WGEKFRSKYPSLKKTAAK-NSSSLLKNDHEHHHHDCTGILL-------------------------------------------------
                  +G+ F S   +L     +   ++LL N  ++H     GI+L                                                 
Subjt:  ----------WGEKFRSKYPSLKKTAAK-NSSSLLKNDHEHHHHDCTGILL-------------------------------------------------

Query:  -------------------------------------------------------DVYVEWLVGTQISK-------------------------------
                                                               D  V  ++G +  K                               
Subjt:  -------------------------------------------------------DVYVEWLVGTQISK-------------------------------

Query:  ---DFSLKR----------------HSVFTEREQN---------------------------------------------EREGEREG---ERRETESDF
            F+ +R                 S+F++  +N                                              REG   G    ++  E D 
Subjt:  ---DFSLKR----------------HSVFTEREQN---------------------------------------------EREGEREG---ERRETESDF

Query:  PKRKF---LPLPIDQEG---------------------------------------NCTP----------------------------------------
         K +    LP P   +G                                       NC                                          
Subjt:  PKRKF---LPLPIDQEG---------------------------------------NCTP----------------------------------------

Query:  ----------YFCNKVFN---------------------------------------------VKKDAKPRLIRWILLLQELDLEIKDKKGSENVIADHL
                  Y+ ++  N                                              K DAKPRLIRWILLLQE DLEI+D KG+ENV+ADHL
Subjt:  ----------YFCNKVFN---------------------------------------------VKKDAKPRLIRWILLLQELDLEIKDKKGSENVIADHL

Query:  SRLDPS------------------------------------------SSLLEQSAISD-----------FFQMSSFLLFRCVSGAEAKEILEQCHSSPY
        SRL+                                            SS  ++  + D           F   +  ++ RCV   E   I++ CHS   
Subjt:  SRLDPS------------------------------------------SSLLEQSAISD-----------FFQMSSFLLFRCVSGAEAKEILEQCHSSPY

Query:  GGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEAIACHQSD
        GGHF  +RTA +IL CGF+WP +F D   +   C  CQR GN+  R E+P   IL +ELFDVW                    VDYVSKWVEA+A   +D
Subjt:  GGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVW--------------------VDYVSKWVEAIACHQSD

Query:  AKTVARFLQSHIFARFGTPRALVSDE---------------------------------------------EKVVHPSRKDWSFRLDEALWAYRTAYKTP
        A+ V RFL+ +IF+RFG PRA++SD                                              EK V+ SRKDWS +LD+ALWAYRTA+KTP
Subjt:  AKTVARFLQSHIFARFGTPRALVSDE---------------------------------------------EKVVHPSRKDWSFRLDEALWAYRTAYKTP

Query:  LGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQK---------------
        +GMSPY++VYGK+CHLP+ELEHK +WA+K LNFDL  AG  R+LQLN +   R+ +YENA++YKE+ K WHD+ I  +EF+ GQK               
Subjt:  LGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQK---------------

Query:  ---------------------DEKDGRVFKVNGQRVKHYWGEE
                               +D R FKVNG  +KHY+  E
Subjt:  ---------------------DEKDGRVFKVNGQRVKHYWGEE

SwissProt top hitse value%identityAlignment
A1Z651 Gag-Pol polyprotein1.8e-1634.78Show/hide
Query:  WGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFAKHEIKHRIATPYHPQANGQAEI
        W +DF    P   G  Y+L+  D  S WVEA    +  AK VS+ L   IF RFG P+ L SD G  F + +   +     I  ++   Y PQ++GQ E 
Subjt:  WGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFAKHEIKHRIATPYHPQANGQAEI

Query:  SNREIKSILEKVVHPS-RKDWSLRLDEALW-AYRTAYKTPLGPFVVVEVFPHGAITLQDPK
         NR IK  L K+   S  +DW L L  AL+ A  T     L P+ ++   P   +   DP+
Subjt:  SNREIKSILEKVVHPS-RKDWSLRLDEALW-AYRTAYKTPLGPFVVVEVFPHGAITLQDPK

P03355 Gag-Pol polyprotein3.0e-1633.94Show/hide
Query:  WGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFAKHEIKHRIATPYHPQANGQAEI
        W IDF    P   G  Y+L+  D  S W+EA    +  AK V++ L   IF RFG P+ L +D G  FV+ +   +     I  ++   Y PQ++GQ E 
Subjt:  WGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFAKHEIKHRIATPYHPQANGQAEI

Query:  SNREIKSILEKV-VHPSRKDWSLRLDEALW-AYRTAYKTPLGPFVVVEVFPHGAITLQDPKDGRV
         NR IK  L K+ +    +DW L L  AL+ A  T     L P+ ++   P   +   DP   RV
Subjt:  SNREIKSILEKV-VHPSRKDWSLRLDEALW-AYRTAYKTPLGPFVVVEVFPHGAITLQDPKDGRV

P08361 Gag-Pol polyprotein3.0e-1633.94Show/hide
Query:  WGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFAKHEIKHRIATPYHPQANGQAEI
        W IDF    P   G  Y+L+  D  S W+EA    +  AK V++ L   IF RFG P+ L +D G  FV+ +   +     I  ++   Y PQ++GQ E 
Subjt:  WGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFAKHEIKHRIATPYHPQANGQAEI

Query:  SNREIKSILEKV-VHPSRKDWSLRLDEALW-AYRTAYKTPLGPFVVVEVFPHGAITLQDPKDGRV
         NR IK  L K+ +    +DW L L  AL+ A  T     L P+ ++   P   +   DP   RV
Subjt:  SNREIKSILEKV-VHPSRKDWSLRLDEALW-AYRTAYKTPLGPFVVVEVFPHGAITLQDPKDGRV

P26809 Gag-Pol polyprotein3.0e-1633.33Show/hide
Query:  WGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFAKHEIKHRIATPYHPQANGQAEI
        W IDF    P   G  Y+L+  D  S WVEA    +  AK V++ L   IF RFG P+ L +D G  FV+ +   +     +  ++   Y PQ++GQ E 
Subjt:  WGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFAKHEIKHRIATPYHPQANGQAEI

Query:  SNREIKSILEKV-VHPSRKDWSLRLDEALW-AYRTAYKTPLGPFVVVEVFPHGAITLQDPKDGRV
         NR IK  L K+ +    +DW L L  AL+ A  T     L P+ ++   P   +   DP   +V
Subjt:  SNREIKSILEKV-VHPSRKDWSLRLDEALW-AYRTAYKTPLGPFVVVEVFPHGAITLQDPKDGRV

Q2F7J0 Gag-Pol polyprotein1.8e-1634.78Show/hide
Query:  WGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFAKHEIKHRIATPYHPQANGQAEI
        W +DF    P   G  Y+L+  D  S WVEA    +  AK VS+ L   IF RFG P+ L SD G  F + +   +     I  ++   Y PQ++GQ E 
Subjt:  WGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFAKHEIKHRIATPYHPQANGQAEI

Query:  SNREIKSILEKVVHPS-RKDWSLRLDEALW-AYRTAYKTPLGPFVVVEVFPHGAITLQDPK
         NR IK  L K+   S  +DW L L  AL+ A  T     L P+ ++   P   +   DP+
Subjt:  SNREIKSILEKVVHPS-RKDWSLRLDEALW-AYRTAYKTPLGPFVVVEVFPHGAITLQDPK

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein3.1e-1661.4Show/hide
Query:  ILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWVDYVSK
        +L  GF+WPT FKDAH F   CDACQR+GN   R+EMP  +ILEVE+FDVW  Y  K
Subjt:  ILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWVDYVSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGGAAACCTAGGGCCTAGAGATGAAATGCCTCTTACTTACATTTTGGAAGTAGAGTTATTTGATGTATGGGGTATTGACTTCATGGGGCCATTTCCCCCTTCTAA
TGGCAATGTTTACATCTTATTGGCATTTGATTACGTGTCCAAGTGGGTGGAGGCCATCGCATGCCATCAGAATGATGCCAAGACAGTGTCAAGGTTTCTTCAATCGCACA
TCTTTGCGCGGTTTGGGACACCTAGAGCTCTAGTAAGTGATGAGGGAACACATTTTGTTAATAATATCTTAACTAAGCTGTTTGCTAAGCATGAAATTAAGCATAGGATA
GCTACCCCTTATCACCCACAAGCAAATGGTCAAGCTGAGATTAGCAATAGGGAAATAAAATCTATTCTAGAGAAAGTGGTCCATCCATCTAGGAAGGATTGGTCTCTTAG
GTTGGATGAGGCTCTTTGGGCCTACAGGACAGCTTATAAGACTCCTCTAGGACCATTTGTTGTGGTTGAAGTTTTCCCCCATGGAGCAATTACTTTGCAGGATCCAAAAG
ATGGGAGAGTGTTCAAAGTGAATGGACAGCGTGTGAAACATTATTGGGGAGAGAAGTTCCGGTCGAAATATCCTTCCCTAAAAAAAACAGCAGCCAAGAACTCCTCTTCT
TTGCTGAAAAATGATCACGAACACCACCACCACGATTGCACCGGTATTCTCTTAGATGTGTATGTTGAGTGGCTTGTGGGAACCCAGATCTCTAAGGATTTTTCTCTGAA
AAGACACTCTGTATTCACAGAGCGAGAGCAGAACGAGAGAGAGGGCGAGAGAGAGGGGGAGAGGCGTGAGACTGAGAGTGATTTTCCAAAAAGAAAATTCCTCCCCTTAC
CTATCGATCAGGAGGGTAACTGCACGCCCTATTTCTGCAATAAGGTATTTAATGTCAAGAAAGATGCAAAGCCTAGACTAATTCGTTGGATTTTATTACTGCAGGAATTG
GACTTGGAGATAAAGGACAAGAAGGGATCGGAGAATGTCATTGCAGATCATTTATCTCGTCTTGATCCATCCTCATCTTTGCTGGAGCAATCTGCCATTTCCGATTTTTT
CCAGATGAGCAGCTTTTTGCTGTTCAGGTGTGTTTCAGGTGCTGAAGCAAAGGAAATCCTGGAGCAATGTCACTCTTCGCCGTATGGAGGTCATTTCAGCGGTCAGAGGA
CAGCCATGAGGATTTTGCATTGCGGATTTTTCTGGCCTACGTTATTCAAGGATGCCCATTGGTTCTACAAGCAATGTGACGCTTGCCAAAGGAGAGGAAACTTGGGACCT
AGAGATGAAATGCCTCTTACTTACATTTTAGAAGTTGAACTATTCGATGTATGGGTTGATTATGTGTCCAAGTGGGTGGAGGCCATTGCATGTCATCAGAGTGATGCCAA
GACAGTAGCAAGGTTTCTTCAATCGCACATCTTTGCGCGGTTTGGGACACCTAGGGCTCTAGTGAGTGATGAGGAGAAAGTAGTCCATCCATCTCGAAAGGATTGGTCCT
TTAGGTTGGATGAGGCTCTTTGGGCTTATAGGACAGCTTATAAGACTCCTCTAGGTATGTCTCCCTATAGGTTAGTATATGGGAAAGCTTGCCATTTACCATTAGAGCTA
GAGCATAAGACGTTTTGGGCTTTGAAAAAGTTAAATTTTGATCTGAGTCGTGCAGGAGCAATAAGAATGCTGCAGCTTAATGAGTTAGAGGAATTTCGCCAATTTTCTTA
CGAGAATGCGAAAATGTATAAGGAGAAGACTAAGCTGTGGCATGACAAGAAAATAAAGTCTAAGGAGTTTGTAAAGGGTCAAAAAGATGAAAAAGATGGGAGAGTATTCA
AGGTGAATGGACAGCGTGTGAAGCATTATTGGGGTGAGGAGTTTCAGTCGAAATATCCTTCCCTAAGGAATCATTTTGCTGCAGCAGAGCTTGGTTTTGCAGAGTGCTCA
GATTATGCTGCTGAGCGACTGGAAGGAGCAAATTCTATGCTGCAGCAAAACTGGGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGGAAACCTAGGGCCTAGAGATGAAATGCCTCTTACTTACATTTTGGAAGTAGAGTTATTTGATGTATGGGGTATTGACTTCATGGGGCCATTTCCCCCTTCTAA
TGGCAATGTTTACATCTTATTGGCATTTGATTACGTGTCCAAGTGGGTGGAGGCCATCGCATGCCATCAGAATGATGCCAAGACAGTGTCAAGGTTTCTTCAATCGCACA
TCTTTGCGCGGTTTGGGACACCTAGAGCTCTAGTAAGTGATGAGGGAACACATTTTGTTAATAATATCTTAACTAAGCTGTTTGCTAAGCATGAAATTAAGCATAGGATA
GCTACCCCTTATCACCCACAAGCAAATGGTCAAGCTGAGATTAGCAATAGGGAAATAAAATCTATTCTAGAGAAAGTGGTCCATCCATCTAGGAAGGATTGGTCTCTTAG
GTTGGATGAGGCTCTTTGGGCCTACAGGACAGCTTATAAGACTCCTCTAGGACCATTTGTTGTGGTTGAAGTTTTCCCCCATGGAGCAATTACTTTGCAGGATCCAAAAG
ATGGGAGAGTGTTCAAAGTGAATGGACAGCGTGTGAAACATTATTGGGGAGAGAAGTTCCGGTCGAAATATCCTTCCCTAAAAAAAACAGCAGCCAAGAACTCCTCTTCT
TTGCTGAAAAATGATCACGAACACCACCACCACGATTGCACCGGTATTCTCTTAGATGTGTATGTTGAGTGGCTTGTGGGAACCCAGATCTCTAAGGATTTTTCTCTGAA
AAGACACTCTGTATTCACAGAGCGAGAGCAGAACGAGAGAGAGGGCGAGAGAGAGGGGGAGAGGCGTGAGACTGAGAGTGATTTTCCAAAAAGAAAATTCCTCCCCTTAC
CTATCGATCAGGAGGGTAACTGCACGCCCTATTTCTGCAATAAGGTATTTAATGTCAAGAAAGATGCAAAGCCTAGACTAATTCGTTGGATTTTATTACTGCAGGAATTG
GACTTGGAGATAAAGGACAAGAAGGGATCGGAGAATGTCATTGCAGATCATTTATCTCGTCTTGATCCATCCTCATCTTTGCTGGAGCAATCTGCCATTTCCGATTTTTT
CCAGATGAGCAGCTTTTTGCTGTTCAGGTGTGTTTCAGGTGCTGAAGCAAAGGAAATCCTGGAGCAATGTCACTCTTCGCCGTATGGAGGTCATTTCAGCGGTCAGAGGA
CAGCCATGAGGATTTTGCATTGCGGATTTTTCTGGCCTACGTTATTCAAGGATGCCCATTGGTTCTACAAGCAATGTGACGCTTGCCAAAGGAGAGGAAACTTGGGACCT
AGAGATGAAATGCCTCTTACTTACATTTTAGAAGTTGAACTATTCGATGTATGGGTTGATTATGTGTCCAAGTGGGTGGAGGCCATTGCATGTCATCAGAGTGATGCCAA
GACAGTAGCAAGGTTTCTTCAATCGCACATCTTTGCGCGGTTTGGGACACCTAGGGCTCTAGTGAGTGATGAGGAGAAAGTAGTCCATCCATCTCGAAAGGATTGGTCCT
TTAGGTTGGATGAGGCTCTTTGGGCTTATAGGACAGCTTATAAGACTCCTCTAGGTATGTCTCCCTATAGGTTAGTATATGGGAAAGCTTGCCATTTACCATTAGAGCTA
GAGCATAAGACGTTTTGGGCTTTGAAAAAGTTAAATTTTGATCTGAGTCGTGCAGGAGCAATAAGAATGCTGCAGCTTAATGAGTTAGAGGAATTTCGCCAATTTTCTTA
CGAGAATGCGAAAATGTATAAGGAGAAGACTAAGCTGTGGCATGACAAGAAAATAAAGTCTAAGGAGTTTGTAAAGGGTCAAAAAGATGAAAAAGATGGGAGAGTATTCA
AGGTGAATGGACAGCGTGTGAAGCATTATTGGGGTGAGGAGTTTCAGTCGAAATATCCTTCCCTAAGGAATCATTTTGCTGCAGCAGAGCTTGGTTTTGCAGAGTGCTCA
GATTATGCTGCTGAGCGACTGGAAGGAGCAAATTCTATGCTGCAGCAAAACTGGGAGTAG
Protein sequenceShow/hide protein sequence
MRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVYILLAFDYVSKWVEAIACHQNDAKTVSRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLFAKHEIKHRI
ATPYHPQANGQAEISNREIKSILEKVVHPSRKDWSLRLDEALWAYRTAYKTPLGPFVVVEVFPHGAITLQDPKDGRVFKVNGQRVKHYWGEKFRSKYPSLKKTAAKNSSS
LLKNDHEHHHHDCTGILLDVYVEWLVGTQISKDFSLKRHSVFTEREQNEREGEREGERRETESDFPKRKFLPLPIDQEGNCTPYFCNKVFNVKKDAKPRLIRWILLLQEL
DLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDFFQMSSFLLFRCVSGAEAKEILEQCHSSPYGGHFSGQRTAMRILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGP
RDEMPLTYILEVELFDVWVDYVSKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLEL
EHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQKDEKDGRVFKVNGQRVKHYWGEEFQSKYPSLRNHFAAAELGFAECS
DYAAERLEGANSMLQQNWE