; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035992 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035992
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:36088173..36094221
RNA-Seq ExpressionLag0035992
SyntenyLag0035992
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5443558.1 hypothetical protein F2P56_036105, partial [Juglans regia]2.3e-9734.2Show/hide
Query:  MITICWNVRGLGSPWAFRSVCNELKRLNPRLCFLSETKCSSAVLNKLKVRLHYSGCFVVDRVGLSGGLCLLWKDDVDVSIRNFSIHHIDVSIK-WKNTSW
        M  +CWN RGLG+P   R + + +   +P L FL ETK  +  +   K RLH + CF VD VG SGGL LLWK D+ V +++FS+HHID  I+      W
Subjt:  MITICWNVRGLGSPWAFRSVCNELKRLNPRLCFLSETKCSSAVLNKLKVRLHYSGCFVVDRVGLSGGLCLLWKDDVDVSIRNFSIHHIDVSIK-WKNTSW

Query:  RFTGIYGQPDTSLRFHTWNLIRRIYNNDDTPWVVGGDFNEILWDEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRRQAGEQI--------
        RFTG+YG P+   R+ TWNL+RR+ +  D PW+VGGDFNE+L   EK G  PR    ++ FR++I  C L DL F G  YTWCN R     I        
Subjt:  RFTGIYGQPDTSLRFHTWNLIRRIYNNDDTPWVVGGDFNEILWDEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRRQAGEQI--------

Query:  GN-----------------------------------------------WSDPESSHQALN----------------QKLQRCARTLKEWGYRKNKARWA
        GN                                               W   E   Q +N                + +++C   L EW    NK  + 
Subjt:  GN-----------------------------------------------WSDPESSHQALN----------------QKLQRCARTLKEWGYRKNKARWA

Query:  NI----RQVRDKIKTIYDR-SLPIDFATVHSLERHLDSLLLEEELYWKQRSRENWLKWGDRNTIWFHHRASYRRKRNMISGVEDADG---RWHTESGQIY
        N+       +  +K I DR SL  D   V    R +   L  EE+ WKQRSR  WL+ GD+NT +FH +A+ RRKRN+I G++DA G     H     I 
Subjt:  NI----RQVRDKIKTIYDR-SLPIDFATVHSLERHLDSLLLEEELYWKQRSRENWLKWGDRNTIWFHHRASYRRKRNMISGVEDADG---RWHTESGQIY

Query:  -----------QRHWDMVGPQTVTECLAILNRERSIKDWNHTNIVLI------PKVPNP-----------------------------------------
                   QRH + V      +    +N E  +K ++   +          K P P                                         
Subjt:  -----------QRHWDMVGPQTVTECLAILNRERSIKDWNHTNIVLI------PKVPNP-----------------------------------------

Query:  --------RVEWSFIHAVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLFLLCSESLSAMLGFVRRNRAM-GIAISPSSP
                RVEWSF+  +M K+GF   ++ LV +C++   FS+LINGE  G   P+RGLRQGD L PYLFLLC+E L+A+L    RN  + G+ +   +P
Subjt:  --------RVEWSFIHAVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLFLLCSESLSAMLGFVRRNRAM-GIAISPSSP

Query:  KISHLFFADDSLIFMKASVEEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSSDTTTYLSSILRMKTVSNLGSYLGLPSSFQRSKSKDFKD
         +SHL FADDS+IF KA V+     + ++  +E ASGQ +N +K+ + FSKNV +     L S+  ++T      YLGLP+   RSKS  F D
Subjt:  KISHLFFADDSLIFMKASVEEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSSDTTTYLSSILRMKTVSNLGSYLGLPSSFQRSKSKDFKD

XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]2.6e-9632.1Show/hide
Query:  MITICWNVRGLGSPWAFRSVCNELKRLNPRLCFLSETKCSSAVLNKLKVRLHYSGCFVVDRVGLSGGLCLLWKDDVDVSIRNFSIHHIDVSIKWKN-TSW
        M  I WNV+GLG    FR     L+ L P++ FLSETK  +  +   +  L +  CFVVDR GL GGL LLW  DV + ++++S HHID  I  +N +SW
Subjt:  MITICWNVRGLGSPWAFRSVCNELKRLNPRLCFLSETKCSSAVLNKLKVRLHYSGCFVVDRVGLSGGLCLLWKDDVDVSIRNFSIHHIDVSIKWKN-TSW

Query:  RFTGIYGQPDTSLRFHTWNLIRRIYNNDDTPWVVGGDFNEILWDEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRRQAGEQIGNWSDPES
        R T +YG P++  + HTW+L+RR+      PW+  GDFNEI    EK G   R+   +  FR  +  C LLDL   G  +TW NRR   + +        
Subjt:  RFTGIYGQPDTSLRFHTWNLIRRIYNNDDTPWVVGGDFNEILWDEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRRQAGEQIGNWSDPES

Query:  SHQALNQKLQRCARTLKEWGYRKNKARWANIRQVRDKIKTIYDRSLPIDFA-TVHSLERHLDSLLLEEELYWKQRSRENWLKWGDRNTIWFHHRASYRRK
             N+K +     L+ W  ++   R   + Q+++K+K+I       D    +   E  +D++L +EE++WKQRSR +WLK GD+NT +FH +AS RRK
Subjt:  SHQALNQKLQRCARTLKEWGYRKNKARWANIRQVRDKIKTIYDRSLPIDFA-TVHSLERHLDSLLLEEELYWKQRSRENWLKWGDRNTIWFHHRASYRRK

Query:  RNMISGVEDADGRWHTESGQI------------------------------------------------------------------------YQRHWDM
        +N I G+ D  G+W  +S ++                                                                        +Q+HW  
Subjt:  RNMISGVEDADGRWHTESGQI------------------------------------------------------------------------YQRHWDM

Query:  VGPQTVTECLAILNRERSIKDWNHTNIVLIPKVPNP----------------------------------------------------------------
        V    +T CL ILN + ++   NHT I LIPK   P                                                                
Subjt:  VGPQTVTECLAILNRERSIKDWNHTNIVLIPKVPNP----------------------------------------------------------------

Query:  --------------------RVEWSFIHAVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLFLLCSESLSAMLGFVRRNR
                            RVEW+F+   M+KLGF   WI+L   CI+  SFS+LING  +G   P RGLRQG PL PYLFLLC+E  S ML    +N+
Subjt:  --------------------RVEWSFIHAVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLFLLCSESLSAMLGFVRRNR

Query:  AMGIAISPSSPKISHLFFADDSLIFMKASVEEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSSDTTTYLSSILRMKTVSNLGSYLGLPSSFQRSKSK
         +      SS  ISHL FADDSL+F +A+ E+    K I   +  ASGQ  N +KS + FS NV +     +  I ++  VS    YLGLPS   R K  
Subjt:  AMGIAISPSSPKISHLFFADDSLIFMKASVEEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSSDTTTYLSSILRMKTVSNLGSYLGLPSSFQRSKSK

Query:  DFKD
         F +
Subjt:  DFKD

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]4.0e-11327.92Show/hide
Query:  SWRFTGIYGQPDTSLRFHTWNLIRRIYNNDDTPWVVGGDFNEILWDEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRRQAGEQI------
        S RFTG YG P    R  TW L+RRI N D +PW++GGD N ILW+ E S     D   I+ FR+I+D+C L D+ F G I+TWCN R AG+Q+      
Subjt:  SWRFTGIYGQPDTSLRFHTWNLIRRIYNNDDTPWVVGGDFNEILWDEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRRQAGEQI------

Query:  ---------------GNWSDPESSHQALNQKLQRCARTLKEWGYRKNKARWANIRQVRDKIKTIYDRSLPIDFATVHSLERHLDSLLLEEELYWKQRSRE
                       G+WS+   ++ + +  +Q  +  L+ WG       +  I+  +  I   Y++ LP+DF  +H+LE  L  LL  EE++WKQRSRE
Subjt:  ---------------GNWSDPESSHQALNQKLQRCARTLKEWGYRKNKARWANIRQVRDKIKTIYDRSLPIDFATVHSLERHLDSLLLEEELYWKQRSRE

Query:  NWLKWG-----------------DRNTIWFHHR--ASYRRKRNMISGVEDADGRWHTESG---QIYQRHWDMVGPQTVTECLAILNRERSIKDWNHTNIV
        +WLKWG                  R T   + +  A Y ++   ++  +    +     G     YQ +W +VGP+T+  CL  LN    IK WN T I 
Subjt:  NWLKWG-----------------DRNTIWFHHR--ASYRRKRNMISGVEDADGRWHTESG---QIYQRHWDMVGPQTVTECLAILNRERSIKDWNHTNIV

Query:  LIPKVPNP------------------------------------------------------------------------------------RVEWSFIH
        LIPK+  P                                                                                    RVEW+++ 
Subjt:  LIPKVPNP------------------------------------------------------------------------------------RVEWSFIH

Query:  AVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLFLLCSESLSAMLGFVRRN-RAMGIAISPSSPKISHLFFADDSLIFMK
         +M K+GF   WI  + +CIS   FSI +NG   G F PSRG+RQGDPL PYLFLLC+E LSA++     + R  GI    ++  I+HL FADDSLIF++
Subjt:  AVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLFLLCSESLSAMLGFVRRN-RAMGIAISPSSPKISHLFFADDSLIFMK

Query:  ASVEEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSSDTTTYLSSILRMKTVSNLGSYLGLPSSFQRS------------------------------
        +   E    + ++  + RASGQCIN  KS + FS NV  +   YL  IL +K VS+ G+YLGLPS F R                               
Subjt:  ASVEEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSSDTTTYLSSILRMKTVSNLGSYLGLPSSFQRS------------------------------

Query:  --------------------------KSKDFKDC------------------------------------------------------------------
                                  K K FKD                                                                   
Subjt:  --------------------------KSKDFKDC------------------------------------------------------------------

Query:  ----ILPSGQWDIPKLSNFLIDEDIQEIRRIPI-SMSMSDRWIWHFDKFGCYTVKSGYKLGMTKTVEASPSDVD---------------------VCSSL
            I   G WD+  +S+   +ED   I  +PI S ++ D W+WH+DK G Y+V+SGYKL M     A+ +  +                     +  S 
Subjt:  ----ILPSGQWDIPKLSNFLIDEDIQEIRRIPI-SMSMSDRWIWHFDKFGCYTVKSGYKLGMTKTVEASPSDVD---------------------VCSSL

Query:  RNH-----HVLVDDIRPL-----CSETIEDTSHALFMCSRASEVWSTL-RYRDIVRSDTIMDIQDRWTNIRKVESTTTIEQICIGAWAIWNDRNSRLHQS
          H     ++L+  I  L     C +  E   HA F C RA ++W TL  +   + ++  +   + W+++ +      +    I  W IWNDRNS +H  
Subjt:  RNH-----HVLVDDIRPL-----CSETIEDTSHALFMCSRASEVWSTL-RYRDIVRSDTIMDIQDRWTNIRKVESTTTIEQICIGAWAIWNDRNSRLHQS

Query:  PIPPVGVRCEW--------------------------IVEYGR---------------------VGAVLRTKMGELVVLMQCRIPLSSSPLCAEAVAVLE
         + PV  +CEW                          +V+Y R                      G ++R     LV     R+P   SPL AE   +LE
Subjt:  PIPPVGVRCEW--------------------------IVEYGR---------------------VGAVLRTKMGELVVLMQCRIPLSSSPLCAEAVAVLE

Query:  GLRIISLRHIRKVNVCTDSLSLISILRKNERCPADCFPVVTDIMCLIGSFEKITFCHINREYNLLSHELARLGADFP--TRVWSRNFPEWASDLAR
        GL+  +  +   + V +DSL  I ++R       D    V +I  L   F  I+F H +R+ N  +H LA+ G   P  T  W  NFP W  DL +
Subjt:  GLRIISLRHIRKVNVCTDSLSLISILRKNERCPADCFPVVTDIMCLIGSFEKITFCHINREYNLLSHELARLGADFP--TRVWSRNFPEWASDLAR

XP_024172304.2 uncharacterized protein LOC112178381 [Rosa chinensis]1.1e-9431.12Show/hide
Query:  MITICWNVRGLGSPWAFRSVCNELKRLNPRLCFLSETKCSSAVLNKLKVRLHYSGCFVVD---------RVGLSGGLCLLWKDDVDVSIRNFSIHHIDVS
        M  +CWN +G+G+PW    +   +    P + FLSETKC +  ++K++ +L Y   F VD         RV  +GGLCLLWK+ +DV++  FS +HIDV 
Subjt:  MITICWNVRGLGSPWAFRSVCNELKRLNPRLCFLSETKCSSAVLNKLKVRLHYSGCFVVD---------RVGLSGGLCLLWKDDVDVSIRNFSIHHIDVS

Query:  I--KWKNTSWRFTGIYGQPDTSLRFHTWNLIRRIYNNDDTPWVVGGDFNEILWDEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRRQAGE
        I        WRFTG+YG     LR  TW LI +I  N+  PW++GGDFNEIL   EK G PPR  R ++ FR  ++ C L DL F G  +TW  +R  GE
Subjt:  I--KWKNTSWRFTGIYGQPDTSLRFHTWNLIRRIYNNDDTPWVVGGDFNEILWDEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRRQAGE

Query:  QI----------GNWSD------------PESSH--------------------------------------------------QALNQKLQRCARTLKE
        +I           +WSD             +S H                                                  Q +  ++++  + L  
Subjt:  QI----------GNWSD------------PESSH--------------------------------------------------QALNQKLQRCARTLKE

Query:  WGYRKNKARWANIRQVRDKIKTIYDRSLPIDFATVH-SLERHLDSLLLEEELYWKQRSRENWLKWGDRNTIWFHHRASYRRKRNMISGVEDADGRWHTES
        W  +K     A I ++R K+   YD+SL          LE  L+ LL  E  YW+QRSR  WL  GD NT +FHHRAS R+KRN ISG+ + DG W TE 
Subjt:  WGYRKNKARWANIRQVRDKIKTIYDRSLPIDFATVH-SLERHLDSLLLEEELYWKQRSRENWLKWGDRNTIWFHHRASYRRKRNMISGVEDADGRWHTES

Query:  GQI----------------------------------------------------------------------YQRHWDMVGPQTVTECLAILNRERSIK
          +                                                                      YQR+W +VG   +      +N E  ++
Subjt:  GQI----------------------------------------------------------------------YQRHWDMVGPQTVTECLAILNRERSIK

Query:  DWNHTNIVLIPKVPN------------------------------------------------------------------------------------P
        + N T + LIPKV                                                                                       
Subjt:  DWNHTNIVLIPKVPN------------------------------------------------------------------------------------P

Query:  RVEWSFIHAVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLFLLCSESLSAMLGF-VRRNRAMGIAISPSSPKISHLFFA
        RVEW FI AVM  +GF   WI  +  C++  S+S L+NGE RG+  P+RGLRQGD + PYLFLLC+E LS ML +   ++R  GIAI+  +P I+HLFFA
Subjt:  RVEWSFIHAVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLFLLCSESLSAMLGF-VRRNRAMGIAISPSSPKISHLFFA

Query:  DDSLIFMKASVEEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSSDTTTYLSSILRMKTVSNLGSYLGLPSSFQRSKSKDFK
        DDS +FMKA  EE    K I+  +E ASGQ +N  KS+I FSKNV       L+ +  ++ V     YLGLP+    SK++ F+
Subjt:  DDSLIFMKASVEEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSSDTTTYLSSILRMKTVSNLGSYLGLPSSFQRSKSKDFK

XP_028068804.1 uncharacterized protein LOC114271378 [Camellia sinensis]2.3e-10028.12Show/hide
Query:  MITICWNVRGLGSPWAFRSVCNELKRLNPRLCFLSETKCSSAVLNKLKVRLHYSGCFVVDRVGLSGGLCLLWKDDVDVSIRNFSIHHIDVSIKWKN--TS
        M  + WN RGLG+P   R++   LK   P + FL ETKC S  +   + +L  +    VDR+GLSGGL L W+  V VSIRN+S  H+D  ++      +
Subjt:  MITICWNVRGLGSPWAFRSVCNELKRLNPRLCFLSETKCSSAVLNKLKVRLHYSGCFVVDRVGLSGGLCLLWKDDVDVSIRNFSIHHIDVSIKWKN--TS

Query:  WRFTGIYGQPDTSLRFHTWNLIRRIYNNDDTPWVVGGDFNEILWDEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRRQAG---EQIGNWS
        WRFTG YG P+   ++ +W L+RR+   DD PW+V  DFNEIL   EK G   R    I  F+  +  C L DL F G  +TWCN R +G   E++    
Subjt:  WRFTGIYGQPDTSLRFHTWNLIRRIYNNDDTPWVVGGDFNEILWDEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRRQAG---EQIGNWS

Query:  DPESSHQALNQKLQRCARTLKEWGYRK-------NKARWANIRQVRDKIKTIYDRSLPIDFATVHSLERHLDSLLLEEELYWKQRSRENWLKWGDRNTIW
            +  A+  K +    T     +          +  W+ + +VR +++ +   S   ++     L   +D LL  EE  W QR+R NWLK GDRNT +
Subjt:  DPESSHQALNQKLQRCARTLKEWGYRK-------NKARWANIRQVRDKIKTIYDRSLPIDFATVHSLERHLDSLLLEEELYWKQRSRENWLKWGDRNTIW

Query:  FHHRASYRRKRNMISGVEDADGRWHT-----------------ESGQ-----------------------------------------------------
        FH +A  R K+  I G++D   RW +                 ++GQ                                                     
Subjt:  FHHRASYRRKRNMISGVEDADGRWHT-----------------ESGQ-----------------------------------------------------

Query:  IYQRHWDMVGPQTVTECLAILNRERSIKDWNHTNIVLIPKVPNP--------------------------------------------------------
         YQR W +VG Q     L +LN  +++   N T IVLIPKV +P                                                        
Subjt:  IYQRHWDMVGPQTVTECLAILNRERSIKDWNHTNIVLIPKVPNP--------------------------------------------------------

Query:  ----------------------------RVEWSFIHAVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLFLLCSESLSAM
                                    RVEWSF+  VME++GF   ++D +  CIS  S+S+L+NG     F P+RGLRQGDPL PYLF+LC+E LSA+
Subjt:  ----------------------------RVEWSFIHAVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLFLLCSESLSAM

Query:  LGFVR-RNRAMGIAISPSSPKISHLFFADDSLIFMKASVEEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSSDTTTYLSSILRMKTVSNLGSYLGLP
        +       +  G+A+   SP++SHL FADDSL+F  A+++E    + I+  +E  SGQ INL+KS ICFSKNV  D    L + + +  + + G YLGLP
Subjt:  LGFVR-RNRAMGIAISPSSPKISHLFFADDSLIFMKASVEEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSSDTTTYLSSILRMKTVSNLGSYLGLP

Query:  SSFQRSKSKDF---KDCI-------------------------------------LPSG-----------------------QWDIPKLSNFLIDEDIQE
            RSK   F   KD +                                     LP G                        WD+  L   L++ D++ 
Subjt:  SSFQRSKSKDF---KDCI-------------------------------------LPSG-----------------------QWDIPKLSNFLIDEDIQE

Query:  IRRIPI-SMSMSDRWIWHFDKFGCYTVKSGYKLGMT---KTVEASPSDVD-------------------------------VCSSLRNHHVLVDDIRPLC
        IR+IP+    + D+ +WH+ + G ++V+  Y L M    + V  S SDV                                V S LR  H+L +D    C
Subjt:  IRRIPI-SMSMSDRWIWHFDKFGCYTVKSGYKLGMT---KTVEASPSDVD-------------------------------VCSSLRNHHVLVDDIRPLC

Query:  SETIEDTSHALFMCSRASEVWSTLRYRDIVRSDTIMDIQDRWTNIRKVESTTTIEQICIGAWAIWNDRNSRLHQSPIPPVGVRCEWIVEYGR
         +  E   H L+ C RA EVW       ++      D       + +   + T+       W +W+ RN  L  + I       + + +  R
Subjt:  SETIEDTSHALFMCSRASEVWSTLRYRDIVRSDTIMDIQDRWTNIRKVESTTTIEQICIGAWAIWNDRNSRLHQSPIPPVGVRCEWIVEYGR

TrEMBL top hitse value%identityAlignment
A0A2N9F9E4 Reverse transcriptase domain-containing protein4.3e-9727.83Show/hide
Query:  QGIQVELLEDGISKSWIPIQYERLPEFCFYCGIVGHQHKDCSQFYSA-----DRSHSVVFNYGEWLHFDPKGVVLQSLPVPDVVEHNNVPRFGDVPVTIP
        +G ++ L   G  + W+  QYERLP FC++CGI  H  KDC  + ++     +++      YG WL    + V  +                  V VT+ 
Subjt:  QGIQVELLEDGISKSWIPIQYERLPEFCFYCGIVGHQHKDCSQFYSA-----DRSHSVVFNYGEWLHFDPKGVVLQSLPVPDVVEHNNVPRFGDVPVTIP

Query:  SLNFIAKHNRSTCISIYDQPAQSQLPVKSPSFDKSTRGAAWYSLSGGLHPLDKGKAVMVEENSVGILHGWNKIKNNSWRASSSIDPQVQASDFSNSAVSA
                                        +  TRGA                         G   G                   + S  SN   +A
Subjt:  SLNFIAKHNRSTCISIYDQPAQSQLPVKSPSFDKSTRGAAWYSLSGGLHPLDKGKAVMVEENSVGILHGWNKIKNNSWRASSSIDPQVQASDFSNSAVSA

Query:  QSAINSQPTVTHNDTLRFGTFDFLEHYQRKLKDFNVGFNEKFNSPAFREFNKEKSLNESDVDPKSPINIGPGSLDGGPTFNAVSFTQPVLCKKKLNLDRV
        Q A + QP    +D     T + L       +D ++       S  F +  +E  L   +  P  P  +G    +                 K   L+  
Subjt:  QSAINSQPTVTHNDTLRFGTFDFLEHYQRKLKDFNVGFNEKFNSPAFREFNKEKSLNESDVDPKSPINIGPGSLDGGPTFNAVSFTQPVLCKKKLNLDRV

Query:  LQGGSSSLDEQDSKFESLDGVGLRIGL---KKEGHVDLTPKPKAVGSGCSWKKRARV-GFVPSGLNLSVLEEFNKSPPATMI-----TICWNVRGLGSPW
        +  GS +   Q +  +    +GLR  L     +G +  T KPKA     +WKK+AR  G   +   +  + E   S  A  +         N RG     
Subjt:  LQGGSSSLDEQDSKFESLDGVGLRIGL---KKEGHVDLTPKPKAVGSGCSWKKRARV-GFVPSGLNLSVLEEFNKSPPATMI-----TICWNVRGLGSPW

Query:  AFRSVCNELKRLNPRLCFLSETKCSSAVLNKLKVRLHYSGCFVVDRVGLSGGLCLLWKDDVDVSIRNFSIHHIDVSIK-WKNTSWRFTGIYGQPDTSLRF
         F++  + ++   P   FL+ET  +   L K++  LH+S   VV      GGL L W DD +++I+++S  HID  I+   + +WR TG+YG P+T  R 
Subjt:  AFRSVCNELKRLNPRLCFLSETKCSSAVLNKLKVRLHYSGCFVVDRVGLSGGLCLLWKDDVDVSIRNFSIHHIDVSIK-WKNTSWRFTGIYGQPDTSLRF

Query:  HTWNLIRRIYNNDDTPWVVGGDFNEILWDEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRR-----------------------------
         TW L+R +      PW   GDFNEI+  +E  GR PR  R ++ FR  +D CGL++L+F G  YTWCN R                             
Subjt:  HTWNLIRRIYNNDDTPWVVGGDFNEILWDEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRR-----------------------------

Query:  ------------------------------------------QAGEQIGNWSDPESSHQALNQKLQRCARTLKEWGYRKNKARWANIRQVRDKIK-----
                                                  +  +Q   +S   +    + QKL+ C++ L EW    ++  + NI++  D +K     
Subjt:  ------------------------------------------QAGEQIGNWSDPESSHQALNQKLQRCARTLKEWGYRKNKARWANIRQVRDKIK-----

Query:  ----TIYDRSLPIDFATVHSLERHLDSLLLEEELYWKQRSRENWLKWGDRNTIWFHHRASYRRKRNMISGVEDADGRWHTESGQI---------------
             I DR        ++ L R L+ LL +EE  W+QRSR +WL  GDRNT +FH +AS RR+RN IS + D  GRW+  + +I               
Subjt:  ----TIYDRSLPIDFATVHSLERHLDSLLLEEELYWKQRSRENWLKWGDRNTIWFHHRASYRRKRNMISGVEDADGRWHTESGQI---------------

Query:  -------------------------------------------------------YQRHWDMVGPQTVTECLAILNRERSIKDWNHTNIVLIPKVPNP--
                                                               YQ++W +VG    T  L+ LN  R +K  NHT I LIPKV  P  
Subjt:  -------------------------------------------------------YQRHWDMVGPQTVTECLAILNRERSIKDWNHTNIVLIPKVPNP--

Query:  -------------------------------RVEWSFIHAVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLFLLCSESL
                                       RVEWS++ AVMEK+GF  KWI L+ ECIS  S+S+LINGE  GN +PSRGLRQGDPL PYLFLLC+E L
Subjt:  -------------------------------RVEWSFIHAVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLFLLCSESL

Query:  SAMLGFVRRN-RAMGIAISPSSPKISHLFFADDSLIFMKASVEEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSSDTTTYLSSILRMKTVSNLGSYL
         +++     +    G+A+    PKI+HLFFADDSL+F KA+        SI+  +ERASGQ +N DK+ I FSK++   T   +   L +  +     YL
Subjt:  SAMLGFVRRN-RAMGIAISPSSPKISHLFFADDSLIFMKASVEEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSSDTTTYLSSILRMKTVSNLGSYL

Query:  GLPSSFQRSKSKDF
        GLPS   R+K++ F
Subjt:  GLPSSFQRSKSKDF

A0A2N9FD73 Uncharacterized protein3.8e-10128.52Show/hide
Query:  KSWIPIQYERLPEFCFYCGIVGHQHKDCSQFYSADRSHSVVFNYGEWLHFDPKGVVLQSLPVPDVVEHNNVPRFGDVPVTIPSLNFIAKHNRSTCISIYD
        + W+  +YERL  FCF CG +GH+ + C   +S          YGEWL     G     L VP  ++  +       PV     N   +   S  + +  
Subjt:  KSWIPIQYERLPEFCFYCGIVGHQHKDCSQFYSADRSHSVVFNYGEWLHFDPKGVVLQSLPVPDVVEHNNVPRFGDVPVTIPSLNFIAKHNRSTCISIYD

Query:  QPAQSQLPVKSPSFDKSTRGAAWYSLSGGLHPLDKGKAVMVEENSVGILHGWNKIKNNSWRASSSIDPQVQASDFSNSAVSAQSAINSQPTVTHNDTLRF
        +  ++   ++ P  D  T                        E     L+G +KI+    +   +   +  +     S V  +  I SQ   T+      
Subjt:  QPAQSQLPVKSPSFDKSTRGAAWYSLSGGLHPLDKGKAVMVEENSVGILHGWNKIKNNSWRASSSIDPQVQASDFSNSAVSAQSAINSQPTVTHNDTLRF

Query:  GTFDFLEHYQRKLKDFNVGFNEKFNSPAFREFNKEKSLNESDVDPKSPINIGPGSLDGGPTFNAVSFTQPVLCKKKLNLDRVLQGGSSSLDEQDSKFESL
                Y RK K+     ++    P       ++   E  VDP SP  I    ++  P    V   Q   C        V +G  +    + +   + 
Subjt:  GTFDFLEHYQRKLKDFNVGFNEKFNSPAFREFNKEKSLNESDVDPKSPINIGPGSLDGGPTFNAVSFTQPVLCKKKLNLDRVLQGGSSSLDEQDSKFESL

Query:  DGVGLRIGLKKEGHVDLTPKPKAV--------GSGCSWKKRARVGFVPSGLNLSVLEEFNK------SPPATMITICWNVRGLGSPWAFRSVCNELKRLN
        DG G+ I   KEGHV    K  A+         +   WK+  +   + +  N  ++E F        +PP  M  + WN RGLG+P+A R++ + +K   
Subjt:  DGVGLRIGLKKEGHVDLTPKPKAV--------GSGCSWKKRARVGFVPSGLNLSVLEEFNK------SPPATMITICWNVRGLGSPWAFRSVCNELKRLN

Query:  PRLCFLSETKCSSAVLNKLKVRLHYSGCFVVDRVGLSGGLCLLWKDDVDVSIRNFSIHHIDVSIKWKN-TSWRFTGIYGQPDTSLRFHTWNLIRRIYNND
        P + FL ETK     + +++V L Y+  F V  VG SGGL LLWK+ + + I+NFS HHID  I +K+  +WR TG YG+P+   R+ +W L+  +    
Subjt:  PRLCFLSETKCSSAVLNKLKVRLHYSGCFVVDRVGLSGGLCLLWKDDVDVSIRNFSIHHIDVSIKWKN-TSWRFTGIYGQPDTSLRFHTWNLIRRIYNND

Query:  DTPWVVGGDFNEILWDEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRRQAGEQI----------GNWS----------------------
          PW+  GDFNEIL  EEK G   + +R + +FRD++  C L+D+ + G  +TW N R  G  +            WS                      
Subjt:  DTPWVVGGDFNEILWDEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRRQAGEQI----------GNWS----------------------

Query:  -------------------------DPE----------------SSHQALNQKLQRCARTLKEWGYRKNKARWANIRQVRDKIKTI-YDRSLPIDFATVH
                                 +PE                S    L +K++ C   L +W  +        +R   D I+ +  D         + 
Subjt:  -------------------------DPE----------------SSHQALNQKLQRCARTLKEWGYRKNKARWANIRQVRDKIKTI-YDRSLPIDFATVH

Query:  SLERHLDSLLLEEELYWKQRSRENWLKWGDRNTIWFHHRASYRRKRNMISGVEDADGRWHTESGQI-------YQRHWDMVGPQTVTECLAILN------
        +L+  ++SLLL +E +WKQRSR+ WL  GD+NT +FH  AS RRK N + G+ ++ G+W T++ ++       +Q  +    P  + E +  +       
Subjt:  SLERHLDSLLLEEELYWKQRSRENWLKWGDRNTIWFHHRASYRRKRNMISGVEDADGRWHTESGQI-------YQRHWDMVGPQTVTECLAILN------

Query:  -RERSIKDWNHTNIVLIPKVPN-----------PRVEWSFIHAVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLFLLCS
           + +K +    +  + K  +            RVEW F+  +M+KLGF  +W +LV ECI   S+++LINGE +G   P+RG+RQGDPL PYLFLLC+
Subjt:  -RERSIKDWNHTNIVLIPKVPN-----------PRVEWSFIHAVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLFLLCS

Query:  ESLSAMLGFVRRNRAM-GIAISPSSPKISHLFFADDSLIFMKASVEEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSSDTTTYLSSILRMKTVSNLG
        E  SA+L    R++ + GI+IS   P++SHL FADDSL+F +A+  E      ++  +E +SGQ INL+K+ I FSKN  ++T   + SI   + V    
Subjt:  ESLSAMLGFVRRNRAM-GIAISPSSPKISHLFFADDSLIFMKASVEEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSSDTTTYLSSILRMKTVSNLG

Query:  SYLGLPSSFQRSKSKDF
         YLG+P+   RSK + F
Subjt:  SYLGLPSSFQRSKSKDF

A0A6J1DX30 uncharacterized protein LOC1110248741.9e-11327.92Show/hide
Query:  SWRFTGIYGQPDTSLRFHTWNLIRRIYNNDDTPWVVGGDFNEILWDEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRRQAGEQI------
        S RFTG YG P    R  TW L+RRI N D +PW++GGD N ILW+ E S     D   I+ FR+I+D+C L D+ F G I+TWCN R AG+Q+      
Subjt:  SWRFTGIYGQPDTSLRFHTWNLIRRIYNNDDTPWVVGGDFNEILWDEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRRQAGEQI------

Query:  ---------------GNWSDPESSHQALNQKLQRCARTLKEWGYRKNKARWANIRQVRDKIKTIYDRSLPIDFATVHSLERHLDSLLLEEELYWKQRSRE
                       G+WS+   ++ + +  +Q  +  L+ WG       +  I+  +  I   Y++ LP+DF  +H+LE  L  LL  EE++WKQRSRE
Subjt:  ---------------GNWSDPESSHQALNQKLQRCARTLKEWGYRKNKARWANIRQVRDKIKTIYDRSLPIDFATVHSLERHLDSLLLEEELYWKQRSRE

Query:  NWLKWG-----------------DRNTIWFHHR--ASYRRKRNMISGVEDADGRWHTESG---QIYQRHWDMVGPQTVTECLAILNRERSIKDWNHTNIV
        +WLKWG                  R T   + +  A Y ++   ++  +    +     G     YQ +W +VGP+T+  CL  LN    IK WN T I 
Subjt:  NWLKWG-----------------DRNTIWFHHR--ASYRRKRNMISGVEDADGRWHTESG---QIYQRHWDMVGPQTVTECLAILNRERSIKDWNHTNIV

Query:  LIPKVPNP------------------------------------------------------------------------------------RVEWSFIH
        LIPK+  P                                                                                    RVEW+++ 
Subjt:  LIPKVPNP------------------------------------------------------------------------------------RVEWSFIH

Query:  AVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLFLLCSESLSAMLGFVRRN-RAMGIAISPSSPKISHLFFADDSLIFMK
         +M K+GF   WI  + +CIS   FSI +NG   G F PSRG+RQGDPL PYLFLLC+E LSA++     + R  GI    ++  I+HL FADDSLIF++
Subjt:  AVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLFLLCSESLSAMLGFVRRN-RAMGIAISPSSPKISHLFFADDSLIFMK

Query:  ASVEEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSSDTTTYLSSILRMKTVSNLGSYLGLPSSFQRS------------------------------
        +   E    + ++  + RASGQCIN  KS + FS NV  +   YL  IL +K VS+ G+YLGLPS F R                               
Subjt:  ASVEEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSSDTTTYLSSILRMKTVSNLGSYLGLPSSFQRS------------------------------

Query:  --------------------------KSKDFKDC------------------------------------------------------------------
                                  K K FKD                                                                   
Subjt:  --------------------------KSKDFKDC------------------------------------------------------------------

Query:  ----ILPSGQWDIPKLSNFLIDEDIQEIRRIPI-SMSMSDRWIWHFDKFGCYTVKSGYKLGMTKTVEASPSDVD---------------------VCSSL
            I   G WD+  +S+   +ED   I  +PI S ++ D W+WH+DK G Y+V+SGYKL M     A+ +  +                     +  S 
Subjt:  ----ILPSGQWDIPKLSNFLIDEDIQEIRRIPI-SMSMSDRWIWHFDKFGCYTVKSGYKLGMTKTVEASPSDVD---------------------VCSSL

Query:  RNH-----HVLVDDIRPL-----CSETIEDTSHALFMCSRASEVWSTL-RYRDIVRSDTIMDIQDRWTNIRKVESTTTIEQICIGAWAIWNDRNSRLHQS
          H     ++L+  I  L     C +  E   HA F C RA ++W TL  +   + ++  +   + W+++ +      +    I  W IWNDRNS +H  
Subjt:  RNH-----HVLVDDIRPL-----CSETIEDTSHALFMCSRASEVWSTL-RYRDIVRSDTIMDIQDRWTNIRKVESTTTIEQICIGAWAIWNDRNSRLHQS

Query:  PIPPVGVRCEW--------------------------IVEYGR---------------------VGAVLRTKMGELVVLMQCRIPLSSSPLCAEAVAVLE
         + PV  +CEW                          +V+Y R                      G ++R     LV     R+P   SPL AE   +LE
Subjt:  PIPPVGVRCEW--------------------------IVEYGR---------------------VGAVLRTKMGELVVLMQCRIPLSSSPLCAEAVAVLE

Query:  GLRIISLRHIRKVNVCTDSLSLISILRKNERCPADCFPVVTDIMCLIGSFEKITFCHINREYNLLSHELARLGADFP--TRVWSRNFPEWASDLAR
        GL+  +  +   + V +DSL  I ++R       D    V +I  L   F  I+F H +R+ N  +H LA+ G   P  T  W  NFP W  DL +
Subjt:  GLRIISLRHIRKVNVCTDSLSLISILRKNERCPADCFPVVTDIMCLIGSFEKITFCHINREYNLLSHELARLGADFP--TRVWSRNFPEWASDLAR

A0A803PGT3 Uncharacterized protein1.7e-9329.82Show/hide
Query:  PATMITICWNVRGLGSPWAFRSVCNELKRLNPRLCFLSETKCSSAVLNKLKVRLHYSGCFVVDRVGLSGGLCLLWKDDVDVSIRNFSIHHIDVSIK-WKN
        P+ M  + WNV+GLG+PW  +++C+                         K  L + GCFVV   G SGGL LLWK+  +V++++F++ HID  ++    
Subjt:  PATMITICWNVRGLGSPWAFRSVCNELKRLNPRLCFLSETKCSSAVLNKLKVRLHYSGCFVVDRVGLSGGLCLLWKDDVDVSIRNFSIHHIDVSIK-WKN

Query:  TSWRFTGIYGQPDTSLRFHTWNLIRRIYNNDDTPWVVGGDFNEILWDEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRRQAG--------
         SWRFTG YG PD   R  +W L+ R+ +     WV GGDFNEI+  +EK G   ++  L+++FR  I  C   ++  NG  +TWCN RQ          
Subjt:  TSWRFTGIYGQPDTSLRFHTWNLIRRIYNNDDTPWVVGGDFNEILWDEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRRQAG--------

Query:  -----------------------------------------------------------EQIG-----------NWSDPESSHQALNQKLQRCARTLKEW
                                                                   E+ G           NW  P      L  ++  C +TLK+W
Subjt:  -----------------------------------------------------------EQIG-----------NWSDPESSHQALNQKLQRCARTLKEW

Query:  GYRKNKARWANIRQVRDKIKTIYDRSLPIDFATVHSLERHLDSLLLEEELYWKQRSRENWLKWGDRNTIWFHHRASYRRKRNMISGVEDADGRWHTESGQ
           K     A  +++++++  +   S   D+ T   +E  L+    ++E+ WKQRSR  WL  GDRNT +FHH+AS R+K+NMI G+ D   RW  E  +
Subjt:  GYRKNKARWANIRQVRDKIKTIYDRSLPIDFATVHSLERHLDSLLLEEELYWKQRSRENWLKWGDRNTIWFHHRASYRRKRNMISGVEDADGRWHTESGQ

Query:  I------------------------------------------------------------------------YQRHWDMVGPQTVTECLAILNRERSIK
        I                                                                        Y  HW  VG + +T CL +LN  +   
Subjt:  I------------------------------------------------------------------------YQRHWDMVGPQTVTECLAILNRERSIK

Query:  DWNHTNIVLIPKVPNP----------------------RVEWSFIHAVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLF
          N T + LIPKV  P                      RVEW F+  +M+ LG+  +WI  V  C++  +FSIL+NGEARG+  P RGLRQGDPL P+LF
Subjt:  DWNHTNIVLIPKVPNP----------------------RVEWSFIHAVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLF

Query:  LLCSESLSAMLGFV-RRNRAMGIAISPSSPKISHLFFADDSLIFMKASVEEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSSDTTTYLSSILRMKTV
        L+CSE LS +L    R N+  G+       +++HL +ADDSLIF+ A+ EE    K ++  +   SGQCIN +KS +C  + +S      L++ + +  +
Subjt:  LLCSESLSAMLGFV-RRNRAMGIAISPSSPKISHLFFADDSLIFMKASVEEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSSDTTTYLSSILRMKTV

Query:  SNLGSYLGLPSSFQRSKSKDF
         N   YLGLP+   R+K + F
Subjt:  SNLGSYLGLPSSFQRSKSKDF

A0A803Q9W0 Uncharacterized protein3.8e-10129.47Show/hide
Query:  LKKEGHVDLTPKPKAVGSGC------SWKKRARVGFVPSGLNLSVLEEFNK----SPPATMITICWNVRGLGSPWAFRSVCNELKRLNPRLCFLSETKCS
        L KE   +L   P +  +G       + ++R +  FV    NL       +      P TM ++ WNV+GLG+PW   ++ N +K   P + FLSET+  
Subjt:  LKKEGHVDLTPKPKAVGSGC------SWKKRARVGFVPSGLNLSVLEEFNK----SPPATMITICWNVRGLGSPWAFRSVCNELKRLNPRLCFLSETKCS

Query:  SAVLNKLKVRLHYSGCFVVDRVGLSGGLCLLWKDDVDVSIRNFSIHHIDVSI-KWKNTSWRFTGIYGQPDTSLRFHTWNLIRRIYNNDDTPWVVGGDFNE
        S  L  +++RL + GCF VD  G SGGL LLWK+   V + +F+  HID  I K ++ +WRFTG YG PD S R H+W L++RI  N + PW+ GGDFNE
Subjt:  SAVLNKLKVRLHYSGCFVVDRVGLSGGLCLLWKDDVDVSIRNFSIHHIDVSI-KWKNTSWRFTGIYGQPDTSLRFHTWNLIRRIYNNDDTPWVVGGDFNE

Query:  ILWDEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRRQA-----------------------------------------------GEQIG
        I    EK G   +   L++NF   ID+C L ++ + G  +TWCN R A                                               G+Q  
Subjt:  ILWDEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRRQA-----------------------------------------------GEQIG

Query:  N-----------WSDPE----------------SSHQALNQKLQRCARTLKEWGYRKNKARWANIRQVRDKIKTIYDRSLPIDFATVHSLERHLDSLLLE
                    W+D E                +S   L++ L  C   L +W   + K   A I+ ++ +++   +   P  F  +  +E+ L+  L +
Subjt:  N-----------WSDPE----------------SSHQALNQKLQRCARTLKEWGYRKNKARWANIRQVRDKIKTIYDRSLPIDFATVHSLERHLDSLLLE

Query:  EELYWKQRSRENWLKWGDRNTIWFHHRASYRRKRNMISGVEDADGRWH--------------------TESGQ---------------------------
        EEL+WKQRSR  WL  GDRNT +FH +A+ RRK+N I+G+ D +  W                     T+ GQ                           
Subjt:  EELYWKQRSRENWLKWGDRNTIWFHHRASYRRKRNMISGVEDADGRWH--------------------TESGQ---------------------------

Query:  -------------------------IYQRHWDMVGPQTVTECLAILNRERSIKDWNHTNIVLIPKVPNP-------------------------------
                                  Y++HW+++G      CL ILN  +  +  N T + LIPK+  P                               
Subjt:  -------------------------IYQRHWDMVGPQTVTECLAILNRERSIKDWNHTNIVLIPKVPNP-------------------------------

Query:  -----------------------------------------------------RVEWSFIHAVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYP
                                                             RVEW F+  +M  LG+  +W+D +  CI   SFSIL+NG+  G  +P
Subjt:  -----------------------------------------------------RVEWSFIHAVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYP

Query:  SRGLRQGDPLPPYLFLLCSESLSAMLGFVRR-NRAMGIAISPSSPKISHLFFADDSLIFMKASVEEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSS
        SRGLRQGDPL PY+FLLCSE LS ++    R NR  G+     + K+SHLFFADDS IF+ A+  +    KSI+ ++   SGQ IN DKS++C  K ++ 
Subjt:  SRGLRQGDPLPPYLFLLCSESLSAMLGFVRR-NRAMGIAISPSSPKISHLFFADDSLIFMKASVEEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSS

Query:  DTTTYLSSILRMKTVSNLGSYLGLPSSFQRSKSKDFKD
             L++IL +K V     YLG+P+S  + K + F+D
Subjt:  DTTTYLSSILRMKTVSNLGSYLGLPSSFQRSKSKDFKD

SwissProt top hitse value%identityAlignment
P92555 Uncharacterized mitochondrial protein AtMg012501.8e-1254.41Show/hide
Query:  LINGEARGNFYPSRGLRQGDPLPPYLFLLCSESLSAMLGFVR-RNRAMGIAISPSSPKISHLFFADDS
        +ING  +G   PSRGLRQGDPL PYLF+LC+E LS +    + + R  GI +S +SP+I+HL FADD+
Subjt:  LINGEARGNFYPSRGLRQGDPLPPYLFLLCSESLSAMLGFVR-RNRAMGIAISPSSPKISHLFFADDS

Q05118 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)9.2e-0427.42Show/hide
Query:  DLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLFLLCSESLSAMLGFVRRNRAMGIAISPSSPKISHLFFADDSLIFMKASVEEFGCFKSIMV
        D +   I+ A  +I++ G      Y   G++QGDPL P LF +  + L   L     +   G +++P+  KI+ L FADD L+     ++      +   
Subjt:  DLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLFLLCSESLSAMLGFVRRNRAMGIAISPSSPKISHLFFADDSLIFMKASVEEFGCFKSIMV

Query:  DFERASGQCINLDKSQICFSKNVS
         F R  G  +N +K     +  VS
Subjt:  DFERASGQCINLDKSQICFSKNVS

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein7.7e-0625Show/hide
Query:  WDIPKLSNFLIDEDIQEIRRIPISMSMS-DRWIWHFDKFGCYTVKSGYKL----GMTKTVEASP--SDVDVCSSLRNHHVL-------------------
        WD  K+S F+   D   I RI ++ S   D+ IW+++  G YTV+SGY L      T     +P    +D+ + + N  ++                   
Subjt:  WDIPKLSNFLIDEDIQEIRRIPISMSMS-DRWIWHFDKFGCYTVKSGYKL----GMTKTVEASP--SDVDVCSSLRNHHVL-------------------

Query:  --------VDDIRPLCSETIEDTSHALFMCSRASEVWSTLRYRDIVRSDTIM-DIQDRWTNIRKVESTTTIEQI--CIGAWAIW
                +D   P C    E  +HALF C  A+  W  L    ++R+  +  D ++  +NI      TT+      +  W IW
Subjt:  --------VDDIRPLCSETIEDTSHALFMCSRASEVWSTLRYRDIVRSDTIM-DIQDRWTNIRKVESTTTIEQI--CIGAWAIW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.3e-1354.41Show/hide
Query:  LINGEARGNFYPSRGLRQGDPLPPYLFLLCSESLSAMLGFVR-RNRAMGIAISPSSPKISHLFFADDS
        +ING  +G   PSRGLRQGDPL PYLF+LC+E LS +    + + R  GI +S +SP+I+HL FADD+
Subjt:  LINGEARGNFYPSRGLRQGDPLPPYLFLLCSESLSAMLGFVR-RNRAMGIAISPSSPKISHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGCGGTTTGCGATGTTTTTGGATGGATTTTCCGTCATTCCGCGATTTTTGTTTCAGATTTTATTGCAAATGTTTACGTTTTACATTCAGCTAGGCTGGGAAGGATT
TTCATCCGGGTCAGAGCTGCTAAATCAGGTTGTTCGAGGAATGGGGGTTTCATCACCTCTACACTTACAATCTTTTCTCTTGGATTCCTGTGTTTCTGTTATCTGTTTCC
TTGTTCGTTGTTTCCGTATGGATCCCAATTCACTTGTACAAGATTGGTCCAAGCTCAACCTTACTCAAGAGGAAAATGAGGTTGCAGTCATGGCTGATCGGGAAGTTGTG
GAGCGAATGAGACTGACTTTAGGTTGCTGTTTATTGGGAAAATTACAATCCCATCGTTTTTTAGCCGCTAAAGTCATGCGTAAGACTTTTGCATCGGCTTGGAGAGTTGT
CCAAGGGATTCAGGTGGAATTGTTAGAAGATGGGATTAGTAAGAGCTGGATTCCCATTCAATACGAAAGATTACCAGAGTTCTGTTTCTATTGCGGGATTGTTGGTCATC
AACACAAGGACTGTAGCCAATTTTATTCAGCGGACAGGTCCCATAGTGTGGTGTTTAATTATGGTGAATGGCTACACTTTGATCCTAAAGGCGTAGTTTTGCAAAGTTTA
CCAGTGCCGGATGTGGTGGAGCATAACAATGTTCCACGTTTTGGTGATGTCCCTGTGACGATTCCATCCCTCAATTTCATTGCCAAGCATAATCGTTCAACATGTATCTC
GATTTATGATCAGCCTGCGCAGTCCCAATTACCAGTTAAAAGTCCCTCTTTTGATAAATCGACCAGAGGCGCGGCGTGGTACTCGTTGTCTGGTGGCCTTCATCCTCTGG
ACAAAGGAAAAGCTGTAATGGTCGAGGAGAACTCAGTCGGTATTCTACACGGTTGGAACAAGATCAAGAATAATAGTTGGCGGGCTTCTTCATCCATCGATCCCCAGGTT
CAAGCGAGCGATTTCTCTAATTCGGCGGTCTCTGCGCAGTCGGCAATCAATTCACAGCCGACGGTTACTCACAACGATACGTTACGATTTGGAACGTTTGACTTTCTGGA
GCATTATCAACGGAAGTTAAAGGATTTTAATGTGGGTTTTAATGAGAAGTTTAATTCGCCGGCGTTTCGGGAATTCAATAAAGAAAAATCTCTTAACGAGTCTGATGTCG
ATCCGAAATCACCTATAAATATTGGGCCTGGTTCTCTGGATGGTGGGCCCACTTTCAATGCAGTTTCTTTCACTCAACCAGTGCTTTGCAAGAAAAAATTAAACCTGGAC
AGAGTGCTTCAAGGCGGTTCGAGTTCATTGGACGAGCAAGACTCTAAATTTGAGAGCCTGGATGGAGTGGGCCTTCGGATTGGGCTGAAAAAGGAGGGGCATGTGGACCT
GACACCAAAACCCAAAGCTGTTGGATCGGGTTGTTCCTGGAAGAAAAGGGCTCGCGTAGGTTTTGTCCCATCTGGTTTGAATCTATCGGTGCTTGAAGAATTCAATAAGT
CCCCGCCAGCAACCATGATTACTATATGTTGGAATGTCCGTGGATTGGGGAGTCCTTGGGCGTTCCGTAGTGTCTGTAACGAGTTAAAACGTTTAAATCCCCGTCTTTGT
TTCCTGTCAGAGACCAAATGTTCTTCTGCTGTTTTGAATAAATTGAAAGTTCGTTTGCATTATTCCGGGTGTTTTGTTGTCGATAGAGTTGGGTTGAGCGGGGGTCTATG
CTTGTTGTGGAAGGATGATGTTGATGTGTCTATTCGTAATTTTTCAATTCATCATATTGATGTATCTATTAAATGGAAAAATACAAGTTGGAGGTTTACTGGTATTTATG
GCCAACCAGATACGTCCTTACGCTTTCATACCTGGAATTTGATTCGTAGAATCTATAATAATGATGATACACCATGGGTTGTTGGTGGAGATTTTAATGAAATTTTGTGG
GATGAAGAGAAATCAGGCAGACCTCCTAGGGATATGAGGCTAATTCAAAATTTTCGTGATATTATTGACTCCTGCGGACTTTTGGATCTGAAATTCAATGGTGATATTTA
TACTTGGTGTAATAGGCGTCAAGCAGGGGAGCAAATAGGGAATTGGTCAGATCCGGAAAGTAGTCATCAAGCACTCAACCAAAAATTGCAAAGGTGCGCTCGGACATTAA
AAGAGTGGGGATACAGGAAGAACAAGGCACGATGGGCCAACATTCGTCAGGTGAGGGACAAAATCAAAACTATCTATGATAGGTCATTACCAATTGACTTCGCAACGGTG
CATAGCCTAGAGCGTCATCTGGATAGTTTATTACTCGAGGAGGAACTGTATTGGAAACAACGATCACGAGAGAATTGGCTAAAATGGGGGGACCGCAATACTATATGGTT
CCATCACCGAGCATCCTATAGGCGTAAAAGGAACATGATTAGTGGGGTGGAGGACGCAGATGGGAGGTGGCATACGGAGTCTGGCCAGATTTACCAGCGACACTGGGATA
TGGTTGGTCCTCAGACGGTGACTGAATGCTTAGCAATCCTTAATCGAGAGCGTTCGATTAAAGACTGGAATCACACTAATATTGTTCTCATTCCAAAGGTCCCAAATCCA
AGAGTCGAGTGGTCCTTCATTCATGCTGTTATGGAAAAGTTGGGTTTCCCATGCAAATGGATAGATCTGGTTAAAGAATGTATTTCTATGGCCTCTTTTTCTATCCTTAT
TAATGGGGAGGCAAGGGGGAATTTTTATCCTTCGAGAGGACTAAGACAGGGTGATCCTCTGCCACCGTATTTGTTTTTGCTTTGTTCGGAAAGCTTGTCTGCTATGCTTG
GTTTTGTTAGGAGGAACAGGGCCATGGGAATTGCTATATCCCCATCATCCCCAAAAATTTCCCATCTATTTTTTGCAGACGACAGTCTCATTTTCATGAAGGCTTCAGTG
GAGGAATTTGGTTGTTTTAAAAGCATTATGGTTGACTTCGAACGTGCTTCGGGTCAATGTATTAATTTGGATAAGTCACAAATTTGTTTTTCGAAGAATGTTTCAAGTGA
TACAACAACCTATCTCAGTTCAATTTTGAGAATGAAGACTGTATCAAATTTGGGATCCTACCTTGGTCTGCCGTCATCTTTCCAACGTAGTAAGAGCAAGGATTTCAAAG
ACTGTATCCTGCCATCTGGTCAGTGGGACATTCCAAAGCTTTCTAATTTCCTCATAGATGAGGATATTCAGGAGATTAGAAGGATTCCAATCAGTATGTCGATGTCGGAT
AGATGGATTTGGCATTTTGATAAATTTGGGTGTTATACGGTCAAAAGCGGCTACAAATTGGGTATGACTAAGACAGTAGAGGCGTCACCGTCTGATGTCGATGTTTGCTC
AAGCTTACGGAATCATCACGTCCTAGTGGATGATATTCGTCCGTTATGTTCGGAGACAATCGAGGATACATCTCATGCTCTGTTTATGTGCTCTAGAGCGTCAGAAGTCT
GGTCGACTCTAAGATATAGGGATATCGTAAGATCGGATACAATTATGGACATCCAGGATCGTTGGACTAATATTAGAAAGGTTGAATCAACTACGACCATTGAACAGATT
TGCATAGGAGCCTGGGCCATCTGGAACGATCGGAATAGTCGGCTTCATCAGAGTCCAATTCCTCCGGTGGGGGTCCGTTGTGAATGGATAGTAGAGTATGGTAGAGTGGG
GGCAGTTCTGCGTACAAAGATGGGAGAATTGGTAGTTCTTATGCAATGCAGGATTCCGTTATCATCATCCCCCTTATGTGCGGAGGCGGTAGCGGTTCTTGAAGGTCTTC
GAATAATTTCTCTACGGCATATTCGTAAGGTTAATGTTTGCACTGACTCTCTATCGTTGATCTCCATTCTCCGTAAGAATGAGCGATGTCCAGCGGATTGTTTTCCTGTT
GTGACAGACATTATGTGTCTCATAGGATCTTTTGAAAAGATTACTTTTTGTCATATTAACCGTGAGTATAACTTGTTGTCGCATGAGTTAGCTCGTTTAGGTGCGGATTT
TCCTACTCGAGTTTGGAGTAGGAACTTCCCAGAGTGGGCTTCAGATCTAGCGAGAGGTGAAATGTCTTGTTTTGTATCCCCTAAAGGGAATTTCCGTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGCGGTTTGCGATGTTTTTGGATGGATTTTCCGTCATTCCGCGATTTTTGTTTCAGATTTTATTGCAAATGTTTACGTTTTACATTCAGCTAGGCTGGGAAGGATT
TTCATCCGGGTCAGAGCTGCTAAATCAGGTTGTTCGAGGAATGGGGGTTTCATCACCTCTACACTTACAATCTTTTCTCTTGGATTCCTGTGTTTCTGTTATCTGTTTCC
TTGTTCGTTGTTTCCGTATGGATCCCAATTCACTTGTACAAGATTGGTCCAAGCTCAACCTTACTCAAGAGGAAAATGAGGTTGCAGTCATGGCTGATCGGGAAGTTGTG
GAGCGAATGAGACTGACTTTAGGTTGCTGTTTATTGGGAAAATTACAATCCCATCGTTTTTTAGCCGCTAAAGTCATGCGTAAGACTTTTGCATCGGCTTGGAGAGTTGT
CCAAGGGATTCAGGTGGAATTGTTAGAAGATGGGATTAGTAAGAGCTGGATTCCCATTCAATACGAAAGATTACCAGAGTTCTGTTTCTATTGCGGGATTGTTGGTCATC
AACACAAGGACTGTAGCCAATTTTATTCAGCGGACAGGTCCCATAGTGTGGTGTTTAATTATGGTGAATGGCTACACTTTGATCCTAAAGGCGTAGTTTTGCAAAGTTTA
CCAGTGCCGGATGTGGTGGAGCATAACAATGTTCCACGTTTTGGTGATGTCCCTGTGACGATTCCATCCCTCAATTTCATTGCCAAGCATAATCGTTCAACATGTATCTC
GATTTATGATCAGCCTGCGCAGTCCCAATTACCAGTTAAAAGTCCCTCTTTTGATAAATCGACCAGAGGCGCGGCGTGGTACTCGTTGTCTGGTGGCCTTCATCCTCTGG
ACAAAGGAAAAGCTGTAATGGTCGAGGAGAACTCAGTCGGTATTCTACACGGTTGGAACAAGATCAAGAATAATAGTTGGCGGGCTTCTTCATCCATCGATCCCCAGGTT
CAAGCGAGCGATTTCTCTAATTCGGCGGTCTCTGCGCAGTCGGCAATCAATTCACAGCCGACGGTTACTCACAACGATACGTTACGATTTGGAACGTTTGACTTTCTGGA
GCATTATCAACGGAAGTTAAAGGATTTTAATGTGGGTTTTAATGAGAAGTTTAATTCGCCGGCGTTTCGGGAATTCAATAAAGAAAAATCTCTTAACGAGTCTGATGTCG
ATCCGAAATCACCTATAAATATTGGGCCTGGTTCTCTGGATGGTGGGCCCACTTTCAATGCAGTTTCTTTCACTCAACCAGTGCTTTGCAAGAAAAAATTAAACCTGGAC
AGAGTGCTTCAAGGCGGTTCGAGTTCATTGGACGAGCAAGACTCTAAATTTGAGAGCCTGGATGGAGTGGGCCTTCGGATTGGGCTGAAAAAGGAGGGGCATGTGGACCT
GACACCAAAACCCAAAGCTGTTGGATCGGGTTGTTCCTGGAAGAAAAGGGCTCGCGTAGGTTTTGTCCCATCTGGTTTGAATCTATCGGTGCTTGAAGAATTCAATAAGT
CCCCGCCAGCAACCATGATTACTATATGTTGGAATGTCCGTGGATTGGGGAGTCCTTGGGCGTTCCGTAGTGTCTGTAACGAGTTAAAACGTTTAAATCCCCGTCTTTGT
TTCCTGTCAGAGACCAAATGTTCTTCTGCTGTTTTGAATAAATTGAAAGTTCGTTTGCATTATTCCGGGTGTTTTGTTGTCGATAGAGTTGGGTTGAGCGGGGGTCTATG
CTTGTTGTGGAAGGATGATGTTGATGTGTCTATTCGTAATTTTTCAATTCATCATATTGATGTATCTATTAAATGGAAAAATACAAGTTGGAGGTTTACTGGTATTTATG
GCCAACCAGATACGTCCTTACGCTTTCATACCTGGAATTTGATTCGTAGAATCTATAATAATGATGATACACCATGGGTTGTTGGTGGAGATTTTAATGAAATTTTGTGG
GATGAAGAGAAATCAGGCAGACCTCCTAGGGATATGAGGCTAATTCAAAATTTTCGTGATATTATTGACTCCTGCGGACTTTTGGATCTGAAATTCAATGGTGATATTTA
TACTTGGTGTAATAGGCGTCAAGCAGGGGAGCAAATAGGGAATTGGTCAGATCCGGAAAGTAGTCATCAAGCACTCAACCAAAAATTGCAAAGGTGCGCTCGGACATTAA
AAGAGTGGGGATACAGGAAGAACAAGGCACGATGGGCCAACATTCGTCAGGTGAGGGACAAAATCAAAACTATCTATGATAGGTCATTACCAATTGACTTCGCAACGGTG
CATAGCCTAGAGCGTCATCTGGATAGTTTATTACTCGAGGAGGAACTGTATTGGAAACAACGATCACGAGAGAATTGGCTAAAATGGGGGGACCGCAATACTATATGGTT
CCATCACCGAGCATCCTATAGGCGTAAAAGGAACATGATTAGTGGGGTGGAGGACGCAGATGGGAGGTGGCATACGGAGTCTGGCCAGATTTACCAGCGACACTGGGATA
TGGTTGGTCCTCAGACGGTGACTGAATGCTTAGCAATCCTTAATCGAGAGCGTTCGATTAAAGACTGGAATCACACTAATATTGTTCTCATTCCAAAGGTCCCAAATCCA
AGAGTCGAGTGGTCCTTCATTCATGCTGTTATGGAAAAGTTGGGTTTCCCATGCAAATGGATAGATCTGGTTAAAGAATGTATTTCTATGGCCTCTTTTTCTATCCTTAT
TAATGGGGAGGCAAGGGGGAATTTTTATCCTTCGAGAGGACTAAGACAGGGTGATCCTCTGCCACCGTATTTGTTTTTGCTTTGTTCGGAAAGCTTGTCTGCTATGCTTG
GTTTTGTTAGGAGGAACAGGGCCATGGGAATTGCTATATCCCCATCATCCCCAAAAATTTCCCATCTATTTTTTGCAGACGACAGTCTCATTTTCATGAAGGCTTCAGTG
GAGGAATTTGGTTGTTTTAAAAGCATTATGGTTGACTTCGAACGTGCTTCGGGTCAATGTATTAATTTGGATAAGTCACAAATTTGTTTTTCGAAGAATGTTTCAAGTGA
TACAACAACCTATCTCAGTTCAATTTTGAGAATGAAGACTGTATCAAATTTGGGATCCTACCTTGGTCTGCCGTCATCTTTCCAACGTAGTAAGAGCAAGGATTTCAAAG
ACTGTATCCTGCCATCTGGTCAGTGGGACATTCCAAAGCTTTCTAATTTCCTCATAGATGAGGATATTCAGGAGATTAGAAGGATTCCAATCAGTATGTCGATGTCGGAT
AGATGGATTTGGCATTTTGATAAATTTGGGTGTTATACGGTCAAAAGCGGCTACAAATTGGGTATGACTAAGACAGTAGAGGCGTCACCGTCTGATGTCGATGTTTGCTC
AAGCTTACGGAATCATCACGTCCTAGTGGATGATATTCGTCCGTTATGTTCGGAGACAATCGAGGATACATCTCATGCTCTGTTTATGTGCTCTAGAGCGTCAGAAGTCT
GGTCGACTCTAAGATATAGGGATATCGTAAGATCGGATACAATTATGGACATCCAGGATCGTTGGACTAATATTAGAAAGGTTGAATCAACTACGACCATTGAACAGATT
TGCATAGGAGCCTGGGCCATCTGGAACGATCGGAATAGTCGGCTTCATCAGAGTCCAATTCCTCCGGTGGGGGTCCGTTGTGAATGGATAGTAGAGTATGGTAGAGTGGG
GGCAGTTCTGCGTACAAAGATGGGAGAATTGGTAGTTCTTATGCAATGCAGGATTCCGTTATCATCATCCCCCTTATGTGCGGAGGCGGTAGCGGTTCTTGAAGGTCTTC
GAATAATTTCTCTACGGCATATTCGTAAGGTTAATGTTTGCACTGACTCTCTATCGTTGATCTCCATTCTCCGTAAGAATGAGCGATGTCCAGCGGATTGTTTTCCTGTT
GTGACAGACATTATGTGTCTCATAGGATCTTTTGAAAAGATTACTTTTTGTCATATTAACCGTGAGTATAACTTGTTGTCGCATGAGTTAGCTCGTTTAGGTGCGGATTT
TCCTACTCGAGTTTGGAGTAGGAACTTCCCAGAGTGGGCTTCAGATCTAGCGAGAGGTGAAATGTCTTGTTTTGTATCCCCTAAAGGGAATTTCCGTTCTTGA
Protein sequenceShow/hide protein sequence
MRRFAMFLDGFSVIPRFLFQILLQMFTFYIQLGWEGFSSGSELLNQVVRGMGVSSPLHLQSFLLDSCVSVICFLVRCFRMDPNSLVQDWSKLNLTQEENEVAVMADREVV
ERMRLTLGCCLLGKLQSHRFLAAKVMRKTFASAWRVVQGIQVELLEDGISKSWIPIQYERLPEFCFYCGIVGHQHKDCSQFYSADRSHSVVFNYGEWLHFDPKGVVLQSL
PVPDVVEHNNVPRFGDVPVTIPSLNFIAKHNRSTCISIYDQPAQSQLPVKSPSFDKSTRGAAWYSLSGGLHPLDKGKAVMVEENSVGILHGWNKIKNNSWRASSSIDPQV
QASDFSNSAVSAQSAINSQPTVTHNDTLRFGTFDFLEHYQRKLKDFNVGFNEKFNSPAFREFNKEKSLNESDVDPKSPINIGPGSLDGGPTFNAVSFTQPVLCKKKLNLD
RVLQGGSSSLDEQDSKFESLDGVGLRIGLKKEGHVDLTPKPKAVGSGCSWKKRARVGFVPSGLNLSVLEEFNKSPPATMITICWNVRGLGSPWAFRSVCNELKRLNPRLC
FLSETKCSSAVLNKLKVRLHYSGCFVVDRVGLSGGLCLLWKDDVDVSIRNFSIHHIDVSIKWKNTSWRFTGIYGQPDTSLRFHTWNLIRRIYNNDDTPWVVGGDFNEILW
DEEKSGRPPRDMRLIQNFRDIIDSCGLLDLKFNGDIYTWCNRRQAGEQIGNWSDPESSHQALNQKLQRCARTLKEWGYRKNKARWANIRQVRDKIKTIYDRSLPIDFATV
HSLERHLDSLLLEEELYWKQRSRENWLKWGDRNTIWFHHRASYRRKRNMISGVEDADGRWHTESGQIYQRHWDMVGPQTVTECLAILNRERSIKDWNHTNIVLIPKVPNP
RVEWSFIHAVMEKLGFPCKWIDLVKECISMASFSILINGEARGNFYPSRGLRQGDPLPPYLFLLCSESLSAMLGFVRRNRAMGIAISPSSPKISHLFFADDSLIFMKASV
EEFGCFKSIMVDFERASGQCINLDKSQICFSKNVSSDTTTYLSSILRMKTVSNLGSYLGLPSSFQRSKSKDFKDCILPSGQWDIPKLSNFLIDEDIQEIRRIPISMSMSD
RWIWHFDKFGCYTVKSGYKLGMTKTVEASPSDVDVCSSLRNHHVLVDDIRPLCSETIEDTSHALFMCSRASEVWSTLRYRDIVRSDTIMDIQDRWTNIRKVESTTTIEQI
CIGAWAIWNDRNSRLHQSPIPPVGVRCEWIVEYGRVGAVLRTKMGELVVLMQCRIPLSSSPLCAEAVAVLEGLRIISLRHIRKVNVCTDSLSLISILRKNERCPADCFPV
VTDIMCLIGSFEKITFCHINREYNLLSHELARLGADFPTRVWSRNFPEWASDLARGEMSCFVSPKGNFRS