; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g010910 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g010910
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr06:12297487..12300037
RNA-Seq ExpressionLcy06g010910
SyntenyLcy06g010910
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.8e-9835.54Show/hide
Query:  TISNCLSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKG
        T+  CL+ LN+ + ++ WN T I LIPK +  R +SD+RPIS CNVSYKI++K I NRLK V+  +I + QS F+P R+ISDN+I+GHE LH +   + G
Subjt:  TISNCLSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKG

Query:  KTGFATLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKR-YLAG
          G A LK+D+SKA+DRVEW+Y   IM K+GF+  W++ +++CI+T  FSI +NG   G  +PSRGIRQGDPLSPYLFL+C EGLSAL++       L G
Subjt:  KTGFATLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKR-YLAG

Query:  VSIARSCPKISHLFFADDSLVFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILSMKVFDSLGSYLGLPSSFHR-------
        +    +   I+HL FADDSL+FL+ +  E    + +L  Y +ASGQC+N SKS + FS  V  + +QYL  IL++K+    G+YLGLPS F R       
Subjt:  VSIARSCPKISHLFFADDSLVFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILSMKVFDSLGSYLGLPSSFHR-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------EFISPSLQWDVAKLNLYLTWEDVEVIQCLQKSS-SAPDKWIWHYDRRGEYSVKSGYKLSMMSGQEASLSGIGLE
                                   FI+    WDV  ++     ED ++I  +  SS +  D W+WHYD+RG YSV+SGYKL M     A+ +     
Subjt:  --------------------------EFISPSLQWDVAKLNLYLTWEDVEVIQCLQKSS-SAPDKWIWHYDRRGEYSVKSGYKLSMMSGQEASLSGIGLE

Query:  TSWWNKLWRMRVPSKVKLFVWRSFHNSIPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVWDILFP
         + WN +W++ VP+K+K+F+WRS H  IPT  NL    +     C +C +   +  HA F C RA ++W  LFP
Subjt:  TSWWNKLWRMRVPSKVKLFVWRSFHNSIPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVWDILFP

XP_023908235.1 uncharacterized protein LOC112019924 [Quercus suber]2.7e-8635.31Show/hide
Query:  LSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFA
        L  LN+       NHT IVLIPK ++   +SD+RPIS CNV YKI++KV+ NRLK VL +II   QS F+PGR I+DN++L +E LH +  ++KGK G+ 
Subjt:  LSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFA

Query:  TLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSA-RKRYLAGVSIAR
         LK+D+SKAYDRVEW +   IM +LGF  +W++ VM C+TT++FS+L+NG  FG ++PSRGIRQGDPLSPYLFL+CTEG ++LLD A     L GV+I R
Subjt:  TLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSA-RKRYLAGVSIAR

Query:  SCPKISHLFFADDSLVFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILSMKVFDSLGSYLGLP-----SSFH--------
        + PKI++L FADDSL+F +V  VE      ILQ Y KASGQ +N+ KS+++FS       +Q    IL +K      SYLGLP     S +H        
Subjt:  SCPKISHLFFADDSLVFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILSMKVFDSLGSYLGLP-----SSFH--------

Query:  -----------------REFI-------SPSLQWDVAKLNLYL-------------------------TWEDVEVIQCLQKSSSA------------PDK
                         +E +        P+    V KL L L                          W  +     + +S                D+
Subjt:  -----------------REFI-------SPSLQWDVAKLNLYL-------------------------TWEDVEVIQCLQKSSSA------------PDK

Query:  WI----------------------------------------------------------------WHYDRRGEYSVKSGYK----LSMMSGQEASLSGI
        WI                                                                W +++ G ++VKS YK    LS       S SG 
Subjt:  WI----------------------------------------------------------------WHYDRRGEYSVKSGYK----LSMMSGQEASLSGI

Query:  GLETSWWNKLWRMRVPSKVKLFVWRSFHNSIPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVW
        G ++  W  LW++R+P+K+K+F WR+ H+ +PT  NL    + V+++CP+C     +T HAL++C  A ++W
Subjt:  GLETSWWNKLWRMRVPSKVKLFVWRSFHNSIPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVW

XP_028068804.1 uncharacterized protein LOC114271378 [Camellia sinensis]2.0e-8638.29Show/hide
Query:  LSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFA
        L +LN  + V A N T IVLIPK +  + +S +RPIS CNV YK+V+KV+ANR++ +L DII   QS F+ GR ISDNM+   E+ HFL++KR GK G  
Subjt:  LSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFA

Query:  TLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSAR-KRYLAGVSIAR
         LK+DMSKAYDRVEWS+   +M ++GF+  +V  ++ CI++ ++S+L+NG       P+RG+RQGDPLSPYLF++C EGLSAL+  A  +  L GV++ R
Subjt:  TLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSAR-KRYLAGVSIAR

Query:  SCPKISHLFFADDSLVFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILSMKVFDSLGSYLGLPSSFHR------------
          P++SHL FADDSL+F      E    + IL  YE  SGQ +N+ KS + FS  V  D +  L N + +      G YLGLP    R            
Subjt:  SCPKISHLFFADDSLVFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILSMKVFDSLGSYLGLPSSFHR------------

Query:  --------------------------------------------EFISPSLQ--------------WDVAKLNLYLTWEDVEVIQCLQKSS-SAPDKWIW
                                                    E I   LQ              WDV  L   L   DVE I+ +       PDK +W
Subjt:  --------------------------------------------EFISPSLQ--------------WDVAKLNLYLTWEDVEVIQCLQKSS-SAPDKWIW

Query:  HYDRRGEYSVKSGYKLSM-MSGQEASLSGIGLE-TSW-WNKLWRMRVPSKVKLFVWRSFHNSIPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRA
        HY R G +SV+  Y L M +SG+E   SG  ++  SW W ++W +R+P+K+K+F W+   N +P    L   H+   + C  C +   T  H L++C+RA
Subjt:  HYDRRGEYSVKSGYKLSM-MSGQEASLSGIGLE-TSW-WNKLWRMRVPSKVKLFVWRSFHNSIPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRA

Query:  HEVW
         EVW
Subjt:  HEVW

XP_042950314.1 uncharacterized protein LOC122282427 [Carya illinoinensis]2.4e-8736.46Show/hide
Query:  LSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFA
        L  LNS       N T I LIPK +   LVSDYRPIS CNV YK+ +KVI NRLK  L  II E Q  F+ GR ISDN+++ +E++++LR+KR GK G+ 
Subjt:  LSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFA

Query:  TLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSAR-KRYLAGVSIAR
        ++K+DMSKAYDRVEW +   IM KLG    +V L+M+CI T TFS+L+NG + G I P+RGIRQGDPLSPYLFL CTEGL ALL +A  +R L GV I R
Subjt:  TLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSAR-KRYLAGVSIAR

Query:  SCPKISHLFFADDSLVFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILSMKVFDSLG---SYLGLPSSFHREFISPSLQW
          P ++HL FADDS++F +         + +L  YE ASGQ VN  K++M FS  V A+ ++ L     ++++ SLG   SY        R+++    +W
Subjt:  SCPKISHLFFADDSLVFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILSMKVFDSLG---SYLGLPSSFHREFISPSLQW

Query:  DVAKLNLYLTWEDV----------------------------------------------------EVIQCLQKSSSAPDKWIWHYDRRGEYSVKSGYKL
         V    +   W+D                                                     E+++    S+   D+WIW  +  G ++VKS YKL
Subjt:  DVAKLNLYLTWEDV----------------------------------------------------EVIQCLQKSSSAPDKWIWHYDRRGEYSVKSGYKL

Query:  --SMMSGQEASLSGIGLETSWWNKLWRMRVPSKVKLFVWRSFHNSIPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVWDILFPSMNAAQMI
          + M+      S    ++  W K+W M+VP K+K+F WR   + +PT+VNL    V    +C  C +     +HAL  C     VW   FP     Q +
Subjt:  --SMMSGQEASLSGIGLETSWWNKLWRMRVPSKVKLFVWRSFHNSIPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVWDILFPSMNAAQMI

Query:  VSTVSTSPLGAEAVTVLEGLRLAKALDVHRVTVFSDSLTLIESVNEEIQCDSSIASTLWDIKAIR
               P G   V V +G   A         V  D+   I     +I+ +   A+T+  I  +R
Subjt:  VSTVSTSPLGAEAVTVLEGLRLAKALDVHRVTVFSDSLTLIESVNEEIQCDSSIASTLWDIKAIR

XP_042988748.1 uncharacterized protein LOC122316282 [Carya illinoinensis]7.0e-8739.49Show/hide
Query:  LSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFA
        L+ILN+   ++  N T I LIPK + ++ V+D+RPIS CNV YKIV KVI+N+LK VL +II   Q+ F+PGR ISDN+++ +E+LH +  + +GK G+ 
Subjt:  LSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFA

Query:  TLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLS-ALLDSARKRYLAGVSIAR
          K+DMSK Y+RVEW + + +MAKLGF+ S + L+ +C ++ ++SIL+NGE   F KP+RG+RQGDPLSPYLF++CTE LS +L    +KR ++ V + +
Subjt:  TLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLS-ALLDSARKRYLAGVSIAR

Query:  SCPKISHLFFADDSLVFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPAD-SRQYLSNILSMKVFDSLGSYLGLPS--------SFH---R
           +ISHLFF DDSL+F K  ++E+     IL  YE ASGQ +N  KS+++FS   P + +++ +  I  +K   +   YLGLPS        +FH    
Subjt:  SCPKISHLFFADDSLVFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPAD-SRQYLSNILSMKVFDSLGSYLGLPS--------SFH---R

Query:  EFISPSLQWDVAKLNLYLTWEDVEVIQCLQKS-SSAPDKWIWHYDRRGEYSVKSGYKL--SMMSGQEASLSGIGLETSWWNKLWRMRVPSKVKLFVWRSF
           +    W    L++    E V++I+ +  S  +  D+  W     G+++VKS Y +  ++ +  E + S    + S+W  +W+M VP+ V++F+WR++
Subjt:  EFISPSLQWDVAKLNLYLTWEDVEVIQCLQKS-SSAPDKWIWHYDRRGEYSVKSGYKL--SMMSGQEASLSGIGLETSWWNKLWRMRVPSKVKLFVWRSF

Query:  HNSIPTMVNLWNHHVPVNEICPVCQEEMGTTDH
        + ++PT+ NL+   +  + +CP+CQ E     H
Subjt:  HNSIPTMVNLWNHHVPVNEICPVCQEEMGTTDH

TrEMBL top hitse value%identityAlignment
A0A2N9FJ03 Reverse transcriptase domain-containing protein1.1e-9036.03Show/hide
Query:  ISNCLSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGK
        IS  LS LNS   +++ NHT I LIPK ++   V+D+RPIS CNV YK+V+KV+ANRLKL+L  +I E QS F+PGR I+DN+++  E LH + H + G+
Subjt:  ISNCLSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGK

Query:  TGFATLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKR-YLAGV
         G   LK+DMSKAYDRVEW Y   +M K+GFH  W+ L+ +CI+T ++SIL+NGE  G I PSRG+RQGDPLSPYLFL+C EG+ +L+  A     + GV
Subjt:  TGFATLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKR-YLAGV

Query:  SIARSCPKISHLFFADDSLVFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILSMKVFDSLGSYLGLP-------------
        S+ R  PKI+HLFFADDSL+F K         + IL+ YE+ASGQ VN  K+T+FFS  VP  ++  + + L + +      YLGLP             
Subjt:  SIARSCPKISHLFFADDSLVFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILSMKVFDSLGSYLGLP-------------

Query:  -------------SSFHREF--------------------------------------------------------------------------------
                     S F+R F                                                                                
Subjt:  -------------SSFHREF--------------------------------------------------------------------------------

Query:  ISPSLQWDVAKLNLYLTWEDVEVIQCLQKSSSAP-DKWIWHYDRRGEYSVKSGYK--LSMMSGQEASLSGIGLETSWWNKLWRMRVPSKVKLFVWRSFHN
        + P ++WD A ++      D E I+ +  S  AP DK+ W     G YSVKSGY+  + + + Q    S   +    W  +W +R+P K + F WR+  +
Subjt:  ISPSLQWDVAKLNLYLTWEDVEVIQCLQKSSSAP-DKWIWHYDRRGEYSVKSGYK--LSMMSGQEASLSGIGLETSWWNKLWRMRVPSKVKLFVWRSFHN

Query:  SIPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVWD
        ++PT VNL   H+P++ +C  C+       HA++ C     VW+
Subjt:  SIPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVWD

A0A2N9FR17 Reverse transcriptase domain-containing protein1.9e-9040.86Show/hide
Query:  LSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFA
        LS LNS   + + NHT I LIPK+++   V+++RPIS CNV YK+++KV+ANRLK++L  I+ + QS F+ GR I+DN+++  E LH++ H + G+ G  
Subjt:  LSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFA

Query:  TLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSA-RKRYLAGVSIAR
         LK+DM+KAYDRVEW +   IM+++GFH  W+ L+ +CI+T ++SIL+NGE  G+IKPSRG+RQGDPLSPYLFL+C EGL +L++ A R   + GVS+ R
Subjt:  TLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSA-RKRYLAGVSIAR

Query:  SCPKISHLFFADDSLVFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILSMKVF----------DSLGSYLGLPSSFHREF
          PKI+HLFFADD+L+F K         + IL  YEKASGQ VN  K+T++FS   P  S+  +   L  K F           + GSY        RE 
Subjt:  SCPKISHLFFADDSLVFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILSMKVF----------DSLGSYLGLPSSFHREF

Query:  IS-PSLQWDVAKLNLYLTWEDVEVIQCLQKSS-SAPDKWIWHYDRRGEYSVKSGYKLSMMSGQEASLSGIGLETSWWNKLWRMRVPSKVKLFVWRSFHNS
        I+ P + WD + ++      DVE I+ +  S+    DK IW  +  G+YSV+SGY+  ++  ++ +L G                         R+   +
Subjt:  IS-PSLQWDVAKLNLYLTWEDVEVIQCLQKSS-SAPDKWIWHYDRRGEYSVKSGYKLSMMSGQEASLSGIGLETSWWNKLWRMRVPSKVKLFVWRSFHNS

Query:  IPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVWD
        +PT +NL   H+P+   C +C ++   T HAL+ C +   VWD
Subjt:  IPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVWD

A0A2N9HPU0 Uncharacterized protein8.6e-9137.33Show/hide
Query:  LSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFA
        LS LNS   +   NHT I LIPK ++   ++++RPIS CNV+YK+++KVIANRLK +L  II E QS F+PGR I+DN+++  E LH +   + GK G  
Subjt:  LSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFA

Query:  TLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKR-YLAGVSIAR
         +K+DMSKAYDRVEWS+   IM K+GFH  WV L+M CI+T ++SIL+NGE  GF+KPSRGIRQGDPLSPYLFL+C EGL  L+ +A++R  L G+S+ R
Subjt:  TLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKR-YLAGVSIAR

Query:  SCPKISHLFFADDSLVFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILSMKVFDSLGSYLGLPSSFHREFIS--------
        + PKI+HLFFADDSL+F K    E    + IL  YEKASGQ VN  K+T+FFS   P   +  + N L + +      YLGL S   R  ++        
Subjt:  SCPKISHLFFADDSLVFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILSMKVFDSLGSYLGLPSSFHREFIS--------

Query:  -----------------------------PSLQWDVAKLNLYLTWEDVEVI-------------------QCLQKSS-----------------------
                                     P+      KL + L  +D+E +                   QCL K+                        
Subjt:  -----------------------------PSLQWDVAKLNLYLTWEDVEVI-------------------QCLQKSS-----------------------

Query:  --------------SAPDKWIWHYDRRGEYSVKSGYKLSMMSGQE--ASLSGIGLETSWWNKLWRMRVPSKVKLFVWRSFHNSIPTMVNLWNHHVPVNEI
                      S  D   W   + G YSV+SGY+  +       AS S        W K+W + VP K+++F WR+  +S+P+ + LW   V  +  
Subjt:  --------------SAPDKWIWHYDRRGEYSVKSGYKLSMMSGQE--ASLSGIGLETSWWNKLWRMRVPSKVKLFVWRSFHNSIPTMVNLWNHHVPVNEI

Query:  CPVCQEEMGTTDHALFQCVRAHEVW
        C +C  +     HAL+ C      W
Subjt:  CPVCQEEMGTTDHALFQCVRAHEVW

A0A6J1DX30 uncharacterized protein LOC1110248748.6e-9935.54Show/hide
Query:  TISNCLSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKG
        T+  CL+ LN+ + ++ WN T I LIPK +  R +SD+RPIS CNVSYKI++K I NRLK V+  +I + QS F+P R+ISDN+I+GHE LH +   + G
Subjt:  TISNCLSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKG

Query:  KTGFATLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKR-YLAG
          G A LK+D+SKA+DRVEW+Y   IM K+GF+  W++ +++CI+T  FSI +NG   G  +PSRGIRQGDPLSPYLFL+C EGLSAL++       L G
Subjt:  KTGFATLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKR-YLAG

Query:  VSIARSCPKISHLFFADDSLVFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILSMKVFDSLGSYLGLPSSFHR-------
        +    +   I+HL FADDSL+FL+ +  E    + +L  Y +ASGQC+N SKS + FS  V  + +QYL  IL++K+    G+YLGLPS F R       
Subjt:  VSIARSCPKISHLFFADDSLVFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILSMKVFDSLGSYLGLPSSFHR-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------EFISPSLQWDVAKLNLYLTWEDVEVIQCLQKSS-SAPDKWIWHYDRRGEYSVKSGYKLSMMSGQEASLSGIGLE
                                   FI+    WDV  ++     ED ++I  +  SS +  D W+WHYD+RG YSV+SGYKL M     A+ +     
Subjt:  --------------------------EFISPSLQWDVAKLNLYLTWEDVEVIQCLQKSS-SAPDKWIWHYDRRGEYSVKSGYKLSMMSGQEASLSGIGLE

Query:  TSWWNKLWRMRVPSKVKLFVWRSFHNSIPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVWDILFP
         + WN +W++ VP+K+K+F+WRS H  IPT  NL    +     C +C +   +  HA F C RA ++W  LFP
Subjt:  TSWWNKLWRMRVPSKVKLFVWRSFHNSIPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVWDILFP

A0A803P5H2 Uncharacterized protein2.4e-9339.64Show/hide
Query:  LSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFA
        L +LN+    +++N T I LIPK +  + + D+RPIS CNV+YKI++K++A R K VL+ +I E QS F+  R I+DN+++  EM+H L+H+ +G  GFA
Subjt:  LSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFA

Query:  TLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARK-RYLAGVSIAR
         LK+DMSKA+DRVEWS+ + +M K+GF   W+ L+M C+ T++FS L+NGE  G + P RG+RQGDPLSPYLFLIC+EGLS LL    +   L G++++R
Subjt:  TLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARK-RYLAGVSIAR

Query:  SCPKISHLFFADDSLVFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILSMKVFDSLG------------------SYLGL
          P I+HL FADDSL+F +      G  K  L  Y +ASGQ +N  KS + +        R+ LS  L +K+ D  G                   + G 
Subjt:  SCPKISHLFFADDSLVFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILSMKVFDSLG------------------SYLGL

Query:  PSSFHREFISPSLQWDVAKLNLYLTWEDVEVIQCLQKS-SSAPDKWIWHYDRRGEYSVKSGYKLSMMSGQEASLSGIGLETSWWNKLWRMRVPSKVKLFV
         S+   ++I+ + +WD+  L+   +  D++ I  +  S +S  D+W WHYD  G+Y+VKSGY L+     + + S    + +WW   W + +PSKV++F 
Subjt:  PSSFHREFISPSLQWDVAKLNLYLTWEDVEVIQCLQKS-SSAPDKWIWHYDRRGEYSVKSGYKLSMMSGQEASLSGIGLETSWWNKLWRMRVPSKVKLFV

Query:  WRSFHNSIPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVW
        WR  ++++P   NL++  V  +  C +C     +  HALF C  A  VW
Subjt:  WRSFHNSIPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVW

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein8.8e-2429.66Show/hide
Query:  NIVLIPK-SRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGK-TGFATLKIDMSKAYDRVE
        +I+LIPK  R      ++RPIS  N+  KI+ K++ANR++  +  +I   Q  FIPG     N+    + ++ ++H  + K      + ID  KA+D+++
Subjt:  NIVLIPK-SRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGK-TGFATLKIDMSKAYDRVE

Query:  WSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKRYLAGVSIARSCPKISHLFFADDSL
          +    + KLG    ++K++       T +I++NG+         G RQG PLSP LF I  E L+  +   +++ + G+ + +   K+S   FADD +
Subjt:  WSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKRYLAGVSIARSCPKISHLFFADDSL

Query:  VFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMF
        V+L+   V   +   ++ ++ K SG  +NV KS  F
Subjt:  VFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMF

P08548 LINE-1 reverse transcriptase homolog6.3e-2230.51Show/hide
Query:  NIVLIPK-SRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKT-GFATLKIDMSKAYDRVE
        NI LIPK  +      +YRPIS  N+  KI+ K++ NR++  +  II   Q  FIPG   S       + ++ ++H  K K      L ID  KA+D ++
Subjt:  NIVLIPK-SRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKT-GFATLKIDMSKAYDRVE

Query:  WSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKRYLAGVSIARSCPKISHLFFADDSL
          +    + K+G   +++KL+    +  T +I++NG          G RQG PLSP LF I  E L+  +    ++ + G+ I     K+S   FADD +
Subjt:  WSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKRYLAGVSIARSCPKISHLFFADDSL

Query:  VFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMF
        V+L+           ++++Y   SG  +N  KS  F
Subjt:  VFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMF

P11369 LINE-1 retrotransposable element ORF2 protein2.0e-2028.63Show/hide
Query:  IVLIPK-SRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFATLKIDMSKAYDRVEWS
        I LIPK  +    + ++RPIS  N+  KI+ K++ANR++  +  II   Q  FIPG     N+     ++H++ +K K K     + +D  KA+D+++  
Subjt:  IVLIPK-SRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFATLKIDMSKAYDRVEWS

Query:  YFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKRYLAGVSIARSCPKISHLFFADDSLVF
        +   ++ + G    ++ ++    +    +I +NGE    I    G RQG PLSPYLF I  E L+  +   +++ + G+ I +   KIS L  ADD +V+
Subjt:  YFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKRYLAGVSIARSCPKISHLFFADDSLVF

Query:  LKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMF
        +           +++  + +  G  +N +KS  F
Subjt:  LKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMF

P14381 Transposon TX1 uncharacterized 149 kDa protein3.7e-2231.3Show/hide
Query:  LIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFATLKIDMSKAYDRVEWSYFS
        L+PK    RL+ ++RP+S  +  YKIV K I+ RLK VL ++I   QS  +PGR+I DN+ L  ++LHF R  R G    A L +D  KA+DRV+  Y  
Subjt:  LIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFATLKIDMSKAYDRVEWSYFS

Query:  YIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKRYLAGVSIARSCPKISHLFFADDSLVFLKV
          +    F   +V  +     ++   + +N      +   RG+RQG PLS  L+ +  E    LL   RKR L G+ +     ++    +ADD ++ +  
Subjt:  YIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKRYLAGVSIARSCPKISHLFFADDSLVFLKV

Query:  VAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILS--MKVFDSLGSYL
          V+    +   + Y  AS   +N SKS+      +  D        +S   K+   LG YL
Subjt:  VAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILS--MKVFDSLGSYL

P92555 Uncharacterized mitochondrial protein AtMg012501.8e-1351.47Show/hide
Query:  LMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKR-YLAGVSIARSCPKISHLFFADDS
        ++NG   G + PSRG+RQGDPLSPYLF++CTE LS L   A+++  L G+ ++ + P+I+HL FADD+
Subjt:  LMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKR-YLAGVSIARSCPKISHLFFADDS

Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein3.2e-0530.16Show/hide
Query:  LWRMRVPSKVKLFVWRSFHNSIPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVW
        +W++ V  K+K F+WR    ++ T   L + ++  + IC  C  E  T  H +F C     VW
Subjt:  LWRMRVPSKVKLFVWRSFHNSIPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVW

AT3G09510.1 Ribonuclease H-like superfamily protein1.4e-1328.57Show/hide
Query:  WDVAKLNLYLTWEDVEVIQCLQ-KSSSAPDKWIWHYDRRGEYSVKSGYKL------SMMSGQEASLSGIGLETSWWNKLWRMRVPSKVKLFVWRSFHNSI
        WD +K++ ++   D   I  +    S  PDK IW+Y+  GEY+V+SGY L      + +         I L+T    ++W + +  K+K F+WR+   ++
Subjt:  WDVAKLNLYLTWEDVEVIQCLQ-KSSSAPDKWIWHYDRRGEYSVKSGYKL------SMMSGQEASLSGIGLETSWWNKLWRMRVPSKVKLFVWRSFHNSI

Query:  PTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVWDILFPSMNAAQMI
         T   L    + ++  CP C  E  + +HALF C  A   W +   S+   Q++
Subjt:  PTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVWDILFPSMNAAQMI

AT3G25270.1 Ribonuclease H-like superfamily protein2.9e-0631.25Show/hide
Query:  KLWRMRVPSKVKLFVWRSFHNSIPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVW
        K+W+++   K+K F+W+    ++ T  NL   H+  +  C  C +E  T+ H  F C  A +VW
Subjt:  KLWRMRVPSKVKLFVWRSFHNSIPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVW

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.2e-1233.33Show/hide
Query:  IANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFATLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMN
        +  RLK ++ ++I   Q++FIPGR  +DN++   E +H +R K KG  G+  LK+D+ KAYDR+ W Y    +   GF   W+  + +    STF     
Subjt:  IANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFATLKIDMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMN

Query:  GESFGFIKPSRGIRQGD
            G    S+  R  D
Subjt:  GESFGFIKPSRGIRQGD

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.3e-1451.47Show/hide
Query:  LMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKR-YLAGVSIARSCPKISHLFFADDS
        ++NG   G + PSRG+RQGDPLSPYLF++CTE LS L   A+++  L G+ ++ + P+I+HL FADD+
Subjt:  LMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKR-YLAGVSIARSCPKISHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGATATCAAATTGTCTGTCTATTCTGAATTCAGACGAGCCGGTGCAGGCTTGGAACCACACGAATATTGTGCTAATTCCAAAGTCTCGTCATGCAAGGTTAGTTTC
TGACTATCGCCCAATTAGCTTCTGCAATGTTTCCTATAAAATTGTTACTAAGGTCATAGCCAACAGACTTAAGTTAGTGCTAAATGATATCATAGATGAGTGTCAATCGA
CATTTATTCCTGGTAGATCAATCTCTGATAATATGATATTGGGGCATGAAATGTTACATTTCCTCCGTCATAAGCGAAAAGGAAAAACTGGGTTTGCTACCTTAAAAATT
GATATGAGTAAAGCTTATGATAGGGTAGAATGGTCGTATTTTAGCTATATCATGGCCAAGCTTGGATTTCATGCTAGTTGGGTTAAATTGGTTATGAAGTGTATCACGAC
GTCTACATTTTCCATTCTAATGAATGGAGAATCATTTGGTTTTATTAAGCCATCTCGTGGAATTAGGCAAGGTGATCCTTTATCTCCTTACCTATTCTTAATCTGTACGG
AAGGTCTCTCTGCCTTATTGGATTCAGCTAGGAAAAGATATTTGGCCGGGGTGTCAATTGCGAGATCTTGTCCCAAAATTTCCCATCTATTTTTTGCAGATGATAGCCTA
GTGTTCCTCAAAGTTGTGGCTGTTGAATTTGGGCATTTTAAATCCATTTTGCAGGACTATGAGAAGGCATCTGGTCAGTGTGTTAATGTAAGCAAATCGACGATGTTTTT
TTCGGCAATTGTTCCAGCAGATTCCAGGCAGTACCTTAGTAATATTCTCTCGATGAAGGTGTTTGATTCCTTGGGATCATATCTTGGACTACCTTCATCGTTTCATCGAG
AGTTTATCTCACCATCATTGCAATGGGATGTTGCGAAACTTAACCTGTATCTGACTTGGGAGGATGTCGAGGTGATCCAATGCCTTCAGAAAAGTAGTTCTGCCCCGGAT
AAATGGATTTGGCATTATGATAGAAGAGGAGAGTATTCTGTGAAGAGTGGGTATAAGCTCAGTATGATGAGCGGTCAAGAGGCTTCTTTGTCAGGAATAGGGCTCGAGAC
GAGTTGGTGGAACAAACTTTGGAGGATGAGGGTGCCTAGTAAGGTGAAACTTTTTGTCTGGAGATCATTCCATAATTCAATTCCAACCATGGTCAATCTTTGGAATCACC
ATGTACCTGTCAATGAGATTTGTCCAGTCTGTCAAGAGGAGATGGGGACTACAGACCATGCCCTCTTTCAGTGTGTGAGAGCTCATGAGGTTTGGGATATCCTTTTTCCT
TCTATGAATGCGGCTCAGATGATAGTTTCTACTGTTAGTACCTCTCCATTAGGGGCGGAGGCGGTGACAGTTCTAGAAGGGCTTCGTTTGGCAAAAGCATTAGATGTGCA
TCGAGTTACGGTTTTTTCGGATTCCTTGACATTAATAGAATCTGTTAATGAGGAGATCCAATGTGACTCTAGCATTGCTTCGACACTTTGGGATATCAAAGCTATTAGAA
ATTCTTTCGAAAAGGTGAATTTTAATTTCGTTGGGGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACGATATCAAATTGTCTGTCTATTCTGAATTCAGACGAGCCGGTGCAGGCTTGGAACCACACGAATATTGTGCTAATTCCAAAGTCTCGTCATGCAAGGTTAGTTTC
TGACTATCGCCCAATTAGCTTCTGCAATGTTTCCTATAAAATTGTTACTAAGGTCATAGCCAACAGACTTAAGTTAGTGCTAAATGATATCATAGATGAGTGTCAATCGA
CATTTATTCCTGGTAGATCAATCTCTGATAATATGATATTGGGGCATGAAATGTTACATTTCCTCCGTCATAAGCGAAAAGGAAAAACTGGGTTTGCTACCTTAAAAATT
GATATGAGTAAAGCTTATGATAGGGTAGAATGGTCGTATTTTAGCTATATCATGGCCAAGCTTGGATTTCATGCTAGTTGGGTTAAATTGGTTATGAAGTGTATCACGAC
GTCTACATTTTCCATTCTAATGAATGGAGAATCATTTGGTTTTATTAAGCCATCTCGTGGAATTAGGCAAGGTGATCCTTTATCTCCTTACCTATTCTTAATCTGTACGG
AAGGTCTCTCTGCCTTATTGGATTCAGCTAGGAAAAGATATTTGGCCGGGGTGTCAATTGCGAGATCTTGTCCCAAAATTTCCCATCTATTTTTTGCAGATGATAGCCTA
GTGTTCCTCAAAGTTGTGGCTGTTGAATTTGGGCATTTTAAATCCATTTTGCAGGACTATGAGAAGGCATCTGGTCAGTGTGTTAATGTAAGCAAATCGACGATGTTTTT
TTCGGCAATTGTTCCAGCAGATTCCAGGCAGTACCTTAGTAATATTCTCTCGATGAAGGTGTTTGATTCCTTGGGATCATATCTTGGACTACCTTCATCGTTTCATCGAG
AGTTTATCTCACCATCATTGCAATGGGATGTTGCGAAACTTAACCTGTATCTGACTTGGGAGGATGTCGAGGTGATCCAATGCCTTCAGAAAAGTAGTTCTGCCCCGGAT
AAATGGATTTGGCATTATGATAGAAGAGGAGAGTATTCTGTGAAGAGTGGGTATAAGCTCAGTATGATGAGCGGTCAAGAGGCTTCTTTGTCAGGAATAGGGCTCGAGAC
GAGTTGGTGGAACAAACTTTGGAGGATGAGGGTGCCTAGTAAGGTGAAACTTTTTGTCTGGAGATCATTCCATAATTCAATTCCAACCATGGTCAATCTTTGGAATCACC
ATGTACCTGTCAATGAGATTTGTCCAGTCTGTCAAGAGGAGATGGGGACTACAGACCATGCCCTCTTTCAGTGTGTGAGAGCTCATGAGGTTTGGGATATCCTTTTTCCT
TCTATGAATGCGGCTCAGATGATAGTTTCTACTGTTAGTACCTCTCCATTAGGGGCGGAGGCGGTGACAGTTCTAGAAGGGCTTCGTTTGGCAAAAGCATTAGATGTGCA
TCGAGTTACGGTTTTTTCGGATTCCTTGACATTAATAGAATCTGTTAATGAGGAGATCCAATGTGACTCTAGCATTGCTTCGACACTTTGGGATATCAAAGCTATTAGAA
ATTCTTTCGAAAAGGTGAATTTTAATTTCGTTGGGGAGTAA
Protein sequenceShow/hide protein sequence
MTISNCLSILNSDEPVQAWNHTNIVLIPKSRHARLVSDYRPISFCNVSYKIVTKVIANRLKLVLNDIIDECQSTFIPGRSISDNMILGHEMLHFLRHKRKGKTGFATLKI
DMSKAYDRVEWSYFSYIMAKLGFHASWVKLVMKCITTSTFSILMNGESFGFIKPSRGIRQGDPLSPYLFLICTEGLSALLDSARKRYLAGVSIARSCPKISHLFFADDSL
VFLKVVAVEFGHFKSILQDYEKASGQCVNVSKSTMFFSAIVPADSRQYLSNILSMKVFDSLGSYLGLPSSFHREFISPSLQWDVAKLNLYLTWEDVEVIQCLQKSSSAPD
KWIWHYDRRGEYSVKSGYKLSMMSGQEASLSGIGLETSWWNKLWRMRVPSKVKLFVWRSFHNSIPTMVNLWNHHVPVNEICPVCQEEMGTTDHALFQCVRAHEVWDILFP
SMNAAQMIVSTVSTSPLGAEAVTVLEGLRLAKALDVHRVTVFSDSLTLIESVNEEIQCDSSIASTLWDIKAIRNSFEKVNFNFVGE