; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025887 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025887
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDNA-directed DNA polymerase
Genome locationchr10:23041178..23050371
RNA-Seq ExpressionLag0025887
SyntenyLag0025887
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8501049.1 hypothetical protein CXB51_003148 [Gossypium anomalum]8.3e-10549.89Show/hide
Query:  LGCSICDGSHTYDQCPSNPKSFFYVGP-QDNAQNNPFPNAYNSSWGNHPSFWSSNVGASKLPQQQGSNNSGPQLFNANNDQVLQGSFQGNFQRNNNACGG
        + C  C   H +++CPSNP+S +Y+G    N    P P          P+F S  V   + P Q  S+NS   L  A                       
Subjt:  LGCSICDGSHTYDQCPSNPKSFFYVGP-QDNAQNNPFPNAYNSSWGNHPSFWSSNVGASKLPQQQGSNNSGPQLFNANNDQVLQGSFQGNFQRNNNACGG

Query:  VSGASSSNSAKPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEAPR--GKEYCHALTLGDGKQVPSNTTTHALEDPKKCLEEPNTVKSSKEGVVKEKEEE
                 AK     +NLE Q+GQL +EL+N+  G LP++T+  R  GKE+C ALTL   + V                 EPN +++ KE V  +  EE
Subjt:  VSGASSSNSAKPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEAPR--GKEYCHALTLGDGKQVPSNTTTHALEDPKKCLEEPNTVKSSKEGVVKEKEEE

Query:  KTK--------------EPDAYKPPIPFPQRLQKKNDDTNFKKFLNVLKQLHINIPLVEALEKMPSYAKFLKDILSNKRQWREFEIVVLTNECSAILSSK
         T                   YKPP P+PQRLQK+  +  FKKFL+VLKQLHINIPLVEALE+MP+Y KF+KDILS KR+  EFE V LT EC+  L  K
Subjt:  KTK--------------EPDAYKPPIPFPQRLQKKNDDTNFKKFLNVLKQLHINIPLVEALEKMPSYAKFLKDILSNKRQWREFEIVVLTNECSAILSSK

Query:  IPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSVFKRLGIGKARPTTVTLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYEADREVPII
        +PPKLKDPG FTIPC+IG    G+ALCDLGASINLMP+S+FK+LGIG+ RPTTVTLQLADRS+ H EGKI+DVLV+VDKFIFPADF+ILD+EAD+EVPII
Subjt:  IPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSVFKRLGIGKARPTTVTLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYEADREVPII

Query:  LGRPFLATGQTSIDVQKGELTMRFNDEQVTFNAFSSLKFPDKVEECNALEE
        LGRPFLATG+T IDVQKGELTM   D+QVTFN F S++FPD +++C+ + +
Subjt:  LGRPFLATGQTSIDVQKGELTMRFNDEQVTFNAFSSLKFPDKVEECNALEE

XP_017239676.1 PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus]3.1e-10449.35Show/hide
Query:  CSICDGSHTYDQCP------SNPKSFFYVGPQDNAQNNPFPNAYNSSWGNHPSF-WSSNVGASKLPQQQG-----SNNSGPQLFNA---NNDQVLQGSFQ
        C +C   H   QCP      +   S  YVG   N QNNPF N YN  W NHP+F W++NV    +P +Q        N  PQ         D +LQ    
Subjt:  CSICDGSHTYDQCP------SNPKSFFYVGPQDNAQNNPFPNAYNSSWGNHPSF-WSSNVGASKLPQQQG-----SNNSGPQLFNA---NNDQVLQGSFQ

Query:  GNFQRNNNACGGVSGASSSNSAKPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEA-PRG--KEYCHALTLGDGKQVPSNT-TTHALEDPKKCLEEPNTV
          + +  +A   +   S+S  A        LE+Q+GQL S + N+  G+LP+NTE  P+   +E+C A+TL  GK++  NT       DP+K L E  +V
Subjt:  GNFQRNNNACGGVSGASSSNSAKPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEA-PRG--KEYCHALTLGDGKQVPSNT-TTHALEDPKKCLEEPNTV

Query:  KSSKEGVVKEKEEEKTKEPDAYKPPIPFPQRLQKKNDDTNFKKFLNVLKQLHINIPLVEALEKMPSYAKFLKDILSNKRQWREFEIVVLTNECSAILSSK
         S+       K +  T +   Y PP PFPQRLQK+  D  F+KF++V K+L INIP  EALE+M SY KF+KDILS KR+  EFE V LT ECSAIL  K
Subjt:  KSSKEGVVKEKEEEKTKEPDAYKPPIPFPQRLQKKNDDTNFKKFLNVLKQLHINIPLVEALEKMPSYAKFLKDILSNKRQWREFEIVVLTNECSAILSSK

Query:  IPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSVFKRLGIGKARPTTVTLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYEADREVPII
        +PPKLKDPGSFTIPC+IG    G+ALCDLGAS+NLMPLS+F +LG+G+ +PT+V LQLADRS+ +P G +EDVLVKVDKFIFPADFI+LD E D ++P++
Subjt:  IPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSVFKRLGIGKARPTTVTLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYEADREVPII

Query:  LGRPFLATGQTSIDVQKGELTMRFNDEQVTFNAFSSLKFPDKVEECNALEEKVGHPEEEVALKQ
        LGRPFLATG+T IDVQKGELTMR  DEQVTFN FS++KF +  E C ++    G  +  + L+Q
Subjt:  LGRPFLATGQTSIDVQKGELTMRFNDEQVTFNAFSSLKFPDKVEECNALEEKVGHPEEEVALKQ

XP_024022975.1 uncharacterized protein LOC112092050 [Morus notabilis]3.4e-10650.11Show/hide
Query:  EVAQPHLAALGCSICDGSHTYDQCPSNPKSFFYVGPQDNAQNNPFPNAYNSSWGNHPSF-WSSNVGASKLPQQQGS-------NNSGPQLFNANNDQVLQ
        E  Q     + C  C   H +++CPSNP S  Y+G      NNP+ N+YN  W  H +F WS+   +S     + +        N   +L N      L+
Subjt:  EVAQPHLAALGCSICDGSHTYDQCPSNPKSFFYVGPQDNAQNNPFPNAYNSSWGNHPSF-WSSNVGASKLPQQQGS-------NNSGPQLFNANNDQVLQ

Query:  GSFQGNFQRNNNACGGVSGASSSNSAKPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEAPR--GKEYCHALTLGDGKQVPSNTTTHALEDPKKCLEEPN
           +    +N       S    +  A      +NLE Q+GQL + L N+S GTLP++TE PR  G E+C A+ L  GK++         +D  K   EP+
Subjt:  GSFQGNFQRNNNACGGVSGASSSNSAKPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEAPR--GKEYCHALTLGDGKQVPSNTTTHALEDPKKCLEEPN

Query:  TV--KSSKEGVVKEKEEE----------KTKEPDAYKPPI--PFPQRLQKKNDDTNFKKFLNVLKQLHINIPLVEALEKMPSYAKFLKDILSNKRQWREF
        ++  +  +E  VK+   +          K    D  K  +  PFPQR QK+     F KFLNVLKQLHINIPLVEALE+MP+YAKF+KD+L+ KR++ EF
Subjt:  TV--KSSKEGVVKEKEEE----------KTKEPDAYKPPI--PFPQRLQKKNDDTNFKKFLNVLKQLHINIPLVEALEKMPSYAKFLKDILSNKRQWREF

Query:  EIVVLTNECSAILSSKIPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSVFKRLGIGKARPTTVTLQLADRSVVHPEGKIEDVLVKVDKFIFPA
        E + LT ECS IL SK+PPKL DPGSFTIPCSIG    GRALCDLGASINLMP+S FK+LGI +ARPTTVTLQ ADRS V+PEGKIEDVLV+VDKFIF  
Subjt:  EIVVLTNECSAILSSKIPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSVFKRLGIGKARPTTVTLQLADRSVVHPEGKIEDVLVKVDKFIFPA

Query:  DFIILDYEADREVPIILGRPFLATGQTSIDVQKGELTMRFNDEQVTFNAFSSLKFPDKVEECNAL
        DFI+LDYE D+EVPIILGRPFLATG+T IDVQKGELTMR +DEQVTFN F +++F D VE+C A+
Subjt:  DFIILDYEADREVPIILGRPFLATGQTSIDVQKGELTMRFNDEQVTFNAFSSLKFPDKVEECNAL

XP_024028757.1 uncharacterized protein LOC112093792 [Morus notabilis]9.4e-11749.4Show/hide
Query:  HQRETLAAQHCDARAMREIFSKIEVAQPHLA----ALGCSICDGSHTYDQCPSNPKSFFYVGPQDNAQNNPFPNAYNSSWGNHPSFWSSNVGASKLPQ--
        H+ + + A      ++  I   + VA P  A    AL C  C   H+++ CPSNP+S  YV    N  NNP+ N+YN  W  HP+F  SN  A+ +P   
Subjt:  HQRETLAAQHCDARAMREIFSKIEVAQPHLA----ALGCSICDGSHTYDQCPSNPKSFFYVGPQDNAQNNPFPNAYNSSWGNHPSFWSSNVGASKLPQ--

Query:  --------QQGSNNSGPQLFNANNDQVLQGS-------FQGNFQRNNNACGGVSGASSSNSAKPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEAPRG-
                 Q  +   P    +N  Q  Q S        +    RN++   G +    S +A      + LE Q+GQL + L N+  G+LP++T+ PR  
Subjt:  --------QQGSNNSGPQLFNANNDQVLQGS-------FQGNFQRNNNACGGVSGASSSNSAKPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEAPRG-

Query:  -----KEYCHALTLGDGKQVPSNT-----TTHALEDPKKCLEEPNTVKSSKEGVVKEKEEEKTKEPDAYKPPIPFPQRLQKKNDDTNFKKFLNVLKQLHI
             KE+C A+TL +G+++   T     T H+    ++  + P     S++ VV +    K K+    +PP PFPQR Q +  D  F++FL+VLKQLHI
Subjt:  -----KEYCHALTLGDGKQVPSNT-----TTHALEDPKKCLEEPNTVKSSKEGVVKEKEEEKTKEPDAYKPPIPFPQRLQKKNDDTNFKKFLNVLKQLHI

Query:  NIPLVEALEKMPSYAKFLKDILSNKRQWREFEIVVLTNECSAILSSKIPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSVFKRLGIGKARPTT
        NIPLVEALE+MPSY KF+KDIL+ KR+  EFE V LT ECSAIL +++PPKLKDPGSFTIPCSIG   +G+ALCDLGASINLMP+S+F++LGIG+  PTT
Subjt:  NIPLVEALEKMPSYAKFLKDILSNKRQWREFEIVVLTNECSAILSSKIPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSVFKRLGIGKARPTT

Query:  VTLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYEADREVPIILGRPFLATGQTSIDVQKGELTMRFNDEQVTFNAFSSLKFPDKVEECNAL
        VTLQLADRS  HPEGKIEDVLV+VDKFIFPADFI+LDYEAD+EVPIILGRPFLATG+T IDVQKGELTMR +D+QVTFN F +++F D+VEEC+A+
Subjt:  VTLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYEADREVPIILGRPFLATGQTSIDVQKGELTMRFNDEQVTFNAFSSLKFPDKVEECNAL

XP_024963059.1 uncharacterized protein LOC112503227 [Cynara cardunculus var. scolymus]5.2e-10751.22Show/hide
Query:  CSICDGSHTYDQCPSNPKSFFYVGPQDNAQNNPFPNAYNSSWGNHPSF-WSSNVG--------ASKLPQQQGSNNSGPQLFNANNDQVLQG-SFQGNFQR
        C  C  +H+Y+ C  NP S  YVG  +  +N+PF   YN +W NHP+F  + N G            P      + G +   +N+     G S Q +  R
Subjt:  CSICDGSHTYDQCPSNPKSFFYVGPQDNAQNNPFPNAYNSSWGNHPSF-WSSNVG--------ASKLPQQQGSNNSGPQLFNANNDQVLQG-SFQGNFQR

Query:  NNNACGGVSGASSSNSA---KPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEAPRGKEYCHALTLGDGKQVPSNTTTHALEDPKKC-LEEPNTVKSSKE
          NA  G     S       K     +N E QLGQ  ++LKN+   T P++TE P  KE+  A+TL   K +  +  T   E PK   L   +T +S  +
Subjt:  NNNACGGVSGASSSNSA---KPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEAPRGKEYCHALTLGDGKQVPSNTTTHALEDPKKC-LEEPNTVKSSKE

Query:  GV-----VKEKEEEKTKEPDAYKPPIPFPQRLQKKNDDTNFKKFLNVLKQLHINIPLVEALEKMPSYAKFLKDILSNKRQWREFEIVVLTNECSAILSSK
               +   +++KT   D +   +PFP R++ KN D  FKKFL++ KQLHINIPLVEALEKMPSY KFLKDIL+ KR+  EFE + LTNECSA+L+ K
Subjt:  GV-----VKEKEEEKTKEPDAYKPPIPFPQRLQKKNDDTNFKKFLNVLKQLHINIPLVEALEKMPSYAKFLKDILSNKRQWREFEIVVLTNECSAILSSK

Query:  IPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSVFKRLGIGKARPTTVTLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYEADREVPII
        IPPKLKDPGSFTIPCSIGG +VG ALCDLGASINLMPLS+F RLGIG+ RPTTVTLQLADRS+ +P+GKIED+LVKVDKF FPADF++LDYEA R V II
Subjt:  IPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSVFKRLGIGKARPTTVTLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYEADREVPII

Query:  LGRPFLATGQTSIDVQKGELTMRFNDEQVTFNAFSSLKFPDKVEECNAL
        LGRPFL  G+T IDVQKGELTMR ND+ VTFN F +LKF  ++E+C+++
Subjt:  LGRPFLATGQTSIDVQKGELTMRFNDEQVTFNAFSSLKFPDKVEECNAL

TrEMBL top hitse value%identityAlignment
A0A2G9GK35 Reverse transcriptase2.2e-10349.09Show/hide
Query:  CSICDGSHTYDQCPSNPKSFFYVGPQDNAQNNPFPNAYNSSWGNHPSF-WSSNVGASKLPQ-QQGSNNSGPQLFNANNDQVLQGSFQGNFQRNNNACGGV
        C  C  SH  DQCP + +S  +V      QNNP+ N YN  W  HP+F W++N G    P+ QQG      Q        + +   Q             
Subjt:  CSICDGSHTYDQCPSNPKSFFYVGPQDNAQNNPFPNAYNSSWGNHPSF-WSSNVGASKLPQ-QQGSNNSGPQLFNANNDQVLQGSFQGNFQRNNNACGGV

Query:  SGASSSNSAKPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEA-PR--GKEYCHALTLGDGKQVPSNTTTHALEDPKKCLEEPNTVKSSKEGVVKEKEEE
          ++++N        + +E Q+GQL + + ++  G+LP+NTE  PR  GK  C A+TL +G+++            ++ ++EP T    KE + +EKE+E
Subjt:  SGASSSNSAKPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEA-PR--GKEYCHALTLGDGKQVPSNTTTHALEDPKKCLEEPNTVKSSKEGVVKEKEEE

Query:  KTKEPDAYKPPI---PFPQRLQKKNDDTNFKKFLNVLKQLHINIPLVEALEKMPSYAKFLKDILSNKRQWREFEIVVLTNECSAILSSKIPPKLKDPGSF
             +  KP     PFPQRLQK+  +  F KFL V K+LHINIP  EALE+MPSY KF+KDILS KR+  ++E V LT ECSAI+ +K+PPKLKDPGSF
Subjt:  KTKEPDAYKPPI---PFPQRLQKKNDDTNFKKFLNVLKQLHINIPLVEALEKMPSYAKFLKDILSNKRQWREFEIVVLTNECSAILSSKIPPKLKDPGSF

Query:  TIPCSIGGIDVGRALCDLGASINLMPLSVFKRLGIGKARPTTVTLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYEADREVPIILGRPFLATGQT
        TIPC+IG    GRALCDLGASINLMP S+++ LG+G+A+PT++TLQLADRS+ +P+G IED+LVKVDKFIFPADF++LD E D EVPIILGRPFLATG+T
Subjt:  TIPCSIGGIDVGRALCDLGASINLMPLSVFKRLGIGKARPTTVTLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYEADREVPIILGRPFLATGQT

Query:  SIDVQKGELTMRFNDEQVTFNAFSSLKFPDKVEECNAL
         IDVQKGELTMR  D+Q+TFN F ++KFP++ +EC A+
Subjt:  SIDVQKGELTMRFNDEQVTFNAFSSLKFPDKVEECNAL

A0A2G9HYA0 Reverse transcriptase3.7e-10348.86Show/hide
Query:  CSICDGSHTYDQCPSNPKSFFYVGPQDNAQNNPFPNAYNSSWGNHPSF-WSSNVGASKLPQ-QQGSNNSGPQLFNANNDQVLQGSFQGNFQRNNNACGGV
        C  C   H  DQCP + +S  +V      QNNP+ N YN  W  HP+F W++N G    P+ QQG      Q        + +   Q             
Subjt:  CSICDGSHTYDQCPSNPKSFFYVGPQDNAQNNPFPNAYNSSWGNHPSF-WSSNVGASKLPQ-QQGSNNSGPQLFNANNDQVLQGSFQGNFQRNNNACGGV

Query:  SGASSSNSAKPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEA-PR--GKEYCHALTLGDGKQVPSNTTTHALEDPKKCLEEPNTVKSSKEGVVKEKEEE
          ++++N        + +E Q+GQL + + ++  G+LP+NTE  PR  GK  C A+TL +G+++            ++ ++EP T    KE + +EKE+E
Subjt:  SGASSSNSAKPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEA-PR--GKEYCHALTLGDGKQVPSNTTTHALEDPKKCLEEPNTVKSSKEGVVKEKEEE

Query:  KTKEPDAYKPPI---PFPQRLQKKNDDTNFKKFLNVLKQLHINIPLVEALEKMPSYAKFLKDILSNKRQWREFEIVVLTNECSAILSSKIPPKLKDPGSF
             +  KP     PFPQRLQK+  +  F KFL V K+LHINIP  EALE+MPSY KF+KDILS KR+  ++E V LT ECSAI+ +K+PPKLKDPGSF
Subjt:  KTKEPDAYKPPI---PFPQRLQKKNDDTNFKKFLNVLKQLHINIPLVEALEKMPSYAKFLKDILSNKRQWREFEIVVLTNECSAILSSKIPPKLKDPGSF

Query:  TIPCSIGGIDVGRALCDLGASINLMPLSVFKRLGIGKARPTTVTLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYEADREVPIILGRPFLATGQT
        TIPC+IG    GRALCDLGASINLMP S+++ LG+G+A+PT++TLQLADRS+ +P+G IED+LVKVDKFIFPADF++LD E D EVPIILGRPFLATG+T
Subjt:  TIPCSIGGIDVGRALCDLGASINLMPLSVFKRLGIGKARPTTVTLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYEADREVPIILGRPFLATGQT

Query:  SIDVQKGELTMRFNDEQVTFNAFSSLKFPDKVEECNAL
         IDVQKGELTMR  D+Q+TFN F ++KFP++ +EC A+
Subjt:  SIDVQKGELTMRFNDEQVTFNAFSSLKFPDKVEECNAL

A0A2G9HYD8 Reverse transcriptase2.3e-10048.72Show/hide
Query:  SHTYDQCPSNPKSFFYVGPQDNAQNNPFPNAYNSSWGNHPSF-WSSNVGASKLPQ-QQGSNNSGPQLFNANNDQVLQGSFQGNFQRNNNACGGVSGASSS
        SH  DQCP + +S  +V      QNNP+ N YN  W  HP+F W++N G    P+ QQG      Q        + +   Q                   
Subjt:  SHTYDQCPSNPKSFFYVGPQDNAQNNPFPNAYNSSWGNHPSF-WSSNVGASKLPQ-QQGSNNSGPQLFNANNDQVLQGSFQGNFQRNNNACGGVSGASSS

Query:  NSAKPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEA-PR--GKEYCHALTLGDGKQVPSNTTTHALEDPKKCLEEPNTVKSSKEGVVKEKEEEKTKEPD
          A      + +E Q+GQL + + ++  G+LP+NTE  PR  GK  C A+TL +G+++            ++ +++P T    KE + KEKE+E     +
Subjt:  NSAKPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEA-PR--GKEYCHALTLGDGKQVPSNTTTHALEDPKKCLEEPNTVKSSKEGVVKEKEEEKTKEPD

Query:  AYKPPI---PFPQRLQKKNDDTNFKKFLNVLKQLHINIPLVEALEKMPSYAKFLKDILSNKRQWREFEIVVLTNECSAILSSKIPPKLKDPGSFTIPCSI
          KP     PFPQ+LQK+  +  F KFL V K+LHINIP  EALE+MPSY KF+KDILS KR+  ++E   LT EC+AI+ +K+PPKLKDPGSFTIPC+I
Subjt:  AYKPPI---PFPQRLQKKNDDTNFKKFLNVLKQLHINIPLVEALEKMPSYAKFLKDILSNKRQWREFEIVVLTNECSAILSSKIPPKLKDPGSFTIPCSI

Query:  GGIDVGRALCDLGASINLMPLSVFKRLGIGKARPTTVTLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYEADREVPIILGRPFLATGQTSIDVQK
        G    GRALCDLGASINLMP S+++ LG+G+A+PT++TLQLADRS+ +P+G IED+LVKVDKFIFPADF++LD E D EVPIILGRPFLATG+T IDVQK
Subjt:  GGIDVGRALCDLGASINLMPLSVFKRLGIGKARPTTVTLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYEADREVPIILGRPFLATGQTSIDVQK

Query:  GELTMRFNDEQVTFNAFSSLKFPDKVEEC
        GELTMR  D+Q+TFN F ++KFP++ +EC
Subjt:  GELTMRFNDEQVTFNAFSSLKFPDKVEEC

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129454.1e-10248.42Show/hide
Query:  CSICDGSHTYDQCPSNPKSFFYVGPQDNAQNNPFPNAYNSSWGNHPSF-WSSNVGASK----LPQ--QQGSNNSGPQLFNANNDQVLQGSFQGNFQRNNN
        C +C  SH+YDQCP N +S  +VG  +  QNNP+ N YN  W NHP+F WS+N G S     +P   QQ +    P+  +   + +LQ      +    +
Subjt:  CSICDGSHTYDQCPSNPKSFFYVGPQDNAQNNPFPNAYNSSWGNHPSF-WSSNVGASK----LPQ--QQGSNNSGPQLFNANNDQVLQGSFQGNFQRNNN

Query:  ACGGVSGASSSNSAKPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEA-PRGKEYCHALTLGDGKQVPSNTTTHALEDPKKCLEEPNTVKSSKEGVVKEK
        A     GAS           +NLE Q+GQL + + N+  G+LP++T+  P+GKE C A+TL  GK++       A+E   + +++    ++  E  +++K
Subjt:  ACGGVSGASSSNSAKPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEA-PRGKEYCHALTLGDGKQVPSNTTTHALEDPKKCLEEPNTVKSSKEGVVKEK

Query:  EEEKTKE---PDAYKPPIPFPQRLQKKNDDTNFKKFLNVLKQLHINIPLVEALEKMPSYAKFLKDILSNKRQWREFEIVVLTNECSAILSSKIPPKLKDP
        +++K +         PP PFPQRLQK+  +  F+KFLNV K+LHINIP  EALE+MPSY KFLKDILS KR+  EFE V LT ECSAIL +K+PPKLKDP
Subjt:  EEEKTKE---PDAYKPPIPFPQRLQKKNDDTNFKKFLNVLKQLHINIPLVEALEKMPSYAKFLKDILSNKRQWREFEIVVLTNECSAILSSKIPPKLKDP

Query:  GSFTIPCSIGGIDVGRALCDLGASINLMPLSVFKRLGIGKARPTTVTLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYEADREVPIILGRPFLAT
        GSFTIPC+IG +   +AL DLGASINLMP S+F++LG+G+ +PT+VTLQLADRS V+P G IEDVLVKVDKFIFP DF+ILD E DR++PIILGRPFLAT
Subjt:  GSFTIPCSIGGIDVGRALCDLGASINLMPLSVFKRLGIGKARPTTVTLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYEADREVPIILGRPFLAT

Query:  GQTSIDVQKGELTMRFNDEQVTFNAFSSLKFPDKVEECNALE
            IDV++G+++ +  +E V FN F++ K P     C+ +E
Subjt:  GQTSIDVQKGELTMRFNDEQVTFNAFSSLKFPDKVEECNALE

A0A6J1DY39 uncharacterized protein LOC1110256531.2e-10143.49Show/hide
Query:  LAALGCSICDGSHTYDQCPSNPKSFFYVGPQDNAQNNPFPNAYNSSWGNHPSFWSSNVGASK-------------------------LPQQ--QGSNNSG
        +A   C  C   H  + CPSNP S +YVG  +  + NP+ N YN  W  HP+F  S  G+S                           P Q  Q  N   
Subjt:  LAALGCSICDGSHTYDQCPSNPKSFFYVGPQDNAQNNPFPNAYNSSWGNHPSFWSSNVGASK-------------------------LPQQ--QGSNNSG

Query:  PQLFNANNDQVL--------QGSFQGNFQRNNNACGGVSGASSSNSAKPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEAPR--GKEYCHALTLG----
        P   N +N ++L          + +    R +     +         +     + LE+QLGQL +E++ +  G+LP++TE PR  GKE+C+++       
Subjt:  PQLFNANNDQVL--------QGSFQGNFQRNNNACGGVSGASSSNSAKPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEAPR--GKEYCHALTLG----

Query:  -DGKQVPSNTTTHALEDPKKCLEEPNTVKSSKEGVVKEKEEEKTKEPDAYKPPIPFPQRLQKKNDDTNFKKFLNVLKQLHINIPLVEALEKMPSYAKFLK
         +G ++P + ++H+    K     P+ +      V    +   +      +PP PFPQRL +KN D NF+KFL++LKQLHINIP VEALE+MP+YAKF+K
Subjt:  -DGKQVPSNTTTHALEDPKKCLEEPNTVKSSKEGVVKEKEEEKTKEPDAYKPPIPFPQRLQKKNDDTNFKKFLNVLKQLHINIPLVEALEKMPSYAKFLK

Query:  DILSNKRQWREFEIVVLTNECSAILSSKIPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSVFKRLGIGKARPTTVTLQLADRSVVHPEGKIED
        DI++ K++  E+E V LT   S +  SK+PPKLKDPGSFTIPC IGG DVGRALCDLGASINLMPLS+FK+  IGKA PTTVTLQLADRS+  PEGKIED
Subjt:  DILSNKRQWREFEIVVLTNECSAILSSKIPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSVFKRLGIGKARPTTVTLQLADRSVVHPEGKIED

Query:  VLVKVDKFIFPADFIILDYEADREVPIILGRPFLATGQTSIDVQKGELTMRFNDEQVTFNAFSSLKFPDKVEECNALE----------EKVGHPEEEVAL
        VLVKVDKFIFP DFIILD EAD++VPIILGRPFLATG+T IDV+KGELTMR +D++VTFN   ++K+ D +EEC  +           + + + E E  L
Subjt:  VLVKVDKFIFPADFIILDYEADREVPIILGRPFLATGQTSIDVQKGELTMRFNDEQVTFNAFSSLKFPDKVEECNALE----------EKVGHPEEEVAL

Query:  KQKAKKGKGVELGSSASKKKISSRQNTTTNGATVVKEKHKSCQPL
        ++  K+G    + ++A K                 KEK KS QPL
Subjt:  KQKAKKGKGVELGSSASKKKISSRQNTTTNGATVVKEKHKSCQPL

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003101.1e-1451.25Show/hide
Query:  SIPTYFMSRFRIPKTLCDELHLMMARFWWGSTDTKRKIHWKKWAQMCLPKE-RGGLNFRDLEPFNKALLAKQKWKRKRQP
        ++P Y MS FR+ K LC +L   M  FWW S + KRKI W  W ++C  KE  GGL FRDL  FN+ALLAKQ ++   QP
Subjt:  SIPTYFMSRFRIPKTLCDELHLMMARFWWGSTDTKRKIHWKKWAQMCLPKE-RGGLNFRDLEPFNKALLAKQKWKRKRQP

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.7e-0731.25Show/hide
Query:  MPFVDTLVIIKSIAQSIPTYFMSRFRIPKTLCDELHLMMARFWWGSTDTKRKIHWKKWAQMCLPKERGGLNFRDLEPFNK
        + F   L +I S+  S+  ++MS FR+P     E+  + + F W   +   K     W+ +C PK+ GGL  R L+  NK
Subjt:  MPFVDTLVIIKSIAQSIPTYFMSRFRIPKTLCDELHLMMARFWWGSTDTKRKIHWKKWAQMCLPKERGGLNFRDLEPFNK

AT4G29090.1 Ribonuclease H-like superfamily protein6.5e-1540.51Show/hide
Query:  SIPTYFMSRFRIPKTLCDELHLMMARFWWGSTDTKRKIHWKKWAQMCLPKERGGLNFRDLEPFNKALLAKQKWKRKRQP
        ++PTY M+ F +PKT+C ++  ++A FWW +    + +HWK W  +   K  GG+ F+D+E FN ALL KQ W+   +P
Subjt:  SIPTYFMSRFRIPKTLCDELHLMMARFWWGSTDTKRKIHWKKWAQMCLPKERGGLNFRDLEPFNKALLAKQKWKRKRQP

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.7e-1651.25Show/hide
Query:  SIPTYFMSRFRIPKTLCDELHLMMARFWWGSTDTKRKIHWKKWAQMCLPKE-RGGLNFRDLEPFNKALLAKQKWKRKRQP
        ++P Y MS FR+ K LC +L   M  FWW S + KRKI W  W ++C  KE  GGL FRDL  FN+ALLAKQ ++   QP
Subjt:  SIPTYFMSRFRIPKTLCDELHLMMARFWWGSTDTKRKIHWKKWAQMCLPKE-RGGLNFRDLEPFNKALLAKQKWKRKRQP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AGAAATTTCCAGCGAAGAAATCAAGAAGGAATCGCTGTGTTTTCGTTCGTTGGAGCGTCGTTGGCGAAGAACAGTCAAGTCGACAATGAAGAGGTGTTGGACCATTCAAG
GGTCTGGCAGGCCACAATTAGAATTGTCTTCAGAATTGCAGAACTCGTCTCAAAAAATGGGGCAAAGACTTTCTGGAGCGAGGTTGGTGATATTACTTCTTTAAATTGCT
TGGAAAATTTAAATCGCATACGATCTGCTCGTGTTTGGAATGATATTCACATTGCTTTGATCCCTAAGAAGATCAATATTGATAAATCAGCTATATACTTCTCTTCGAAT
TTGAATCGTGAATTTCGTATGGTGATTTCGAATTTGGTGGGTATGCCTTTTGTGGATACTCTTGTCATAATTAAGAGTATTGCTCAATCAATTCCAACGTACTTTATGAG
CCGCTTCCGTATTCCAAAGACACTTTGTGATGAACTACATTTAATGATGGCTCGATTCTGGTGGGGATCAACGGATACTAAAAGGAAGATTCATTGGAAGAAATGGGCTC
AAATGTGCCTTCCAAAGGAGCGAGGGGGTCTAAACTTTCGAGATTTGGAGCCTTTTAACAAAGCTTTATTGGCAAAACAGAAGTGGAAGAGAAAAAGACAACCATTTCAG
AGTTGGAGGAACTGGTGGCCTCGACGAATTCGTAAGGCCGAAAATTTGAACTTCATTGCTCGATCGATTCTCAAGGAGACAAGACCTATGCCAAATACCAACAGGACACC
TAAGAAGAGGGAGGAGGACATGACGCCTACGGTTATGGACAAAGAGAATACAGTCTTACAAAATAACATTGTCCAGTTGTATACGGACTATGTTGTACGGTCGCAGGCAG
ACAGGTCTGGGTATGGGGTGGTGGTTATCGGCATGGATGGGCAACATATGCAAATTTCTTGTGCACACGTTTATTCCGACTCCACGAATGAAATCAGGATGATTATAGGA
GATATTCATATTACATCTGATGTGTATCATTGGATAATGCAAATTCGAAGGATGTCTGAGTCTTTTGAATTTTTATCCTTTAGTCATGTGTTTAGAGATTGTAATAGGAG
GGAAGATTCTCTTGTGAAACATGCCTTAGCTCAACGCAATTCTATGCTTTGGTTAAAGAATGTCTCTCAGGAGCAAGCATCAATGAGTGCTTGTCCTCATTGCCATTGTC
CTCACCAGAGCTTTGGAAGCAATTCGGGGCTTAAACGAGCGAAACCAGAGCGTAAAATGACCATTCTGCCCCTAGAGCTTCATCAGCGTCGAGACGCTCGTCATCAGAGA
GAGACGCTTGCGGCACAGCATTGCGACGCTAGGGCAATGCGAGAAATATTTTCCAAAATTGAAGTCGCACAGCCGCATCTTGCTGCGTTAGGATGTTCCATCTGCGACGG
TAGCCACACGTATGATCAGTGCCCATCAAATCCAAAATCATTTTTCTATGTGGGACCCCAAGATAACGCACAGAATAATCCCTTTCCAAACGCCTACAATTCTAGTTGGG
GAAATCATCCTAGTTTCTGGAGTAGTAATGTTGGCGCATCAAAGCTACCGCAGCAACAAGGGAGTAACAATTCTGGCCCTCAACTCTTTAACGCAAACAATGATCAAGTT
TTGCAGGGATCATTCCAAGGAAATTTCCAGCGCAACAACAATGCATGCGGTGGAGTCTCTGGCGCATCATCTTCTAACTCTGCGAAGCCAGGCTGCATCACTCAAAACCT
TGAAGTTCAATTGGGGCAATTAACATCAGAATTGAAGAACAAGTCAGTAGGAACTCTTCCAAACAACACTGAAGCGCCAAGAGGAAAAGAATACTGTCATGCGTTAACCT
TGGGCGATGGTAAACAAGTTCCAAGTAATACAACTACTCATGCACTAGAAGATCCGAAGAAGTGTCTAGAAGAACCCAACACGGTTAAATCAAGCAAAGAAGGCGTTGTG
AAGGAAAAAGAAGAAGAAAAGACAAAAGAGCCTGACGCATACAAGCCACCTATTCCCTTTCCGCAACGCCTACAAAAAAAGAATGATGACACAAACTTCAAGAAATTTCT
GAATGTGTTGAAGCAGTTGCATATCAACATTCCGCTAGTTGAGGCGCTTGAAAAAATGCCAAGCTACGCAAAGTTCTTAAAGGACATCCTTTCGAACAAGCGCCAGTGGA
GAGAATTCGAGATCGTTGTGCTAACAAATGAGTGTAGTGCAATCCTCAGTAGCAAAATTCCCCCAAAATTGAAGGACCCAGGAAGCTTCACTATTCCATGTTCCATTGGA
GGAATTGATGTGGGGAGAGCCCTATGCGACCTTGGCGCAAGCATTAACTTAATGCCTTTGTCAGTATTCAAACGCCTTGGTATTGGAAAAGCACGTCCAACAACAGTCAC
TCTCCAGCTTGCAGATAGGTCGGTGGTGCATCCAGAAGGAAAAATTGAAGATGTACTGGTAAAAGTGGATAAATTTATTTTTCCGGCTGACTTTATCATTCTTGATTATG
AAGCAGATAGAGAAGTCCCAATCATTTTGGGAAGACCTTTCTTAGCTACCGGACAAACTTCGATTGATGTGCAGAAAGGCGAGCTAACCATGCGGTTCAATGATGAACAA
GTAACTTTTAATGCTTTCAGCTCATTAAAATTCCCTGATAAAGTAGAAGAGTGCAATGCGTTAGAAGAAAAAGTTGGACATCCTGAAGAAGAAGTTGCGCTCAAACAGAA
GGCGAAAAAAGGAAAAGGAGTTGAGTTGGGGAGTAGCGCATCAAAGAAAAAAATCTCATCGCGTCAAAACACAACTACGAATGGCGCTACTGTAGTAAAGGAAAAGCACA
AAAGCTGTCAACCTTTGTCTTGGATAGAATGCAAACTTCGAGGGAAAACCCAACATTGCAAGGTGTGGCAGAAAGTCCGACAGCGCAAATCAAGAATAAAATCTCATGAA
GGAAAAGAGATGTAG
mRNA sequenceShow/hide mRNA sequence
AGAAATTTCCAGCGAAGAAATCAAGAAGGAATCGCTGTGTTTTCGTTCGTTGGAGCGTCGTTGGCGAAGAACAGTCAAGTCGACAATGAAGAGGTGTTGGACCATTCAAG
GGTCTGGCAGGCCACAATTAGAATTGTCTTCAGAATTGCAGAACTCGTCTCAAAAAATGGGGCAAAGACTTTCTGGAGCGAGGTTGGTGATATTACTTCTTTAAATTGCT
TGGAAAATTTAAATCGCATACGATCTGCTCGTGTTTGGAATGATATTCACATTGCTTTGATCCCTAAGAAGATCAATATTGATAAATCAGCTATATACTTCTCTTCGAAT
TTGAATCGTGAATTTCGTATGGTGATTTCGAATTTGGTGGGTATGCCTTTTGTGGATACTCTTGTCATAATTAAGAGTATTGCTCAATCAATTCCAACGTACTTTATGAG
CCGCTTCCGTATTCCAAAGACACTTTGTGATGAACTACATTTAATGATGGCTCGATTCTGGTGGGGATCAACGGATACTAAAAGGAAGATTCATTGGAAGAAATGGGCTC
AAATGTGCCTTCCAAAGGAGCGAGGGGGTCTAAACTTTCGAGATTTGGAGCCTTTTAACAAAGCTTTATTGGCAAAACAGAAGTGGAAGAGAAAAAGACAACCATTTCAG
AGTTGGAGGAACTGGTGGCCTCGACGAATTCGTAAGGCCGAAAATTTGAACTTCATTGCTCGATCGATTCTCAAGGAGACAAGACCTATGCCAAATACCAACAGGACACC
TAAGAAGAGGGAGGAGGACATGACGCCTACGGTTATGGACAAAGAGAATACAGTCTTACAAAATAACATTGTCCAGTTGTATACGGACTATGTTGTACGGTCGCAGGCAG
ACAGGTCTGGGTATGGGGTGGTGGTTATCGGCATGGATGGGCAACATATGCAAATTTCTTGTGCACACGTTTATTCCGACTCCACGAATGAAATCAGGATGATTATAGGA
GATATTCATATTACATCTGATGTGTATCATTGGATAATGCAAATTCGAAGGATGTCTGAGTCTTTTGAATTTTTATCCTTTAGTCATGTGTTTAGAGATTGTAATAGGAG
GGAAGATTCTCTTGTGAAACATGCCTTAGCTCAACGCAATTCTATGCTTTGGTTAAAGAATGTCTCTCAGGAGCAAGCATCAATGAGTGCTTGTCCTCATTGCCATTGTC
CTCACCAGAGCTTTGGAAGCAATTCGGGGCTTAAACGAGCGAAACCAGAGCGTAAAATGACCATTCTGCCCCTAGAGCTTCATCAGCGTCGAGACGCTCGTCATCAGAGA
GAGACGCTTGCGGCACAGCATTGCGACGCTAGGGCAATGCGAGAAATATTTTCCAAAATTGAAGTCGCACAGCCGCATCTTGCTGCGTTAGGATGTTCCATCTGCGACGG
TAGCCACACGTATGATCAGTGCCCATCAAATCCAAAATCATTTTTCTATGTGGGACCCCAAGATAACGCACAGAATAATCCCTTTCCAAACGCCTACAATTCTAGTTGGG
GAAATCATCCTAGTTTCTGGAGTAGTAATGTTGGCGCATCAAAGCTACCGCAGCAACAAGGGAGTAACAATTCTGGCCCTCAACTCTTTAACGCAAACAATGATCAAGTT
TTGCAGGGATCATTCCAAGGAAATTTCCAGCGCAACAACAATGCATGCGGTGGAGTCTCTGGCGCATCATCTTCTAACTCTGCGAAGCCAGGCTGCATCACTCAAAACCT
TGAAGTTCAATTGGGGCAATTAACATCAGAATTGAAGAACAAGTCAGTAGGAACTCTTCCAAACAACACTGAAGCGCCAAGAGGAAAAGAATACTGTCATGCGTTAACCT
TGGGCGATGGTAAACAAGTTCCAAGTAATACAACTACTCATGCACTAGAAGATCCGAAGAAGTGTCTAGAAGAACCCAACACGGTTAAATCAAGCAAAGAAGGCGTTGTG
AAGGAAAAAGAAGAAGAAAAGACAAAAGAGCCTGACGCATACAAGCCACCTATTCCCTTTCCGCAACGCCTACAAAAAAAGAATGATGACACAAACTTCAAGAAATTTCT
GAATGTGTTGAAGCAGTTGCATATCAACATTCCGCTAGTTGAGGCGCTTGAAAAAATGCCAAGCTACGCAAAGTTCTTAAAGGACATCCTTTCGAACAAGCGCCAGTGGA
GAGAATTCGAGATCGTTGTGCTAACAAATGAGTGTAGTGCAATCCTCAGTAGCAAAATTCCCCCAAAATTGAAGGACCCAGGAAGCTTCACTATTCCATGTTCCATTGGA
GGAATTGATGTGGGGAGAGCCCTATGCGACCTTGGCGCAAGCATTAACTTAATGCCTTTGTCAGTATTCAAACGCCTTGGTATTGGAAAAGCACGTCCAACAACAGTCAC
TCTCCAGCTTGCAGATAGGTCGGTGGTGCATCCAGAAGGAAAAATTGAAGATGTACTGGTAAAAGTGGATAAATTTATTTTTCCGGCTGACTTTATCATTCTTGATTATG
AAGCAGATAGAGAAGTCCCAATCATTTTGGGAAGACCTTTCTTAGCTACCGGACAAACTTCGATTGATGTGCAGAAAGGCGAGCTAACCATGCGGTTCAATGATGAACAA
GTAACTTTTAATGCTTTCAGCTCATTAAAATTCCCTGATAAAGTAGAAGAGTGCAATGCGTTAGAAGAAAAAGTTGGACATCCTGAAGAAGAAGTTGCGCTCAAACAGAA
GGCGAAAAAAGGAAAAGGAGTTGAGTTGGGGAGTAGCGCATCAAAGAAAAAAATCTCATCGCGTCAAAACACAACTACGAATGGCGCTACTGTAGTAAAGGAAAAGCACA
AAAGCTGTCAACCTTTGTCTTGGATAGAATGCAAACTTCGAGGGAAAACCCAACATTGCAAGGTGTGGCAGAAAGTCCGACAGCGCAAATCAAGAATAAAATCTCATGAA
GGAAAAGAGATGTAG
Protein sequenceShow/hide protein sequence
RNFQRRNQEGIAVFSFVGASLAKNSQVDNEEVLDHSRVWQATIRIVFRIAELVSKNGAKTFWSEVGDITSLNCLENLNRIRSARVWNDIHIALIPKKINIDKSAIYFSSN
LNREFRMVISNLVGMPFVDTLVIIKSIAQSIPTYFMSRFRIPKTLCDELHLMMARFWWGSTDTKRKIHWKKWAQMCLPKERGGLNFRDLEPFNKALLAKQKWKRKRQPFQ
SWRNWWPRRIRKAENLNFIARSILKETRPMPNTNRTPKKREEDMTPTVMDKENTVLQNNIVQLYTDYVVRSQADRSGYGVVVIGMDGQHMQISCAHVYSDSTNEIRMIIG
DIHITSDVYHWIMQIRRMSESFEFLSFSHVFRDCNRREDSLVKHALAQRNSMLWLKNVSQEQASMSACPHCHCPHQSFGSNSGLKRAKPERKMTILPLELHQRRDARHQR
ETLAAQHCDARAMREIFSKIEVAQPHLAALGCSICDGSHTYDQCPSNPKSFFYVGPQDNAQNNPFPNAYNSSWGNHPSFWSSNVGASKLPQQQGSNNSGPQLFNANNDQV
LQGSFQGNFQRNNNACGGVSGASSSNSAKPGCITQNLEVQLGQLTSELKNKSVGTLPNNTEAPRGKEYCHALTLGDGKQVPSNTTTHALEDPKKCLEEPNTVKSSKEGVV
KEKEEEKTKEPDAYKPPIPFPQRLQKKNDDTNFKKFLNVLKQLHINIPLVEALEKMPSYAKFLKDILSNKRQWREFEIVVLTNECSAILSSKIPPKLKDPGSFTIPCSIG
GIDVGRALCDLGASINLMPLSVFKRLGIGKARPTTVTLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYEADREVPIILGRPFLATGQTSIDVQKGELTMRFNDEQ
VTFNAFSSLKFPDKVEECNALEEKVGHPEEEVALKQKAKKGKGVELGSSASKKKISSRQNTTTNGATVVKEKHKSCQPLSWIECKLRGKTQHCKVWQKVRQRKSRIKSHE
GKEM