; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015166 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015166
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr12:8279630..8285287
RNA-Seq ExpressionLag0015166
SyntenyLag0015166
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033746.1 retrotransposon gag protein [Cucumis melo var. makuwa]8.3e-16053.11Show/hide
Query:  MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEP
        MP GYQPPKFQQFDGKGNPKQHIAHF+ETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPE                                   GEP
Subjt:  MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEP

Query:  VVDYINRWRALSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKDEKN--------IEATIEESMVV
        V+DYINRWRALSLDCKD+LTE+S+VEMCTQGMHWELLYILQGIKPRTFEEL+TRAHDMELSIA+ G KD L+   K+ +KN            + ESM+V
Subjt:  VVDYINRWRALSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKDEKN--------IEATIEESMVV

Query:  NTTPLELSSKEKHKKIEKQ-QGNERRHLTLKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILR
          TPL+  SK K  K E+   G+E+R  TL+ERQKK+YPFP+SDV +MLEQL++ +LI+LPECKRPE+  KVDDPNYC YHRVISHPVEKCFVLKELIL+
Subjt:  NTTPLELSSKEKHKKIEKQ-QGNERRHLTLKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILR

Query:  LAKEGKIELDLEEVAQSNHAAIATNTCSQMCFGSLHDQEVHLVQFGTIEPIAIWFHHEVLGDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQ-------
        LA+E KIELD++EVAQ+NH A+   T S +    L+DQ   L+QFGT EPI + F  + +   SQ+      +E +    +T KK+++   VQ       
Subjt:  LAKEGKIELDLEEVAQSNHAAIATNTCSQMCFGSLHDQEVHLVQFGTIEPIAIWFHHEVLGDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQ-------

Query:  ---KGCCSSERVNKHQRRKNPRKPKLEEKRDQSPVRHCRLVTLKESFLRNF---HPVKKVEEHFVSSC-----------YINIEEDGISKEIKQRTSVFD
           KG  S ++  K +R K   KPK  + +D+   +  R + L E   R+F   HP K +E   V++C           Y + EE   S EIKQRT VF 
Subjt:  ---KGCCSSERVNKHQRRKNPRKPKLEEKRDQSPVRHCRLVTLKESFLRNF---HPVKKVEEHFVSSC-----------YINIEEDGISKEIKQRTSVFD

Query:  RIEPSTTQTSVFQRMSMAAAREEDQSLTSAITRHSVFQRLSVSTSRKCRPSTTVFDRLKVRSSQRERKVDSSEVKLFDEVHNDKKLHSTISSRMKRKFSV
        RI+P T ++SVFQR+SMA   EE+Q  TS   R S F+RLS+ST +K RPST+ FDRLK+ + Q++R++ S + K F E ++D K+HS + SRMKRK SV
Subjt:  RIEPSTTQTSVFQRMSMAAAREEDQSLTSAITRHSVFQRLSVSTSRKCRPSTTVFDRLKVRSSQRERKVDSSEVKLFDEVHNDKKLHSTISSRMKRKFSV

Query:  LINTKGSLKVKPNLVILTNPTDEGSDQ
         INT+GSL VKP  +I TNP +EG ++
Subjt:  LINTKGSLKVKPNLVILTNPTDEGSDQ

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]7.7e-16655.32Show/hide
Query:  MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEP
        MPNGYQPPKFQQFDGKGNPKQH+AHFI+TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPE+I +WEQLE++FLN FYSTR  VSM ELT+ +Q+KGE 
Subjt:  MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEP

Query:  VVDYINRWRALSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKDEKNIEAT-------IEESMVVN
        V+DYINRWRALSLDCKDRLTE+S+VEMCTQGMHW LLYILQGIKPRTFEELATRAHDMELSIA+RG KD L+   + D+  ++ T       I+ESMVV+
Subjt:  VVDYINRWRALSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKDEKNIEAT-------IEESMVVN

Query:  TTPLELSSKEKHKKIE-KQQGNERRHLTLKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILRL
         TPL+  SK K  KIE K  G+E+R  TLKERQ+K+YPFP+SDV +MLEQLL+ +LI+LPECKRPE+  KVDDPNYC YHRVISHPVEKCFVLKELIL+L
Subjt:  TTPLELSSKEKHKKIE-KQQGNERRHLTLKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILRL

Query:  AKEGKIELDLEEVAQSNHAAIATNTCSQMCFGSLHDQEVHLVQFGTIEPIAIWFHHEVLGDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQKGCCSSER
        A+E KIELD++EVAQ+NHA                                               + + +N  KG       K ++   +++    +E 
Subjt:  AKEGKIELDLEEVAQSNHAAIATNTCSQMCFGSLHDQEVHLVQFGTIEPIAIWFHHEVLGDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQKGCCSSER

Query:  VNKHQRRKNPRKPKLEEKRDQSPVRHCRLVTLKESFLRNFHPVKKVEEHFVSSCYINIEEDGISKEIKQRTSVFDRIEPSTTQTSVFQRMSMAAAREEDQ
        +        PR    ++  +   V  C   ++ E           V+ ++ SS  +N      S EI QRTSVFDRI+PSTT++SVFQR+S+A   EE+Q
Subjt:  VNKHQRRKNPRKPKLEEKRDQSPVRHCRLVTLKESFLRNFHPVKKVEEHFVSSCYINIEEDGISKEIKQRTSVFDRIEPSTTQTSVFQRMSMAAAREEDQ

Query:  SLTSAITRHSVFQRLSVSTSRKCRPSTTVFDRLKVRSSQRERKVDSSEVKLFDEVHNDKKLHSTISSRMKRKFSVLINTKGSLKVKPNLVILTNPTDEGS
          T   TR S  +RLS+ST +K RPST+ FDRLK+ + Q++R++ SS+ K F E ++D K+HS + SRMKRK  V INT+GSL VKP  +I TNPT+EG 
Subjt:  SLTSAITRHSVFQRLSVSTSRKCRPSTTVFDRLKVRSSQRERKVDSSEVKLFDEVHNDKKLHSTISSRMKRKFSVLINTKGSLKVKPNLVILTNPTDEGS

Query:  DQ
        +Q
Subjt:  DQ

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]6.1e-17157.28Show/hide
Query:  MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEP
        MPNGYQPPKFQQFDGKGNPKQH+AHFIETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPE+I +WEQLE++FLN FYSTRR VSM ELT+ +Q+KGE 
Subjt:  MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEP

Query:  VVDYINRWRALSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKDEKNIEAT-------IEESMVVN
        V+DYINRWRALSLDCKDRLTE+S+VEMCTQGMHW LLYILQGIKPRTFEELATRAHDMELSI +RG KD L+   + D+  +  T       I+ESMVV+
Subjt:  VVDYINRWRALSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKDEKNIEAT-------IEESMVVN

Query:  TTPLELSSKEKHKKIE-KQQGNERRHLTLKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILRL
         TPL+  SK K  KIE K  G+E+R  TLKERQ+K+YPF +SDV +MLEQLL+ +LI+LP+CKRP++ EKVDDPNYC YHRVISHPVEKCFVLKELIL+L
Subjt:  TTPLELSSKEKHKKIE-KQQGNERRHLTLKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILRL

Query:  AKEGKIELDLEEVAQSNHAAIATNTCSQMCFGSLHDQEVHLVQFGTIEPIAIWFHHEVLGDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQ--------
        A+E KIEL+++EVAQ+NH AI   T +      L DQ   L+QFGT    +I F   ++   SQ+      ++D+G   +TR+K ++ + +Q        
Subjt:  AKEGKIELDLEEVAQSNHAAIATNTCSQMCFGSLHDQEVHLVQFGTIEPIAIWFHHEVLGDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQ--------

Query:  --KGCCSSERVNKHQRRKNPRKPKLEEKRDQSPVRHCRLVTLKESFLRNF---HPVKKVEEHFVSSC-----------YINIEEDGISKEIKQRTSVFDR
          KG  S ++  K +R K    PK  + +D+  ++  R +TL E   R+F   HP + +E   V++C           Y + +E     EI QRTSVFDR
Subjt:  --KGCCSSERVNKHQRRKNPRKPKLEEKRDQSPVRHCRLVTLKESFLRNF---HPVKKVEEHFVSSC-----------YINIEEDGISKEIKQRTSVFDR

Query:  IEPSTTQTSVFQRMSMAAAREEDQSLTSAITRHSVFQRLSVSTSRKCRPSTTVFDRLKVRSSQRERKVDSSEVKLFDEVHNDKKLHSTISSRMKRKFSVL
        I+PSTT++SVFQR+SMA   EE+Q  T   TR S F+RLS+S S+K RPST+ FDRLK+ + Q++R++ S + K F E ++D K+HS + SRMKRK  V 
Subjt:  IEPSTTQTSVFQRMSMAAAREEDQSLTSAITRHSVFQRLSVSTSRKCRPSTTVFDRLKVRSSQRERKVDSSEVKLFDEVHNDKKLHSTISSRMKRKFSVL

Query:  INTK
        INTK
Subjt:  INTK

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]6.4e-15256.01Show/hide
Query:  MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEP
        MP GYQPPKFQQFDGKGNPKQH+AHF+ETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPE+I SWEQLEKEFLN FYSTRRTVSM ELT+ KQRKGEP
Subjt:  MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEP

Query:  VVDYINRWRALSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKD-------EKNIEATIEESMVVN
        V+DYINRWRALSLDCKDRLTE+S+VEMCTQGMHW LLYILQGIKPRTFEELATRAHDMELSIASRG KD L+  +KKD       EK +++T +ESMVVN
Subjt:  VVDYINRWRALSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKD-------EKNIEATIEESMVVN

Query:  TTPLELSSKEKHKKIEKQQGNERRHLTLKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILRLA
        TTPL+ S  ++ +  +K  G+ERR LTLKERQ+K+YPFP+SD+ +MLEQLL+ +LI+LPECKRPE+  KVDDPNYC YHRVISHPVEKCFVLKELILRLA
Subjt:  TTPLELSSKEKHKKIEKQQGNERRHLTLKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILRLA

Query:  KEGKIELDLEEVAQSNHAAI---ATNTCSQMCFGSLHDQEVHLVQFGTIEPIAIWFHHEVLGDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQKGCCS-
        +E +IELDLEEVAQ+NHA +   +  + S++ F    +Q   LVQFGT EPI + F  E+  +  Q     +  +D+G  ++T +KK++    Q+   S 
Subjt:  KEGKIELDLEEVAQSNHAAI---ATNTCSQMCFGSLHDQEVHLVQFGTIEPIAIWFHHEVLGDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQKGCCS-

Query:  -----SERVNKHQRRKNPRKPKLEEKRDQSPVRHCRLVTLKESFLRNFHPVKKVEEHFVSSCYI------------NIEEDGISKEIKQRTSVFDRIE-P
               +  K++++K   K KL    D +  R  RLVTL +   ++F    + E+  V +C+             ++E +G+SK++  R +V D +  P
Subjt:  -----SERVNKHQRRKNPRKPKLEEKRDQSPVRHCRLVTLKESFLRNFHPVKKVEEHFVSSCYI------------NIEEDGISKEIKQRTSVFDRIE-P

Query:  STTQTSVFQRMSMAAAREEDQSLTSAITRHSVFQRLSVSTS
          T+T +   +  + A     S T  +T  S    +S+  S
Subjt:  STTQTSVFQRMSMAAAREEDQSLTSAITRHSVFQRLSVSTS

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]2.9e-15256.01Show/hide
Query:  MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEP
        MP GYQPPKFQQFDGKGNPKQH+AHF+ETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPE+I SWEQLEKEFLN FYSTRRTVSM ELT+ KQRKGEP
Subjt:  MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEP

Query:  VVDYINRWRALSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKD-------EKNIEATIEESMVVN
        V+DYINRWRALSLDCKDRLTE+S+VEMCTQGMHW LLYILQGIKPRTFEELATRAHDMELSIASRG KD L+  +KKD       EK +++T++ESMVVN
Subjt:  VVDYINRWRALSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKD-------EKNIEATIEESMVVN

Query:  TTPLELSSKEKHKKIEKQQGNERRHLTLKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILRLA
        TTPL+ S  ++ +  +K  G+ERR LTLKERQ+K+YPFP+SD+ +MLEQLL+ +LI+LPECKRPE+  KVDDPNYC YHRVISHPVEKCFVLKELILRLA
Subjt:  TTPLELSSKEKHKKIEKQQGNERRHLTLKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILRLA

Query:  KEGKIELDLEEVAQSNHAAI---ATNTCSQMCFGSLHDQEVHLVQFGTIEPIAIWFHHEVLGDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQKGCCS-
        +E +IELDLEEVAQ+NHA +   +  + S++ F    +Q   LVQFGT EPI + F  E+  +  Q     +  +D+G  ++T +KK++    Q+   S 
Subjt:  KEGKIELDLEEVAQSNHAAI---ATNTCSQMCFGSLHDQEVHLVQFGTIEPIAIWFHHEVLGDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQKGCCS-

Query:  -----SERVNKHQRRKNPRKPKLEEKRDQSPVRHCRLVTLKESFLRNFHPVKKVEEHFVSSCYI------------NIEEDGISKEIKQRTSVFDRIE-P
               +  K++++K   K KL    D +  R  RLVTL +   ++F    + E+  V +C+             ++E +G+SK++  R +V D +  P
Subjt:  -----SERVNKHQRRKNPRKPKLEEKRDQSPVRHCRLVTLKESFLRNFHPVKKVEEHFVSSCYI------------NIEEDGISKEIKQRTSVFDRIE-P

Query:  STTQTSVFQRMSMAAAREEDQSLTSAITRHSVFQRLSVSTS
          T+T +   +  + A     S T  +T  S    +S+  S
Subjt:  STTQTSVFQRMSMAAAREEDQSLTSAITRHSVFQRLSVSTS

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein2.9e-15052.33Show/hide
Query:  MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEP
        MP GYQP KFQQFDGKGNPKQHI HF+ETCENAG+RGD LV+QFVR+LKGNAF+                          TRR VSM ELT+  QRKGEP
Subjt:  MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEP

Query:  VVDYINRWRALSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKDEKNIEAT-------IEESMVVN
        V+DYINRWRALSLDCKD+LTE+S+VEMCTQGMHWELLYILQGIKPRTFEELATRAHDM+LSIA+RG KD L+   + D+  I  T       + ESM+V 
Subjt:  VVDYINRWRALSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKDEKNIEAT-------IEESMVVN

Query:  TTPLELSSKEKHKKIEK-QQGNERRHLTLKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILRL
         TPL+  SK K  K ++   G+E+R  TL+ERQKK+YPFP+SDV +MLEQL++ +LI+LPECKRPE++ KVDDPNYC YHRVISH VEKCFVLKELI +L
Subjt:  TTPLELSSKEKHKKIEK-QQGNERRHLTLKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILRL

Query:  AKEGKIELDLEEVAQSNHAAIATNTCSQMCFGSLHDQEVHLVQFGTIEPIAIWFHHEVLGDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQKGCCSSER
        A+E KIELD++EVAQ+NH A+   T S      L+DQ   L+QFGT EPI + F  + +   SQ+                   K+E          SE 
Subjt:  AKEGKIELDLEEVAQSNHAAIATNTCSQMCFGSLHDQEVHLVQFGTIEPIAIWFHHEVLGDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQKGCCSSER

Query:  VNKHQRRKNPRKPKLEEKRDQSPVRHCRLVTLKESFLRNFHPVKKVEEHFVSSCYINIEEDGISKEIKQRTSVFDRIEPSTTQTSVFQRMSMAAAREEDQ
          +      PR    +   +   V  C   ++ E                V + Y + EE   S EIKQRTSVFD I+P TT++SVFQR+SMA  +EE+Q
Subjt:  VNKHQRRKNPRKPKLEEKRDQSPVRHCRLVTLKESFLRNFHPVKKVEEHFVSSCYINIEEDGISKEIKQRTSVFDRIEPSTTQTSVFQRMSMAAAREEDQ

Query:  SLTSAITRHSVFQRLSVSTSRKCRPSTTVFDRLKVRSSQRERKVDSSEVKLFDEVHNDKKLHSTISSRMKRKFSVLINTKGSLKVKPNLVILTNPTDEGS
          T    + S F+RLS+S S+K RPST  FDRLK+ + Q++R++ + + K F E ++D K+HS + SRMKRK SV INT+GSL VKP  +I TNP +EG 
Subjt:  SLTSAITRHSVFQRLSVSTSRKCRPSTTVFDRLKVRSSQRERKVDSSEVKLFDEVHNDKKLHSTISSRMKRKFSVLINTKGSLKVKPNLVILTNPTDEGS

Query:  DQ
        ++
Subjt:  DQ

A0A5A7SUW1 Retrotransposon gag protein4.0e-16053.11Show/hide
Query:  MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEP
        MP GYQPPKFQQFDGKGNPKQHIAHF+ETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPE                                   GEP
Subjt:  MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEP

Query:  VVDYINRWRALSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKDEKN--------IEATIEESMVV
        V+DYINRWRALSLDCKD+LTE+S+VEMCTQGMHWELLYILQGIKPRTFEEL+TRAHDMELSIA+ G KD L+   K+ +KN            + ESM+V
Subjt:  VVDYINRWRALSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKDEKN--------IEATIEESMVV

Query:  NTTPLELSSKEKHKKIEKQ-QGNERRHLTLKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILR
          TPL+  SK K  K E+   G+E+R  TL+ERQKK+YPFP+SDV +MLEQL++ +LI+LPECKRPE+  KVDDPNYC YHRVISHPVEKCFVLKELIL+
Subjt:  NTTPLELSSKEKHKKIEKQ-QGNERRHLTLKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILR

Query:  LAKEGKIELDLEEVAQSNHAAIATNTCSQMCFGSLHDQEVHLVQFGTIEPIAIWFHHEVLGDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQ-------
        LA+E KIELD++EVAQ+NH A+   T S +    L+DQ   L+QFGT EPI + F  + +   SQ+      +E +    +T KK+++   VQ       
Subjt:  LAKEGKIELDLEEVAQSNHAAIATNTCSQMCFGSLHDQEVHLVQFGTIEPIAIWFHHEVLGDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQ-------

Query:  ---KGCCSSERVNKHQRRKNPRKPKLEEKRDQSPVRHCRLVTLKESFLRNF---HPVKKVEEHFVSSC-----------YINIEEDGISKEIKQRTSVFD
           KG  S ++  K +R K   KPK  + +D+   +  R + L E   R+F   HP K +E   V++C           Y + EE   S EIKQRT VF 
Subjt:  ---KGCCSSERVNKHQRRKNPRKPKLEEKRDQSPVRHCRLVTLKESFLRNF---HPVKKVEEHFVSSC-----------YINIEEDGISKEIKQRTSVFD

Query:  RIEPSTTQTSVFQRMSMAAAREEDQSLTSAITRHSVFQRLSVSTSRKCRPSTTVFDRLKVRSSQRERKVDSSEVKLFDEVHNDKKLHSTISSRMKRKFSV
        RI+P T ++SVFQR+SMA   EE+Q  TS   R S F+RLS+ST +K RPST+ FDRLK+ + Q++R++ S + K F E ++D K+HS + SRMKRK SV
Subjt:  RIEPSTTQTSVFQRMSMAAAREEDQSLTSAITRHSVFQRLSVSTSRKCRPSTTVFDRLKVRSSQRERKVDSSEVKLFDEVHNDKKLHSTISSRMKRKFSV

Query:  LINTKGSLKVKPNLVILTNPTDEGSDQ
         INT+GSL VKP  +I TNP +EG ++
Subjt:  LINTKGSLKVKPNLVILTNPTDEGSDQ

A0A5A7TZU9 Ribonuclease H2.7e-14860.08Show/hide
Query:  MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEP
        MP+GYQPPKFQQFDGKGNPKQH+AHFIETCE AGTRGDLLVKQFVRTLKGNAFDWYTDLEPE+I SWEQLE++FLN FYSTRR VSM ELT+ KQRKGEP
Subjt:  MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEP

Query:  VVDYINRWRALSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKDEKNIEAT-------IEESMVVN
        V+DYINRWRALSLDCKDRLTE+S+VEMCTQGMHW LLYILQGIKPRTFEELATRAHDMELSIA+RG+ DLL+  ++K++K +++T        +E+MVV+
Subjt:  VVDYINRWRALSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKDEKNIEAT-------IEESMVVN

Query:  TTPLELSSKEKHKKIEKQQG-NERRHLTLKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILRL
        TTPL+L SKE  KK+EK+Q   E+R  TLKERQ+K+YPFP+SD+P+ML+QLL+ +LI+LPECKRP E+ +V+DPNYC YHRVISHPVEKCFVLKELIL+L
Subjt:  TTPLELSSKEKHKKIEKQQG-NERRHLTLKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILRL

Query:  AKEGKIELDLEEVAQSNHAAIATNTCSQM-CFGSLHDQEVHLVQFGTIEPIAIWFHHEVL------GDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQK
        A + KIEL+L++VAQ+NHAA+   + S++   GS       L+QFG++EP+ I+   E L       D  ++    V+N ++G T++TR+KK+++   QK
Subjt:  AKEGKIELDLEEVAQSNHAAIATNTCSQM-CFGSLHDQEVHLVQFGTIEPIAIWFHHEVL------GDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQK

Query:  GC------CSSERVNKHQRRKNPRK--PKLEEKRDQSPVRHCRLVTLKESFLRNFHPVKKVEEHFVSSCYINIEEDGISKEIKQRT
                 S  +  +   RKNPRK  P +EE    S  R  R + LK+ F +NF P++ V  H  S+     EED       + T
Subjt:  GC------CSSERVNKHQRRKNPRK--PKLEEKRDQSPVRHCRLVTLKESFLRNFHPVKKVEEHFVSSCYINIEEDGISKEIKQRT

A0A5A7TZU9 Ribonuclease H1.1e-1339.51Show/hide
Query:  SSCYINIEEDGISKEIK----QRTSVFDRIEPSTTQTSVFQRMSMAAAREEDQSLTSAITRHSVFQRLSVSTS--RKCRPSTT----VFDRLKVRSSQRE
        ++C+I +EE   S+E K    QR+SVFDRI  S  + SVFQR+S + A++ +Q  T + TR S FQRL+ S    R   P+ T     F RL V  ++ +
Subjt:  SSCYINIEEDGISKEIK----QRTSVFDRIEPSTTQTSVFQRMSMAAAREEDQSLTSAITRHSVFQRLSVSTS--RKCRPSTT----VFDRLKVRSSQRE

Query:  RKVDSSEVKLFDEVHNDKKLHSTISSRMKRKFSVLINTKGSLKVKPNLVILTNPTDEGSDQD
        +K   S       V  D+++ S   SRMKRK  V +NT+GSLKVK + V+ T P D   + +
Subjt:  RKVDSSEVKLFDEVHNDKKLHSTISSRMKRKFSVLINTKGSLKVKPNLVILTNPTDEGSDQD

A0A5A7URH1 Ty3-gypsy retrotransposon protein3.8e-16655.32Show/hide
Query:  MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEP
        MPNGYQPPKFQQFDGKGNPKQH+AHFI+TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPE+I +WEQLE++FLN FYSTR  VSM ELT+ +Q+KGE 
Subjt:  MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEP

Query:  VVDYINRWRALSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKDEKNIEAT-------IEESMVVN
        V+DYINRWRALSLDCKDRLTE+S+VEMCTQGMHW LLYILQGIKPRTFEELATRAHDMELSIA+RG KD L+   + D+  ++ T       I+ESMVV+
Subjt:  VVDYINRWRALSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKDEKNIEAT-------IEESMVVN

Query:  TTPLELSSKEKHKKIE-KQQGNERRHLTLKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILRL
         TPL+  SK K  KIE K  G+E+R  TLKERQ+K+YPFP+SDV +MLEQLL+ +LI+LPECKRPE+  KVDDPNYC YHRVISHPVEKCFVLKELIL+L
Subjt:  TTPLELSSKEKHKKIE-KQQGNERRHLTLKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILRL

Query:  AKEGKIELDLEEVAQSNHAAIATNTCSQMCFGSLHDQEVHLVQFGTIEPIAIWFHHEVLGDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQKGCCSSER
        A+E KIELD++EVAQ+NHA                                               + + +N  KG       K ++   +++    +E 
Subjt:  AKEGKIELDLEEVAQSNHAAIATNTCSQMCFGSLHDQEVHLVQFGTIEPIAIWFHHEVLGDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQKGCCSSER

Query:  VNKHQRRKNPRKPKLEEKRDQSPVRHCRLVTLKESFLRNFHPVKKVEEHFVSSCYINIEEDGISKEIKQRTSVFDRIEPSTTQTSVFQRMSMAAAREEDQ
        +        PR    ++  +   V  C   ++ E           V+ ++ SS  +N      S EI QRTSVFDRI+PSTT++SVFQR+S+A   EE+Q
Subjt:  VNKHQRRKNPRKPKLEEKRDQSPVRHCRLVTLKESFLRNFHPVKKVEEHFVSSCYINIEEDGISKEIKQRTSVFDRIEPSTTQTSVFQRMSMAAAREEDQ

Query:  SLTSAITRHSVFQRLSVSTSRKCRPSTTVFDRLKVRSSQRERKVDSSEVKLFDEVHNDKKLHSTISSRMKRKFSVLINTKGSLKVKPNLVILTNPTDEGS
          T   TR S  +RLS+ST +K RPST+ FDRLK+ + Q++R++ SS+ K F E ++D K+HS + SRMKRK  V INT+GSL VKP  +I TNPT+EG 
Subjt:  SLTSAITRHSVFQRLSVSTSRKCRPSTTVFDRLKVRSSQRERKVDSSEVKLFDEVHNDKKLHSTISSRMKRKFSVLINTKGSLKVKPNLVILTNPTDEGS

Query:  DQ
        +Q
Subjt:  DQ

A0A5D3BX77 Retrotransposon gag protein3.0e-17157.28Show/hide
Query:  MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEP
        MPNGYQPPKFQQFDGKGNPKQH+AHFIETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPE+I +WEQLE++FLN FYSTRR VSM ELT+ +Q+KGE 
Subjt:  MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEP

Query:  VVDYINRWRALSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKDEKNIEAT-------IEESMVVN
        V+DYINRWRALSLDCKDRLTE+S+VEMCTQGMHW LLYILQGIKPRTFEELATRAHDMELSI +RG KD L+   + D+  +  T       I+ESMVV+
Subjt:  VVDYINRWRALSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKDEKNIEAT-------IEESMVVN

Query:  TTPLELSSKEKHKKIE-KQQGNERRHLTLKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILRL
         TPL+  SK K  KIE K  G+E+R  TLKERQ+K+YPF +SDV +MLEQLL+ +LI+LP+CKRP++ EKVDDPNYC YHRVISHPVEKCFVLKELIL+L
Subjt:  TTPLELSSKEKHKKIE-KQQGNERRHLTLKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILRL

Query:  AKEGKIELDLEEVAQSNHAAIATNTCSQMCFGSLHDQEVHLVQFGTIEPIAIWFHHEVLGDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQ--------
        A+E KIEL+++EVAQ+NH AI   T +      L DQ   L+QFGT    +I F   ++   SQ+      ++D+G   +TR+K ++ + +Q        
Subjt:  AKEGKIELDLEEVAQSNHAAIATNTCSQMCFGSLHDQEVHLVQFGTIEPIAIWFHHEVLGDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQ--------

Query:  --KGCCSSERVNKHQRRKNPRKPKLEEKRDQSPVRHCRLVTLKESFLRNF---HPVKKVEEHFVSSC-----------YINIEEDGISKEIKQRTSVFDR
          KG  S ++  K +R K    PK  + +D+  ++  R +TL E   R+F   HP + +E   V++C           Y + +E     EI QRTSVFDR
Subjt:  --KGCCSSERVNKHQRRKNPRKPKLEEKRDQSPVRHCRLVTLKESFLRNF---HPVKKVEEHFVSSC-----------YINIEEDGISKEIKQRTSVFDR

Query:  IEPSTTQTSVFQRMSMAAAREEDQSLTSAITRHSVFQRLSVSTSRKCRPSTTVFDRLKVRSSQRERKVDSSEVKLFDEVHNDKKLHSTISSRMKRKFSVL
        I+PSTT++SVFQR+SMA   EE+Q  T   TR S F+RLS+S S+K RPST+ FDRLK+ + Q++R++ S + K F E ++D K+HS + SRMKRK  V 
Subjt:  IEPSTTQTSVFQRMSMAAAREEDQSLTSAITRHSVFQRLSVSTSRKCRPSTTVFDRLKVRSSQRERKVDSSEVKLFDEVHNDKKLHSTISSRMKRKFSVL

Query:  INTK
        INTK
Subjt:  INTK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTAATGGATACCAGCCACCGAAGTTTCAACAATTCGACGGGAAGGGCAATCCCAAGCAGCACATTGCTCATTTCATTGAGACGTGTGAAAATGCTGGTACTAGAGG
AGATCTACTAGTCAAGCAGTTCGTACGAACACTGAAGGGAAATGCTTTCGATTGGTATACAGACCTTGAGCCTGAAACTATTGGTAGTTGGGAGCAACTTGAGAAGGAGT
TCCTCAACCACTTCTACAGCACAAGACGAACCGTCAGTATGACGGAGCTCACCAGTGCTAAACAACGGAAAGGTGAGCCAGTTGTTGACTACATCAACCGTTGGAGAGCT
CTGAGCCTAGACTGCAAGGACCGACTCACCGAAGTGTCTTCTGTTGAGATGTGCACTCAAGGCATGCACTGGGAACTCCTCTACATCCTTCAAGGTATAAAGCCTCGCAC
CTTTGAAGAACTAGCAACTCGCGCTCACGATATGGAGTTAAGTATTGCTAGTCGAGGAGATAAAGACCTTCTACTCTCTAATATGAAGAAAGACGAAAAGAACATTGAAG
CAACTATAGAAGAATCTATGGTTGTAAATACAACCCCTCTCGAGTTGTCTTCTAAAGAAAAACACAAGAAGATTGAAAAGCAACAAGGAAATGAAAGACGTCACTTGACT
TTGAAGGAAAGACAGAAAAAGATCTATCCTTTTCCAAATTCTGACGTCCCTAACATGTTAGAACAACTACTGAAAACGCGGTTGATAGAACTCCCCGAGTGCAAACGACC
AGAAGAGATTGAAAAGGTCGATGATCCCAACTACTGCATGTATCATCGAGTTATCAGCCACCCAGTAGAAAAATGTTTTGTCCTGAAAGAGTTAATTCTAAGGTTAGCTA
AAGAAGGAAAAATCGAATTGGACCTTGAAGAAGTAGCTCAATCAAATCATGCTGCAATAGCAACAAACACTTGCAGTCAAATGTGTTTTGGTTCGCTTCATGATCAAGAG
GTGCACCTAGTCCAATTTGGAACCATTGAACCTATAGCTATTTGGTTTCATCATGAGGTATTAGGAGACAAATCTCAAGATCCTGTCAATCTTGTCAATAATGAAGATAA
AGGAAGTACTATCATGACTCGTAAGAAGAAACAAGAACGACATCTCGTTCAAAAAGGATGTTGTTCATCAGAAAGAGTGAATAAACATCAAAGACGGAAGAATCCAAGGA
AACCTAAACTTGAGGAAAAGAGAGATCAAAGTCCTGTTCGACATTGTCGACTAGTAACCTTAAAAGAATCGTTTCTGAGAAATTTTCATCCCGTCAAGAAAGTGGAAGAA
CACTTTGTAAGTTCATGTTATATCAACATAGAAGAAGATGGTATATCAAAGGAAATCAAGCAAAGGACTTCAGTCTTTGATCGCATTGAACCTTCAACTACTCAAACTTC
GGTCTTTCAAAGAATGAGTATGGCTGCAGCAAGAGAAGAAGATCAAAGCTTGACGTCCGCCATCACTCGACATTCAGTTTTCCAAAGACTAAGCGTCTCTACATCAAGAA
AGTGTCGACCTTCAACAACTGTTTTTGATCGCCTCAAAGTAAGAAGCAGCCAACGTGAAAGAAAGGTGGACAGTTCAGAAGTTAAATTATTCGATGAAGTACACAATGAC
AAGAAGCTTCATAGTACCATTTCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAACACAAAAGGTTCCTTGAAGGTGAAACCAAATCTTGTCATTTTGACAAATCC
TACAGATGAAGGATCTGATCAAGACTATGGTTCTCACGCGTTCCGCTACAGTTCCTTCTCTCCAAGTACGAAGGTTCTCTCCTCCAAGTCGAAGGTTCTCACGTTGCTTC
ACTGCAGTTCCTTTCTCCAAGTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTTTCATGCGCTTTGCTGCAGTTCCTTCTCTCCAAGTTCAAAGGTTCTCACGCGTTCTGC
TACAGTTCCTTCTCTCCAAGTGTAAAGGTTCTCTCCTCCAAGTCGAAGGTTCTCACGTTGCTTCACTACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTGCTTCGC
CGTAGTTCCTTCTCTCCAAGTACGAAGGTTCTCTCCTCCAAGTCGAAGGTTCTCTACGTTGCTTCCTGCAGTTCCTTCCTCCAAATTCGAAGGTTTCGAAGGTTCTCACG
CGTTGCGCTTCGTTGCAGTTCTTTCCTCCAAGTTCGAAAGTTCTCATGCGCTTCGCTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGTTCCTTCT
CCAAGTTCGAAGGTTCCTCACGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGCTGCTCCTTT
TCCAAGTTTGAAGGTGCTTCTCTCTGTTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAATGTGCTTCTCTCCACCCCTCCTTTTG
AAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCATCGTTGCTTCTTTTCAAATGTTTGGCGGCGGTTGACGGCCTCGTTCCACTCCATTTTCAAATGTTG
GCAGTTGACGGCGTCCGCTGCGCTTCATCTTCAAATACAGGTGGTGAAATCACTGCAATTGAATCTGATGACGACCGTTGTAGGAGTCTGGTGACCACCCTTGCAGGTTA
CTCAGATCACCCAATAAAATGGGACTGGTCTAGCAGGAGTGCATCACTGAAGGAGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGACTGGT
CTAGCATGAGTGCATCACTGTTACTCAGATCACCCAATAAAATGGGGACTGGTCTAGCAGGAGTGCATCACTGTAGGCAAATCTGGAGTGCATCACTGAAGGCGAATCTG
GTGACTACTCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGAGTGCATCACTGAAGGCGAATCTAGTGACTACTCCTGCAGGTTACTCAGATCACCCAATAA
AATAGGGGACTGGTCTAGCAGGAGTGAAATAACTGCAATCGAATTTGGTGACGACTGTTGCACGATGCTTACACTCTCAAGCAATAAAGAAATAGATTGCAGTAATATGT
GTGCAAATAAAAAGTACCAACACCATCGCCGCAAACCCATTTTATTTTACTTAACCTCCGCTTATCAAAACGCCGACAACTCTCAGGAAAGCAGCATCCTTACTTTCTTC
ACCGTAAAACCTTCGATCAATGTCCAATGGAGACCTATGGCGGTGGTGTATTCGCACGAACGCGGACTCGGCAACGACAACATTTTTGTTCTCTTCCTTCCGATTTCTTG
CAATCTAACCGCGACGAGTCTTTTCCCTTCCTCTGACTGTCGTTCGACGTTAGTGGAAAGCTCAAGTGGGACACAAACAACGGCGGGGTATTCAACAACTCTACTGTTTG
CACAAACCCACAATGATGGAACCTCCTCCCTCGATGCAACAAAACTGATGATCGACGATGTTGAGCCCATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTAATGGATACCAGCCACCGAAGTTTCAACAATTCGACGGGAAGGGCAATCCCAAGCAGCACATTGCTCATTTCATTGAGACGTGTGAAAATGCTGGTACTAGAGG
AGATCTACTAGTCAAGCAGTTCGTACGAACACTGAAGGGAAATGCTTTCGATTGGTATACAGACCTTGAGCCTGAAACTATTGGTAGTTGGGAGCAACTTGAGAAGGAGT
TCCTCAACCACTTCTACAGCACAAGACGAACCGTCAGTATGACGGAGCTCACCAGTGCTAAACAACGGAAAGGTGAGCCAGTTGTTGACTACATCAACCGTTGGAGAGCT
CTGAGCCTAGACTGCAAGGACCGACTCACCGAAGTGTCTTCTGTTGAGATGTGCACTCAAGGCATGCACTGGGAACTCCTCTACATCCTTCAAGGTATAAAGCCTCGCAC
CTTTGAAGAACTAGCAACTCGCGCTCACGATATGGAGTTAAGTATTGCTAGTCGAGGAGATAAAGACCTTCTACTCTCTAATATGAAGAAAGACGAAAAGAACATTGAAG
CAACTATAGAAGAATCTATGGTTGTAAATACAACCCCTCTCGAGTTGTCTTCTAAAGAAAAACACAAGAAGATTGAAAAGCAACAAGGAAATGAAAGACGTCACTTGACT
TTGAAGGAAAGACAGAAAAAGATCTATCCTTTTCCAAATTCTGACGTCCCTAACATGTTAGAACAACTACTGAAAACGCGGTTGATAGAACTCCCCGAGTGCAAACGACC
AGAAGAGATTGAAAAGGTCGATGATCCCAACTACTGCATGTATCATCGAGTTATCAGCCACCCAGTAGAAAAATGTTTTGTCCTGAAAGAGTTAATTCTAAGGTTAGCTA
AAGAAGGAAAAATCGAATTGGACCTTGAAGAAGTAGCTCAATCAAATCATGCTGCAATAGCAACAAACACTTGCAGTCAAATGTGTTTTGGTTCGCTTCATGATCAAGAG
GTGCACCTAGTCCAATTTGGAACCATTGAACCTATAGCTATTTGGTTTCATCATGAGGTATTAGGAGACAAATCTCAAGATCCTGTCAATCTTGTCAATAATGAAGATAA
AGGAAGTACTATCATGACTCGTAAGAAGAAACAAGAACGACATCTCGTTCAAAAAGGATGTTGTTCATCAGAAAGAGTGAATAAACATCAAAGACGGAAGAATCCAAGGA
AACCTAAACTTGAGGAAAAGAGAGATCAAAGTCCTGTTCGACATTGTCGACTAGTAACCTTAAAAGAATCGTTTCTGAGAAATTTTCATCCCGTCAAGAAAGTGGAAGAA
CACTTTGTAAGTTCATGTTATATCAACATAGAAGAAGATGGTATATCAAAGGAAATCAAGCAAAGGACTTCAGTCTTTGATCGCATTGAACCTTCAACTACTCAAACTTC
GGTCTTTCAAAGAATGAGTATGGCTGCAGCAAGAGAAGAAGATCAAAGCTTGACGTCCGCCATCACTCGACATTCAGTTTTCCAAAGACTAAGCGTCTCTACATCAAGAA
AGTGTCGACCTTCAACAACTGTTTTTGATCGCCTCAAAGTAAGAAGCAGCCAACGTGAAAGAAAGGTGGACAGTTCAGAAGTTAAATTATTCGATGAAGTACACAATGAC
AAGAAGCTTCATAGTACCATTTCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAACACAAAAGGTTCCTTGAAGGTGAAACCAAATCTTGTCATTTTGACAAATCC
TACAGATGAAGGATCTGATCAAGACTATGGTTCTCACGCGTTCCGCTACAGTTCCTTCTCTCCAAGTACGAAGGTTCTCTCCTCCAAGTCGAAGGTTCTCACGTTGCTTC
ACTGCAGTTCCTTTCTCCAAGTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTTTCATGCGCTTTGCTGCAGTTCCTTCTCTCCAAGTTCAAAGGTTCTCACGCGTTCTGC
TACAGTTCCTTCTCTCCAAGTGTAAAGGTTCTCTCCTCCAAGTCGAAGGTTCTCACGTTGCTTCACTACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTGCTTCGC
CGTAGTTCCTTCTCTCCAAGTACGAAGGTTCTCTCCTCCAAGTCGAAGGTTCTCTACGTTGCTTCCTGCAGTTCCTTCCTCCAAATTCGAAGGTTTCGAAGGTTCTCACG
CGTTGCGCTTCGTTGCAGTTCTTTCCTCCAAGTTCGAAAGTTCTCATGCGCTTCGCTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGTTCCTTCT
CCAAGTTCGAAGGTTCCTCACGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGCTGCTCCTTT
TCCAAGTTTGAAGGTGCTTCTCTCTGTTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAATGTGCTTCTCTCCACCCCTCCTTTTG
AAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCATCGTTGCTTCTTTTCAAATGTTTGGCGGCGGTTGACGGCCTCGTTCCACTCCATTTTCAAATGTTG
GCAGTTGACGGCGTCCGCTGCGCTTCATCTTCAAATACAGGTGGTGAAATCACTGCAATTGAATCTGATGACGACCGTTGTAGGAGTCTGGTGACCACCCTTGCAGGTTA
CTCAGATCACCCAATAAAATGGGACTGGTCTAGCAGGAGTGCATCACTGAAGGAGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGACTGGT
CTAGCATGAGTGCATCACTGTTACTCAGATCACCCAATAAAATGGGGACTGGTCTAGCAGGAGTGCATCACTGTAGGCAAATCTGGAGTGCATCACTGAAGGCGAATCTG
GTGACTACTCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGAGTGCATCACTGAAGGCGAATCTAGTGACTACTCCTGCAGGTTACTCAGATCACCCAATAA
AATAGGGGACTGGTCTAGCAGGAGTGAAATAACTGCAATCGAATTTGGTGACGACTGTTGCACGATGCTTACACTCTCAAGCAATAAAGAAATAGATTGCAGTAATATGT
GTGCAAATAAAAAGTACCAACACCATCGCCGCAAACCCATTTTATTTTACTTAACCTCCGCTTATCAAAACGCCGACAACTCTCAGGAAAGCAGCATCCTTACTTTCTTC
ACCGTAAAACCTTCGATCAATGTCCAATGGAGACCTATGGCGGTGGTGTATTCGCACGAACGCGGACTCGGCAACGACAACATTTTTGTTCTCTTCCTTCCGATTTCTTG
CAATCTAACCGCGACGAGTCTTTTCCCTTCCTCTGACTGTCGTTCGACGTTAGTGGAAAGCTCAAGTGGGACACAAACAACGGCGGGGTATTCAACAACTCTACTGTTTG
CACAAACCCACAATGATGGAACCTCCTCCCTCGATGCAACAAAACTGATGATCGACGATGTTGAGCCCATGTGA
Protein sequenceShow/hide protein sequence
MPNGYQPPKFQQFDGKGNPKQHIAHFIETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPETIGSWEQLEKEFLNHFYSTRRTVSMTELTSAKQRKGEPVVDYINRWRA
LSLDCKDRLTEVSSVEMCTQGMHWELLYILQGIKPRTFEELATRAHDMELSIASRGDKDLLLSNMKKDEKNIEATIEESMVVNTTPLELSSKEKHKKIEKQQGNERRHLT
LKERQKKIYPFPNSDVPNMLEQLLKTRLIELPECKRPEEIEKVDDPNYCMYHRVISHPVEKCFVLKELILRLAKEGKIELDLEEVAQSNHAAIATNTCSQMCFGSLHDQE
VHLVQFGTIEPIAIWFHHEVLGDKSQDPVNLVNNEDKGSTIMTRKKKQERHLVQKGCCSSERVNKHQRRKNPRKPKLEEKRDQSPVRHCRLVTLKESFLRNFHPVKKVEE
HFVSSCYINIEEDGISKEIKQRTSVFDRIEPSTTQTSVFQRMSMAAAREEDQSLTSAITRHSVFQRLSVSTSRKCRPSTTVFDRLKVRSSQRERKVDSSEVKLFDEVHND
KKLHSTISSRMKRKFSVLINTKGSLKVKPNLVILTNPTDEGSDQDYGSHAFRYSSFSPSTKVLSSKSKVLTLLHCSSFLQVRSSFLQVRRFSCALLQFLLSKFKGSHAFC
YSSFSPSVKVLSSKSKVLTLLHYSSFLQVRRFSRCFAVVPSLQVRRFSPPSRRFSTLLPAVPSSKFEGFEGSHALRFVAVLSSKFESSHALRCYLPPSSKVLSRAAAVPS
PSSKVPHASLAPSPSSKVLLSTAPSPSSKALLSAAPFPSLKVLLSVAPSPSSKALLSTAPSPSSNVLLSTPPFEGSPLRFSFSKFEGSSLLLFKCLAAVDGLVPLHFQML
AVDGVRCASSSNTGGEITAIESDDDRCRSLVTTLAGYSDHPIKWDWSSRSASLKENLVTTPAGYSDHPIKWDWSSMSASLLLRSPNKMGTGLAGVHHCRQIWSASLKANL
VTTPAGYSDHPIKWGLECITEGESSDYSCRLLRSPNKIGDWSSRSEITAIEFGDDCCTMLTLSSNKEIDCSNMCANKKYQHHRRKPILFYLTSAYQNADNSQESSILTFF
TVKPSINVQWRPMAVVYSHERGLGNDNIFVLFLPISCNLTATSLFPSSDCRSTLVESSSGTQTTAGYSTTLLFAQTHNDGTSSLDATKLMIDDVEPM