; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035726 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035726
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr3:28770337..28777283
RNA-Seq ExpressionLag0035726
SyntenyLag0035726
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.5e-13354.36Show/hide
Query:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE
        MI + I+ QYGGP Q   LY KPYTKRIDNLRM  GYQPPKFQQFDG+GNPKQH+AHF++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPESID+WE
Subjt:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE

Query:  KLEREFLNRFYSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQ
        +LER+FLNRFYSTR  V M ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP TFEELAT AHDMELSIA+R  +
Subjt:  KLEREFLNRFYSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQ

Query:  DLLLPNVRNEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKC------
        D L+P  R++    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+  MLEQLLE QLI+LP+C      
Subjt:  DLLLPNVRNEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKC------

Query:  ------------------------------------KIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFS
                                            KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    +
Subjt:  ------------------------------------KIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFS

Query:  KTFHKKEKENLATFYCIDVEEVDNSKKSEQRTSVFDHIKPPTTRPSVFQRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSKPSTSIFDRLKVTNSQ
         +  + +    ++      +EV+NS +  QRTSVFD IKP TTR SVFQR+S+   EEENQC     TR S  +RLS+ST KK +PSTS FDRLK+TN Q
Subjt:  KTFHKKEKENLATFYCIDVEEVDNSKKSEQRTSVFDHIKPPTTRPSVFQRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSKPSTSIFDRLKVTNSQ

Query:  SKRKMDNLEMKLFDEVNNDKKPQSSILS
         +R+M + + K F E N+D K  S + S
Subjt:  SKRKMDNLEMKLFDEVNNDKKPQSSILS

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]6.9e-12947.71Show/hide
Query:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE
        MI + I+ QYGGP Q   LYSKPYTKRIDNLRM  GYQPPKFQQFDG+GNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPESID+WE
Subjt:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE

Query:  KLEREFLNRFYSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQ
        +LER+FLNRFYSTRR V M ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP TFEELAT AHDMELSI +R  +
Subjt:  KLEREFLNRFYSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQ

Query:  DLLLPNVRNEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKC------
        D L+P  R++     +T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPF D+D+  MLEQLLE QLI+LPKC      
Subjt:  DLLLPNVRNEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKC------

Query:  ------------------------------------KIELDLDEVAQSN-----------------------------------LATIKGKSKHQRKKD-
                                            KIEL++DEVAQ+N                                   + TI  ++K    KD 
Subjt:  ------------------------------------KIELDLDEVAQSN-----------------------------------LATIKGKSKHQRKKD-

Query:  --------------PKKLQ-----------------PKRKRSKK-------------FSQPQQLVMLNKSFSKTF---HKKEKENLATFYCIDVEEVDNS
                      P  +Q                  K +R+KK             F Q ++ + L +   ++F   H +E   + T +   + EV+N+
Subjt:  --------------PKKLQ-----------------PKRKRSKK-------------FSQPQQLVMLNKSFSKTF---HKKEKENLATFYCIDVEEVDNS

Query:  KKS----------EQRTSVFDHIKPPTTRPSVFQRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSKPSTSIFDRLKVTNSQSKRKMDNLEMKLFDE
          S           QRTSVFD IKP TTR SVFQR+SM   EEENQC     TR S F+RLS+S SKK++PSTS FDRLK+TN Q +R+M +L+ K F E
Subjt:  KKS----------EQRTSVFDHIKPPTTRPSVFQRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSKPSTSIFDRLKVTNSQSKRKMDNLEMKLFDE

Query:  VNNDKKPQSSILSLPTLKSSRVLTLYAIALFLL
         N+D K  S + S    K    +   A  +F+L
Subjt:  VNNDKKPQSSILSLPTLKSSRVLTLYAIALFLL

XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]3.9e-11673.47Show/hide
Query:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE
        MITN IRAQYGGP+Q S +YSKPYTKRIDNLRM +GYQPPKFQQFDG+GNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPESI+SWE
Subjt:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE

Query:  KLEREFLNRFYSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQ
        +LE+EFLNRFYSTRRTV M ELTNTKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP TFEELAT AHDMELSIASR  +
Subjt:  KLEREFLNRFYSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQ

Query:  DLLLPNVRNEGRN-------DEETIEESMVVNTT-LPKSSSKEKR---QTNGA--HHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKCK
        D L+P V+ + +         + T +ESMVVNTT L  S  KE R   + +G+    LTLKERQ+K+YPFPD+DI  MLEQLLE QLI+LP+CK
Subjt:  DLLLPNVRNEGRN-------DEETIEESMVVNTT-LPKSSSKEKR---QTNGA--HHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKCK

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]5.1e-11673.13Show/hide
Query:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE
        MIT+ IRAQYGGP+Q S +YSKPYTKRIDNLRM +GYQPPKFQQFDG+GNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPESI+SWE
Subjt:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE

Query:  KLEREFLNRFYSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQ
        +LE+EFLNRFYSTRRTV M ELTNTKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP TFEELAT AHDMELSIASR  +
Subjt:  KLEREFLNRFYSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQ

Query:  DLLLPNVRNEGRN-------DEETIEESMVVNTT-LPKSSSKEKR---QTNGA--HHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKCK
        D L+P V+ + +         + T++ESMVVNTT L  S  KE R   + +G+    LTLKERQ+K+YPFPD+DI  MLEQLLE QLI+LP+CK
Subjt:  DLLLPNVRNEGRN-------DEETIEESMVVNTT-LPKSSSKEKR---QTNGA--HHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKCK

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]3.9e-11673.47Show/hide
Query:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE
        MITN IRAQYGGP+Q S +YSKPYTKRIDNLRM +GYQPPKFQQFDG+GNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPESI+SWE
Subjt:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE

Query:  KLEREFLNRFYSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQ
        +LE+EFLNRFYSTRRTV M ELTNTKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP TFEELAT AHDMELSIASR  +
Subjt:  KLEREFLNRFYSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQ

Query:  DLLLPNVRNEGRN-------DEETIEESMVVNTT-LPKSSSKEKR---QTNGA--HHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKCK
        D L+P V+ + +         + T +ESMVVNTT L  S  KE R   + +G+    LTLKERQ+K+YPFPD+DI  MLEQLLE QLI+LP+CK
Subjt:  DLLLPNVRNEGRN-------DEETIEESMVVNTT-LPKSSSKEKR---QTNGA--HHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKCK

TrEMBL top hitse value%identityAlignment
A0A5A7TZU9 Ribonuclease H4.2e-11670.49Show/hide
Query:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE
        MI N I+ QYGGP Q   LYSKPYTKRIDN+RM  GYQPPKFQQFDG+GNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE
Subjt:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE

Query:  KLEREFLNRFYSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQ
        +LER+FLNRFYSTRR V M ELT TKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP TFEELAT AHDMELSIA+R N 
Subjt:  KLEREFLNRFYSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQ

Query:  DLLLPNVRNEGRNDEET-------IEESMVVNTT----LPKSSSKEKRQTNG-AHHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKCKIELDLDE
        DLL+P VR E +  + T        +E+MVV+TT    + K    EKRQ  G     TLKERQ+K+YPFPD+D+P ML+QLLE QLI+LP+CK   ++  
Subjt:  DLLLPNVRNEGRNDEET-------IEESMVVNTT----LPKSSSKEKRQTNG-AHHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKCKIELDLDE

Query:  VAQSN
        V   N
Subjt:  VAQSN

A0A5A7TZU9 Ribonuclease H1.3e+0033.33Show/hide
Query:  KLQPKRKRSKK--FSQPQQLVMLNKSFSKTFH--KKEKENLATFYCIDVEEVDNSKKSE----QRTSVFDHIKPPTTRPSVFQRMSMVATEEENQCSMST
        +L P +K+ +K  +S P     +    S+      K K  +A    I VEE  +S++ +    QR+SVFD I     RPSVFQR+S    ++ NQ S  +
Subjt:  KLQPKRKRSKK--FSQPQQLVMLNKSFSKTFH--KKEKENLATFYCIDVEEVDNSKKSE----QRTSVFDHIKPPTTRPSVFQRMSMVATEEENQCSMST

Query:  STRPSAFQRLSVSTSK----KSKPST--SIFDRLKVTNSQSKRKMDNLEMKLFDEVNNDKKPQSSILS
        STR SAFQRL+ S  K       P+T  S F RL V+ ++ ++K           V  D++ +S+  S
Subjt:  STRPSAFQRLSVSTSK----KSKPST--SIFDRLKVTNSQSKRKMDNLEMKLFDEVNNDKKPQSSILS

A0A5A7TZU9 Ribonuclease H1.4e-11463.77Show/hide
Query:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE
        MITN IRAQYGG +Q SLLYSKPYTKRI++LRM  GYQPPKFQ FDG+GNPKQHIAHFVETCENAGTRGD LVKQFVRTLKGNAFDWYTDLEPE+I+SWE
Subjt:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE

Query:  KLEREFLNRFYSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQ
        +LE+EFLNRFYST+RTV M ELTN++QRKGE VV YINRWRA+SLDCKDRLTELS+VE+C QGMHWELLYIL+ IKP TFEELAT AHDMELSIA+R ++
Subjt:  KLEREFLNRFYSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQ

Query:  DLLLPNVRNEGRNDEETIEESMVVNTTLPKSSSK------EKRQTNGAHHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKC--------------
        D L+ ++  E ++ E+T  E   VNT  PK  SK      EK++ N    L+LKERQ+K+YPFP++DIPYMLEQLLE +LI LP+C              
Subjt:  DLLLPNVRNEGRNDEETIEESMVVNTTLPKSSSK------EKRQTNGAHHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKC--------------

Query:  ----------------------------KIELDLDEVAQSNLATI
                                    KIELD +E+AQSN A +
Subjt:  ----------------------------KIELDLDEVAQSNLATI

A0A5A7URH1 Ty3-gypsy retrotransposon protein1.7e-13354.36Show/hide
Query:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE
        MI + I+ QYGGP Q   LY KPYTKRIDNLRM  GYQPPKFQQFDG+GNPKQH+AHF++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPESID+WE
Subjt:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE

Query:  KLEREFLNRFYSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQ
        +LER+FLNRFYSTR  V M ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP TFEELAT AHDMELSIA+R  +
Subjt:  KLEREFLNRFYSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQ

Query:  DLLLPNVRNEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKC------
        D L+P  R++    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+  MLEQLLE QLI+LP+C      
Subjt:  DLLLPNVRNEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKC------

Query:  ------------------------------------KIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFS
                                            KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    +
Subjt:  ------------------------------------KIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFS

Query:  KTFHKKEKENLATFYCIDVEEVDNSKKSEQRTSVFDHIKPPTTRPSVFQRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSKPSTSIFDRLKVTNSQ
         +  + +    ++      +EV+NS +  QRTSVFD IKP TTR SVFQR+S+   EEENQC     TR S  +RLS+ST KK +PSTS FDRLK+TN Q
Subjt:  KTFHKKEKENLATFYCIDVEEVDNSKKSEQRTSVFDHIKPPTTRPSVFQRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSKPSTSIFDRLKVTNSQ

Query:  SKRKMDNLEMKLFDEVNNDKKPQSSILS
         +R+M + + K F E N+D K  S + S
Subjt:  SKRKMDNLEMKLFDEVNNDKKPQSSILS

A0A5A7VE63 Uncharacterized protein2.3e-11470.16Show/hide
Query:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE
        MI N I+ QYGGP Q   LYSKPYTKRIDN+RM  GYQ PKFQQFDG+GNPKQH+AHF+ETCE AGTRGDLLVKQFV+TLKGNAFDWYTDLEPESIDSWE
Subjt:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE

Query:  KLEREFLNRFYSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQ
        +LER+FLNRFYSTRR V M ELT TKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP TFEELAT AHDMELSIA+R N 
Subjt:  KLEREFLNRFYSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQ

Query:  DLLLPNVRNEGRNDEET-------IEESMVVNTTLPKSSSKEKR----QTNG-AHHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKCKIELDLDE
        DLL+P VR E +  + T        +E+MVV+TT  K  SKEK+    Q  G     TLKERQ+K+YPFPD+D+P ML+QLLE QLI+LP+CK   ++  
Subjt:  DLLLPNVRNEGRNDEET-------IEESMVVNTTLPKSSSKEKR----QTNG-AHHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKCKIELDLDE

Query:  VAQSN
        V   N
Subjt:  VAQSN

A0A5A7VE63 Uncharacterized protein1.4e-0240.37Show/hide
Query:  KEKENLATFYCIDVEEVDNSKKSE----QRTSVFDHIKPPTTRPSVFQRMSMVATEEENQCSMSTSTRPSAFQRLSV------STSKKSKPSTSIFDRLK
        K K  +A    I +EE  +SK+ +    QR+SVFD I     RPSVFQR+S    ++ NQ S  +STR SAFQRL+       S S  S    S F RL 
Subjt:  KEKENLATFYCIDVEEVDNSKKSE----QRTSVFDHIKPPTTRPSVFQRMSMVATEEENQCSMSTSTRPSAFQRLSV------STSKKSKPSTSIFDRLK

Query:  VTNSQSKRK
        V+ ++ ++K
Subjt:  VTNSQSKRK

A0A5D3BX77 Retrotransposon gag protein3.3e-12947.71Show/hide
Query:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE
        MI + I+ QYGGP Q   LYSKPYTKRIDNLRM  GYQPPKFQQFDG+GNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPESID+WE
Subjt:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE

Query:  KLEREFLNRFYSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQ
        +LER+FLNRFYSTRR V M ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP TFEELAT AHDMELSI +R  +
Subjt:  KLEREFLNRFYSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQ

Query:  DLLLPNVRNEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKC------
        D L+P  R++     +T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPF D+D+  MLEQLLE QLI+LPKC      
Subjt:  DLLLPNVRNEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKC------

Query:  ------------------------------------KIELDLDEVAQSN-----------------------------------LATIKGKSKHQRKKD-
                                            KIEL++DEVAQ+N                                   + TI  ++K    KD 
Subjt:  ------------------------------------KIELDLDEVAQSN-----------------------------------LATIKGKSKHQRKKD-

Query:  --------------PKKLQ-----------------PKRKRSKK-------------FSQPQQLVMLNKSFSKTF---HKKEKENLATFYCIDVEEVDNS
                      P  +Q                  K +R+KK             F Q ++ + L +   ++F   H +E   + T +   + EV+N+
Subjt:  --------------PKKLQ-----------------PKRKRSKK-------------FSQPQQLVMLNKSFSKTF---HKKEKENLATFYCIDVEEVDNS

Query:  KKS----------EQRTSVFDHIKPPTTRPSVFQRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSKPSTSIFDRLKVTNSQSKRKMDNLEMKLFDE
          S           QRTSVFD IKP TTR SVFQR+SM   EEENQC     TR S F+RLS+S SKK++PSTS FDRLK+TN Q +R+M +L+ K F E
Subjt:  KKS----------EQRTSVFDHIKPPTTRPSVFQRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSKPSTSIFDRLKVTNSQSKRKMDNLEMKLFDE

Query:  VNNDKKPQSSILSLPTLKSSRVLTLYAIALFLL
         N+D K  S + S    K    +   A  +F+L
Subjt:  VNNDKKPQSSILSLPTLKSSRVLTLYAIALFLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTATATTCCAAACCTTATACTAAGAGGATTGATAACTTGAGAATGACAATCGGGTA
TCAGCCACCAAAATTTCAGCAATTCGATGGAAGGGGCAATCCTAAACAACATATTGCTCACTTCGTTGAGACATGCGAGAACGCTGGTACTCGAGGGGATCTACTAGTCA
AACAGTTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAACCTGAGTCAATAGACAGTTGGGAGAAACTCGAAAGAGAGTTCTTGAATCGCTTT
TACAGCACTAGAAGAACCGTTAGGATGTTCGAGCTCACCAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGCTGGAGAGCCATAAGTCTAGATTG
CAAAGATCGTCTCACTGAACTCTCTTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTTTACATCCTTAAAGGTATAAAGCCTCACACCTTTGAGGAGCTAG
CAACTCACGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAAGACCTTCTACTCCCTAATGTGAGGAATGAAGGAAGGAACGACGAAGAGACTATAGAAGAG
TCCATGGTTGTCAACACAACCCTTCCCAAGTCGTCTTCAAAAGAAAAACGACAAACAAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTT
CCCTGATGCCGACATCCCTTATATGCTGGAACAATTACTGGAAGCGCAACTGATAGAGCTTCCTAAGTGCAAAATTGAGCTCGACCTTGATGAAGTAGCCCAATCAAATC
TTGCTACAATCAAAGGAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATG
TTGAATAAATCATTCTCCAAAACTTTCCACAAAAAGGAAAAAGAGAACCTTGCAACTTTCTACTGCATCGATGTAGAAGAAGTTGACAATTCCAAGAAAAGTGAACAAAG
GACTTCCGTCTTCGATCACATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCAAAGAATGAGTATGGTTGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCA
CTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTAAACCTTCAACATCTATTTTTGATCGCCTCAAAGTAACAAACAGTCAATCTAAAAGAAAG
ATGGATAACTTGGAGATGAAACTTTTTGATGAAGTAAACAACGACAAGAAGCCTCAAAGTAGCATCCTGTCACTTCCTACTCTCAAAAGTTCAAGGGTCCTTACACTGTA
CGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTCTATGACTGCTACGTTGTTCC
TCTTCCAAGTCCGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTCGCTGTAGTTCCTTCTCCCCAAGTTCGA
AGGTTCTCACGCGCTTCGTTGCAGTTCCTTCTCTCCAAGGTCGAAGGTTCTCACGCGCTGCGTTGCAGTTCTTTCTCCCAAAGTTCGAAGGTTCACGCACTTCGCTGCAG
TTCCTTCTCCCAAATTCGAACGTTCTCACGCGTTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTTTATGTTGCTGTAGTTCATTCTTCCAAGTTCGAAGGTTCTCAC
ATCACTTTGTTGCAGTTCCTTCTTCCAAGTTCGAAGGTTCTCACCGTTTCGCTGCAGTTCCTTCCCCCAAAATTCAAAGGTCCTCACGCCGCTTCGCTCCAGTTCCTTCC
TCCAAGTTCGAAGGTTCTCACGAGCTTCGATGCAATTCCTTCTTCCAAGTTCGAAGGTTCTCACGCGTTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGAG
CTTCGCTGCAGTTCCTTCCTCCATGTTCGAAGGTTCTCACGCGCTTGCTGCAGTTCCTTCCTCCAAGTTCGAGGGTGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCA
AGTTTGAGGGTTCACATGCATTTTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTC
GCTGCAGTTCCTTCCTCCATGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCGTTGCGTTCTTTCC
TCCCAAGTTCAAAGGTTCTCACGTTTCACACGTCGCTTCGCTGCGTTGTTTCCTCCCAAGTTCGAAGGTTCTCACGTTTCACACGTCGTTTCACTGCGTTCCTTTCTCCA
AGTTCGAAGGTTCTCACGTCGCTTGTTCTAACGCGCTTCGCTGTAGTGTAGTTCCTTCCTCCAAATTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGC
TGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCGCTACGTTCTTTCCTCCCAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCGCTGCA
TTCTTTCCTCCCAAGTTCGAAGGTTCTAACGTTTCACACATCGCTTCGCTACGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCCGCTTCACTGCAGTTCCTTCCTCATA
GTTTGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCGCTGCGTTCTTTCCTCCCAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCGCT
GCGTTCCTTTCTCCAAGTTCGAAGGTTCTCACGAGCTTCGCTGCTGTTCCTTCCTCCAAGTTGGAAGGTTCTCACGAGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAG
TGTAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCGCTGCGTTCTTTCCTCCCAAGTTCGAAGTGTAGTTCCTTCCTCGAAATTCGATTGTTCT
CACAAGCTTTGCTGCAGTTCCTTCTTCCAAGTTTGAAGGTTCTCACGAGCTTCGCTGTTGTTCCTTCCTCCAAGTTGGAAGGTTCTCACGAGCTTCGCTGCAGTTCCTTC
CTCCAAATTCGAAGGTTCTCACACGCTTCACTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCTTCACACGTCGCTTCGTCGCGCTTCCTTCCTCAAAGTTCGAAG
GTTCTCACGTCGCGCTTCCTTCCTCAATGTTCGAAGGTTCTCACGCGCTTTTCGCTTCGTCGCGCTTCCTTCCTCAAAGTTCGAAGGTTCTCACGCGCTTCACGCGCTTC
GCTGTAGTTCCTTCCTCCAAGTTCGAAGGTTTTCACGTTTCACACGTCGCTTCGTCGCGCTTCCTTCCTCAAAGTTCGAAGGTTCTCACATCGCTCTTCCTTCCTCAATG
TTTGAAGGTTCTCACGCGCTTTTCGCTTTGTCGCGCTTCCCTTCTAAAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAAGTTCGAAGGTTCTCTCA
CGCGCTTTTTCGCTTCGTCGCGCTTCCTTCCTCAAAGTTCGAAGGTTCTCACGCGCTTTTCGCTTCGTCGCACTTCCTTCCTCAAAGTTTGAAGGTTCTCACGCGCTTTT
CGCTTCGCTGCGTTCTTTCCTCCCAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCACTGCGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCCGCTTCGTTGCAGT
TCCTTCACGTTTCACACGTCGCTTCGCTGTGTTCCTTCCTCCAAGTTAGAAGGTTCTCACGCCGCTTCGCTGCAGTTCCTTCCTCATAGTTTGAAGTTCCTTCCTCCAAG
TTCGAAGGTTCTCACGAGCTTCGCTGCAGTTCCTTCTTCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTTCACAC
ATTGCTTCGCCGCTTTCTTTCCTCCCAAGTTAGAAGGTTCTCACGTTTTACACGTCGCTTCGCTGCGTTCTTTCCTCCTAATTTCGAAGGTTCTCACGTTTCACACGTCG
CTTCGCTGCGTTCCTTTCTCCAAGTTCGAAGGTTCTCACGCCACTTGTTCTCACGCGCTTCGCTGTAGTGTAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACGAGCTTCG
CGGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCGTCGCGCTTCCTTCCTCAAAGTTCGAAGGTTCTCACGTCGCGCTTCCTTCCTCAAAGT
TCGAAGGTTCTCACGCGCTTTTCGCTTCGTCGCGCTGCTTCCTTCCTCAAAGTTCGAAGGTTCTCACGCGCTTTTCGCTTCGTCGCGCAGCTTCCTTCCTCAAAGTTCGA
AGGTTCTCACGCGCTTCCTTCCCCGCAAAGTTCGAAGGTTCTCTCACCCGCTTTTCGCTTCGTCGCGCTTCCTTCCTCAAAGTTCGAATGTTCTCACGCGCTTTTCGCTT
CGCTGCGTTCTTTCCTCCCAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCACTGCGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGAGCTTCGCTGCAGTTCCTTC
TTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCTCGAAGGTTCTAACGTTTCACACATTGCTTCGCTGCGTTCTTTCCTCCCAAGTTAGA
AGGTTCTCACGTTTTACACGTCGCTTCGCTGCGTTCCTTTCTCCAAGTTCGAAGTTCCTTCCTCAAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCGTCGCGCTTC
CTTCCTCAAAGTTCGAAGGTTCTCACGTGCTTTTCGCTGTAGTTCCTTCCTCAAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCGTCGCGCTTCCTTCCTCAAAGT
TCGAAGGTTCTCACGTGCTTTTCGCTTTGTCGCGCTGCTTCCTTCCTCAAAGTTCGAAGGTTCTCACGCGCTTCCTTCCTCAAAGTTTGAAGGTTCTCTCACCCGCTTTT
CGCTTCGTCGCGCTTCCTTCCTCAAAGTTCGAAGGTTCTCTCACCCGCTTTTCGCTTCGTCGCGCTTCCTTCCTCAAAGTTCGAAGGTTCTCTCACCCGCTTTTCGCTTC
GTCGCGCTTCCTTCCTCAAAGTTCGAAGGTTCTCACCGTGCTTTCCAGTTCCTTCCTCAAAGTTCGAAGGTTCTCACGTGCTTTCCAGTTCCTTCCTCAAAGTTCGAAGG
TTCTCACGCGCTTTCCAGTTCCTTCCTCAAAGTTCGAAGGTGCTACGCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTATATTCCAAACCTTATACTAAGAGGATTGATAACTTGAGAATGACAATCGGGTA
TCAGCCACCAAAATTTCAGCAATTCGATGGAAGGGGCAATCCTAAACAACATATTGCTCACTTCGTTGAGACATGCGAGAACGCTGGTACTCGAGGGGATCTACTAGTCA
AACAGTTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAACCTGAGTCAATAGACAGTTGGGAGAAACTCGAAAGAGAGTTCTTGAATCGCTTT
TACAGCACTAGAAGAACCGTTAGGATGTTCGAGCTCACCAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGCTGGAGAGCCATAAGTCTAGATTG
CAAAGATCGTCTCACTGAACTCTCTTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTTTACATCCTTAAAGGTATAAAGCCTCACACCTTTGAGGAGCTAG
CAACTCACGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAAGACCTTCTACTCCCTAATGTGAGGAATGAAGGAAGGAACGACGAAGAGACTATAGAAGAG
TCCATGGTTGTCAACACAACCCTTCCCAAGTCGTCTTCAAAAGAAAAACGACAAACAAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTT
CCCTGATGCCGACATCCCTTATATGCTGGAACAATTACTGGAAGCGCAACTGATAGAGCTTCCTAAGTGCAAAATTGAGCTCGACCTTGATGAAGTAGCCCAATCAAATC
TTGCTACAATCAAAGGAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATG
TTGAATAAATCATTCTCCAAAACTTTCCACAAAAAGGAAAAAGAGAACCTTGCAACTTTCTACTGCATCGATGTAGAAGAAGTTGACAATTCCAAGAAAAGTGAACAAAG
GACTTCCGTCTTCGATCACATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCAAAGAATGAGTATGGTTGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCA
CTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTAAACCTTCAACATCTATTTTTGATCGCCTCAAAGTAACAAACAGTCAATCTAAAAGAAAG
ATGGATAACTTGGAGATGAAACTTTTTGATGAAGTAAACAACGACAAGAAGCCTCAAAGTAGCATCCTGTCACTTCCTACTCTCAAAAGTTCAAGGGTCCTTACACTGTA
CGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTCTATGACTGCTACGTTGTTCC
TCTTCCAAGTCCGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTCGCTGTAGTTCCTTCTCCCCAAGTTCGA
AGGTTCTCACGCGCTTCGTTGCAGTTCCTTCTCTCCAAGGTCGAAGGTTCTCACGCGCTGCGTTGCAGTTCTTTCTCCCAAAGTTCGAAGGTTCACGCACTTCGCTGCAG
TTCCTTCTCCCAAATTCGAACGTTCTCACGCGTTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTTTATGTTGCTGTAGTTCATTCTTCCAAGTTCGAAGGTTCTCAC
ATCACTTTGTTGCAGTTCCTTCTTCCAAGTTCGAAGGTTCTCACCGTTTCGCTGCAGTTCCTTCCCCCAAAATTCAAAGGTCCTCACGCCGCTTCGCTCCAGTTCCTTCC
TCCAAGTTCGAAGGTTCTCACGAGCTTCGATGCAATTCCTTCTTCCAAGTTCGAAGGTTCTCACGCGTTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGAG
CTTCGCTGCAGTTCCTTCCTCCATGTTCGAAGGTTCTCACGCGCTTGCTGCAGTTCCTTCCTCCAAGTTCGAGGGTGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCA
AGTTTGAGGGTTCACATGCATTTTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTC
GCTGCAGTTCCTTCCTCCATGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCGTTGCGTTCTTTCC
TCCCAAGTTCAAAGGTTCTCACGTTTCACACGTCGCTTCGCTGCGTTGTTTCCTCCCAAGTTCGAAGGTTCTCACGTTTCACACGTCGTTTCACTGCGTTCCTTTCTCCA
AGTTCGAAGGTTCTCACGTCGCTTGTTCTAACGCGCTTCGCTGTAGTGTAGTTCCTTCCTCCAAATTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGC
TGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCGCTACGTTCTTTCCTCCCAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCGCTGCA
TTCTTTCCTCCCAAGTTCGAAGGTTCTAACGTTTCACACATCGCTTCGCTACGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCCGCTTCACTGCAGTTCCTTCCTCATA
GTTTGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCGCTGCGTTCTTTCCTCCCAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCGCT
GCGTTCCTTTCTCCAAGTTCGAAGGTTCTCACGAGCTTCGCTGCTGTTCCTTCCTCCAAGTTGGAAGGTTCTCACGAGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAG
TGTAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCGCTGCGTTCTTTCCTCCCAAGTTCGAAGTGTAGTTCCTTCCTCGAAATTCGATTGTTCT
CACAAGCTTTGCTGCAGTTCCTTCTTCCAAGTTTGAAGGTTCTCACGAGCTTCGCTGTTGTTCCTTCCTCCAAGTTGGAAGGTTCTCACGAGCTTCGCTGCAGTTCCTTC
CTCCAAATTCGAAGGTTCTCACACGCTTCACTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCTTCACACGTCGCTTCGTCGCGCTTCCTTCCTCAAAGTTCGAAG
GTTCTCACGTCGCGCTTCCTTCCTCAATGTTCGAAGGTTCTCACGCGCTTTTCGCTTCGTCGCGCTTCCTTCCTCAAAGTTCGAAGGTTCTCACGCGCTTCACGCGCTTC
GCTGTAGTTCCTTCCTCCAAGTTCGAAGGTTTTCACGTTTCACACGTCGCTTCGTCGCGCTTCCTTCCTCAAAGTTCGAAGGTTCTCACATCGCTCTTCCTTCCTCAATG
TTTGAAGGTTCTCACGCGCTTTTCGCTTTGTCGCGCTTCCCTTCTAAAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAAGTTCGAAGGTTCTCTCA
CGCGCTTTTTCGCTTCGTCGCGCTTCCTTCCTCAAAGTTCGAAGGTTCTCACGCGCTTTTCGCTTCGTCGCACTTCCTTCCTCAAAGTTTGAAGGTTCTCACGCGCTTTT
CGCTTCGCTGCGTTCTTTCCTCCCAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCACTGCGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCCGCTTCGTTGCAGT
TCCTTCACGTTTCACACGTCGCTTCGCTGTGTTCCTTCCTCCAAGTTAGAAGGTTCTCACGCCGCTTCGCTGCAGTTCCTTCCTCATAGTTTGAAGTTCCTTCCTCCAAG
TTCGAAGGTTCTCACGAGCTTCGCTGCAGTTCCTTCTTCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTTCACAC
ATTGCTTCGCCGCTTTCTTTCCTCCCAAGTTAGAAGGTTCTCACGTTTTACACGTCGCTTCGCTGCGTTCTTTCCTCCTAATTTCGAAGGTTCTCACGTTTCACACGTCG
CTTCGCTGCGTTCCTTTCTCCAAGTTCGAAGGTTCTCACGCCACTTGTTCTCACGCGCTTCGCTGTAGTGTAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACGAGCTTCG
CGGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCGTCGCGCTTCCTTCCTCAAAGTTCGAAGGTTCTCACGTCGCGCTTCCTTCCTCAAAGT
TCGAAGGTTCTCACGCGCTTTTCGCTTCGTCGCGCTGCTTCCTTCCTCAAAGTTCGAAGGTTCTCACGCGCTTTTCGCTTCGTCGCGCAGCTTCCTTCCTCAAAGTTCGA
AGGTTCTCACGCGCTTCCTTCCCCGCAAAGTTCGAAGGTTCTCTCACCCGCTTTTCGCTTCGTCGCGCTTCCTTCCTCAAAGTTCGAATGTTCTCACGCGCTTTTCGCTT
CGCTGCGTTCTTTCCTCCCAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCACTGCGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGAGCTTCGCTGCAGTTCCTTC
TTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCTCGAAGGTTCTAACGTTTCACACATTGCTTCGCTGCGTTCTTTCCTCCCAAGTTAGA
AGGTTCTCACGTTTTACACGTCGCTTCGCTGCGTTCCTTTCTCCAAGTTCGAAGTTCCTTCCTCAAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCGTCGCGCTTC
CTTCCTCAAAGTTCGAAGGTTCTCACGTGCTTTTCGCTGTAGTTCCTTCCTCAAAGTTCGAAGGTTCTCACGTTTCACACGTCGCTTCGTCGCGCTTCCTTCCTCAAAGT
TCGAAGGTTCTCACGTGCTTTTCGCTTTGTCGCGCTGCTTCCTTCCTCAAAGTTCGAAGGTTCTCACGCGCTTCCTTCCTCAAAGTTTGAAGGTTCTCTCACCCGCTTTT
CGCTTCGTCGCGCTTCCTTCCTCAAAGTTCGAAGGTTCTCTCACCCGCTTTTCGCTTCGTCGCGCTTCCTTCCTCAAAGTTCGAAGGTTCTCTCACCCGCTTTTCGCTTC
GTCGCGCTTCCTTCCTCAAAGTTCGAAGGTTCTCACCGTGCTTTCCAGTTCCTTCCTCAAAGTTCGAAGGTTCTCACGTGCTTTCCAGTTCCTTCCTCAAAGTTCGAAGG
TTCTCACGCGCTTTCCAGTTCCTTCCTCAAAGTTCGAAGGTGCTACGCTGTAG
Protein sequenceShow/hide protein sequence
MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMTIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEKLEREFLNRF
YSTRRTVRMFELTNTKQRKGELVVNYINRWRAISLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPHTFEELATHAHDMELSIASRENQDLLLPNVRNEGRNDEETIEE
SMVVNTTLPKSSSKEKRQTNGAHHLTLKERQKKIYPFPDADIPYMLEQLLEAQLIELPKCKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVM
LNKSFSKTFHKKEKENLATFYCIDVEEVDNSKKSEQRTSVFDHIKPPTTRPSVFQRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSKPSTSIFDRLKVTNSQSKRK
MDNLEMKLFDEVNNDKKPQSSILSLPTLKSSRVLTLYAIALFLLQVRRFFVVSCCVVPSPSSRVLSSMTATLFLFQVRRILCGALLHCSLFSQVRWFSRSFAVVPSPQVR
RFSRASLQFLLSKVEGSHALRCSSFSQSSKVHALRCSSFSQIRTFSRVSLQFLPPSSKVLCCCSSFFQVRRFSHHFVAVPSSKFEGSHRFAAVPSPKIQRSSRRFAPVPS
SKFEGSHELRCNSFFQVRRFSRVSLQFLPPSSKVLTSFAAVPSSMFEGSHALAAVPSSKFEGVLTRFAAVPSSKFEGSHAFCCSSFLQVRRFSCASLQFLPPSSKVLTRF
AAVPSSMFEGSHALRCSSFLQVRRFSRFTRRFVAFFPPKFKGSHVSHVASLRCFLPSSKVLTFHTSFHCVPFSKFEGSHVACSNALRCSVVPSSKFEVPSSKFEGSHALR
CSSFLQVRRFSRFTRRFATFFPPKFEGSHVSHVASLHSFLPSSKVLTFHTSLRYVPSSKFEGSHAASLQFLPHSLKFLPPSSKVLTFHTSLRCVLSSQVRRFSRFTRRFA
AFLSPSSKVLTSFAAVPSSKLEGSHELRCSSFLQVRSVVPSSKFEGSHVSHVASLRSFLPSSKCSSFLEIRLFSQALLQFLLPSLKVLTSFAVVPSSKLEGSHELRCSSF
LQIRRFSHASLQFLPPSSKVLTLHTSLRRASFLKVRRFSRRASFLNVRRFSRAFRFVALPSSKFEGSHALHALRCSSFLQVRRFSRFTRRFVALPSSKFEGSHIALPSSM
FEGSHALFALSRFPSKSSKVLTRFAAVPSSKVRRFSHALFRFVALPSSKFEGSHALFASSHFLPQSLKVLTRFSLRCVLSSQVRRFSRFTRRFTAFLPPSSKVLTPLRCS
SFTFHTSLRCVPSSKLEGSHAASLQFLPHSLKFLPPSSKVLTSFAAVPSSKFEGSHVLRCSSFLQVRRFSRFTHCFAAFFPPKLEGSHVLHVASLRSFLLISKVLTFHTS
LRCVPFSKFEGSHATCSHALRCSVVPSSKFEGSHELRGSSFLQVRRFSRFTRRFVALPSSKFEGSHVALPSSKFEGSHALFASSRCFLPQSSKVLTRFSLRRAASFLKVR
RFSRASFPAKFEGSLTRFSLRRASFLKVRMFSRAFRFAAFFPPKFEGSHVSHVASLRSFLQVRRFSRASLQFLLPSSKVLTRFAAVPSSKFSKVLTFHTLLRCVLSSQVR
RFSRFTRRFAAFLSPSSKFLPQSSKVLTFHTSLRRASFLKVRRFSRAFRCSSFLKVRRFSRFTRRFVALPSSKFEGSHVLFALSRCFLPQSSKVLTRFLPQSLKVLSPAF
RFVALPSSKFEGSLTRFSLRRASFLKVRRFSHPLFASSRFLPQSSKVLTVLSSSFLKVRRFSRAFQFLPQSSKVLTRFPVPSSKFEGATL