; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018194 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018194
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr5:18605629..18612224
RNA-Seq ExpressionLag0018194
SyntenyLag0018194
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]2.5e-13442.32Show/hide
Query:  MADPNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARD
        MA+ ++   PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  FL+IC TVKINGV+ED IRLRLFPFSL+DKAR 
Subjt:  MADPNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARD

Query:  WLQSITPGSITTWDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKT
        WLQS+ PGSI +W  +   FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+ GTL+SKT
Subjt:  WLQSITPGSITTWDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKT

Query:  VENARTLLEDMATNSYQWPSERSTPKKIVAGVFEVD--KSIESAVALASR----------PQ---------------EETIEQVQYVSN------FNSTP
         E A  LLE+MA+N+YQWP+ER+  KK VAG+ E++   ++ + VA  S           PQ               E + EQVQYV+N       N  P
Subjt:  VENARTLLEDMATNSYQWPSERSTPKKIVAGVFEVD--KSIESAVALASR----------PQ---------------EETIEQVQYVSN------FNSTP

Query:  THYHPNNRNHENFSYANNKNVL---NPPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEEGVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAP
         +YHP  RNHEN SY N KNVL   +PPGF  Q  E K  LED + +F+ E++ R  K +  +  I T  +   A +KN+E Q+GQL   ++   +G  P
Subjt:  THYHPNNRNHENFSYANNKNVL---NPPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEEGVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAP

Query:  AEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSSDEAEKPNLE-------------PPIPSPTVLVPKERKKKRRKRT--------
        +  E    E CKAIT+   +E E  P  E   TPT    G++++    +E     LE             PPI +P +  P+  +K++  +         
Subjt:  AEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSSDEAEKPNLE-------------PPIPSPTVLVPKERKKKRRKRT--------

Query:  -------------------IRIMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS-----FD----------------------IGE
                           ++ +K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DP SF++PC+     FD                      +GE
Subjt:  -------------------IRIMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS-----FD----------------------IGE

Query:  IKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNEKEIFKAVEDSK
        +K T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGRPFLATGR +ID+++ ELT+RV  E+ +F   +  K
Subjt:  IKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNEKEIFKAVEDSK

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]3.0e-13542.46Show/hide
Query:  MADPNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARD
        MA+ ++   PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  FL+IC TVKINGV+ED IRLRLFPFSL+DKAR 
Subjt:  MADPNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARD

Query:  WLQSITPGSITTWDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKT
        WLQS+ PGSI +W  +   FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+ GTL+SKT
Subjt:  WLQSITPGSITTWDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKT

Query:  VENARTLLEDMATNSYQWPSERSTPKKIVAGVFEVD--KSIESAVALASR----------PQ---------------EETIEQVQYVSN------FNSTP
         E A  LLE+MA+N+YQWP+ER+  KK VAG+ E++   ++ + VA  S           PQ               E + EQVQYV+N       N  P
Subjt:  VENARTLLEDMATNSYQWPSERSTPKKIVAGVFEVD--KSIESAVALASR----------PQ---------------EETIEQVQYVSN------FNSTP

Query:  THYHPNNRNHENFSYANNKNVL---NPPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEEGVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAP
         +YHP  RNHEN SY N KNVL   +PPGF  Q  E K  LED + +F+ E++ R  K +  +  I T  +   A +KN+E Q+GQL   ++   +G  P
Subjt:  THYHPNNRNHENFSYANNKNVL---NPPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEEGVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAP

Query:  AEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSSDEAEKPNLE-------------PPIPSPTVLVPKERKKKRRKRT--------
        +  E    E CKAIT+   +E E  P  E   TPT    G++++    +E     LE             PPI +P +  P+  +K++  +         
Subjt:  AEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSSDEAEKPNLE-------------PPIPSPTVLVPKERKKKRRKRT--------

Query:  -------------------IRIMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS-----FD----------------------IGE
                           ++ +K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+     FD                      +GE
Subjt:  -------------------IRIMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS-----FD----------------------IGE

Query:  IKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNEKEIFKAVEDSK
        +K T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGRPFLATGR +ID+++ ELT+RV  E+ +F   +  K
Subjt:  IKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNEKEIFKAVEDSK

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]7.9e-13643.13Show/hide
Query:  MADPNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARD
        MA+ ++   PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  FL+IC TVKINGV+ED IRLRLFPFSL+DKAR 
Subjt:  MADPNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARD

Query:  WLQSITPGSITTWDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKT
        WLQS+ PGSI +W  +   FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+ GTL+SKT
Subjt:  WLQSITPGSITTWDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKT

Query:  VENARTLLEDMATNSYQWPSERSTPKKIVAGVFEVD--KSIESAVALASR----------PQ---------------EETIEQVQYVSN------FNSTP
         E A  LLE+MA+N+YQWP+ER+  KK VAG+ +++   ++ + VA  S           PQ               E + EQVQYV+N       N  P
Subjt:  VENARTLLEDMATNSYQWPSERSTPKKIVAGVFEVD--KSIESAVALASR----------PQ---------------EETIEQVQYVSN------FNSTP

Query:  THYHPNNRNHENFSYANNKNVL---NPPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEEGVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAP
         +YHP  RNHEN SY N KNVL   +PPGF  Q  E K  LED + +F+ E++ R  K +  +  I T  +   AAIKNIE Q+GQL   ++   +G  P
Subjt:  THYHPNNRNHENFSYANNKNVL---NPPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEEGVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAP

Query:  AEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSSDEAEKPNLE-------------PPIPSPTVLVPKERKKKRRKRT--------
        +  E    E CKAIT+   +E E  P  E   TPT    G+++     DE     LE             PPI +P +  P+  +K++  +         
Subjt:  AEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSSDEAEKPNLE-------------PPIPSPTVLVPKERKKKRRKRT--------

Query:  -------------------IRIMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSF----------DIG-----------------E
                           ++ +K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+           D+G                 E
Subjt:  -------------------IRIMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSF----------DIG-----------------E

Query:  IKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNEKEIFK
        +K T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGRPFLATGR +ID+++ ELT+RV  E+ +FK
Subjt:  IKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNEKEIFK

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]1.1e-13242.3Show/hide
Query:  MADPNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARD
        MA+  Q  +PR ++DY +P+     SGI    INANNFELK  LI M +   + GSP +DPN HL  FL+IC T+K+NGV+ED IRLRLFPFSL+DKAR 
Subjt:  MADPNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARD

Query:  WLQSITPGSITTWDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKT
        WLQS+ PGSIT+W  +   FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R CPQHG PDWLQVQ+FYNGL   T+TIVDAA+ GTL+SKT
Subjt:  WLQSITPGSITTWDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKT

Query:  VENARTLLEDMATNSYQWPSERSTPKKIVAGVFEVD--KSIESAVALASR----------PQ---------------EETIEQVQYVSN------FNSTP
         E A +LLE+MA+N+YQWP+ER+  KK VAG+ E++   ++ + VA  S           PQ               E + EQVQY++N       N  P
Subjt:  VENARTLLEDMATNSYQWPSERSTPKKIVAGVFEVD--KSIESAVALASR----------PQ---------------EETIEQVQYVSN------FNSTP

Query:  THYHPNNRNHENFSYANNKNVLN-PPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEEGVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAE
         +YHP  RNHENFSY N KNVL  PPGF  Q  E K  LED + +F+ E+     K +  +  I T  +   A +KN+E Q+GQL   ++   +G  P+ 
Subjt:  THYHPNNRNHENFSYANNKNVLN-PPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEEGVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAE

Query:  QEKPQMEYCKAITVHQ-EESEEEPESEDYETPTG-------------EAEEDTSSDEAEKPNL----EPPIPSPTVLVPKERKKKRRKRTI---------
         E    E CKAIT+    E E  P  E   TPT              E  EDT  +    P++     PPI S  +  P+  +K++  +           
Subjt:  QEKPQMEYCKAITVHQ-EESEEEPESEDYETPTG-------------EAEEDTSSDEAEKPNL----EPPIPSPTVLVPKERKKKRRKRTI---------

Query:  ------------------RIMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS-----FD----------------------IGEIK
                          + +K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+     FD                      +GE+K
Subjt:  ------------------RIMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS-----FD----------------------IGEIK

Query:  STPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNEKEIFKAVEDSK
         T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGRPFLATGR ++D+++ ELT+RV  E+  F   E  K
Subjt:  STPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNEKEIFKAVEDSK

XP_023929660.1 uncharacterized protein LOC112040975 [Quercus suber]2.8e-13343.48Show/hide
Query:  MADPNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARD
        MA+  Q  +PR ++DY +P+     SGI +  INANNFEL   LI M +   + GSP +DPN HL  FL+IC  VK+NGV+ED IRLRLFPFSL+DKAR 
Subjt:  MADPNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARD

Query:  WLQSITPGSITTWDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKT
        WLQS+ PGSIT+W  +   FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R CPQHG  DWLQVQ+FYNGL   T+TIVDAA+ GTL+SKT
Subjt:  WLQSITPGSITTWDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKT

Query:  VENARTLLEDMATNSYQWPSERSTPKKIVAGVFEVDKSIESAVALASR------------PQ---------------EETIEQVQYVSN------FNSTP
         E A +LLE+MA+N YQWP+ER+  KK VAG+ E++     A  +AS             PQ               E + E VQY++N       N  P
Subjt:  VENARTLLEDMATNSYQWPSERSTPKKIVAGVFEVDKSIESAVALASR------------PQ---------------EETIEQVQYVSN------FNSTP

Query:  THYHPNNRNHENFSYANNKNVLN-PPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEEGVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAE
         +YHP  RNHENFSY N KNVL  PPGF  Q  E K  LED + +F+ E+  R  K +  +  I T  +   A +KN+E Q+GQL   ++   +G  P+ 
Subjt:  THYHPNNRNHENFSYANNKNVLN-PPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEEGVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAE

Query:  QEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEA-----EEDTSSDEAEKPNLEPPIPSP-------TVLVPKERKKKRRKRTIRIMKEWLAKK
         E    E CKAIT+    E E  P  E   TPT    G++     EE+   D   + ++ P I  P       T L   +   ++    ++ +K+ ++KK
Subjt:  QEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEA-----EEDTSSDEAEKPNLEPPIPSP-------TVLVPKERKKKRRKRTIRIMKEWLAKK

Query:  RKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS-----FD----------------------IGEIKSTPVKLQLADQSVVRPVGIVENVLIRVG
        R+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+     FD                      +GE+K T + LQLA++S+  P GI+E+VL++V 
Subjt:  RKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCS-----FD----------------------IGEIKSTPVKLQLADQSVVRPVGIVENVLIRVG

Query:  RFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNEKEIFKAVEDSK
        +F  P D  V+DM E+  +P+ILGRPFLATGR ++D+++ ELT+RVK E+ +F   E  K
Subjt:  RFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNEKEIFKAVEDSK

TrEMBL top hitse value%identityAlignment
A0A1S3UKD4 uncharacterized protein LOC1067662676.6e-9637.35Show/hide
Query:  QPEEPR-PIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQS
        + + PR  IRDY  P   G Q  IV  PI ANNFE+K  L+Q+ +   + G+ +EDPNSHL++FL IC T+K NGVS+DAI LRLFPFSL+DKA++WLQS
Subjt:  QPEEPR-PIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQS

Query:  ITPGSITTWDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKTVENA
        +  GSI+TW+ +   F+ K+FPP+K+ K+R EI +F QQ +E L+EAWER+KEL+RKCP H  P+WLQVQ FYNGL+P+ K ++DAA+ G+ + KT E A
Subjt:  ITPGSITTWDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKTVENA

Query:  RTLLEDMATNSYQWPSERSTPKKIVAGVFEVDKSIESAVALASRPQEETIEQ-VQYVSNFNSTPTHY------HPN-------------------NRNHE
           LE MA N+     +R   K  V  V  +D  +     L  +  + T +  +   +N N+    Y      H N                   N+   
Subjt:  RTLLEDMATNSYQWPSERSTPKKIVAGVFEVDKSIESAVALASRPQEETIEQ-VQYVSNFNSTPTHY------HPN-------------------NRNHE

Query:  NFSY---------ANNKNVLNPP------GFAPQTQENK--KLEDLVGAFIAESSNRTTKLEEGVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGK
        NF+           +N    NPP         P  Q NK   LE+ +     ++S    +    +    T      A+I+N+E Q+GQL   +S  + G 
Subjt:  NFSY---------ANNKNVLNPP------GFAPQTQENK--KLEDLVGAFIAESSNRTTKLEEGVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGK

Query:  APAEQEKPQMEYCKAITVHQE---ESEEEPESE-DYETPTGEAEEDTSSDEAEKPNLEPPIPSPTVLVPKERKKKRRKRTIRIMKEWLAKKRKEKKVDTV
         P++      E CKAI +  E   ESE+  E E + +    E  E+++  E E+ +LE  +PS                  + MK+ L+KKRK ++ +T+
Subjt:  APAEQEKPQMEYCKAITVHQE---ESEEEPESE-DYETPTGEAEEDTSSDEAEKPNLEPPIPSPTVLVPKERKKKRRKRTIRIMKEWLAKKRKEKKVDTV

Query:  YLASTCSTRVQQKVPEKVADPGSFSVPCS---------------------------FDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLY
         L   CS  +QQK+P K+ DP SF +PC                              IG++K T + LQLAD+S+  P GIVE+VL++V +F  P D  
Subjt:  YLASTCSTRVQQKVPEKVADPGSFSVPCS---------------------------FDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLY

Query:  VMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNEKEIFKAVE
        V+DM E+  +P+ILGRPFLATGR +ID+E+  L +RV +EK  F   E
Subjt:  VMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNEKEIFKAVE

A0A1U7Z951 uncharacterized protein LOC1045905688.6e-9635.86Show/hide
Query:  ADPNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARD
        A+  +  E R + DY +P   G  S IV   I ANNF++K  +IQM ++   + G   EDPN+H+ +FL+IC T K NGVS+D +RLRLFPFSL+DK + 
Subjt:  ADPNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARD

Query:  WLQSITPGSITTWDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKT
        WL S+   SI+TWD +   FL K+FPP+K  K+R +I TF QQ  E L+E+WER+KELLRK P HG P WLQVQ FYN LT + KTI+DA A G++ +KT
Subjt:  WLQSITPGSITTWDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKT

Query:  VENARTLLEDMATNSYQWPSERSTPKK------------IVAGVFEVDKSIESAVALASRPQEETIE----------------------QVQYVSNF---
         E A  L+E+M  N+YQW SER+  ++            + A +  + K I       +  Q    E                      QV +V N    
Subjt:  VENARTLLEDMATNSYQWPSERSTPKK------------IVAGVFEVDKSIESAVALASRPQEETIE----------------------QVQYVSNF---

Query:  --NSTPTHYHPNNRNHENFSYANNKNVLNPPGFAPQTQENKKLEDLVGAFIAESSNRTTKLEEGVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGK
          N  P  ++P  RNH N S+ N   V   P    Q +    LE+L+  FIA +  R    E              A+IKN+ETQ+GQL  ++S+  +G 
Subjt:  --NSTPTHYHPNNRNHENFSYANNKNVLNPPGFAPQTQENKKLEDLVGAFIAESSNRTTKLEEGVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGK

Query:  APAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSSDEAEKPNLEPPIPSPTVLVPKERKKKRRKRTIRIMKEWLAKKRKEKKVDTVYLAS
         P+  E    E  +AIT+   +  +E E +         EED     A+ P+                         + +KE LA K K   V TV +  
Subjt:  APAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSSDEAEKPNLEPPIPSPTVLVPKERKKKRRKRTIRIMKEWLAKKRKEKKVDTVYLAS

Query:  TCSTRVQQKVPEKVADPGSFSVPCS---------------------------FDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDM
         CS  +  K+P+K+ DPGSF++PC+                             +GE + T V LQL D+S+  P GI+E+VL++V +F  P+D  V+DM
Subjt:  TCSTRVQQKVPEKVADPGSFSVPCS---------------------------FDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDM

Query:  IENPSMPVILGRPFLATGRVIIDIERKELTIRVKNEKEIFKAVEDSK
         E+  +P+ILGRPFLATG+  +D+++ +L++++++E+ IFK  +  K
Subjt:  IENPSMPVILGRPFLATGRVIIDIERKELTIRVKNEKEIFKAVEDSK

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129452.8e-11032.52Show/hide
Query:  PEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSI
        PE  R +RDY  P+ QG    I    INANNFE+K   IQM +    + G P++DPNSHL +FL+IC T K NGV++DAIRLRLFPFSL+DKA+ WL S+
Subjt:  PEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSI

Query:  TPGSITTWDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKTVENAR
          GSITTW+ L   FL KFFPPAKT K+R +I +F Q   E L+EAWERFKELLR+CP HG PDWLQVQ FYNGL  S KTI+DAAA G L+SK   +A 
Subjt:  TPGSITTWDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKTVENAR

Query:  TLLEDMATNSYQWPSERSTPKKIVAGVFEVDK--SIESAVALASRPQEE---------------------------TIEQVQYVSNFNSTPTH-----YH
         LLE+MA+N+YQWPSERS  +K V G +E+D   ++ + VA  S+  +                              E VQ+V NFN    +     Y+
Subjt:  TLLEDMATNSYQWPSERSTPKKIVAGVFEVDK--SIESAVALASRPQEE---------------------------TIEQVQYVSNFNSTPTH-----YH

Query:  PNNRNHENFSYANNKNVLN-----PPGF----APQTQENK-KLEDLVGAFIAESSNRTTKLEEGVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGK
        P  RNH NFS++NN    N     PPGF     PQ  E K +LE+L+  +I+++              +  +    A+++N+ETQ               
Subjt:  PNNRNHENFSYANNKNVLN-----PPGF----APQTQENK-KLEDLVGAFIAESSNRTTKLEEGVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGK

Query:  APAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSSDEAEKPNLEPPIPSPTVLVPKERKKKRRKRTIRIMKEWLAKKRKEKKVDTVYLAS
                                                                                                            
Subjt:  APAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSSDEAEKPNLEPPIPSPTVLVPKERKKKRRKRTIRIMKEWLAKKRKEKKVDTVYLAS

Query:  TCSTRVQQKVPEKVADPGSFSVPCSFDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERK
                                                                                                            
Subjt:  TCSTRVQQKVPEKVADPGSFSVPCSFDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERK

Query:  ELTIRVKNEKEIFKAVEDSKDEVLFMGYRKGLVFFLSFAFKLSRFQAIKQELIIIPVGQLASELKNRPVRTLPSNTKA-PKGKEQCHALTLCNGRQIALT
                                                                VGQLA+ + NRP  +LPS+T+  PKGKEQC A+TL +G++I   
Subjt:  ELTIRVKNEKEIFKAVEDSKDEVLFMGYRKGLVFFLSFAFKLSRFQAIKQELIIIPVGQLASELKNRPVRTLPSNTKA-PKGKEQCHALTLCNGRQIALT

Query:  GPAPTQEESTEAALSLNREPEHKGNAKVITTEPEKDEEKQPEKEPDTYKPPIHFLQKLRKRNDGAQFKKFLGVLRERHINIPLVEALEKVPGYAKFLKDI
             + E          E E       I  + + D++ + +       PP  F Q+L+K+    QF+KFL V ++ HINIP  EALE++P Y KFLKDI
Subjt:  GPAPTQEESTEAALSLNREPEHKGNAKVITTEPEKDEEKQPEKEPDTYKPPIHFLQKLRKRNDGAQFKKFLGVLRERHINIPLVEALEKVPGYAKFLKDI

Query:  LSNKRPWKEFETVALTNEWSTILASKIPPKLKDPRSFTIPCSIGGI-------DVGRALY--------------------------QSVVHPEGKIEDVL
        LS KR   EFETV LT E S IL +K+PPKLKDP SFTIPC+IG +       D+G ++                           +S V+P G IEDVL
Subjt:  LSNKRPWKEFETVALTNEWSTILASKIPPKLKDPRSFTIPCSIGGI-------DVGRALY--------------------------QSVVHPEGKIEDVL

Query:  VKVDKFIFPADFIILNYEADKDVPIILGRPFLATGQTLIDV
        VKVDKFIFP DF+IL+ E D+ +PIILGRPFLAT   +IDV
Subjt:  VKVDKFIFPADFIILNYEADKDVPIILGRPFLATGQTLIDV

A0A6J1DU19 uncharacterized protein LOC1110243611.5e-11143.87Show/hide
Query:  IRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITT
        IRDY QP F     GI+  PINANN ELK GLIQM R+  +RG+ TEDPN+HL  FLD+CGTVK+NGV +DAIRLRLFP SLQDK               
Subjt:  IRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITT

Query:  WDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKTVENARTLLEDMA
           +V AFL  FFPPAKT +LRTEI +F++   EQLFE WER+KELLRKCPQHG  +WLQ+Q+FYNGL   T+TI+DAAA GTLLS+T ENA  LL+DMA
Subjt:  WDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKTVENARTLLEDMA

Query:  TNSYQWPSERSTPKKIVAGVFEVDKSIESAVALASRPQEETIEQVQYVSNFNST---PTHYHPNNRNHENFSYANNKNVLNPPGFAPQTQENK-KLEDLV
         NS+QWPSERS  KK VAG++E+D+ + S  A           QVQ ++N  S    P   H N       +Y+  +  +    F     E K  LEDL+
Subjt:  TNSYQWPSERSTPKKIVAGVFEVDKSIESAVALASRPQEETIEQVQYVSNFNST---PTHYHPNNRNHENFSYANNKNVLNPPGFAPQTQENK-KLEDLV

Query:  GAFIAESSNRTTKLEEGVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSSDE
        GAFI E  +R +++E  V  +   + G++ +IKN+E Q+GQ+   ++TM KGK P++ E    E+CKA+T+   +  +EPE +  E P    EE  + +E
Subjt:  GAFIAESSNRTTKLEEGVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSSDE

Query:  ----------AEKPNLEPPIPSPTVLVPKERKKKRRKRTIRIMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFDIGEIKSTPVK
                  A+KP        P  L   +   ++    +R MK+ +  KRK +  +TV L   CS  +Q+K+P+K+ DPGSF++PC+     I S+   
Subjt:  ----------AEKPNLEPPIPSPTVLVPKERKKKRRKRTIRIMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFDIGEIKSTPVK

Query:  LQLADQSV---VRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNEKEIF
          L D      + P+G++E+VL++V R   P D  V+   E+  +P+ILGR FLATG  +ID++   LT+RV  E  +F
Subjt:  LQLADQSV---VRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNEKEIF

A0A6P6XAQ1 Reverse transcriptase3.2e-11440.74Show/hide
Query:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSI
        R +RD+  P  QG Q+ IV   +NANNFE+K  LIQM +   Y G+ TEDPNSHL +FL+IC T+K NGVSEDAI+LRLFPFSL+DKA+ WLQS  P + 
Subjt:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSI

Query:  TTWDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKTVENARTLLED
        TTWD L  AFL KFFPP KT KLR +I +F QQ  E L+EAWER++EL R+CP HG PDWL VQ FYNGLT  TKT VDAAA G L+ KT E A+ L+E+
Subjt:  TTWDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKTVENARTLLED

Query:  MATNSYQWPSERSTPKKIVAGVFEVD------KSIESAVALASRP------------------------QEETIEQVQYVSNFNSTPTH------YHPNN
        MA N+YQW +ER   ++  AG+ EVD        +++ V + +R                            + EQVQY++N+N  P +      Y+P  
Subjt:  MATNSYQWPSERSTPKKIVAGVFEVD------KSIESAVALASRP------------------------QEETIEQVQYVSNFNSTPTH------YHPNN

Query:  RNHENFSY---ANNKNVLNPPGFAPQ--TQENKKLEDLVGAFIAESSN-RTTKLEEGVIAINTTVNGHSAAI----KNIETQLGQLVNVVSTMNKGKAPA
        RNH NF +    N +  +NPPGF  +    E+K   +L    +A +SN +  KL          + G    +    +N+E QLGQ+ N V+  N+G  P+
Subjt:  RNHENFSY---ANNKNVLNPPGFAPQ--TQENKKLEDLVGAFIAESSN-RTTKLEEGVIAINTTVNGHSAAI----KNIETQLGQLVNVVSTMNKGKAPA

Query:  EQEKPQMEYCKAITVHQEESEEEP-------ESEDYETPTGEAEEDTSSDEAEKPNLE---------PPIPSPTVLVPKERKKKRRKRTIRIMKEWLAKK
        + E    E+ KAIT+   +   EP       E E  E       ++ S +E  K  +E          PIP P   +P   K          +KE + KK
Subjt:  EQEKPQMEYCKAITVHQEESEEEP-------ESEDYETPTGEAEEDTSSDEAEKPNLE---------PPIPSPTVLVPKERKKKRRKRTIRIMKEWLAKK

Query:  RKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSF----------DIG-----------------EIKSTPVKLQLADQSVVRPVGIVENVLIRVG
        RK    +T+ L   CS  +Q K+P K+ DPGSF+VPC+           D+G                 E+K T + LQLAD+S+  P+GI+ENVLI+V 
Subjt:  RKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSF----------DIG-----------------EIKSTPVKLQLADQSVVRPVGIVENVLIRVG

Query:  RFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNEKEIF
        +F +P+D  V+DM E+ ++P+ILGRPFLAT   IID++R +   ++  E+  F
Subjt:  RFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERKELTIRVKNEKEIF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGACCCAAATCAACCTGAGGAGCCTAGGCCTATTAGAGACTACTTTCAGCCCGTGTTTCAGGGGCAACAATCGGGGATTGTCTATGCCCCGATTAATGCCAACAA
CTTTGAGCTGAAGACCGGTCTCATTCAGATGGCTCGAGATTGTGCTTATAGAGGATCACCCACTGAGGATCCAAATTCTCATCTTAAATCATTCTTAGACATTTGTGGGA
CGGTAAAAATTAATGGAGTCTCTGAGGATGCTATTCGCTTACGTTTATTTCCCTTTTCTTTGCAGGATAAAGCACGAGATTGGTTGCAGTCTATTACCCCTGGGAGCATC
ACCACCTGGGATGCTTTGGTCCATGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATATGATGAGCA
GTTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAATTGTTTTATAATGGTTTAACTCCGAGTA
CAAAAACGATTGTTGATGCAGCTGCAGATGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCT
GAGCGGTCTACACCCAAAAAGATTGTTGCTGGAGTGTTTGAGGTTGATAAATCAATTGAATCAGCTGTTGCTTTAGCATCTAGACCTCAGGAGGAGACCATTGAACAGGT
TCAGTATGTATCAAATTTTAATTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCCAATAATAAGAATGTTCTTAACCCTCCTGGGT
TTGCCCCTCAAACTCAAGAAAATAAAAAGTTAGAGGATCTTGTTGGAGCTTTCATTGCAGAGTCAAGTAACAGGACAACCAAACTAGAGGAGGGAGTCATTGCTATCAAC
ACCACGGTGAATGGCCACAGTGCAGCCATCAAGAACATTGAGACTCAGCTGGGACAGTTGGTGAATGTTGTAAGCACCATGAATAAAGGTAAGGCCCCAGCTGAACAAGA
GAAACCCCAGATGGAGTATTGTAAGGCAATCACTGTTCACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGACTATGAAACGCCTACAGGGGAAGCTGAGGAGGACA
CATCATCTGATGAGGCTGAAAAGCCTAACCTTGAGCCTCCTATTCCTTCTCCCACAGTGTTGGTTCCCAAGGAAAGGAAAAAAAAAAGAAGAAAAAGAACAATCAGGATT
ATGAAGGAGTGGTTAGCAAAGAAGCGAAAAGAAAAGAAAGTTGACACTGTCTATCTTGCTTCCACATGCAGCACCAGAGTACAACAGAAAGTACCTGAAAAAGTAGCAGA
TCCAGGGAGTTTTTCTGTTCCTTGTAGTTTTGATATAGGTGAGATTAAGTCTACTCCTGTAAAGCTCCAATTGGCTGATCAATCTGTGGTGAGACCAGTTGGTATTGTAG
AAAATGTGTTAATCAGAGTAGGTAGATTTTTCCTCCCTATTGACTTGTATGTTATGGACATGATAGAAAATCCTTCAATGCCTGTCATATTAGGACGACCATTCCTCGCC
ACTGGGCGAGTGATTATAGATATTGAGCGCAAGGAGCTCACTATTAGAGTCAAGAACGAAAAAGAGATCTTTAAAGCAGTTGAAGACTCTAAAGATGAAGTGTTGTTCAT
GGGCTACAGGAAAGGTCTTGTTTTCTTTCTCTCCTTTGCTTTCAAGCTTTCAAGATTCCAAGCTATCAAGCAAGAACTCATCATCATTCCAGTGGGACAACTTGCGTCAG
AATTAAAGAACAGACCAGTCAGGACATTGCCAAGCAATACAAAAGCTCCAAAAGGAAAAGAGCAATGTCACGCATTGACTCTCTGCAATGGCAGACAGATCGCACTAACT
GGCCCTGCACCAACGCAAGAAGAATCAACCGAAGCTGCGCTCAGCCTCAACCGTGAACCTGAGCACAAAGGTAACGCAAAGGTGATTACAACAGAGCCTGAGAAGGACGA
AGAGAAGCAACCAGAAAAAGAACCTGACACATACAAGCCACCGATACACTTTCTGCAGAAGTTAAGAAAGAGAAATGACGGAGCACAGTTCAAGAAATTTCTTGGTGTCT
TGCGAGAGCGTCACATCAATATTCCTTTGGTTGAAGCATTAGAAAAAGTTCCCGGTTATGCAAAATTCTTGAAGGATATTCTATCCAATAAAAGACCCTGGAAGGAATTC
GAAACGGTTGCGCTAACAAACGAATGGAGCACAATATTGGCGAGTAAAATTCCCCCAAAGTTGAAGGATCCTAGGAGTTTCACCATCCCATGCTCGATTGGAGGAATTGA
TGTAGGAAGGGCATTATACCAATCTGTAGTGCATCCCGAAGGAAAGATTGAAGATGTTCTGGTAAAAGTTGATAAGTTTATTTTTCCAGCTGACTTTATTATTCTCAACT
ATGAAGCAGATAAGGATGTCCCAATCATTTTGGGAAGACCATTTTTAGCAACTGGACAGACACTGATTGACGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGACCCAAATCAACCTGAGGAGCCTAGGCCTATTAGAGACTACTTTCAGCCCGTGTTTCAGGGGCAACAATCGGGGATTGTCTATGCCCCGATTAATGCCAACAA
CTTTGAGCTGAAGACCGGTCTCATTCAGATGGCTCGAGATTGTGCTTATAGAGGATCACCCACTGAGGATCCAAATTCTCATCTTAAATCATTCTTAGACATTTGTGGGA
CGGTAAAAATTAATGGAGTCTCTGAGGATGCTATTCGCTTACGTTTATTTCCCTTTTCTTTGCAGGATAAAGCACGAGATTGGTTGCAGTCTATTACCCCTGGGAGCATC
ACCACCTGGGATGCTTTGGTCCATGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATATGATGAGCA
GTTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAATTGTTTTATAATGGTTTAACTCCGAGTA
CAAAAACGATTGTTGATGCAGCTGCAGATGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCT
GAGCGGTCTACACCCAAAAAGATTGTTGCTGGAGTGTTTGAGGTTGATAAATCAATTGAATCAGCTGTTGCTTTAGCATCTAGACCTCAGGAGGAGACCATTGAACAGGT
TCAGTATGTATCAAATTTTAATTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCCAATAATAAGAATGTTCTTAACCCTCCTGGGT
TTGCCCCTCAAACTCAAGAAAATAAAAAGTTAGAGGATCTTGTTGGAGCTTTCATTGCAGAGTCAAGTAACAGGACAACCAAACTAGAGGAGGGAGTCATTGCTATCAAC
ACCACGGTGAATGGCCACAGTGCAGCCATCAAGAACATTGAGACTCAGCTGGGACAGTTGGTGAATGTTGTAAGCACCATGAATAAAGGTAAGGCCCCAGCTGAACAAGA
GAAACCCCAGATGGAGTATTGTAAGGCAATCACTGTTCACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGACTATGAAACGCCTACAGGGGAAGCTGAGGAGGACA
CATCATCTGATGAGGCTGAAAAGCCTAACCTTGAGCCTCCTATTCCTTCTCCCACAGTGTTGGTTCCCAAGGAAAGGAAAAAAAAAAGAAGAAAAAGAACAATCAGGATT
ATGAAGGAGTGGTTAGCAAAGAAGCGAAAAGAAAAGAAAGTTGACACTGTCTATCTTGCTTCCACATGCAGCACCAGAGTACAACAGAAAGTACCTGAAAAAGTAGCAGA
TCCAGGGAGTTTTTCTGTTCCTTGTAGTTTTGATATAGGTGAGATTAAGTCTACTCCTGTAAAGCTCCAATTGGCTGATCAATCTGTGGTGAGACCAGTTGGTATTGTAG
AAAATGTGTTAATCAGAGTAGGTAGATTTTTCCTCCCTATTGACTTGTATGTTATGGACATGATAGAAAATCCTTCAATGCCTGTCATATTAGGACGACCATTCCTCGCC
ACTGGGCGAGTGATTATAGATATTGAGCGCAAGGAGCTCACTATTAGAGTCAAGAACGAAAAAGAGATCTTTAAAGCAGTTGAAGACTCTAAAGATGAAGTGTTGTTCAT
GGGCTACAGGAAAGGTCTTGTTTTCTTTCTCTCCTTTGCTTTCAAGCTTTCAAGATTCCAAGCTATCAAGCAAGAACTCATCATCATTCCAGTGGGACAACTTGCGTCAG
AATTAAAGAACAGACCAGTCAGGACATTGCCAAGCAATACAAAAGCTCCAAAAGGAAAAGAGCAATGTCACGCATTGACTCTCTGCAATGGCAGACAGATCGCACTAACT
GGCCCTGCACCAACGCAAGAAGAATCAACCGAAGCTGCGCTCAGCCTCAACCGTGAACCTGAGCACAAAGGTAACGCAAAGGTGATTACAACAGAGCCTGAGAAGGACGA
AGAGAAGCAACCAGAAAAAGAACCTGACACATACAAGCCACCGATACACTTTCTGCAGAAGTTAAGAAAGAGAAATGACGGAGCACAGTTCAAGAAATTTCTTGGTGTCT
TGCGAGAGCGTCACATCAATATTCCTTTGGTTGAAGCATTAGAAAAAGTTCCCGGTTATGCAAAATTCTTGAAGGATATTCTATCCAATAAAAGACCCTGGAAGGAATTC
GAAACGGTTGCGCTAACAAACGAATGGAGCACAATATTGGCGAGTAAAATTCCCCCAAAGTTGAAGGATCCTAGGAGTTTCACCATCCCATGCTCGATTGGAGGAATTGA
TGTAGGAAGGGCATTATACCAATCTGTAGTGCATCCCGAAGGAAAGATTGAAGATGTTCTGGTAAAAGTTGATAAGTTTATTTTTCCAGCTGACTTTATTATTCTCAACT
ATGAAGCAGATAAGGATGTCCCAATCATTTTGGGAAGACCATTTTTAGCAACTGGACAGACACTGATTGACGTTTAG
Protein sequenceShow/hide protein sequence
MADPNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSI
TTWDALVHAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAADGTLLSKTVENARTLLEDMATNSYQWPS
ERSTPKKIVAGVFEVDKSIESAVALASRPQEETIEQVQYVSNFNSTPTHYHPNNRNHENFSYANNKNVLNPPGFAPQTQENKKLEDLVGAFIAESSNRTTKLEEGVIAIN
TTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSSDEAEKPNLEPPIPSPTVLVPKERKKKRRKRTIRI
MKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLA
TGRVIIDIERKELTIRVKNEKEIFKAVEDSKDEVLFMGYRKGLVFFLSFAFKLSRFQAIKQELIIIPVGQLASELKNRPVRTLPSNTKAPKGKEQCHALTLCNGRQIALT
GPAPTQEESTEAALSLNREPEHKGNAKVITTEPEKDEEKQPEKEPDTYKPPIHFLQKLRKRNDGAQFKKFLGVLRERHINIPLVEALEKVPGYAKFLKDILSNKRPWKEF
ETVALTNEWSTILASKIPPKLKDPRSFTIPCSIGGIDVGRALYQSVVHPEGKIEDVLVKVDKFIFPADFIILNYEADKDVPIILGRPFLATGQTLIDV