; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038602 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038602
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr2:21212809..21221683
RNA-Seq ExpressionLag0038602
SyntenyLag0038602
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]1.5e-15446.12Show/hide
Query:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDANRLRLFPFSLQDKTRD
        MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  FL+IC TVKINGV+ED  RLRLFPFSL+DK R 
Subjt:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDANRLRLFPFSLQDKTRD

Query:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACERFKELLRKCPQHGYPNWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
        WLQS+ PGSI +W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EA ER+K+L+R+CPQHG P+WLQVQ+FYNGL   T+TIVDAA+GGTL+SKT
Subjt:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACERFKELLRKCPQHGYPNWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPSKRSAPKKIAAGVFEVDKSIE------------SAATLASRPQ---------------EETIEQVQYVSNFNSRGYNNNST
         E A  LLE+MA+N+YQWP++R+  KK+ AG+ E++                SA T    PQ               E + EQVQYV+N N   Y  N  
Subjt:  VENARTLLEDMATNSYQWPSKRSAPKKIAAGVFEVDKSIE------------SAATLASRPQ---------------EETIEQVQYVSNFNSRGYNNNST

Query:  PTHYHPNNRNHENFSYTNTKNVL---NLPGFALQTQENK-KLEDLVEAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKA
        P +YHP  RNHEN SY NTKNVL   + PGF  Q  E K  LED + +F+ E++ R  K +  +  I T  +   A +KN+E Q+GQL   ++   +G  
Subjt:  PTHYHPNNRNHENFSYTNTKNVL---NLPGFALQTQENK-KLEDLVEAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKA

Query:  PAEQEKTQMEYCKAIIVHQEK-VEEEPESEDYDTPTGEAEKDPSSDEAEKPE------------------PEPPIPSPTLMVLKDRKKKKKKKNNQVQFD
        P+  E    E CKAI +   K +E  P  E   TPT  A    S D+ E+ E                    PPI +P L   +  +K+K  K    QF 
Subjt:  PAEQEKTQMEYCKAIIVHQEK-VEEEPESEDYDTPTGEAEKDPSSDEAEKPE------------------PEPPIPSPTLMVLKDRKKKKKKKNNQVQFD

Query:  KFMNAFMSLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKAPEKVADPGSFSVPCSFGTYSF-RALCDLGASNNIIPLSLFK
        KF++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K P+K+ DP SF++PC+ G   F R LCDLGAS N++P  + +
Subjt:  KFMNAFMSLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKAPEKVADPGSFSVPCSFGTYSF-RALCDLGASNNIIPLSLFK

Query:  KLDIGEIKSTPVRLQLADQSVVKPVGIIENVLIRVGRFFLPIDLHVMDMVENPSMPVILGRPFLVTGRVIIDIERMELTVRVRNEKEIFKAVEDSK
        KL +GE+K T + LQLAD+S+  P GIIE+VL++V +F  P D  V+DM E+  +P+ILGRPFL TGR +ID+++ ELT+RV  E+ +F   +  K
Subjt:  KLDIGEIKSTPVRLQLADQSVVKPVGIIENVLIRVGRFFLPIDLHVMDMVENPSMPVILGRPFLVTGRVIIDIERMELTVRVRNEKEIFKAVEDSK

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]3.7e-15646.41Show/hide
Query:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDANRLRLFPFSLQDKTRD
        MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  FL+IC TVKINGV+ED  RLRLFPFSL+DK R 
Subjt:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDANRLRLFPFSLQDKTRD

Query:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACERFKELLRKCPQHGYPNWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
        WLQS+ PGSI +W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EA ER+K+L+R+CPQHG P+WLQVQ+FYNGL   T+TIVDAA+GGTL+SKT
Subjt:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACERFKELLRKCPQHGYPNWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPSKRSAPKKIAAGVFEVDKSIE------------SAATLASRPQ---------------EETIEQVQYVSNFNSRGYNNNST
         E A  LLE+MA+N+YQWP++R+  KK+ AG+ E++                SA T    PQ               E + EQVQYV+N N   Y  N  
Subjt:  VENARTLLEDMATNSYQWPSKRSAPKKIAAGVFEVDKSIE------------SAATLASRPQ---------------EETIEQVQYVSNFNSRGYNNNST

Query:  PTHYHPNNRNHENFSYTNTKNVL---NLPGFALQTQENK-KLEDLVEAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKA
        P +YHP  RNHEN SY NTKNVL   + PGF  Q  E K  LED + +F+ E++ R  K +  +  I T  +   A +KN+E Q+GQL   ++   +G  
Subjt:  PTHYHPNNRNHENFSYTNTKNVL---NLPGFALQTQENK-KLEDLVEAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKA

Query:  PAEQEKTQMEYCKAIIVHQEK-VEEEPESEDYDTPTGEAEKDPSSDEAEKPE------------------PEPPIPSPTLMVLKDRKKKKKKKNNQVQFD
        P+  E    E CKAI +   K +E  P  E   TPT  A    S D+ E+ E                    PPI +P L   +  +K+K  K    QF 
Subjt:  PAEQEKTQMEYCKAIIVHQEK-VEEEPESEDYDTPTGEAEKDPSSDEAEKPE------------------PEPPIPSPTLMVLKDRKKKKKKKNNQVQFD

Query:  KFMNAFMSLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKAPEKVADPGSFSVPCSFGTYSF-RALCDLGASNNIIPLSLFK
        KF++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K P+K+ DPGSF++PC+ G   F R LCDLGAS N++P S+ +
Subjt:  KFMNAFMSLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKAPEKVADPGSFSVPCSFGTYSF-RALCDLGASNNIIPLSLFK

Query:  KLDIGEIKSTPVRLQLADQSVVKPVGIIENVLIRVGRFFLPIDLHVMDMVENPSMPVILGRPFLVTGRVIIDIERMELTVRVRNEKEIFKAVEDSK
        KL +GE+K T + LQLAD+S+  P GIIE+VL++V +F  P D  V+DM E+  +P+ILGRPFL TGR +ID+++ ELT+RV  E+ +F   +  K
Subjt:  KLDIGEIKSTPVRLQLADQSVVKPVGIIENVLIRVGRFFLPIDLHVMDMVENPSMPVILGRPFLVTGRVIIDIERMELTVRVRNEKEIFKAVEDSK

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]1.5e-16244.67Show/hide
Query:  MRKVRELALVPLDPEIERTIHRIRRENRENIQMADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSF
        MR+ R   ++P+DPEIERT+  +RR   + + MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  F
Subjt:  MRKVRELALVPLDPEIERTIHRIRRENRENIQMADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSF

Query:  LDICGTVKINGVSEDANRLRLFPFSLQDKTRDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACERFKELLRKCPQHGYP
        L+IC TVKINGV+ED  RLRLFPFSL+DK R WLQS+ PGSI +W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EA ER+K+L+R+CPQHG P
Subjt:  LDICGTVKINGVSEDANRLRLFPFSLQDKTRDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACERFKELLRKCPQHGYP

Query:  NWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSKRSAPKKIAAGVFEVDKSIE------------SAATLASRPQ------
        +WLQVQ+FYNGL   T+TIVDAA+GGTL+SKT E A  LLE+MA+N+YQWP++R+  KK+ AG+ +++                SA T    PQ      
Subjt:  NWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSKRSAPKKIAAGVFEVDKSIE------------SAATLASRPQ------

Query:  ---------EETIEQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYTNTKNVL---NLPGFALQTQENK-KLEDLVEAFIAESSNRTTKLEEAVIAIN
                 E + EQVQYV+N N   Y  N  P +YHP  RNHEN SY NTKNVL   + PGF  Q  E K  LED + +F+ E++ R  K +  +  I 
Subjt:  ---------EETIEQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYTNTKNVL---NLPGFALQTQENK-KLEDLVEAFIAESSNRTTKLEEAVIAIN

Query:  TTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEK-VEEEPESEDYDTPT----GEAEKDPSSDEAEKPEPE----------
        T  +   AAIKNIE Q+GQL   ++   +G  P+  E    E CKAI +   K +E  P  E   TPT    G+++     DE      E          
Subjt:  TTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEK-VEEEPESEDYDTPT----GEAEKDPSSDEAEKPEPE----------

Query:  ---PPIPSPTLMVLKDRKKKKKKKNNQVQFDKFMNAFMSLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKAPEKVADPGSF
           PPI +P L   +  +K+K  K    QF KF++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K P+K+ DPGSF
Subjt:  ---PPIPSPTLMVLKDRKKKKKKKNNQVQFDKFMNAFMSLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKAPEKVADPGSF

Query:  SVPCSFGTYSF-RALCDLGASNNIIPLSLFKKLDIGEIKSTPVRLQLADQSVVKPVGIIENVLIRVGRFFLPIDLHVMDMVENPSMPVILGRPFLVTGRV
        ++PC+ G   F + LCDLGAS N++PLS+ +KL + E+K T + LQLAD+S+  P GIIE+VL++V +F  P D  V+DM E+  +P+ILGRPFL TGR 
Subjt:  SVPCSFGTYSF-RALCDLGASNNIIPLSLFKKLDIGEIKSTPVRLQLADQSVVKPVGIIENVLIRVGRFFLPIDLHVMDMVENPSMPVILGRPFLVTGRV

Query:  IIDIERMELTVRVRNEKEIFKAV------EDSKGHFEVLVMGYKKGARKSTSVGFTEKKPLDARSTRRANDVKQALMGGNPSVRDVKR
        +ID+++ ELT+RV  E+ +FK        E+    F V V+  K+G  K     F E  P        AN +++AL        +V+R
Subjt:  IIDIERMELTVRVRNEKEIFKAV------EDSKGHFEVLVMGYKKGARKSTSVGFTEKKPLDARSTRRANDVKQALMGGNPSVRDVKR

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]3.0e-15045.63Show/hide
Query:  YMRKVRELALVPLDPEIERTIHRIRR-ENRENIQMADQ-----NPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDP
        +MR+ R L L+ +DPE ERT   +R  +  E   MA+Q     N   + R IRDY +PV     SGI    I A NFELK GLI M +   + G+  EDP
Subjt:  YMRKVRELALVPLDPEIERTIHRIRR-ENRENIQMADQ-----NPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDP

Query:  NSHLKSFLDICGTVKINGVSEDANRLRLFPFSLQDKTRDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACERFKELLRK
        N+HL SFL+IC TVK+NGV+EDA RLRLF FSL+DK + W QS+  GSITTWD L Q FL K+FPP+K+ +LR EI  F+Q   E  +EA ERFK+LLR+
Subjt:  NSHLKSFLDICGTVKINGVSEDANRLRLFPFSLQDKTRDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACERFKELLRK

Query:  CPQHGYPNWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSKRSAPKKIAAGVFEVD-------------------------
        CPQHG+  W+Q+++FYNGL   T+T+VDAAAGG L++KT E A  LL+D+ATNSYQWPS+RS  KK+ AG+ EVD                         
Subjt:  CPQHGYPNWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSKRSAPKKIAAGVFEVD-------------------------

Query:  KSIESAATLASRPQEETI--EQVQYVS--NFNSR-GYNNNSTPTHYHPNNRNHENFSYTNTKNVLN-LPGFALQTQENK-KLEDLVEAFIAESSNRTTKL
        ++++S  + +S  QE  +  EQVQY+   N+N R GY  N    HYHP  RNHEN SY N +N L   PGF  Q  + K  LED++  FI+E+ +R  K 
Subjt:  KSIESAATLASRPQEETI--EQVQYVS--NFNSR-GYNNNSTPTHYHPNNRNHENFSYTNTKNVLN-LPGFALQTQENK-KLEDLVEAFIAESSNRTTKL

Query:  EEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEKVEEEPESEDYDTPTGEA-EKDPSSDEAEKPEPE------
        E  +  I T V+   A +KN+E Q+GQL  ++ +  KGK P++ E    E+C AI +   K+ EE + +    PT +    D    E +K E E      
Subjt:  EEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEKVEEEPESEDYDTPTGEA-EKDPSSDEAEKPEPE------

Query:  ---------PPIPSPTLMVLKDRKKKKKKKNNQVQFDKFMNAFMSLNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKAPEKV
                 PPI  P L       ++  KK    QF KF+  F  ++INIPFAE L +MP Y +F+KE ++ K+K ++ +T+ L   CS  + QK P K+
Subjt:  ---------PPIPSPTLMVLKDRKKKKKKKNNQVQFDKFMNAFMSLNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKAPEKV

Query:  ADPGSFSVPCSFGTYSF-RALCDLGASNNIIPLSLFKKLDIGEIKSTPVRLQLADQSVVKPVGIIENVLIRVGRFFLPIDLHVMDMVENPSMPVILGRPF
         DPGSF++PC+ G  +F RALCD GAS N++PLS+FKKL +GE+K T + LQLAD+S+  P G+IE+VL++V +F LP+D  V+DM EN  +P+ILGRPF
Subjt:  ADPGSFSVPCSFGTYSF-RALCDLGASNNIIPLSLFKKLDIGEIKSTPVRLQLADQSVVKPVGIIENVLIRVGRFFLPIDLHVMDMVENPSMPVILGRPF

Query:  LVTGRVIIDI
        L TGR +ID+
Subjt:  LVTGRVIIDI

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]2.2e-15345.28Show/hide
Query:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDANRLRLFPFSLQDKTRD
        MA+     +PR ++DY +P+     SGI    INANNFELK  LI M +   + GSP +DPN HL  FL+IC T+K+NGV+ED  RLRLFPFSL+DK R 
Subjt:  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDANRLRLFPFSLQDKTRD

Query:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACERFKELLRKCPQHGYPNWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
        WLQS+ PGSIT+W  + + FL KFFPPAKT +LR+EIG F Q   E L+EA ER+K+L+R CPQHG P+WLQVQ+FYNGL   T+TIVDAA+GGTL+SKT
Subjt:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACERFKELLRKCPQHGYPNWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPSKRSAPKKIAAGVFEVDKSIESAATLASR------------PQ---------------EETIEQVQYVSNFNSRGYNNNST
         E A +LLE+MA+N+YQWP++R+  KK+ AG+ E++     +A +AS             PQ               E + EQVQY++N N   Y  N  
Subjt:  VENARTLLEDMATNSYQWPSKRSAPKKIAAGVFEVDKSIESAATLASR------------PQ---------------EETIEQVQYVSNFNSRGYNNNST

Query:  PTHYHPNNRNHENFSYTNTKNVLN-LPGFALQTQENK-KLEDLVEAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPA
        P +YHP  RNHENFSY NTKNVL   PGF  Q  E K  LED + +F+ E+     K +  +  I T  +   A +KN+E Q+GQL   ++   +G  P+
Subjt:  PTHYHPNNRNHENFSYTNTKNVLN-LPGFALQTQENK-KLEDLVEAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPA

Query:  EQEKTQMEYCKAIIVHQ-EKVEEEPESEDYDTPT--------GEAEKDPSSDEAEKPEPEPPIPS-----PTLMVLKDRKKKKKKKNNQVQFDKFMNAFM
          E    E CKAI +    ++E  P  E   TPT         + E++   ++  +    PP  S     P L       ++ +K+    QF KF++ F 
Subjt:  EQEKTQMEYCKAIIVHQ-EKVEEEPESEDYDTPT--------GEAEKDPSSDEAEKPEPEPPIPS-----PTLMVLKDRKKKKKKKNNQVQFDKFMNAFM

Query:  SLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKAPEKVADPGSFSVPCSFGTYSF-RALCDLGASNNIIPLSLFKKLDIGEI
         ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K P+K+ DPGSF++PC+ G   F + LCDLGAS N++PLS+++KL +GE+
Subjt:  SLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKAPEKVADPGSFSVPCSFGTYSF-RALCDLGASNNIIPLSLFKKLDIGEI

Query:  KSTPVRLQLADQSVVKPVGIIENVLIRVGRFFLPIDLHVMDMVENPSMPVILGRPFLVTGRVIIDIERMELTVRVRNEKEIFKAVEDSK
        K T + LQLAD+S+  P GIIE+VL++V +F  P D  V+DM E+  +P+ILGRPFL TGR ++D+++ ELT+RV  E+  F   E  K
Subjt:  KSTPVRLQLADQSVVKPVGIIENVLIRVGRFFLPIDLHVMDMVENPSMPVILGRPFLVTGRVIIDIERMELTVRVRNEKEIFKAVEDSK

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129453.7e-13842.19Show/hide
Query:  MRKVRELALVPLDPEIERTIHRIRRENRE----NIQMADQN----------PPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-Y
        M++   L LVP DP+IERT  R RREN +    N  MA+ N           PE  R +RDY  P+ QG    I    INANNFE+K   IQM +    +
Subjt:  MRKVRELALVPLDPEIERTIHRIRRENRE----NIQMADQN----------PPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-Y

Query:  RGSPTEDPNSHLKSFLDICGTVKINGVSEDANRLRLFPFSLQDKTRDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACE
         G P++DPNSHL +FL+IC T K NGV++DA RLRLFPFSL+DK + WL S+  GSITTW+ L Q FL KFFPPAKT K+R +I +F Q   E L+EA E
Subjt:  RGSPTEDPNSHLKSFLDICGTVKINGVSEDANRLRLFPFSLQDKTRDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACE

Query:  RFKELLRKCPQHGYPNWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSKRSAPKKIAAGVFEVDK----SIESAA------
        RFKELLR+CP HG P+WLQVQ FYNGL  S KTI+DAAAGG L+SK   +A  LLE+MA+N+YQWPS+RS  +K A G +E+D     + + AA      
Subjt:  RFKELLRKCPQHGYPNWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSKRSAPKKIAAGVFEVDK----SIESAA------

Query:  TLASRPQEETI-------------------EQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYTNTKNVLN-----LPGFALQT-----QENKKLEDL
        TL     + ++                   E VQ+V NFN +   NN     Y+P  RNH NFS++N     N      PGF  Q      ++  +LE+L
Subjt:  TLASRPQEETI-------------------EQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYTNTKNVLN-----LPGFALQT-----QENKKLEDL

Query:  VEAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQE--KTQMEYCKAII---------VHQEKVEEEPESED----
        +  +I+++              +  +    A+++N+ETQ+GQL N ++   +G  P++ +      E C+AI          V+Q+ VE E E  D    
Subjt:  VEAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQE--KTQMEYCKAII---------VHQEKVEEEPESED----

Query:  --YDTPTGEAEKDPSSDEAEKPEPEPPIPSPTLMVLKDRKKKKKKKNNQVQFDKFMNAFMSLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYL
           +    + + D + ++       PP P P         ++ +K+  + QF KF+N F  L+INIPFAEALE MP Y +F+K+ L+KKRK  + +TV+L
Subjt:  --YDTPTGEAEKDPSSDEAEKPEPEPPIPSPTLMVLKDRKKKKKKKNNQVQFDKFMNAFMSLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYL

Query:  ASTCSTRVQQKAPEKVADPGSFSVPCSFGTYSF-RALCDLGASNNIIPLSLFKKLDIGEIKSTPVRLQLADQSVVKPVGIIENVLIRVGRFFLPIDLHVM
           CS  +Q K P K+ DPGSF++PC+ G   F +AL DLGAS N++P S+F+KL +GE K T V LQLAD+S V P GIIE+VL++V +F  P+D  ++
Subjt:  ASTCSTRVQQKAPEKVADPGSFSVPCSFGTYSF-RALCDLGASNNIIPLSLFKKLDIGEIKSTPVRLQLADQSVVKPVGIIENVLIRVGRFFLPIDLHVM

Query:  DMVENPSMPVILGRPFLVTGRVIIDIERMELTVRVRNEKEIFKAVEDSK
        DM E+  +P+ILGRPFL T   IID+   +++ +V  E   F     SK
Subjt:  DMVENPSMPVILGRPFLVTGRVIIDIERMELTVRVRNEKEIFKAVEDSK

A0A6J1DU19 uncharacterized protein LOC1110243613.0e-11142.22Show/hide
Query:  IRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDANRLRLFPFSLQDKTRDWLQSITPGSITT
        IRDY QP F     GI+  PINANN ELK GLIQM R+  +RG+ TEDPN+HL  FLD+CGTVK+NGV +DA RLRLFP SLQDK               
Subjt:  IRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDANRLRLFPFSLQDKTRDWLQSITPGSITT

Query:  WDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACERFKELLRKCPQHGYPNWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMA
           +VQAFL  FFPPAKT +LRTEI +F +   EQLFE  ER+KELLRKCPQHG   WLQ+Q+FYNGL   T+TI+DAAAGGTLLS+T ENA  LL+DMA
Subjt:  WDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACERFKELLRKCPQHGYPNWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMA

Query:  TNSYQWPSKRSAPKKIAAGVFEVDKSIESAATLASRPQEETIEQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYTNTKNVLNLPGFALQTQENK-KL
         NS+QWPS+RS  KK+ AG++E+D+     A            QVQ ++N  S+     S P   H N       +Y+  +  +    F     E K  L
Subjt:  TNSYQWPSKRSAPKKIAAGVFEVDKSIESAATLASRPQEETIEQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYTNTKNVLNLPGFALQTQENK-KL

Query:  EDLVEAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEKVEEEPESEDYDTPTGEAEKDP
        EDL+ AFI E  +R +++E  V  +   + G++ +IKN+E Q+GQ+   ++TM KGK P++ E    E+CKA+ +   K  +EPE +  + P    E+  
Subjt:  EDLVEAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEKVEEEPESEDYDTPTGEAEKDP

Query:  SSDEAEKPEPEPPIPSPTLMVLKDRKKKKKKKNNQVQFDKFMNAFMSLNIN-IPFAE-ALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKA
        + +E  K E  P +                      Q DK  ++ +S   N +P+ + ALE MP Y RFMK+ +  KRK +  +TV L   CS  +Q+K 
Subjt:  SSDEAEKPEPEPPIPSPTLMVLKDRKKKKKKKNNQVQFDKFMNAFMSLNIN-IPFAE-ALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKA

Query:  PEKVADPGSFSVPCSFGTYSF-RALCDLGASNNIIPLSLFKKLDIGEIKSTPVRLQLADQSVVKPVGIIENVLIRVGRFFLPIDLHVMDMVENPSMPVIL
        P+K+ DPGSF++PC+  + SF +ALCD+ AS N++PL                             G+IE+VL++V R   P D  V+   E+  +P+IL
Subjt:  PEKVADPGSFSVPCSFGTYSF-RALCDLGASNNIIPLSLFKKLDIGEIKSTPVRLQLADQSVVKPVGIIENVLIRVGRFFLPIDLHVMDMVENPSMPVIL

Query:  GRPFLVTGRVIIDIERMELTVRVRNEKEIF
        GR FL TG  +ID++   LT+RV  E  +F
Subjt:  GRPFLVTGRVIIDIERMELTVRVRNEKEIF

A0A6P6XAQ1 Reverse transcriptase2.4e-12141.47Show/hide
Query:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDANRLRLFPFSLQDKTRDWLQSITPGSI
        R +RD+  P  QG Q+ IV   +NANNFE+K  LIQM +   Y G+ TEDPNSHL +FL+IC T+K NGVSEDA +LRLFPFSL+DK + WLQS  P + 
Subjt:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDANRLRLFPFSLQDKTRDWLQSITPGSI

Query:  TTWDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACERFKELLRKCPQHGYPNWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLED
        TTWD L +AFL KFFPP KT KLR +I +F QQ  E L+EA ER++EL R+CP HG P+WL VQ FYNGLT  TKT VDAAAGG L+ KT E A+ L+E+
Subjt:  TTWDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACERFKELLRKCPQHGYPNWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLED

Query:  MATNSYQWPSKRSAPKKIAAGVFEVDK--------------------------SIESAATLASRPQEE----TIEQVQYVSNFNSRGYNNNSTPTHYHPN
        MA N+YQW ++R   ++  AG+ EVD                            + ++ T+     ++    + EQVQY++N+N R   NN     Y+P 
Subjt:  MATNSYQWPSKRSAPKKIAAGVFEVDK--------------------------SIESAATLASRPQEE----TIEQVQYVSNFNSRGYNNNSTPTHYHPN

Query:  NRNHENFSYT---NTKNVLNLPGFALQ--TQENKKLEDLVEAFIAESSN-RTTKLEEAVIAINTTVNGHSAAI----KNIETQLGQLVNVVSTMNKGKAP
         RNH NF +    N +  +N PGF  +    E+K   +L    +A +SN +  KL  A       + G    +    +N+E QLGQ+ N V+  N+G  P
Subjt:  NRNHENFSYT---NTKNVLNLPGFALQ--TQENKKLEDLVEAFIAESSN-RTTKLEEAVIAINTTVNGHSAAI----KNIETQLGQLVNVVSTMNKGKAP

Query:  AEQEKTQMEYCKAIIVHQEKVEEEPESEDYDTPTGEAEKDPSSDEAEKPEPEPPIPSPTLMVLKDRKKKKKKKNNQVQFDKFMNAFMSLNINIPFAEALE
        ++ E    E+ KAI +   K   EP          E EK  +   +E  E             K+ K K+K + N++Q +       +  I  P      
Subjt:  AEQEKTQMEYCKAIIVHQEKVEEEPESEDYDTPTGEAEKDPSSDEAEKPEPEPPIPSPTLMVLKDRKKKKKKKNNQVQFDKFMNAFMSLNINIPFAEALE

Query:  MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKAPEKVADPGSFSVPCSFGTYSF-RALCDLGASNNIIPLSLFKKLDIGEIKSTPVRLQLADQSV
        +P Y +F+KE + KKRK    +T+ L   CS  +Q K P K+ DPGSF+VPC+ G   F +ALCDLGAS ++IPL++ ++L + E+K T + LQLAD+S+
Subjt:  MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKAPEKVADPGSFSVPCSFGTYSF-RALCDLGASNNIIPLSLFKKLDIGEIKSTPVRLQLADQSV

Query:  VKPVGIIENVLIRVGRFFLPIDLHVMDMVENPSMPVILGRPFLVTGRVIIDIERMELTVRVRNEKEIF
          P+GI+ENVLI+V +F +P+D  V+DM E+ ++P+ILGRPFL T   IID++R +   ++  E+  F
Subjt:  VKPVGIIENVLIRVGRFFLPIDLHVMDMVENPSMPVILGRPFLVTGRVIIDIERMELTVRVRNEKEIF

A0A6P8DD93 uncharacterized protein LOC1162064536.6e-11939.01Show/hide
Query:  MRKVRELALVPLDPEIERTIHRIRRENREN-----IQMADQNPPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTE
        MR+ R   L+PLDPEIERT+HR+RRENR       ++MAD +   +     R +RDY  P   G  S I    I ANNFELK  LIQM +   + G P E
Subjt:  MRKVRELALVPLDPEIERTIHRIRRENREN-----IQMADQNPPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTE

Query:  DPNSHLKSFLDICGTVKINGVSEDANRLRLFPFSLQDKTRDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACERFKELL
         P+ H+  FL  C TVK+N V++D  RL+LFPFSL+DK R W  S+   SITTW  L   FL++FFPPA+T +LR EI  F +   E L+EA ERFKE +
Subjt:  DPNSHLKSFLDICGTVKINGVSEDANRLRLFPFSLQDKTRDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACERFKELL

Query:  RKCPQHGYPNWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSKRSAPK---------------KIAAGVFEVDK--SIESA
        RKCP HG P+ L +++FY  L  + +++VDAAAGG L+ K  + A  L+E+MA++++ W ++RS  +               +I+A   +V K  S  S 
Subjt:  RKCPQHGYPNWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSKRSAPK---------------KIAAGVFEVDK--SIESA

Query:  AT-------LASRPQ------------EETIEQVQYVSNF---NSRGYNNNSTPTHYHPNNRNHENFSYTNTKNVLN-LPGFALQ--------TQENKKL
         T       L S P                 EQV +V+NF   N   Y+N      Y+P  RNH NFS+ N  N L   PGF  Q         Q   ++
Subjt:  AT-------LASRPQ------------EETIEQVQYVSNF---NSRGYNNNSTPTHYHPNNRNHENFSYTNTKNVLN-LPGFALQ--------TQENKKL

Query:  EDLVEAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEKVEEEPESEDYDTPTGEAEKDP
        E+L+ +++ ++              +T +    A I+N+E Q+ Q+   +S    G  P+  E+   +   AI++   K E E  +    T     EKD 
Subjt:  EDLVEAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEKVEEEPESEDYDTPTGEAEKDP

Query:  SSDEAEKPEPE--------PPIPSPTLMVLKDRKKKKKKKNNQVQFDKFMNAFMSLNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCST
           + E+P  +        PP+P P         ++ K++    QF KF++ F  L INIPFAEAL +MP Y RFMK+ L KKRK    + V L   CS 
Subjt:  SSDEAEKPEPE--------PPIPSPTLMVLKDRKKKKKKKNNQVQFDKFMNAFMSLNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCST

Query:  RVQQ---KAPEKVADPGSFSVPCSFGTYSF-RALCDLGASNNIIPLSLFKKLDIGEIKSTPVRLQLADQSVVKPVGIIENVLIRVGRFFLPIDLHVMDMV
         +Q+     P K  D GSF+VPC+ G + F   L D GAS N++PLS+F+KL +GE K T V LQLAD+S+  P GI+ENVL++V +F  P+D  V++M 
Subjt:  RVQQ---KAPEKVADPGSFSVPCSFGTYSF-RALCDLGASNNIIPLSLFKKLDIGEIKSTPVRLQLADQSVVKPVGIIENVLIRVGRFFLPIDLHVMDMV

Query:  ENPSMPVILGRPFLVTGRVIIDIERMELTVRVRNEKEIFKAVEDSK
        E+  +P+ILGRPFL TG+ +ID+E+ +LT+RV NE+  F   +  K
Subjt:  ENPSMPVILGRPFLVTGRVIIDIERMELTVRVRNEKEIFKAVEDSK

A0A6P8DKJ2 uncharacterized protein LOC1162042313.3e-11838.87Show/hide
Query:  MRKVRELALVPLDPEIERTIHRIRRENREN-----IQMADQNPPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTE
        MR+ R   L+PLDPEIERT+HR+RRENR       ++MAD +   +     R +RDY  P   G  S I    I ANNFELK  LIQM +   + G P E
Subjt:  MRKVRELALVPLDPEIERTIHRIRRENREN-----IQMADQNPPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTE

Query:  DPNSHLKSFLDICGTVKINGVSEDANRLRLFPFSLQDKTRDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACERFKELL
         P+ H+  FL  C TVK+N V++D  RL+LFPFSL+DK R W  S+   SITTW  L   FL++FFPPA+T +LR EI  F +   E L+EA ERFKE +
Subjt:  DPNSHLKSFLDICGTVKINGVSEDANRLRLFPFSLQDKTRDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACERFKELL

Query:  RKCPQHGYPNWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSKRSAPK---------------KIAAGVFEVDK--SIESA
        RKCP HG P+ L +++FY  L  + +++VDAAAGG L+ K  + A  L+E+MA++++ W ++RS  +               +I+A   +V K  S  S 
Subjt:  RKCPQHGYPNWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSKRSAPK---------------KIAAGVFEVDK--SIESA

Query:  AT-------LASRPQ------------EETIEQVQYVSNF---NSRGYNNNSTPTHYHPNNRNHENFSYTNTKNVLN-LPGFALQ--------TQENKKL
         T       L S P                 EQV +V+NF   N   Y+N      Y+P  RNH NFS+ N  N L   PGF  Q         Q   ++
Subjt:  AT-------LASRPQ------------EETIEQVQYVSNF---NSRGYNNNSTPTHYHPNNRNHENFSYTNTKNVLN-LPGFALQ--------TQENKKL

Query:  EDLVEAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEKVEEEPESEDYDTPTGEAEKDP
        E+L+ +++ ++              +T +    A I+N+E Q+ Q+   +S    G  P+  E+   +   AI++   K E E  +    T     EKD 
Subjt:  EDLVEAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEKVEEEPESEDYDTPTGEAEKDP

Query:  SSDEAEKPEPE--------PPIPSPTLMVLKDRKKKKKKKNNQVQFDKFMNAFMSLNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCST
           + E+P  +        PP+P P          + K++    QF KF++ F  L INIPFAEAL +MP Y RFMK+ L KKRK    + V L   CS 
Subjt:  SSDEAEKPEPE--------PPIPSPTLMVLKDRKKKKKKKNNQVQFDKFMNAFMSLNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCST

Query:  RVQQ---KAPEKVADPGSFSVPCSFGTYSF-RALCDLGASNNIIPLSLFKKLDIGEIKSTPVRLQLADQSVVKPVGIIENVLIRVGRFFLPIDLHVMDMV
         +Q+     P K  D GSF+VPC+ G + F   L D GAS N++PLS+F+KL +GE K T + LQLAD+S+  P GI+ENVL++V +F  P+D  V++M 
Subjt:  RVQQ---KAPEKVADPGSFSVPCSFGTYSF-RALCDLGASNNIIPLSLFKKLDIGEIKSTPVRLQLADQSVVKPVGIIENVLIRVGRFFLPIDLHVMDMV

Query:  ENPSMPVILGRPFLVTGRVIIDIERMELTVRVRNEKEIFKAVEDSK
        E+  +P+ILGRPFL TG+ +ID+E+ +LT+RV NE+  F   +  K
Subjt:  ENPSMPVILGRPFLVTGRVIIDIERMELTVRVRNEKEIFKAVEDSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGACAACCCAACACGAGCCTGAAAACAAGACCAGAAAACAGACTCCGGAGACGAAACAGGCTAGAGGGTCGGGCCAAGACTGGAGAGATAGGGCCTTGGCCCAACC
CAAATGGTCGGCCTCGGCCCAAGGCCGAGGTCGACCACTCGGCCCACTTGTGTGGGCCAAGTCTTTTCATCTCCGTTCGGTCCCTGGTGCCCCCGGCTGTCTCGGTTCCA
CGCTGTCTAAGTGTTTGATTAAGTTCGAAACACTTCTACTTGATCACCCACTACCCAAGGACTATTCTAGCGCATCGTTAGAGCGACTGAAAAGCCTTATCTGGAGCCAT
TTGGCGAACACTCATTGCCCGCTGTGCCTTAGACATGGTTGCCATGGTCTAAAGTTTGAGGTGCTTGTGGAGCTTTGTGCAAGTGTGCCTCGTTGGTATTTGGAAGCGTG
TTTTTGTGGGAGGCATATGCATGATGTGGTTTTGGTGCTTGTGAGCATGTTGTTGGTTTGTTGCTTTGAGAGCATGCTGGTTTTTACTGCTTTTGGAAGTATCGAGGATG
ATCCGAACCTTGGTGGCGAGGAGGAGAACTACGTGGAAATTCCTAGAGTGTTGTTATGTTGGTTTTGTTGGTATATGCGAAAGGTTAGGGAGTTGGCCTTAGTACCGTTG
GATCCCGAGATCGAAAGAACAATTCATAGGATTCGAAGGGAGAATAGAGAAAACATCCAAATGGCCGACCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGACTACTT
TCAGCCCGTGTTTCAGGGGCAACAATCGGGGATTGTCTATGCCCCGATCAATGCCAACAACTTTGAGCTGAAGACCGGTCTCATTCAGATGGCTCGAGATTGTGCTTATA
GAGGATCGCCCACCGAGGATCCAAATTCTCATCTAAAATCATTTTTGGACATTTGTGGGACAGTAAAAATTAATGGAGTTTCTGAGGATGCCAATCGCTTACGCTTATTC
CCTTTTTCTTTGCAGGATAAAACACGAGATTGGTTGCAGTCTATTACCCCTGGGAGCATCACCACCTGGGATGCTTTGGTCCAGGCTTTTCTAAAGAAATTCTTCCCTCC
TGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACGTTCGAACAACAATATGATGAGCAGTTGTTCGAGGCTTGTGAGCGATTTAAAGAGTTGCTGAGGAAATGCCCTC
AGCATGGATATCCCAACTGGCTTCAGGTTCAATTGTTTTATAATGGTTTAACTCCTAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACC
GTGGAAAATGCTCGCACACTTCTAGAGGACATGGCCACCAACAGCTATCAGTGGCCATCTAAGCGGTCTGCACCTAAAAAGATTGCTGCTGGCGTGTTTGAGGTTGACAA
GTCAATTGAATCAGCTGCTACGTTAGCATCTAGACCTCAGGAGGAGACCATCGAGCAGGTTCAATATGTATCAAATTTTAATTCCAGGGGATATAATAATAATTCTACAC
CTACACATTATCACCCTAACAATAGGAACCATGAAAATTTTTCTTATACTAATACTAAGAATGTTCTTAATCTTCCTGGTTTTGCCCTTCAAACTCAAGAAAATAAAAAG
TTAGAAGATCTTGTTGAAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAATTGGAGGAGGCAGTCATTGCCATCAACACCACGGTGAATGGCCACAGTGCTGCCAT
CAAGAATATTGAGACTCAGCTGGGACAGTTGGTAAATGTTGTAAGCACCATGAATAAAGGTAAGGCCCCAGCTGAGCAAGAGAAAACCCAGATGGAGTATTGTAAGGCAA
TCATTGTGCATCAGGAGAAAGTCGAAGAGGAGCCGGAGTCTGAGGATTATGACACTCCTACTGGAGAAGCTGAGAAGGACCCATCATCAGATGAAGCTGAAAAGCCTGAA
CCTGAGCCTCCTATTCCTTCTCCCACCTTGATGGTCCTCAAAGATAGGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTAT
GAGTCTGAACATTAACATTCCTTTTGCAGAAGCTTTAGAGATGCCCCAGTACAACAGGTTCATGAAGGAATGGTTAGCAAAGAAGCGAAAGGAAAAGAAGGTTGACACCG
TATATCTTGCTTCCACATGCAGCACCAGAGTACAACAGAAGGCACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTCTGTTCCTTGCAGTTTTGGTACTTATTCATTTAGA
GCATTATGTGATTTAGGTGCTAGCAATAATATTATTCCTCTATCTCTGTTCAAAAAGTTAGATATAGGTGAGATTAAATCTACTCCTGTTAGGCTCCAATTGGCTGATCA
GTCTGTGGTTAAACCAGTTGGCATTATAGAAAATGTTTTAATCAGAGTAGGTAGATTTTTCCTCCCTATTGATTTACATGTTATGGATATGGTGGAAAATCCTTCAATGC
CTGTCATACTAGGAAGACCATTCCTCGTTACTGGGCGAGTGATTATTGATATTGAGCGCATGGAGCTCACTGTTAGAGTCCGGAATGAAAAAGAAATATTTAAAGCAGTT
GAAGACTCTAAGGGACACTTTGAAGTGCTTGTCATGGGCTACAAGAAAGGTGCAAGAAAGAGCACCTCTGTTGGATTCACAGAAAAGAAGCCCCTTGATGCACGATCAAC
ACGTCGAGCTAATGACGTTAAACAAGCGCTTATGGGAGGCAACCCAAGTGTTCGGGATGTGAAAAGATGCCAAAGAACTGAAAAGATTCAAGTAACAAGGAGCCAAGAGA
GGACAATCTGCCTATCAGCTTCGAGACGCCACTCTTGGAGCGTCTCTACGCTCGCTTTCCTTGTTTATTTTGGGCGGCAGCAATCTCAGCGTCGAGACGCTGTGATAAGT
TTTTCCCTTATTCATCAGGCGCGCCAGGCGACAGCGTCGAGACGCTGTCTCCTTAGCGTCTCGACGCTATCGGCAGAATTTCCTATTTATACTTCTTTTCATGCTACGGA
TTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGACAACCCAACACGAGCCTGAAAACAAGACCAGAAAACAGACTCCGGAGACGAAACAGGCTAGAGGGTCGGGCCAAGACTGGAGAGATAGGGCCTTGGCCCAACC
CAAATGGTCGGCCTCGGCCCAAGGCCGAGGTCGACCACTCGGCCCACTTGTGTGGGCCAAGTCTTTTCATCTCCGTTCGGTCCCTGGTGCCCCCGGCTGTCTCGGTTCCA
CGCTGTCTAAGTGTTTGATTAAGTTCGAAACACTTCTACTTGATCACCCACTACCCAAGGACTATTCTAGCGCATCGTTAGAGCGACTGAAAAGCCTTATCTGGAGCCAT
TTGGCGAACACTCATTGCCCGCTGTGCCTTAGACATGGTTGCCATGGTCTAAAGTTTGAGGTGCTTGTGGAGCTTTGTGCAAGTGTGCCTCGTTGGTATTTGGAAGCGTG
TTTTTGTGGGAGGCATATGCATGATGTGGTTTTGGTGCTTGTGAGCATGTTGTTGGTTTGTTGCTTTGAGAGCATGCTGGTTTTTACTGCTTTTGGAAGTATCGAGGATG
ATCCGAACCTTGGTGGCGAGGAGGAGAACTACGTGGAAATTCCTAGAGTGTTGTTATGTTGGTTTTGTTGGTATATGCGAAAGGTTAGGGAGTTGGCCTTAGTACCGTTG
GATCCCGAGATCGAAAGAACAATTCATAGGATTCGAAGGGAGAATAGAGAAAACATCCAAATGGCCGACCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGACTACTT
TCAGCCCGTGTTTCAGGGGCAACAATCGGGGATTGTCTATGCCCCGATCAATGCCAACAACTTTGAGCTGAAGACCGGTCTCATTCAGATGGCTCGAGATTGTGCTTATA
GAGGATCGCCCACCGAGGATCCAAATTCTCATCTAAAATCATTTTTGGACATTTGTGGGACAGTAAAAATTAATGGAGTTTCTGAGGATGCCAATCGCTTACGCTTATTC
CCTTTTTCTTTGCAGGATAAAACACGAGATTGGTTGCAGTCTATTACCCCTGGGAGCATCACCACCTGGGATGCTTTGGTCCAGGCTTTTCTAAAGAAATTCTTCCCTCC
TGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACGTTCGAACAACAATATGATGAGCAGTTGTTCGAGGCTTGTGAGCGATTTAAAGAGTTGCTGAGGAAATGCCCTC
AGCATGGATATCCCAACTGGCTTCAGGTTCAATTGTTTTATAATGGTTTAACTCCTAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACC
GTGGAAAATGCTCGCACACTTCTAGAGGACATGGCCACCAACAGCTATCAGTGGCCATCTAAGCGGTCTGCACCTAAAAAGATTGCTGCTGGCGTGTTTGAGGTTGACAA
GTCAATTGAATCAGCTGCTACGTTAGCATCTAGACCTCAGGAGGAGACCATCGAGCAGGTTCAATATGTATCAAATTTTAATTCCAGGGGATATAATAATAATTCTACAC
CTACACATTATCACCCTAACAATAGGAACCATGAAAATTTTTCTTATACTAATACTAAGAATGTTCTTAATCTTCCTGGTTTTGCCCTTCAAACTCAAGAAAATAAAAAG
TTAGAAGATCTTGTTGAAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAATTGGAGGAGGCAGTCATTGCCATCAACACCACGGTGAATGGCCACAGTGCTGCCAT
CAAGAATATTGAGACTCAGCTGGGACAGTTGGTAAATGTTGTAAGCACCATGAATAAAGGTAAGGCCCCAGCTGAGCAAGAGAAAACCCAGATGGAGTATTGTAAGGCAA
TCATTGTGCATCAGGAGAAAGTCGAAGAGGAGCCGGAGTCTGAGGATTATGACACTCCTACTGGAGAAGCTGAGAAGGACCCATCATCAGATGAAGCTGAAAAGCCTGAA
CCTGAGCCTCCTATTCCTTCTCCCACCTTGATGGTCCTCAAAGATAGGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTAT
GAGTCTGAACATTAACATTCCTTTTGCAGAAGCTTTAGAGATGCCCCAGTACAACAGGTTCATGAAGGAATGGTTAGCAAAGAAGCGAAAGGAAAAGAAGGTTGACACCG
TATATCTTGCTTCCACATGCAGCACCAGAGTACAACAGAAGGCACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTCTGTTCCTTGCAGTTTTGGTACTTATTCATTTAGA
GCATTATGTGATTTAGGTGCTAGCAATAATATTATTCCTCTATCTCTGTTCAAAAAGTTAGATATAGGTGAGATTAAATCTACTCCTGTTAGGCTCCAATTGGCTGATCA
GTCTGTGGTTAAACCAGTTGGCATTATAGAAAATGTTTTAATCAGAGTAGGTAGATTTTTCCTCCCTATTGATTTACATGTTATGGATATGGTGGAAAATCCTTCAATGC
CTGTCATACTAGGAAGACCATTCCTCGTTACTGGGCGAGTGATTATTGATATTGAGCGCATGGAGCTCACTGTTAGAGTCCGGAATGAAAAAGAAATATTTAAAGCAGTT
GAAGACTCTAAGGGACACTTTGAAGTGCTTGTCATGGGCTACAAGAAAGGTGCAAGAAAGAGCACCTCTGTTGGATTCACAGAAAAGAAGCCCCTTGATGCACGATCAAC
ACGTCGAGCTAATGACGTTAAACAAGCGCTTATGGGAGGCAACCCAAGTGTTCGGGATGTGAAAAGATGCCAAAGAACTGAAAAGATTCAAGTAACAAGGAGCCAAGAGA
GGACAATCTGCCTATCAGCTTCGAGACGCCACTCTTGGAGCGTCTCTACGCTCGCTTTCCTTGTTTATTTTGGGCGGCAGCAATCTCAGCGTCGAGACGCTGTGATAAGT
TTTTCCCTTATTCATCAGGCGCGCCAGGCGACAGCGTCGAGACGCTGTCTCCTTAGCGTCTCGACGCTATCGGCAGAATTTCCTATTTATACTTCTTTTCATGCTACGGA
TTGA
Protein sequenceShow/hide protein sequence
MRTTQHEPENKTRKQTPETKQARGSGQDWRDRALAQPKWSASAQGRGRPLGPLVWAKSFHLRSVPGAPGCLGSTLSKCLIKFETLLLDHPLPKDYSSASLERLKSLIWSH
LANTHCPLCLRHGCHGLKFEVLVELCASVPRWYLEACFCGRHMHDVVLVLVSMLLVCCFESMLVFTAFGSIEDDPNLGGEEENYVEIPRVLLCWFCWYMRKVRELALVPL
DPEIERTIHRIRRENRENIQMADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDANRLRLF
PFSLQDKTRDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFEQQYDEQLFEACERFKELLRKCPQHGYPNWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
VENARTLLEDMATNSYQWPSKRSAPKKIAAGVFEVDKSIESAATLASRPQEETIEQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYTNTKNVLNLPGFALQTQENKK
LEDLVEAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEKVEEEPESEDYDTPTGEAEKDPSSDEAEKPE
PEPPIPSPTLMVLKDRKKKKKKKNNQVQFDKFMNAFMSLNINIPFAEALEMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKAPEKVADPGSFSVPCSFGTYSFR
ALCDLGASNNIIPLSLFKKLDIGEIKSTPVRLQLADQSVVKPVGIIENVLIRVGRFFLPIDLHVMDMVENPSMPVILGRPFLVTGRVIIDIERMELTVRVRNEKEIFKAV
EDSKGHFEVLVMGYKKGARKSTSVGFTEKKPLDARSTRRANDVKQALMGGNPSVRDVKRCQRTEKIQVTRSQERTICLSASRRHSWSVSTLAFLVYFGRQQSQRRDAVIS
FSLIHQARQATASRRCLLSVSTLSAEFPIYTSFHATD