; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035835 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035835
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr3:31925378..31926629
RNA-Seq ExpressionLag0035835
SyntenyLag0035835
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]2.4e-3735.76Show/hide
Query:  INSTVNGHNAAIKNIETQLGQLVSVVSTMNKGKALAEQEKTQMEYCKAITVHQ-EEAEKEPEFEDYDTPTGKAEEDTSPDEAEKPE--------------
        I +  +   A +KN+E Q+GQL + ++   +G   +  E    E CKAIT+   +E E+ P  E   TPT  A    S D+ E+ E              
Subjt:  INSTVNGHNAAIKNIETQLGQLVSVVSTMNKGKALAEQEKTQMEYCKAITVHQ-EEAEKEPEFEDYDTPTGKAEEDTSPDEAEKPE--------------

Query:  ----PEPPIPSPTLMVPKEKEKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MLQYNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQ
              PPI +P L  P+  +K+K  K    QF KF++ F  ++INIPFA+ALE M  Y +F+K+ ++KKR+ +  +TV L+  CS  +Q K+P+K+ D 
Subjt:  ----PEPPIPSPTLMVPKEKEKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MLQYNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQ

Query:  GVF--------------------------LFLAVLLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLAT
          F                           F+   L +GE+K T + LQLAD+S+  P GI+E+VL++V KF  P D  V+DM E+  +P+ILGR FLAT
Subjt:  GVF--------------------------LFLAVLLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLAT

Query:  GRVIIDIECRELTVRVKNEKEIFEAVEDSK
        GR +ID++  ELT+RV  E+ +F   +  K
Subjt:  GRVIIDIECRELTVRVKNEKEIFEAVEDSK

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]4.7e-3836.67Show/hide
Query:  INSTVNGHNAAIKNIETQLGQLVSVVSTMNKGKALAEQEKTQMEYCKAITVHQ-EEAEKEPEFEDYDTPTGKAEEDTSPDEAEKPE--------------
        I +  +   A +KN+E Q+GQL + ++   +G   +  E    E CKAIT+   +E E+ P  E   TPT  A    S D+ E+ E              
Subjt:  INSTVNGHNAAIKNIETQLGQLVSVVSTMNKGKALAEQEKTQMEYCKAITVHQ-EEAEKEPEFEDYDTPTGKAEEDTSPDEAEKPE--------------

Query:  ----PEPPIPSPTLMVPKEKEKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MLQYNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQ
              PPI +P L  P+  +K+K  K    QF KF++ F  ++INIPFA+ALE M  Y +F+K+ ++KKR+ +  +TV L+  CS  +Q K+P+K+ D 
Subjt:  ----PEPPIPSPTLMVPKEKEKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MLQYNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQ

Query:  GVF---------LFLAVLLD-----------------IGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLAT
        G F          F  VL D                 +GE+K T + LQLAD+S+  P GI+E+VL++V KF  P D  V+DM E+  +P+ILGR FLAT
Subjt:  GVF---------LFLAVLLD-----------------IGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLAT

Query:  GRVIIDIECRELTVRVKNEKEIFEAVEDSK
        GR +ID++  ELT+RV  E+ +F   +  K
Subjt:  GRVIIDIECRELTVRVKNEKEIFEAVEDSK

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]1.2e-3838.39Show/hide
Query:  INSTVNGHNAAIKNIETQLGQLVSVVSTMNKGKALAEQEKTQMEYCKAITVHQ-EEAEKEPEFEDYDTPT--------GKAEED-TSPDEAEKPE-----
        I +  +   AAIKNIE Q+GQL + ++   +G   +  E    E CKAIT+   +E E+ P  E   TPT         K EED    D  E+ +     
Subjt:  INSTVNGHNAAIKNIETQLGQLVSVVSTMNKGKALAEQEKTQMEYCKAITVHQ-EEAEKEPEFEDYDTPT--------GKAEED-TSPDEAEKPE-----

Query:  ---PEPPIPSPTLMVPKEKEKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MLQYNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQG
             PPI +P L  P+  +K+K  K    QF KF++ F  ++INIPFA+ALE M  Y +F+K+ ++KKR+ +  +TV L+  CS  +Q K+P+K+ D G
Subjt:  ---PEPPIPSPTLMVPKEKEKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MLQYNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQG

Query:  VF---------LFLAVLLDIG-----------------EIKSTPVKLQLADQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLATG
         F          F  VL D+G                 E+K T + LQLAD+S+  P GI+E+VL++V KF  P D  V+DM E+  +P+ILGR FLATG
Subjt:  VF---------LFLAVLLDIG-----------------EIKSTPVKLQLADQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLATG

Query:  RVIIDIECRELTVRVKNEKEIFE
        R +ID++  ELT+RV  E+ +F+
Subjt:  RVIIDIECRELTVRVKNEKEIFE

XP_016460253.1 PREDICTED: uncharacterized protein LOC107783747 [Nicotiana tabacum]7.6e-3636.56Show/hide
Query:  FIIAINSTVNGHNAAIKNIETQLGQLVSVVSTMNKGKALAEQEKTQMEYCKAITVHQEEAEKEP---------EFEDYDTPTGKAEEDTSPD--EAEKPE
        FI A +  V   ++AI N+E Q+ QL +++S   K    +  EK   E+ KAI++   +   +P         E E  +    K + +   +  E EK  
Subjt:  FIIAINSTVNGHNAAIKNIETQLGQLVSVVSTMNKGKALAEQEKTQMEYCKAITVHQEEAEKEP---------EFEDYDTPTGKAEEDTSPD--EAEKPE

Query:  PEPPIPSPTLMVPKEKEKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMLQYNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQGVFL
         E  + +    VP    +K K+KN   QF KF++    L INIPF EAL +M  Y +F+KE L+ KRK + V  V L   CS  +Q+K+P+K+ D   F 
Subjt:  PEPPIPSPTLMVPKEKEKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMLQYNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQGVFL

Query:  FLAVL--------------------------LDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLATGRVI
            +                          L++GE+K T V LQLADQS  RP GI+ENVL+RV KF  P+D  V++M EN  +P+ILGR FLATGR I
Subjt:  FLAVL--------------------------LDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLATGRVI

Query:  IDIECRELTVRVKNEKEIFE
        ID+   +L +RV  E+ IF+
Subjt:  IDIECRELTVRVKNEKEIFE

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]4.0e-3736Show/hide
Query:  INSTVNGHNAAIKNIETQLGQLVSVVSTMNKGKALAEQEKTQMEYCKAITVHQ-EEAEKEPEFEDYDTPT--------GKAEEDTSPDEAEKPEPEPPIP
        I +  +   A +KN+E Q+GQL + ++   +G   +  E    E CKAIT+    E E+ P  E   TPT         K EE+   ++  +    PP  
Subjt:  INSTVNGHNAAIKNIETQLGQLVSVVSTMNKGKALAEQEKTQMEYCKAITVHQ-EEAEKEPEFEDYDTPT--------GKAEEDTSPDEAEKPEPEPPIP

Query:  S-----PTLMVPKEKEKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MLQYNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQGVF--
        S     P L  P    ++ +K+    QF KF++ F  ++INIPFA+ALE M  Y +F+K+ ++KKR+ +  +TV L+  CS  +Q K+P+K+ D G F  
Subjt:  S-----PTLMVPKEKEKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MLQYNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQGVF--

Query:  -------LFLAVLLD-----------------IGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLATGRVII
                F  VL D                 +GE+K T + LQLAD+S+  P GI+E+VL++V KF  P D  V+DM E+  +P+ILGR FLATGR ++
Subjt:  -------LFLAVLLD-----------------IGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLATGRVII

Query:  DIECRELTVRVKNEKEIFEAVEDSK
        D++  ELT+RV  E+  F   E  K
Subjt:  DIECRELTVRVKNEKEIFEAVEDSK

TrEMBL top hitse value%identityAlignment
A0A1S3Z766 Reverse transcriptase3.7e-3636.56Show/hide
Query:  FIIAINSTVNGHNAAIKNIETQLGQLVSVVSTMNKGKALAEQEKTQMEYCKAITVHQEEAEKEP---------EFEDYDTPTGKAEEDTSPD--EAEKPE
        FI A +  V   ++AI N+E Q+ QL +++S   K    +  EK   E+ KAI++   +   +P         E E  +    K + +   +  E EK  
Subjt:  FIIAINSTVNGHNAAIKNIETQLGQLVSVVSTMNKGKALAEQEKTQMEYCKAITVHQEEAEKEP---------EFEDYDTPTGKAEEDTSPD--EAEKPE

Query:  PEPPIPSPTLMVPKEKEKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMLQYNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQGVFL
         E  + +    VP    +K K+KN   QF KF++    L INIPF EAL +M  Y +F+KE L+ KRK + V  V L   CS  +Q+K+P+K+ D   F 
Subjt:  PEPPIPSPTLMVPKEKEKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMLQYNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQGVFL

Query:  FLAVL--------------------------LDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLATGRVI
            +                          L++GE+K T V LQLADQS  RP GI+ENVL+RV KF  P+D  V++M EN  +P+ILGR FLATGR I
Subjt:  FLAVL--------------------------LDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLATGRVI

Query:  IDIECRELTVRVKNEKEIFE
        ID+   +L +RV  E+ IF+
Subjt:  IDIECRELTVRVKNEKEIFE

A0A1U7VVH6 uncharacterized protein LOC1042210975.9e-3435.19Show/hide
Query:  FIIAINSTVNGHNAAIKNIETQLGQLVSVVSTMNKGKALAEQEKTQMEYCKAITVHQ---------------EEAEKEPEFEDYDTPTGKAEEDTSPDEA
        FI A +  V   ++AIKN+E Q+ QL +++S   +G   +  EK   E+ K+I++                 +E EK  E E+   P  +  +  +  E 
Subjt:  FIIAINSTVNGHNAAIKNIETQLGQLVSVVSTMNKGKALAEQEKTQMEYCKAITVHQ---------------EEAEKEPEFEDYDTPTGKAEEDTSPDEA

Query:  EKPEPEPPIPSPTLMVPKEKEKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMLQYNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQ
        E       +P P    P++ +++K  K    QF KF++    L INIPF +AL +M  Y +F+KE L+ KRK ++V  V L   CS  +Q+K+P+K+ D 
Subjt:  EKPEPEPPIPSPTLMVPKEKEKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMLQYNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQ

Query:  GVFLFLAVL--------------------------LDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLAT
        G F+    +                          L++GE+K T V LQLADQS  R  GI+EN+L+RV KF  P+D  V++M EN  +P+ILGR FLAT
Subjt:  GVFLFLAVL--------------------------LDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLAT

Query:  GRVIIDIECRELTVRVKNEKEIFE
        GR IID+   +L +RV  E+ IF+
Subjt:  GRVIIDIECRELTVRVKNEKEIFE

A0A5N6LUB5 Retrotrans_gag domain-containing protein1.3e-3333.65Show/hide
Query:  VNGHNAAIKNIETQLGQLVSVVSTMNKGKALAEQEKTQMEYCKAITVHQEEAEKEPEFEDYDTPTGKAE-------EDTSPDEAEK-PEPEP-PIPSPTL
        +    +AI+ IE Q+GQ+  +++   KGK  +  E    E+CKA+T+   +  K  +      P  + E       ++T  D   K P  EP  +  PT+
Subjt:  VNGHNAAIKNIETQLGQLVSVVSTMNKGKALAEQEKTQMEYCKAITVHQEEAEKEPEFEDYDTPTGKAE-------EDTSPDEAEK-PEPEP-PIPSPTL

Query:  MVPKEKEKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMLQYNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQGVFLFLAVL-----
          P     + K +N +  + KF++ F  L+IN+PF EAL +M +Y +F+K+ L  K+K + +  V L   CS  +Q+K+PEK+ D G F    ++     
Subjt:  MVPKEKEKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMLQYNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQGVFLFLAVL-----

Query:  ---------------------LDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLATGRVIIDIECRELTV
                             LD+GE K T + +QLAD+SV  P GIVEN+L+++GKF  P+D  ++DM E+ ++P+ILGR FLAT R ++D+   +LT+
Subjt:  ---------------------LDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLATGRVIIDIECRELTV

Query:  RVKNEKEIFEAVEDS
        RV  E+ +F  ++DS
Subjt:  RVKNEKEIFEAVEDS

A0A5N6N9T2 Reverse transcriptase5.9e-3433.65Show/hide
Query:  VNGHNAAIKNIETQLGQLVSVVSTMNKGKALAEQEKTQMEYCKAITVHQEEAEKEPEFEDYDTPTGKAE-------EDTSPDEAEK-PEPEP-PIPSPTL
        +    + I+ IE Q+GQ+  +++   KGK  +  E    E+CKA+T+   +  K  +      PT + E       ++T  D   K P  EP  +  PT+
Subjt:  VNGHNAAIKNIETQLGQLVSVVSTMNKGKALAEQEKTQMEYCKAITVHQEEAEKEPEFEDYDTPTGKAE-------EDTSPDEAEK-PEPEP-PIPSPTL

Query:  MVPKEKEKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMLQYNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQGVFLFLAVL-----
          P   + +K +K+    + KF++ F  L+IN+PF EAL +M +Y +F+K++L  K+K + +  V L   CS  +Q+K+PEK+ D G F    ++     
Subjt:  MVPKEKEKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMLQYNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQGVFLFLAVL-----

Query:  ---------------------LDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLATGRVIIDIECRELTV
                             LD+GE K T + +QLAD+SV  P GIVEN+L+++GKF  P+D  ++DM E+ ++P+ILGR FLAT R ++D+   +LT+
Subjt:  ---------------------LDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLATGRVIIDIECRELTV

Query:  RVKNEKEIFEAVEDS
        RV  E+ +F  ++DS
Subjt:  RVKNEKEIFEAVEDS

A0A6P4CAL3 uncharacterized protein LOC1074723983.4e-3435.52Show/hide
Query:  HNAAIKNIETQLGQL-------VSVVSTMNKGKALAEQEKTQMEYCKAIT-----------VHQEEAEKEPEFEDYDTPTGKAEEDTSPDEAEKPEP---
        H A IKNIE Q+GQL        +V+ +         ++  + E CKAIT           + QEE  KE   E+      + E DT  D+  K E    
Subjt:  HNAAIKNIETQLGQL-------VSVVSTMNKGKALAEQEKTQMEYCKAIT-----------VHQEEAEKEPEFEDYDTPTGKAEEDTSPDEAEKPEP---

Query:  -EPPIPSPTLMVPKEKEKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALEMLQ-YNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQGVF-
         +P +P P     + KEK         Q+ KF+  F  L+INIPF EALE +  Y +FMKE L KKR  K   TV +   CS  +Q K+P+K+ D G F 
Subjt:  -EPPIPSPTLMVPKEKEKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALEMLQ-YNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQGVF-

Query:  -------------------------LFLAVLLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLATGRVI
                                 L L   L I E+K T + LQ+AD+S+ + +G+VENVL++V KFFLP+D  ++D+ E+ + P+I+GR FLAT R +
Subjt:  -------------------------LFLAVLLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLATGRVI

Query:  IDIECRELTVRVKNEK---EIFEAVEDSKGQSEVL
        ID+E  EL +RV +E     +F+ ++DS  + E +
Subjt:  IDIECRELTVRVKNEK---EIFEAVEDSKGQSEVL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCATCATTGCCATCAACTCAACAGTGAATGGCCACAATGCTGCCATCAAGAACATTGAGACTCAGCTAGGACAATTGGTAAGTGTTGTAAGCACCATGAATAAAGG
TAAGGCCCTAGCTGAGCAGGAGAAAACCCAGATGGAGTACTGTAAAGCAATCACTGTGCACCAGGAGGAAGCTGAAAAGGAGCCTGAGTTTGAGGATTATGACACGCCTA
CAGGGAAAGCTGAGGAGGACACATCACCAGATGAGGCTGAAAAGCCTGAACCTGAGCCTCCTATTCCTTCTCCCACACTGATGGTTCCCAAAGAAAAGGAAAAAAAAAAG
AAGAAAAAGAACAATCAGGTTCAGTTTGACAAGTTTATGAATGCTTTTATGAATCTGAACATTAATATTCCTTTTGCAGAGGCATTAGAGATGCTCCAGTACAACAGGTT
CATGAAGGAATGGTTAGCAAAGAAGCGAAAGGAAAAGAGGGTTGACACCGTATATCTCGCTTCCACATGCAGCACCAGAGTACAGCATAAGGTACCTGAAAAAGTAGCAG
ACCAGGGAGTTTTTCTGTTCCTTGCAGTTTTGTTAGACATAGGTGAGATTAAATCTACTCCTGTAAAGCTCCAATTGGCTGATCAATCTGTGGTTAGACCAGTTGGTATT
GTAGAAAATGTTTTAATCAGAGTAGGTAAATTTTTCCTCCCTATTGACTTGTATGTTATGGACATGATAGAAAATCCTTCAATGCCTGTCATATTAGGAAGAACATTCCT
CGCTACTGGGCGAGTGATTATTGATATTGAGTGCAGGGAGCTCACTGTGAGAGTCAAGAATGAAAAAGAAATATTTGAAGCAGTTGAAGACTCTAAGGGACAATCTGAAG
TGCTTTTCATGGGCTACAAGAAAGGTGCAAGAAAGAGCACCTCTGTTGGATTCACAGAAAAGAAGCCTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCATCATTGCCATCAACTCAACAGTGAATGGCCACAATGCTGCCATCAAGAACATTGAGACTCAGCTAGGACAATTGGTAAGTGTTGTAAGCACCATGAATAAAGG
TAAGGCCCTAGCTGAGCAGGAGAAAACCCAGATGGAGTACTGTAAAGCAATCACTGTGCACCAGGAGGAAGCTGAAAAGGAGCCTGAGTTTGAGGATTATGACACGCCTA
CAGGGAAAGCTGAGGAGGACACATCACCAGATGAGGCTGAAAAGCCTGAACCTGAGCCTCCTATTCCTTCTCCCACACTGATGGTTCCCAAAGAAAAGGAAAAAAAAAAG
AAGAAAAAGAACAATCAGGTTCAGTTTGACAAGTTTATGAATGCTTTTATGAATCTGAACATTAATATTCCTTTTGCAGAGGCATTAGAGATGCTCCAGTACAACAGGTT
CATGAAGGAATGGTTAGCAAAGAAGCGAAAGGAAAAGAGGGTTGACACCGTATATCTCGCTTCCACATGCAGCACCAGAGTACAGCATAAGGTACCTGAAAAAGTAGCAG
ACCAGGGAGTTTTTCTGTTCCTTGCAGTTTTGTTAGACATAGGTGAGATTAAATCTACTCCTGTAAAGCTCCAATTGGCTGATCAATCTGTGGTTAGACCAGTTGGTATT
GTAGAAAATGTTTTAATCAGAGTAGGTAAATTTTTCCTCCCTATTGACTTGTATGTTATGGACATGATAGAAAATCCTTCAATGCCTGTCATATTAGGAAGAACATTCCT
CGCTACTGGGCGAGTGATTATTGATATTGAGTGCAGGGAGCTCACTGTGAGAGTCAAGAATGAAAAAGAAATATTTGAAGCAGTTGAAGACTCTAAGGGACAATCTGAAG
TGCTTTTCATGGGCTACAAGAAAGGTGCAAGAAAGAGCACCTCTGTTGGATTCACAGAAAAGAAGCCTCCTTGA
Protein sequenceShow/hide protein sequence
MFIIAINSTVNGHNAAIKNIETQLGQLVSVVSTMNKGKALAEQEKTQMEYCKAITVHQEEAEKEPEFEDYDTPTGKAEEDTSPDEAEKPEPEPPIPSPTLMVPKEKEKKK
KKKNNQVQFDKFMNAFMNLNINIPFAEALEMLQYNRFMKEWLAKKRKEKRVDTVYLASTCSTRVQHKVPEKVADQGVFLFLAVLLDIGEIKSTPVKLQLADQSVVRPVGI
VENVLIRVGKFFLPIDLYVMDMIENPSMPVILGRTFLATGRVIIDIECRELTVRVKNEKEIFEAVEDSKGQSEVLFMGYKKGARKSTSVGFTEKKPP