; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028284 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028284
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr8:17575329..17576279
RNA-Seq ExpressionLag0028284
SyntenyLag0028284
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]8.6e-4035.54Show/hide
Query:  MASLTNSLNKLTSSEV---VKFISTLAEGYSKKEGQDVEEVQYVGNRPF---AQGVPNFYHPSLRNHENFSYLNTKNVLQPL--PGFASSNVEKKNNLEE
        +A+L++ ++ LT+  +    +++++ +      E    E+VQYV NR +      +PN+YHP LRNHEN SY NTKNVLQP   PGF S   EKK +LE+
Subjt:  MASLTNSLNKLTSSEV---VKFISTLAEGYSKKEGQDVEEVQYVGNRPF---AQGVPNFYHPSLRNHENFSYLNTKNVLQPL--PGFASSNVEKKNNLEE

Query:  MVALFIKEQRVLNV-------NLQTTVNNHDTALKNMEVQIGQITSAVNTLQKGKLPSDIEPNPREQCKMVTLRSGRQLETNSEKKKEEEKSKDEDERAE
         +  F++E             N++T  +N    +KN+EVQIGQ+ + +N  Q+G  PS+ E NP+EQCK +TLRSG+++E    K               
Subjt:  MVALFIKEQRVLNV-------NLQTTVNNHDTALKNMEVQIGQITSAVNTLQKGKLPSDIEPNPREQCKMVTLRSGRQLETNSEKKKEEEKSKDEDERAE

Query:  AQKISSEGSQHPPNSNDLNCDFSNNFAGKKKDEKQDDRHKKKLNEEEVVP--CNHHDKGSHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEM
              E    P  +N+          G+ KD+ +++       EE  +P   +  D    ++PP             LP+PQRF+  KLD+QF KFL++
Subjt:  AQKISSEGSQHPPNSNDLNCDFSNNFAGKKKDEKQDDRHKKKLNEEEVVP--CNHHDKGSHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEM

Query:  FKKLSVNIPLVDALYNMPNYGKFMKEMLSRKK
        FKK+ +NIP  DAL  MPNY KF+K+++S+K+
Subjt:  FKKLSVNIPLVDALYNMPNYGKFMKEMLSRKK

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]8.6e-4035.54Show/hide
Query:  MASLTNSLNKLTSSEV---VKFISTLAEGYSKKEGQDVEEVQYVGNRPF---AQGVPNFYHPSLRNHENFSYLNTKNVLQPL--PGFASSNVEKKNNLEE
        +A+L++ ++ LT+  +    +++++ +      E    E+VQYV NR +      +PN+YHP LRNHEN SY NTKNVLQP   PGF S   EKK +LE+
Subjt:  MASLTNSLNKLTSSEV---VKFISTLAEGYSKKEGQDVEEVQYVGNRPF---AQGVPNFYHPSLRNHENFSYLNTKNVLQPL--PGFASSNVEKKNNLEE

Query:  MVALFIKEQRVLNV-------NLQTTVNNHDTALKNMEVQIGQITSAVNTLQKGKLPSDIEPNPREQCKMVTLRSGRQLETNSEKKKEEEKSKDEDERAE
         +  F++E             N++T  +N    +KN+EVQIGQ+ + +N  Q+G  PS+ E NP+EQCK +TLRSG+++E    K               
Subjt:  MVALFIKEQRVLNV-------NLQTTVNNHDTALKNMEVQIGQITSAVNTLQKGKLPSDIEPNPREQCKMVTLRSGRQLETNSEKKKEEEKSKDEDERAE

Query:  AQKISSEGSQHPPNSNDLNCDFSNNFAGKKKDEKQDDRHKKKLNEEEVVP--CNHHDKGSHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEM
              E    P  +N+          G+ KD+ +++       EE  +P   +  D    ++PP             LP+PQRF+  KLD+QF KFL++
Subjt:  AQKISSEGSQHPPNSNDLNCDFSNNFAGKKKDEKQDDRHKKKLNEEEVVP--CNHHDKGSHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEM

Query:  FKKLSVNIPLVDALYNMPNYGKFMKEMLSRKK
        FKK+ +NIP  DAL  MPNY KF+K+++S+K+
Subjt:  FKKLSVNIPLVDALYNMPNYGKFMKEMLSRKK

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]4.3e-3935.15Show/hide
Query:  MASLTNSLNKLTSSEVVKFISTLAEGYSKKEGQDV--EEVQYVGNRPF---AQGVPNFYHPSLRNHENFSYLNTKNVLQPL--PGFASSNVEKKNNLEEM
        +A+L++ ++ LT+  + +    LA         +   E+VQYV NR +      +PN+YHP LRNHEN SY NTKNVLQP   PGF S   E+K +LE+ 
Subjt:  MASLTNSLNKLTSSEVVKFISTLAEGYSKKEGQDV--EEVQYVGNRPF---AQGVPNFYHPSLRNHENFSYLNTKNVLQPL--PGFASSNVEKKNNLEEM

Query:  VALFIKEQRVLNV-------NLQTTVNNHDTALKNMEVQIGQITSAVNTLQKGKLPSDIEPNPREQCKMVTLRSGRQLETNSEKKKEEEKSKDEDERAEA
        +  F++E             N++T  +N   A+KN+EVQIGQ+ + +N  Q+G  PS+ E NP+EQCK +TLRSG+++E +  K+ +   +         
Subjt:  VALFIKEQRVLNV-------NLQTTVNNHDTALKNMEVQIGQITSAVNTLQKGKLPSDIEPNPREQCKMVTLRSGRQLETNSEKKKEEEKSKDEDERAEA

Query:  QKISSEGSQHPPNSNDLNCDFSNNFAGKKKDEKQDDRHKKKLNEEEVVP-CNHHDKGSHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEMFK
                             + N    K   ++D+     L E +  P  +  D    ++PP             LP+PQRF+  KLD+QF KFL++FK
Subjt:  QKISSEGSQHPPNSNDLNCDFSNNFAGKKKDEKQDDRHKKKLNEEEVVP-CNHHDKGSHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEMFK

Query:  KLSVNIPLVDALYNMPNYGKFMKEMLSRKK
        K+ +NIP  DAL  MPNY KF+K+++S+K+
Subjt:  KLSVNIPLVDALYNMPNYGKFMKEMLSRKK

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]4.4e-4436.97Show/hide
Query:  MASLTNSLNKLTSSEV---VKFISTLAEGYSKKEGQDVEEVQYVGNRPF---AQGVPNFYHPSLRNHENFSYLNTKNVLQPLPGFASSNVEKKNNLEEMV
        +ASL++ ++ LT+  +    ++++  +      E    E+VQY+ NR +      +PN+YHP LRNHENFSY NTKNVLQP PGF S   EKK +LE+ +
Subjt:  MASLTNSLNKLTSSEV---VKFISTLAEGYSKKEGQDVEEVQYVGNRPF---AQGVPNFYHPSLRNHENFSYLNTKNVLQPLPGFASSNVEKKNNLEEMV

Query:  ALFIKEQRVLNV-------NLQTTVNNHDTALKNMEVQIGQITSAVNTLQKGKLPSDIEPNPREQCKMVTLRSGRQLETNSEKKKEEEKSKDEDERAEAQ
          F++E +           N++T  +N    +KN+EVQIGQ+ + +N  Q+G  PS+ E NP+EQCK +TLRSGR++E +  K+                
Subjt:  ALFIKEQRVLNV-------NLQTTVNNHDTALKNMEVQIGQITSAVNTLQKGKLPSDIEPNPREQCKMVTLRSGRQLETNSEKKKEEEKSKDEDERAEAQ

Query:  KISSEGSQHPPNSNDLNCDFSNNFAGKKKDEKQDDRHKKKLNEEEVV--PCNHHDKGSHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEMFK
           +E +   PN+                      + K K+ EEE+V       D    IS P    + P      LP+PQRF+  KLD+QF KFL++FK
Subjt:  KISSEGSQHPPNSNDLNCDFSNNFAGKKKDEKQDDRHKKKLNEEEVV--PCNHHDKGSHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEMFK

Query:  KLSVNIPLVDALYNMPNYGKFMKEMLSRKK
        K+ +NIP  DAL  MPNY KF+K+++S+K+
Subjt:  KLSVNIPLVDALYNMPNYGKFMKEMLSRKK

XP_024032903.1 uncharacterized protein LOC112095347 [Morus notabilis]9.5e-3939.1Show/hide
Query:  EEVQYVGNRPF---AQGVPNFYHPSLRNHENFSYLNTKNVLQPLPGFASSNVEKKNNLEEMVALFIKEQR-------VLNVNLQTTVNNHDTALKNMEVQ
        E+VQ+V NR F      +PN YHP LRNHENFSY N +NVLQP  GF    VEKK ++E++++ FI E R           N++T  NN +  +K++EVQ
Subjt:  EEVQYVGNRPF---AQGVPNFYHPSLRNHENFSYLNTKNVLQPLPGFASSNVEKKNNLEEMVALFIKEQR-------VLNVNLQTTVNNHDTALKNMEVQ

Query:  IGQITSAVNTLQKGKLPSDIEPNPREQCKMVTLRSGRQLETNSEKKKEEEKSKDEDERAEAQKISSEGSQHPPNSNDLNCDFSNNFAGKKKDEKQDDRHK
        IGQ+ + V     GK PSD EPNP++ CK +TLRSG+++E     K +E+K+++E+   E  K S+      P S      F +N               
Subjt:  IGQITSAVNTLQKGKLPSDIEPNPREQCKMVTLRSGRQLETNSEKKKEEEKSKDEDERAEAQKISSEGSQHPPNSNDLNCDFSNNFAGKKKDEKQDDRHK

Query:  KKLNEEEVVPCNHHDKGSHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVDALYNMPNYGKFMKEMLSRK
                                     P      LP+PQRFK  KLD QF KFLE+FKK+ +NIP  DAL  MPNY KFMK+++++K
Subjt:  KKLNEEEVVPCNHHDKGSHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVDALYNMPNYGKFMKEMLSRK

TrEMBL top hitse value%identityAlignment
A0A2P5AMA4 Uncharacterized protein1.3e-3032.22Show/hide
Query:  SLTNSLNKLTSSEVVKFISTLAEGYSK--KEGQDVEEVQYVGNRPF-------AQGVPNFYHPSLRNHENFSYLNTKNVLQPLPGFASSNVEKKNNLEEM
        +LT+ ++ LT+ E       +    +    +G D+E+V Y+ N  F       +  +P  YHP LRNHENFSY N +NVLQP  GF     EKK +L+++
Subjt:  SLTNSLNKLTSSEVVKFISTLAEGYSK--KEGQDVEEVQYVGNRPF-------AQGVPNFYHPSLRNHENFSYLNTKNVLQPLPGFASSNVEKKNNLEEM

Query:  VALFIKEQR-------VLNVNLQTTVNNHDTALKNMEVQIGQITSAVNTLQKGKLPSDIEPNPREQCKMVTLRSGRQLETNSEKKKEEEKSKDEDERAEA
        ++ FI E +           N++T  NN +  +K++EVQI Q+ +++      K PSD E NP++ CK +TLRS +++E   +K    +K+K++     +
Subjt:  VALFIKEQR-------VLNVNLQTTVNNHDTALKNMEVQIGQITSAVNTLQKGKLPSDIEPNPREQCKMVTLRSGRQLETNSEKKKEEEKSKDEDERAEA

Query:  QKISSEGSQHPPNSNDLNCDFSNNFAGKKKDEKQDDRHKKKLNEEEVVPCNHHDKGSHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEMFKK
        + IS   S   P++  L                                         I+ P             L +PQRF+  KLD QF KF+E+FKK
Subjt:  QKISSEGSQHPPNSNDLNCDFSNNFAGKKKDEKQDDRHKKKLNEEEVVPCNHHDKGSHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEMFKK

Query:  LSVNIPLVDALYNMPNYGKFMKEMLSRKK
        L +NIP  DAL  M NY KFMK+++S+K+
Subjt:  LSVNIPLVDALYNMPNYGKFMKEMLSRKK

A0A2P5BPI6 Uncharacterized protein5.8e-3437.73Show/hide
Query:  VPNFYHPSLRNHENFSYLNTKNVLQPLPGFASSNVEKKNNLEEMVALFIKE-QRVLNV------NLQTTVNNHDTALKNMEVQIGQITSAVNTLQKGKLP
        +P  Y+P LRNHENFSY NT+NVLQP PGF     EKK++LE++++ FI E +R  N       N++T  NN +  +K++EVQIGQ+ +++   Q GK P
Subjt:  VPNFYHPSLRNHENFSYLNTKNVLQPLPGFASSNVEKKNNLEEMVALFIKE-QRVLNV------NLQTTVNNHDTALKNMEVQIGQITSAVNTLQKGKLP

Query:  SDIEPNPREQCKMVTLRSGRQLETNSEKKKEEEKSKDEDERAEAQKISSEGSQHPPNSNDLNCDFSNNFAGKKKDEKQDDRHKKKLNEEEVVPCNHHDKG
        SD E NP++ CK +TLR+G+++E +  K+   + +K E+   E  K+ S+ S         +  F +N                                
Subjt:  SDIEPNPREQCKMVTLRSGRQLETNSEKKKEEEKSKDEDERAEAQKISSEGSQHPPNSNDLNCDFSNNFAGKKKDEKQDDRHKKKLNEEEVVPCNHHDKG

Query:  SHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVDALYNMPNYGKFMKEMLSRKK
                    P      LP+PQRF+  KLD QF K LE+FKKL +NIP  DAL  MPNY KFMK+++S+K+
Subjt:  SHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVDALYNMPNYGKFMKEMLSRKK

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129452.2e-2533.22Show/hide
Query:  EEVQYVGNRPFAQGVP--NFYHPSLRNHENFSYLNTKNVLQPL----PGFASSN----VEKKNNLEEMVALFIKEQRVLNVNLQTTVNNHDTALKNMEVQ
        E VQ+VGN    Q  P  N Y+P  RNH NFS+ N      P     PGF         EKK+ LEE++  +I +   +       + +   +L+N+E Q
Subjt:  EEVQYVGNRPFAQGVP--NFYHPSLRNHENFSYLNTKNVLQPL----PGFASSN----VEKKNNLEEMVALFIKEQRVLNVNLQTTVNNHDTALKNMEVQ

Query:  IGQITSAVNTLQKGKLPSDIEPNP--REQCKMVTLRSGRQLETNSEKKKEEEKSKDEDERAEAQKISSEGSQHPPNSNDLNCDFSNNFAGKKKDEKQDDR
        +GQ+ +++N   +G LPSD + NP  +EQC+ +TLRSG+++E  ++K  E          +E + +  EG           C+  N    ++KD   DD+
Subjt:  IGQITSAVNTLQKGKLPSDIEPNP--REQCKMVTLRSGRQLETNSEKKKEEEKSKDEDERAEAQKISSEGSQHPPNSNDLNCDFSNNFAGKKKDEKQDDR

Query:  HKKKLNEEEVVPCNHHDKGSHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVDALYNMPNYGKFMKEMLSRKK
         + +   + + P           PP              PFPQR +  KL++QFQKFL +FKKL +NIP  +AL  MP+Y KF+K++LS+K+
Subjt:  HKKKLNEEEVVPCNHHDKGSHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVDALYNMPNYGKFMKEMLSRKK

A0A6P5R7L0 Reverse transcriptase1.3e-2533.11Show/hide
Query:  YSKKEGQDVEEVQYVGNRPFAQGVPNFYHPSLRNHENFSYLNTKNVLQPLPGFASSNVEKKNNLEEMVALFIKEQRVLNVNLQTTVNNHDTALKNMEVQI
        + +++   V   +  GN PF+    N Y+P  + H NFS+ N +NV +P PGF     EKKNNLE+++A             +TT+ N   +++N+EVQ+
Subjt:  YSKKEGQDVEEVQYVGNRPFAQGVPNFYHPSLRNHENFSYLNTKNVLQPLPGFASSNVEKKNNLEEMVALFIKEQRVLNVNLQTTVNNHDTALKNMEVQI

Query:  GQITSAVNTLQKGKLPSDIEPNPR--EQCKMVTLRSGRQLET--NSEKKKEEEKSKDEDERAEAQKISSEGSQHPPNSNDLNCDFSNNFAGKKKDEKQDD
        GQ+ SA+   ++G LPS  E NP+  E  K  TLR+GR ++T  + + +K  ++ +D  +  E  + S+E S     +  +N            +EK ++
Subjt:  GQITSAVNTLQKGKLPSDIEPNPR--EQCKMVTLRSGRQLET--NSEKKKEEEKSKDEDERAEAQKISSEGSQHPPNSNDLNCDFSNNFAGKKKDEKQDD

Query:  RHKKKLNEEEVVPCNHHDKGSHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVDALYNMPNYGKFMKEMLSRKK
        +    L  +  VP                          +PFPQR K  K D QF KFLE+FKKL + IP  +AL  M NYGKF+K++LS+K+
Subjt:  RHKKKLNEEEVVPCNHHDKGSHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVDALYNMPNYGKFMKEMLSRKK

A0A6P9DWY0 uncharacterized protein LOC1183440261.4e-2742.13Show/hide
Query:  MASLTNSLNKLTSSEV---VKFISTLAEGYSKKEGQDVEEVQYVGNRPF---AQGVPNFYHPSLRNHENFSYLNTKNVLQPLPGFASSNVEKKNNLEEMV
        +ASL++ ++ LT+  +    ++++  +      E    E+VQY+ NR +      +PN+YH  LRNHEN SY NTKNVLQP PGF S   EKK +LE+ +
Subjt:  MASLTNSLNKLTSSEV---VKFISTLAEGYSKKEGQDVEEVQYVGNRPF---AQGVPNFYHPSLRNHENFSYLNTKNVLQPLPGFASSNVEKKNNLEEMV

Query:  ALFIKEQRVLNV-------NLQTTVNNHDTALKNMEVQIGQITSAVNTLQKGKLPSDIEPNPREQCKMVTLRSGRQLE
          F++E             N++T  +N   A+KN+EVQIGQ+ + +N  Q+G  PS+ E NPREQCK +TLRSGR+L+
Subjt:  ALFIKEQRVLNV-------NLQTTVNNHDTALKNMEVQIGQITSAVNTLQKGKLPSDIEPNPREQCKMVTLRSGRQLE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCGCTGACCAACTCCCTGAACAAGTTGACTTCATCTGAGGTGGTTAAATTCATTTCCACCTTAGCTGAAGGTTATTCAAAGAAGGAAGGTCAAGATGTGGAGGA
AGTCCAGTACGTGGGAAATAGACCATTTGCTCAAGGAGTACCGAACTTCTACCACCCCAGTCTGCGCAATCACGAGAACTTCTCATATTTAAATACGAAGAATGTTTTGC
AGCCGCTGCCAGGTTTTGCATCATCCAATGTTGAAAAGAAGAATAATCTGGAGGAGATGGTGGCTCTGTTTATTAAAGAACAAAGAGTGTTGAATGTGAATCTCCAGACG
ACAGTTAATAACCACGACACAGCTCTGAAAAACATGGAAGTTCAAATAGGACAGATCACTTCAGCAGTGAACACCCTTCAAAAAGGAAAACTTCCAAGCGACATTGAACC
TAACCCCAGAGAGCAGTGCAAGATGGTGACACTGAGAAGTGGTAGACAGCTGGAGACCAATTCAGAAAAGAAGAAGGAAGAAGAGAAGAGCAAGGATGAAGATGAAAGGG
CTGAGGCACAAAAAATCTCCTCTGAAGGGTCCCAACATCCTCCTAACTCTAATGATTTAAATTGTGATTTTTCTAACAATTTTGCAGGAAAGAAGAAAGATGAAAAGCAA
GATGACAGGCATAAAAAGAAACTGAACGAGGAAGAAGTGGTTCCATGCAACCATCATGACAAAGGCTCGCACATCAGCCCGCCCAAGCGAAGGGGCGAATGTCCAACCTT
TGACTACAGGGAGTTACCTTTTCCTCAACGCTTTAAAAATGTTAAATTAGATGAGCAGTTTCAAAAGTTCCTAGAAATGTTTAAGAAGTTGTCTGTGAATATTCCCTTAG
TCGATGCCTTGTATAACATGCCTAATTATGGGAAATTCATGAAAGAAATGCTTTCTAGGAAAAAACTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCATCGCTGACCAACTCCCTGAACAAGTTGACTTCATCTGAGGTGGTTAAATTCATTTCCACCTTAGCTGAAGGTTATTCAAAGAAGGAAGGTCAAGATGTGGAGGA
AGTCCAGTACGTGGGAAATAGACCATTTGCTCAAGGAGTACCGAACTTCTACCACCCCAGTCTGCGCAATCACGAGAACTTCTCATATTTAAATACGAAGAATGTTTTGC
AGCCGCTGCCAGGTTTTGCATCATCCAATGTTGAAAAGAAGAATAATCTGGAGGAGATGGTGGCTCTGTTTATTAAAGAACAAAGAGTGTTGAATGTGAATCTCCAGACG
ACAGTTAATAACCACGACACAGCTCTGAAAAACATGGAAGTTCAAATAGGACAGATCACTTCAGCAGTGAACACCCTTCAAAAAGGAAAACTTCCAAGCGACATTGAACC
TAACCCCAGAGAGCAGTGCAAGATGGTGACACTGAGAAGTGGTAGACAGCTGGAGACCAATTCAGAAAAGAAGAAGGAAGAAGAGAAGAGCAAGGATGAAGATGAAAGGG
CTGAGGCACAAAAAATCTCCTCTGAAGGGTCCCAACATCCTCCTAACTCTAATGATTTAAATTGTGATTTTTCTAACAATTTTGCAGGAAAGAAGAAAGATGAAAAGCAA
GATGACAGGCATAAAAAGAAACTGAACGAGGAAGAAGTGGTTCCATGCAACCATCATGACAAAGGCTCGCACATCAGCCCGCCCAAGCGAAGGGGCGAATGTCCAACCTT
TGACTACAGGGAGTTACCTTTTCCTCAACGCTTTAAAAATGTTAAATTAGATGAGCAGTTTCAAAAGTTCCTAGAAATGTTTAAGAAGTTGTCTGTGAATATTCCCTTAG
TCGATGCCTTGTATAACATGCCTAATTATGGGAAATTCATGAAAGAAATGCTTTCTAGGAAAAAACTTTGA
Protein sequenceShow/hide protein sequence
MASLTNSLNKLTSSEVVKFISTLAEGYSKKEGQDVEEVQYVGNRPFAQGVPNFYHPSLRNHENFSYLNTKNVLQPLPGFASSNVEKKNNLEEMVALFIKEQRVLNVNLQT
TVNNHDTALKNMEVQIGQITSAVNTLQKGKLPSDIEPNPREQCKMVTLRSGRQLETNSEKKKEEEKSKDEDERAEAQKISSEGSQHPPNSNDLNCDFSNNFAGKKKDEKQ
DDRHKKKLNEEEVVPCNHHDKGSHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEMFKKLSVNIPLVDALYNMPNYGKFMKEMLSRKKL