; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019930 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019930
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationscaffold5:28991237..28994556
RNA-Seq ExpressionSpg019930
SyntenySpg019930
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043405.1 hypothetical protein E6C27_scaffold1639G00040 [Cucumis melo var. makuwa]1.8e-2352.71Show/hide
Query:  ARKEAVEDVNTSDLKKGEASTSLAKPKVVKDEKCSNSAILRYVPLSRCKKGESPFTECSESIK-------------KKLLKEGYSLPTTRKGLGYKSPEP
        A KE  + + T   +KGEASTS  K  ++ DEK SN  ILRYVPLSR +KGESPF +  + +K             KKLL+EG+ +P +RKG GYKSPEP
Subjt:  ARKEAVEDVNTSDLKKGEASTSLAKPKVVKDEKCSNSAILRYVPLSRCKKGESPFTECSESIK-------------KKLLKEGYSLPTTRKGLGYKSPEP

Query:  VRIIRKGKAKVADTNHITVEEVDDSKEKK
        + IIRK K KV D+NHITV EVD  +EK+
Subjt:  VRIIRKGKAKVADTNHITVEEVDDSKEKK

KAA0056780.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.3e-2237.89Show/hide
Query:  ERKLNLLMKAVDERDLEIAYLKNQ-------LQNRETLIQFGTLDPIVVRFQKESTMKGSQEKYVSIKDENEGWTLVIRRKKQKKSYARKESRLFRDNKR
        E+K+ L ++ V + +   A + ++        + RE+L+QFGT +P+VV+F +E   +  QEK  SI+ ++EGWT+V RRKK+K +  +KE R +R+ +R
Subjt:  ERKLNLLMKAVDERDLEIAYLKNQ-------LQNRETLIQFGTLDPIVVRFQKESTMKGSQEKYVSIKDENEGWTLVIRRKKQKKSYARKESRLFRDNKR

Query:  KAKSQKKKGKKKSKRSKPIVEESEDFVCPPQPITLAEYFPRRFLDDSQGKALETVTCHIVDVVEDDDVPASS---SGTVADARKEAVEDV
          K+QK K KKK+++ K I EE +DF    + +TLA +FP RFL D Q +  E V CH +D +E++ +P  S    G   D  +  V+D+
Subjt:  KAKSQKKKGKKKSKRSKPIVEESEDFVCPPQPITLAEYFPRRFLDDSQGKALETVTCHIVDVVEDDDVPASS---SGTVADARKEAVEDV

KAA0061113.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.6e-2728.54Show/hide
Query:  RETLIQFGTLDPIVVRFQKESTMKGSQEKYVSIKDENEGWTLVIRRKKQKKSYARKESRLFRDNKRKAKSQKKKGKKKSKRSKPIVEESEDFVCPPQPIT
        +E +++      I +  ++E T + SQEK   I++++EGWT+V RRKK+K +  +KESRL+ + +R  K+QK K KKK+++ K + E+ +DF    + +T
Subjt:  RETLIQFGTLDPIVVRFQKESTMKGSQEKYVSIKDENEGWTLVIRRKKQKKSYARKESRLFRDNKRKAKSQKKKGKKKSKRSKPIVEESEDFVCPPQPIT

Query:  LAEYFPRRFLDDSQGKALETVT-----------------CHIVDVVEDD--------DVPASSSGTVADARKEAV-----------------------ED
        LA++FP RFL D Q +    V                  C  +D  ++D        + P   SG V + R E +                       E+
Subjt:  LAEYFPRRFLDDSQGKALETVT-----------------CHIVDVVEDD--------DVPASSSGTVADARKEAV-----------------------ED

Query:  VNTSDL------------------------------------------------------------KKGEASTSLAKPKVVKDEKCSNSAILRYVPLSRC
        ++ S L                                                             K EASTS AK  +V DEK SN  ILRYVPLSR 
Subjt:  VNTSDL------------------------------------------------------------KKGEASTSLAKPKVVKDEKCSNSAILRYVPLSRC

Query:  KKGESPFTECSESIK-----------------------------------------------------------------------------KKLLKEGY
        KKGESPF E  + +K                                                                             KKLL+EG+
Subjt:  KKGESPFTECSESIK-----------------------------------------------------------------------------KKLLKEGY

Query:  SLPTTRKGLGYKSPEPVRIIRKGKAKVADTNHITVEEVDDSKEKK
        ++P +RKGLGYK PEP+RI RKGK KV D+NHITV+EVD  +EK+
Subjt:  SLPTTRKGLGYKSPEPVRIIRKGKAKVADTNHITVEEVDDSKEKK

KAA0061113.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]8.2e-1352.22Show/hide
Query:  VASRISKLIEESSKDRVAVKDNPLFEYVIPTSKQSKDTLNPDVMSVMMADVDQDERMAEMERKLNLLMKAVDERDLEIAYLKNQLQNRET
        VA  I K + ES K  + +K+NPL++    +S +SK   +PDVMSVMMAD+  +  MAEMERK+N LMKAV+ERD EI  L+ Q++ RET
Subjt:  VASRISKLIEESSKDRVAVKDNPLFEYVIPTSKQSKDTLNPDVMSVMMADVDQDERMAEMERKLNLLMKAVDERDLEIAYLKNQLQNRET

KAA0061113.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]8.4e-2625.83Show/hide
Query:  LIQFGTLDPIVVRFQKESTMKGSQEKYVSIKDENEGWTLVIRRKKQKKSYARKESRLFRDNKRKAKSQKKKGKKKSKRSKPIVEESEDFVCPPQPITLAE
        L +FGT +P+VVRF +E   + SQEK   I++++E WT+V RRKK+K +  +KE R +R+ +R  K+QK K KKK+++ K + +E +DF    + ITLA+
Subjt:  LIQFGTLDPIVVRFQKESTMKGSQEKYVSIKDENEGWTLVIRRKKQKKSYARKESRLFRDNKRKAKSQKKKGKKKSKRSKPIVEESEDFVCPPQPITLAE

Query:  YFPRRFLDDSQGKALETVTCHIVDVVEDDDVP------------------------------------------ASSSGTV-------------------
        +FP RFL D Q +    V CH ++  E++ +P                                          +SS+ TV                   
Subjt:  YFPRRFLDDSQGKALETVTCHIVDVVEDDDVP------------------------------------------ASSSGTV-------------------

Query:  ----------------------------------------------------------------------------------------ADAR--------
                                                                                                ADA+        
Subjt:  ----------------------------------------------------------------------------------------ADAR--------

Query:  ------------------------KEAVEDVNTSDLKKGEASTSLAKPKVVKDEKCSNSAILRYVPLSRCKKGESPFTECSESIK---------------
                                KE    + T   +K EASTS AK  ++ DEK SN +ILRYVPLSR KKGESPF E  + +K               
Subjt:  ------------------------KEAVEDVNTSDLKKGEASTSLAKPKVVKDEKCSNSAILRYVPLSRCKKGESPFTECSESIK---------------

Query:  --------------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRIIRKGKAKVADT
                                                                      KKLL+EG+++P +RKGLGYK PEP+RI RKGK K+ D+
Subjt:  --------------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRIIRKGKAKVADT

Query:  NHITVEEVDDSKEKK
        NHITV+EVD  KEK+
Subjt:  NHITVEEVDDSKEKK

KAA0063066.1 uncharacterized protein E6C27_scaffold468G001830 [Cucumis melo var. makuwa]6.7e-3129.09Show/hide
Query:  RMAEMERKLNLLMKAVDERDLEIAYLKNQ-------LQNRETLIQFGTLDPIVVRFQKESTMKGSQEKYVSIKDENEGWTLVIRRKKQKKSYARKESRLF
        +  E   K+ L ++ V + +  +  + ++        + RE+L+QFG  +P+VV+F +E   + S+EK  SI++++EGW +V RRKK+K +   K+S L 
Subjt:  RMAEMERKLNLLMKAVDERDLEIAYLKNQ-------LQNRETLIQFGTLDPIVVRFQKESTMKGSQEKYVSIKDENEGWTLVIRRKKQKKSYARKESRLF

Query:  RDNKRKAKSQKKKGKKKSKRSKPIVEESEDFVCPPQPITLAEYFPRRFLDDSQGKALETVTCHIVDVVEDDDVPASS---SGTVADARKEAVEDV-----
               K+QK K KK +++ K + EE++DF  P + +TL ++FP RF  D Q +    V CH ++  E++ +P  S    G   D  +  V+D+     
Subjt:  RDNKRKAKSQKKKGKKKSKRSKPIVEESEDFVCPPQPITLAEYFPRRFLDDSQGKALETVTCHIVDVVEDDDVPASS---SGTVADARKEAVEDV-----

Query:  -----------------------------------------------NTSDLKK-------------------------------------GEASTSLAK
                                                       N   +KK                                     GEASTS  K
Subjt:  -----------------------------------------------NTSDLKK-------------------------------------GEASTSLAK

Query:  PKVVKDEKCSNSAILRYVPLSRCKKGESPFTECSESIK----------------------------------------------KKLLKEGYSLPT-TRK
          ++ DEK SN  IL Y+PLSR KKGESPF +  + +K                                              K ++K GY   T T K
Subjt:  PKVVKDEKCSNSAILRYVPLSRCKKGESPFTECSESIK----------------------------------------------KKLLKEGYSLPT-TRK

Query:  GLGYKSPEPVRIIRKGKAKVADTNHITVEEVDDSKEKKVS
        GLGY+SPEP+RIIRKGK KV D NHIT+EEVD  +EK+ S
Subjt:  GLGYKSPEPVRIIRKGKAKVADTNHITVEEVDDSKEKKVS

TrEMBL top hitse value%identityAlignment
A0A5A7TPR5 RNase H domain-containing protein8.5e-2452.71Show/hide
Query:  ARKEAVEDVNTSDLKKGEASTSLAKPKVVKDEKCSNSAILRYVPLSRCKKGESPFTECSESIK-------------KKLLKEGYSLPTTRKGLGYKSPEP
        A KE  + + T   +KGEASTS  K  ++ DEK SN  ILRYVPLSR +KGESPF +  + +K             KKLL+EG+ +P +RKG GYKSPEP
Subjt:  ARKEAVEDVNTSDLKKGEASTSLAKPKVVKDEKCSNSAILRYVPLSRCKKGESPFTECSESIK-------------KKLLKEGYSLPTTRKGLGYKSPEP

Query:  VRIIRKGKAKVADTNHITVEEVDDSKEKK
        + IIRK K KV D+NHITV EVD  +EK+
Subjt:  VRIIRKGKAKVADTNHITVEEVDDSKEKK

A0A5A7V9E7 Uncharacterized protein3.2e-3129.09Show/hide
Query:  RMAEMERKLNLLMKAVDERDLEIAYLKNQ-------LQNRETLIQFGTLDPIVVRFQKESTMKGSQEKYVSIKDENEGWTLVIRRKKQKKSYARKESRLF
        +  E   K+ L ++ V + +  +  + ++        + RE+L+QFG  +P+VV+F +E   + S+EK  SI++++EGW +V RRKK+K +   K+S L 
Subjt:  RMAEMERKLNLLMKAVDERDLEIAYLKNQ-------LQNRETLIQFGTLDPIVVRFQKESTMKGSQEKYVSIKDENEGWTLVIRRKKQKKSYARKESRLF

Query:  RDNKRKAKSQKKKGKKKSKRSKPIVEESEDFVCPPQPITLAEYFPRRFLDDSQGKALETVTCHIVDVVEDDDVPASS---SGTVADARKEAVEDV-----
               K+QK K KK +++ K + EE++DF  P + +TL ++FP RF  D Q +    V CH ++  E++ +P  S    G   D  +  V+D+     
Subjt:  RDNKRKAKSQKKKGKKKSKRSKPIVEESEDFVCPPQPITLAEYFPRRFLDDSQGKALETVTCHIVDVVEDDDVPASS---SGTVADARKEAVEDV-----

Query:  -----------------------------------------------NTSDLKK-------------------------------------GEASTSLAK
                                                       N   +KK                                     GEASTS  K
Subjt:  -----------------------------------------------NTSDLKK-------------------------------------GEASTSLAK

Query:  PKVVKDEKCSNSAILRYVPLSRCKKGESPFTECSESIK----------------------------------------------KKLLKEGYSLPT-TRK
          ++ DEK SN  IL Y+PLSR KKGESPF +  + +K                                              K ++K GY   T T K
Subjt:  PKVVKDEKCSNSAILRYVPLSRCKKGESPFTECSESIK----------------------------------------------KKLLKEGYSLPT-TRK

Query:  GLGYKSPEPVRIIRKGKAKVADTNHITVEEVDDSKEKKVS
        GLGY+SPEP+RIIRKGK KV D NHIT+EEVD  +EK+ S
Subjt:  GLGYKSPEPVRIIRKGKAKVADTNHITVEEVDDSKEKKVS

A0A5D3BE72 Ty3-gypsy retrotransposon protein1.6e-2237.89Show/hide
Query:  ERKLNLLMKAVDERDLEIAYLKNQ-------LQNRETLIQFGTLDPIVVRFQKESTMKGSQEKYVSIKDENEGWTLVIRRKKQKKSYARKESRLFRDNKR
        E+K+ L ++ V + +   A + ++        + RE+L+QFGT +P+VV+F +E   +  QEK  SI+ ++EGWT+V RRKK+K +  +KE R +R+ +R
Subjt:  ERKLNLLMKAVDERDLEIAYLKNQ-------LQNRETLIQFGTLDPIVVRFQKESTMKGSQEKYVSIKDENEGWTLVIRRKKQKKSYARKESRLFRDNKR

Query:  KAKSQKKKGKKKSKRSKPIVEESEDFVCPPQPITLAEYFPRRFLDDSQGKALETVTCHIVDVVEDDDVPASS---SGTVADARKEAVEDV
          K+QK K KKK+++ K I EE +DF    + +TLA +FP RFL D Q +  E V CH +D +E++ +P  S    G   D  +  V+D+
Subjt:  KAKSQKKKGKKKSKRSKPIVEESEDFVCPPQPITLAEYFPRRFLDDSQGKALETVTCHIVDVVEDDDVPASS---SGTVADARKEAVEDV

A0A5D3BY54 Ty3-gypsy retrotransposon protein1.3e-2728.54Show/hide
Query:  RETLIQFGTLDPIVVRFQKESTMKGSQEKYVSIKDENEGWTLVIRRKKQKKSYARKESRLFRDNKRKAKSQKKKGKKKSKRSKPIVEESEDFVCPPQPIT
        +E +++      I +  ++E T + SQEK   I++++EGWT+V RRKK+K +  +KESRL+ + +R  K+QK K KKK+++ K + E+ +DF    + +T
Subjt:  RETLIQFGTLDPIVVRFQKESTMKGSQEKYVSIKDENEGWTLVIRRKKQKKSYARKESRLFRDNKRKAKSQKKKGKKKSKRSKPIVEESEDFVCPPQPIT

Query:  LAEYFPRRFLDDSQGKALETVT-----------------CHIVDVVEDD--------DVPASSSGTVADARKEAV-----------------------ED
        LA++FP RFL D Q +    V                  C  +D  ++D        + P   SG V + R E +                       E+
Subjt:  LAEYFPRRFLDDSQGKALETVT-----------------CHIVDVVEDD--------DVPASSSGTVADARKEAV-----------------------ED

Query:  VNTSDL------------------------------------------------------------KKGEASTSLAKPKVVKDEKCSNSAILRYVPLSRC
        ++ S L                                                             K EASTS AK  +V DEK SN  ILRYVPLSR 
Subjt:  VNTSDL------------------------------------------------------------KKGEASTSLAKPKVVKDEKCSNSAILRYVPLSRC

Query:  KKGESPFTECSESIK-----------------------------------------------------------------------------KKLLKEGY
        KKGESPF E  + +K                                                                             KKLL+EG+
Subjt:  KKGESPFTECSESIK-----------------------------------------------------------------------------KKLLKEGY

Query:  SLPTTRKGLGYKSPEPVRIIRKGKAKVADTNHITVEEVDDSKEKK
        ++P +RKGLGYK PEP+RI RKGK KV D+NHITV+EVD  +EK+
Subjt:  SLPTTRKGLGYKSPEPVRIIRKGKAKVADTNHITVEEVDDSKEKK

A0A5D3BY54 Ty3-gypsy retrotransposon protein4.0e-1352.22Show/hide
Query:  VASRISKLIEESSKDRVAVKDNPLFEYVIPTSKQSKDTLNPDVMSVMMADVDQDERMAEMERKLNLLMKAVDERDLEIAYLKNQLQNRET
        VA  I K + ES K  + +K+NPL++    +S +SK   +PDVMSVMMAD+  +  MAEMERK+N LMKAV+ERD EI  L+ Q++ RET
Subjt:  VASRISKLIEESSKDRVAVKDNPLFEYVIPTSKQSKDTLNPDVMSVMMADVDQDERMAEMERKLNLLMKAVDERDLEIAYLKNQLQNRET

A0A5D3BY54 Ty3-gypsy retrotransposon protein4.1e-2625.83Show/hide
Query:  LIQFGTLDPIVVRFQKESTMKGSQEKYVSIKDENEGWTLVIRRKKQKKSYARKESRLFRDNKRKAKSQKKKGKKKSKRSKPIVEESEDFVCPPQPITLAE
        L +FGT +P+VVRF +E   + SQEK   I++++E WT+V RRKK+K +  +KE R +R+ +R  K+QK K KKK+++ K + +E +DF    + ITLA+
Subjt:  LIQFGTLDPIVVRFQKESTMKGSQEKYVSIKDENEGWTLVIRRKKQKKSYARKESRLFRDNKRKAKSQKKKGKKKSKRSKPIVEESEDFVCPPQPITLAE

Query:  YFPRRFLDDSQGKALETVTCHIVDVVEDDDVP------------------------------------------ASSSGTV-------------------
        +FP RFL D Q +    V CH ++  E++ +P                                          +SS+ TV                   
Subjt:  YFPRRFLDDSQGKALETVTCHIVDVVEDDDVP------------------------------------------ASSSGTV-------------------

Query:  ----------------------------------------------------------------------------------------ADAR--------
                                                                                                ADA+        
Subjt:  ----------------------------------------------------------------------------------------ADAR--------

Query:  ------------------------KEAVEDVNTSDLKKGEASTSLAKPKVVKDEKCSNSAILRYVPLSRCKKGESPFTECSESIK---------------
                                KE    + T   +K EASTS AK  ++ DEK SN +ILRYVPLSR KKGESPF E  + +K               
Subjt:  ------------------------KEAVEDVNTSDLKKGEASTSLAKPKVVKDEKCSNSAILRYVPLSRCKKGESPFTECSESIK---------------

Query:  --------------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRIIRKGKAKVADT
                                                                      KKLL+EG+++P +RKGLGYK PEP+RI RKGK K+ D+
Subjt:  --------------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRIIRKGKAKVADT

Query:  NHITVEEVDDSKEKK
        NHITV+EVD  KEK+
Subjt:  NHITVEEVDDSKEKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATTGAGATCAGGGAGGATCATACTCCTCTTGCTGTTGCAAGCAGGATCTCAAAGCTGATCGAAGAATCCTCTAAGGATAGGGTTGCAGTCAAAGACAACCCGCTGTTCGA
ATATGTCATTCCAACATCTAAGCAGTCAAAGGATACACTAAATCCTGATGTGATGTCCGTCATGATGGCTGATGTAGACCAAGATGAAAGAATGGCAGAGATGGAAAGAA
AACTCAATCTCTTAATGAAGGCAGTTGATGAAAGAGATCTTGAGATTGCCTATTTGAAGAACCAGCTGCAAAACCGAGAAACGTTGATTCAATTTGGGACCCTTGATCCC
ATAGTGGTTCGATTCCAAAAAGAATCCACGATGAAGGGATCCCAAGAAAAATATGTTTCCATCAAAGATGAGAACGAAGGTTGGACCCTTGTCATTCGTCGCAAAAAGCA
AAAGAAAAGTTACGCACGGAAAGAGTCCCGCCTATTTCGAGACAATAAAAGAAAGGCTAAGTCTCAAAAGAAGAAAGGAAAAAAGAAGTCAAAGAGGTCAAAGCCTATCG
TGGAAGAAAGTGAAGATTTCGTTTGCCCTCCACAACCCATAACTTTGGCAGAATACTTCCCAAGGCGCTTTCTCGATGATAGTCAAGGAAAGGCACTTGAAACTGTCACG
TGTCACATTGTGGACGTGGTGGAAGATGATGATGTCCCTGCTAGCTCCTCGGGAACGGTGGCAGATGCAAGAAAGGAAGCTGTTGAAGATGTGAATACTTCCGACCTGAA
AAAGGGTGAAGCGTCTACAAGCCTTGCGAAACCTAAGGTTGTAAAGGATGAGAAATGTTCAAATTCAGCTATCCTACGATACGTCCCCTTATCTCGATGTAAAAAGGGTG
AATCACCTTTCACTGAATGTTCAGAAAGCATAAAGAAGAAACTTCTAAAGGAAGGTTATAGTCTGCCTACAACGAGAAAAGGACTGGGATATAAGTCGCCTGAGCCAGTT
CGCATAATAAGAAAAGGGAAGGCGAAAGTGGCAGATACAAATCATATAACAGTAGAAGAGGTTGATGACTCAAAAGAAAAGAAAGTGTCGACCAACGAACTTCTGTTTTT
AGGCGCATCAGGCCACCAGTTGCTCGTGCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATTGAGATCAGGGAGGATCATACTCCTCTTGCTGTTGCAAGCAGGATCTCAAAGCTGATCGAAGAATCCTCTAAGGATAGGGTTGCAGTCAAAGACAACCCGCTGTTCGA
ATATGTCATTCCAACATCTAAGCAGTCAAAGGATACACTAAATCCTGATGTGATGTCCGTCATGATGGCTGATGTAGACCAAGATGAAAGAATGGCAGAGATGGAAAGAA
AACTCAATCTCTTAATGAAGGCAGTTGATGAAAGAGATCTTGAGATTGCCTATTTGAAGAACCAGCTGCAAAACCGAGAAACGTTGATTCAATTTGGGACCCTTGATCCC
ATAGTGGTTCGATTCCAAAAAGAATCCACGATGAAGGGATCCCAAGAAAAATATGTTTCCATCAAAGATGAGAACGAAGGTTGGACCCTTGTCATTCGTCGCAAAAAGCA
AAAGAAAAGTTACGCACGGAAAGAGTCCCGCCTATTTCGAGACAATAAAAGAAAGGCTAAGTCTCAAAAGAAGAAAGGAAAAAAGAAGTCAAAGAGGTCAAAGCCTATCG
TGGAAGAAAGTGAAGATTTCGTTTGCCCTCCACAACCCATAACTTTGGCAGAATACTTCCCAAGGCGCTTTCTCGATGATAGTCAAGGAAAGGCACTTGAAACTGTCACG
TGTCACATTGTGGACGTGGTGGAAGATGATGATGTCCCTGCTAGCTCCTCGGGAACGGTGGCAGATGCAAGAAAGGAAGCTGTTGAAGATGTGAATACTTCCGACCTGAA
AAAGGGTGAAGCGTCTACAAGCCTTGCGAAACCTAAGGTTGTAAAGGATGAGAAATGTTCAAATTCAGCTATCCTACGATACGTCCCCTTATCTCGATGTAAAAAGGGTG
AATCACCTTTCACTGAATGTTCAGAAAGCATAAAGAAGAAACTTCTAAAGGAAGGTTATAGTCTGCCTACAACGAGAAAAGGACTGGGATATAAGTCGCCTGAGCCAGTT
CGCATAATAAGAAAAGGGAAGGCGAAAGTGGCAGATACAAATCATATAACAGTAGAAGAGGTTGATGACTCAAAAGAAAAGAAAGTGTCGACCAACGAACTTCTGTTTTT
AGGCGCATCAGGCCACCAGTTGCTCGTGCTTTAG
Protein sequenceShow/hide protein sequence
IEIREDHTPLAVASRISKLIEESSKDRVAVKDNPLFEYVIPTSKQSKDTLNPDVMSVMMADVDQDERMAEMERKLNLLMKAVDERDLEIAYLKNQLQNRETLIQFGTLDP
IVVRFQKESTMKGSQEKYVSIKDENEGWTLVIRRKKQKKSYARKESRLFRDNKRKAKSQKKKGKKKSKRSKPIVEESEDFVCPPQPITLAEYFPRRFLDDSQGKALETVT
CHIVDVVEDDDVPASSSGTVADARKEAVEDVNTSDLKKGEASTSLAKPKVVKDEKCSNSAILRYVPLSRCKKGESPFTECSESIKKKLLKEGYSLPTTRKGLGYKSPEPV
RIIRKGKAKVADTNHITVEEVDDSKEKKVSTNELLFLGASGHQLLVL