; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg034558 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg034558
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold4:13423599..13427678
RNA-Seq ExpressionSpg034558
SyntenySpg034558
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043405.1 hypothetical protein E6C27_scaffold1639G00040 [Cucumis melo var. makuwa]1.3e-3043.75Show/hide
Query:  PASSSGTV-AGPGDLSSFSIKDLLSLPQEAKKDEKCSP--VLRYAPLSRRKKGESPFTQCPKSIK-------------KKLLKEGYSLPTTRKGLGYKLP
        P  S GT  +  G+ S+ + K ++ +      DEK S   +LRY PLSRR+KGESPF + P+ +K             KKLL+EG+ +P +RKG GYK P
Subjt:  PASSSGTV-AGPGDLSSFSIKDLLSLPQEAKKDEKCSP--VLRYAPLSRRKKGESPFTQCPKSIK-------------KKLLKEGYSLPTTRKGLGYKLP

Query:  EPVRITRKGKAKVADINHITVEEVDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNETKEESAQLTNSSTRSSVFRRLSMHVGEEESTLSTPDVTRP
        EP+ I RK K KV D NHITV EVD  +E+E   QR S F  IRP V R +VF+R+ V +T+ +  Q T+S  + S  +RL+M   +E+ T      TRP
Subjt:  EPVRITRKGKAKVADINHITVEEVDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNETKEESAQLTNSSTRSSVFRRLSMHVGEEESTLSTPDVTRP

Query:  SAFRRLNM
        SAF RL++
Subjt:  SAFRRLNM

KAA0047672.1 uncharacterized protein E6C27_scaffold115G001730 [Cucumis melo var. makuwa]3.8e-3043.68Show/hide
Query:  AGPGDLSSFSIKDLLSLPQEAKKDEKCSPVLRYAPLSRRKKGESPFTQCPKSIKK---KLLKEGYSL---PTTRKGLGYKLPEPVRITRKGKAKVADINH
        +G  + S+ + K ++ + +         P+LRY PLSR KKGESPF + P+ +K    ++LKE +++     T+KGLGYK PEP+RITRKGK KV D NH
Subjt:  AGPGDLSSFSIKDLLSLPQEAKKDEKCSPVLRYAPLSRRKKGESPFTQCPKSIKK---KLLKEGYSL---PTTRKGLGYKLPEPVRITRKGKAKVADINH

Query:  ITVEEVDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNETKEESAQLTNSSTRSSVFRRLSMHVGEEESTLSTPDVTRPSAFRRLNM
        ITV+EVD  + +E   QR S F  I P VARA VF+R+ + E K +  Q T++  R S F+RL++   EE+        T+PSAF RL++
Subjt:  ITVEEVDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNETKEESAQLTNSSTRSSVFRRLSMHVGEEESTLSTPDVTRPSAFRRLNM

KAA0047672.1 uncharacterized protein E6C27_scaffold115G001730 [Cucumis melo var. makuwa]6.2e-1232.37Show/hide
Query:  ERKLNLLMKAVDERDLEMAYLKNQ--LQNRETAESKRLIQFGTLDPVVVRFRKEATMKGSQEQYNSIEDENEGWTLVVRRKKQKQSYARRSKHVVEESED
        E+K+ L ++ V + +   A + ++  L      + K L+QF T +PVVVRF +E   + SQE+   I++++EGWT+V RRKK                  
Subjt:  ERKLNLLMKAVDERDLEMAYLKNQ--LQNRETAESKRLIQFGTLDPVVVRFRKEATMKGSQEQYNSIEDENEGWTLVVRRKKQKQSYARRSKHVVEESED

Query:  FFCPPQPITLAEYFPRRFLDDGQGEALEIVTCHIVDVVEDDDVPASSSGTVAGPGDLSSFSIKDLLSLPQEAK
                       R+FL D Q E   +V CH ++  E++ +P  S        DLS F++ DLLSLPQE K
Subjt:  FFCPPQPITLAEYFPRRFLDDGQGEALEIVTCHIVDVVEDDDVPASSSGTVAGPGDLSSFSIKDLLSLPQEAK

KAA0055957.1 uncharacterized protein E6C27_scaffold319G00830 [Cucumis melo var. makuwa]8.3e-3341.18Show/hide
Query:  PASSSGTV-AGPGDLSSFSIKDLLSLPQEAKKDEKCS--PVLRYAPLSRRKKGESPFTQCPKSIK-----------------------------------
        P  S GT  +G  + S+ + K ++ +      DEK S  P+LRY PLSRRKKGESPF + P+ +K                                   
Subjt:  PASSSGTV-AGPGDLSSFSIKDLLSLPQEAKKDEKCS--PVLRYAPLSRRKKGESPFTQCPKSIK-----------------------------------

Query:  --------KKLLKEGYSLPTTRKGLGYKLPEPVRITRKGKAKVADINHITVEEVDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNETKEESAQLTN
                KKLL+EG+ +P +RKGLGYK PEP+RITRKGK KV D NHITV+EVD  +E+E  +QR S F  I P VARA VF+R+ + E K +  Q T+
Subjt:  --------KKLLKEGYSLPTTRKGLGYKLPEPVRITRKGKAKVADINHITVEEVDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNETKEESAQLTN

Query:  SSTRSSVFRRLSMHVGEEESTLSTPDVTRPSAFRRLNM
        +  R S F+RL++   EE+    T   T+PSAF RL++
Subjt:  SSTRSSVFRRLSMHVGEEESTLSTPDVTRPSAFRRLNM

TYK05005.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-3025.04Show/hide
Query:  RETAESKRLIQFGTLDPVVVRFRKEATMKGSQEQYNSIEDENEGWTLVVRRKK-------------------------QKQSYARRSKHVVEESEDFFCP
        RE      L +FGT +PVVVRF +E   + SQE+   IE+++E WT+V RRKK                         +K+   R+ K + +E +DF   
Subjt:  RETAESKRLIQFGTLDPVVVRFRKEATMKGSQEQYNSIEDENEGWTLVVRRKK-------------------------QKQSYARRSKHVVEESEDFFCP

Query:  PQPITLAEYFPRRFLDDGQGEALEIVTCHIVDVVEDDDVPASSSGTVAGPGDLSSFSIKDLLSLPQEAK-------------------------------
         + ITLA++FP RFL D Q E   +V CH ++  E++ +P  S        DLS F++ DLLSLPQE K                               
Subjt:  PQPITLAEYFPRRFLDDGQGEALEIVTCHIVDVVEDDDVPASSSGTVAGPGDLSSFSIKDLLSLPQEAK-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------KDEKCS--PVLRYAPLSRRKKGESPFTQCPKSIK-------
                                                                    DEK S   +LRY PLSRRKKGESPF + P+ +K       
Subjt:  -----------------------------------------------------------KDEKCS--PVLRYAPLSRRKKGESPFTQCPKSIK-------

Query:  ----------------------------------------------------------------------KKLLKEGYSLPTTRKGLGYKLPEPVRITRK
                                                                              KKLL+EG+++P +RKGLGYKLPEP+RITRK
Subjt:  ----------------------------------------------------------------------KKLLKEGYSLPTTRKGLGYKLPEPVRITRK

Query:  GKAKVADINHITVEEVDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNETKEESAQLTNSSTRSSVFRRLSMHVGEEESTLSTPDVTR
        GK K+ D NHITV+EVD  KE+E   QR S F  I P VARA VF+R+ V E + +  Q T++  R S F RLS+   +   T   P + R
Subjt:  GKAKVADINHITVEEVDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNETKEESAQLTNSSTRSSVFRRLSMHVGEEESTLSTPDVTR

TYK28162.1 uncharacterized protein E5676_scaffold289G00760 [Cucumis melo var. makuwa]7.7e-3138.71Show/hide
Query:  PASSSGTV-AGPGDLSSFSIKDLLSLPQEAKKDEKCS--PVLRYAPLSRRKKGESPFTQCPKSIK-----------------------------------
        P  S GT  +G  + S+ + K ++ +      DEK S  P+LRY PLSRRKKGESPF + P+ +K                                   
Subjt:  PASSSGTV-AGPGDLSSFSIKDLLSLPQEAKKDEKCS--PVLRYAPLSRRKKGESPFTQCPKSIK-----------------------------------

Query:  ------------------KKLLKEGYSLPTTRKGLGYKLPEPVRITRKGKAKVADINHITVEEVDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNE
                          KKLL+EG+ +P +RKGLGYK PEP+RITRKGK KV D NHITV+EVD  +E+E  +QR S F  I P VARA VF+R+ + E
Subjt:  ------------------KKLLKEGYSLPTTRKGLGYKLPEPVRITRKGKAKVADINHITVEEVDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNE

Query:  TKEESAQLTNSSTRSSVFRRLSMHVGEEESTLSTPDVTRPSAFRRLNM
         + +  Q T++  + S F+RL++   EE+    T   T+PSAF RL++
Subjt:  TKEESAQLTNSSTRSSVFRRLSMHVGEEESTLSTPDVTRPSAFRRLNM

TrEMBL top hitse value%identityAlignment
A0A5A7TPR5 RNase H domain-containing protein6.4e-3143.75Show/hide
Query:  PASSSGTV-AGPGDLSSFSIKDLLSLPQEAKKDEKCSP--VLRYAPLSRRKKGESPFTQCPKSIK-------------KKLLKEGYSLPTTRKGLGYKLP
        P  S GT  +  G+ S+ + K ++ +      DEK S   +LRY PLSRR+KGESPF + P+ +K             KKLL+EG+ +P +RKG GYK P
Subjt:  PASSSGTV-AGPGDLSSFSIKDLLSLPQEAKKDEKCSP--VLRYAPLSRRKKGESPFTQCPKSIK-------------KKLLKEGYSLPTTRKGLGYKLP

Query:  EPVRITRKGKAKVADINHITVEEVDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNETKEESAQLTNSSTRSSVFRRLSMHVGEEESTLSTPDVTRP
        EP+ I RK K KV D NHITV EVD  +E+E   QR S F  IRP V R +VF+R+ V +T+ +  Q T+S  + S  +RL+M   +E+ T      TRP
Subjt:  EPVRITRKGKAKVADINHITVEEVDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNETKEESAQLTNSSTRSSVFRRLSMHVGEEESTLSTPDVTRP

Query:  SAFRRLNM
        SAF RL++
Subjt:  SAFRRLNM

A0A5A7UMY2 Reverse transcriptase domain-containing protein4.0e-3341.18Show/hide
Query:  PASSSGTV-AGPGDLSSFSIKDLLSLPQEAKKDEKCS--PVLRYAPLSRRKKGESPFTQCPKSIK-----------------------------------
        P  S GT  +G  + S+ + K ++ +      DEK S  P+LRY PLSRRKKGESPF + P+ +K                                   
Subjt:  PASSSGTV-AGPGDLSSFSIKDLLSLPQEAKKDEKCS--PVLRYAPLSRRKKGESPFTQCPKSIK-----------------------------------

Query:  --------KKLLKEGYSLPTTRKGLGYKLPEPVRITRKGKAKVADINHITVEEVDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNETKEESAQLTN
                KKLL+EG+ +P +RKGLGYK PEP+RITRKGK KV D NHITV+EVD  +E+E  +QR S F  I P VARA VF+R+ + E K +  Q T+
Subjt:  --------KKLLKEGYSLPTTRKGLGYKLPEPVRITRKGKAKVADINHITVEEVDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNETKEESAQLTN

Query:  SSTRSSVFRRLSMHVGEEESTLSTPDVTRPSAFRRLNM
        +  R S F+RL++   EE+    T   T+PSAF RL++
Subjt:  SSTRSSVFRRLSMHVGEEESTLSTPDVTRPSAFRRLNM

A0A5D3C0W6 Ty3-gypsy retrotransposon protein6.4e-3125.04Show/hide
Query:  RETAESKRLIQFGTLDPVVVRFRKEATMKGSQEQYNSIEDENEGWTLVVRRKK-------------------------QKQSYARRSKHVVEESEDFFCP
        RE      L +FGT +PVVVRF +E   + SQE+   IE+++E WT+V RRKK                         +K+   R+ K + +E +DF   
Subjt:  RETAESKRLIQFGTLDPVVVRFRKEATMKGSQEQYNSIEDENEGWTLVVRRKK-------------------------QKQSYARRSKHVVEESEDFFCP

Query:  PQPITLAEYFPRRFLDDGQGEALEIVTCHIVDVVEDDDVPASSSGTVAGPGDLSSFSIKDLLSLPQEAK-------------------------------
         + ITLA++FP RFL D Q E   +V CH ++  E++ +P  S        DLS F++ DLLSLPQE K                               
Subjt:  PQPITLAEYFPRRFLDDGQGEALEIVTCHIVDVVEDDDVPASSSGTVAGPGDLSSFSIKDLLSLPQEAK-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------KDEKCS--PVLRYAPLSRRKKGESPFTQCPKSIK-------
                                                                    DEK S   +LRY PLSRRKKGESPF + P+ +K       
Subjt:  -----------------------------------------------------------KDEKCS--PVLRYAPLSRRKKGESPFTQCPKSIK-------

Query:  ----------------------------------------------------------------------KKLLKEGYSLPTTRKGLGYKLPEPVRITRK
                                                                              KKLL+EG+++P +RKGLGYKLPEP+RITRK
Subjt:  ----------------------------------------------------------------------KKLLKEGYSLPTTRKGLGYKLPEPVRITRK

Query:  GKAKVADINHITVEEVDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNETKEESAQLTNSSTRSSVFRRLSMHVGEEESTLSTPDVTR
        GK K+ D NHITV+EVD  KE+E   QR S F  I P VARA VF+R+ V E + +  Q T++  R S F RLS+   +   T   P + R
Subjt:  GKAKVADINHITVEEVDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNETKEESAQLTNSSTRSSVFRRLSMHVGEEESTLSTPDVTR

A0A5D3C8N8 Ribonuclease H1.9e-3043.68Show/hide
Query:  AGPGDLSSFSIKDLLSLPQEAKKDEKCSPVLRYAPLSRRKKGESPFTQCPKSIKK---KLLKEGYSL---PTTRKGLGYKLPEPVRITRKGKAKVADINH
        +G  + S+ + K ++ + +         P+LRY PLSR KKGESPF + P+ +K    ++LKE +++     T+KGLGYK PEP+RITRKGK KV D NH
Subjt:  AGPGDLSSFSIKDLLSLPQEAKKDEKCSPVLRYAPLSRRKKGESPFTQCPKSIKK---KLLKEGYSL---PTTRKGLGYKLPEPVRITRKGKAKVADINH

Query:  ITVEEVDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNETKEESAQLTNSSTRSSVFRRLSMHVGEEESTLSTPDVTRPSAFRRLNM
        ITV+EVD  + +E   QR S F  I P VARA VF+R+ + E K +  Q T++  R S F+RL++   EE+        T+PSAF RL++
Subjt:  ITVEEVDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNETKEESAQLTNSSTRSSVFRRLSMHVGEEESTLSTPDVTRPSAFRRLNM

A0A5D3C8N8 Ribonuclease H3.0e-1232.37Show/hide
Query:  ERKLNLLMKAVDERDLEMAYLKNQ--LQNRETAESKRLIQFGTLDPVVVRFRKEATMKGSQEQYNSIEDENEGWTLVVRRKKQKQSYARRSKHVVEESED
        E+K+ L ++ V + +   A + ++  L      + K L+QF T +PVVVRF +E   + SQE+   I++++EGWT+V RRKK                  
Subjt:  ERKLNLLMKAVDERDLEMAYLKNQ--LQNRETAESKRLIQFGTLDPVVVRFRKEATMKGSQEQYNSIEDENEGWTLVVRRKKQKQSYARRSKHVVEESED

Query:  FFCPPQPITLAEYFPRRFLDDGQGEALEIVTCHIVDVVEDDDVPASSSGTVAGPGDLSSFSIKDLLSLPQEAK
                       R+FL D Q E   +V CH ++  E++ +P  S        DLS F++ DLLSLPQE K
Subjt:  FFCPPQPITLAEYFPRRFLDDGQGEALEIVTCHIVDVVEDDDVPASSSGTVAGPGDLSSFSIKDLLSLPQEAK

A0A5D3DXC7 Reverse transcriptase domain-containing protein3.7e-3138.71Show/hide
Query:  PASSSGTV-AGPGDLSSFSIKDLLSLPQEAKKDEKCS--PVLRYAPLSRRKKGESPFTQCPKSIK-----------------------------------
        P  S GT  +G  + S+ + K ++ +      DEK S  P+LRY PLSRRKKGESPF + P+ +K                                   
Subjt:  PASSSGTV-AGPGDLSSFSIKDLLSLPQEAKKDEKCS--PVLRYAPLSRRKKGESPFTQCPKSIK-----------------------------------

Query:  ------------------KKLLKEGYSLPTTRKGLGYKLPEPVRITRKGKAKVADINHITVEEVDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNE
                          KKLL+EG+ +P +RKGLGYK PEP+RITRKGK KV D NHITV+EVD  +E+E  +QR S F  I P VARA VF+R+ + E
Subjt:  ------------------KKLLKEGYSLPTTRKGLGYKLPEPVRITRKGKAKVADINHITVEEVDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNE

Query:  TKEESAQLTNSSTRSSVFRRLSMHVGEEESTLSTPDVTRPSAFRRLNM
         + +  Q T++  + S F+RL++   EE+    T   T+PSAF RL++
Subjt:  TKEESAQLTNSSTRSSVFRRLSMHVGEEESTLSTPDVTRPSAFRRLNM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGAGAATTTTGAAGCTCCAAGCCAGAGAATTGAGATCAGGGAGGATCATACTCCTCTTGCTGTTGCAAGCAGGATCTCAAAGCTGATTGAAGAATCCTCTAAGGA
TAAAGTTGTAGTCAAAGACAACCCGCTGTTCGAATCTGTTGTTCCAACATCTAAGCAGCCAAAGGATGCACTAAACCCTGATGTGATGTCTGTCATGATGGCTGATGTAG
ACCAGGATGAAAGAATGGTAGAGATGGAAAGAAAACTCAATCTCTTGATGAAGGCAGTTGATGAAAGAGATCTGGAGATGGCCTATTTGAAGAACCAGCTGCAAAACCGA
GAAACGGCTGAGTCTAAGAGGCTGATTCAATTTGGGACCCTTGATCCCGTAGTGGTTCGATTCCGAAAAGAAGCCACAATGAAGGGGTCCCAAGAACAATATAATTCCAT
TGAAGATGAGAATGAAGGCTGGACCCTTGTCGTTCGTCGCAAGAAGCAAAAACAAAGTTACGCACGGAGGTCAAAGCATGTTGTGGAGGAAAGTGAAGATTTCTTTTGCC
CTCCACAACCCATAACTTTGGCAGAATACTTCCCAAGGCGCTTTCTCGATGATGGTCAAGGAGAAGCACTTGAAATCGTCACTTGTCACATTGTGGACGTGGTGGAAGAT
GATGATGTCCCTGCTAGTTCCTCGGGAACGGTGGCAGGTCCAGGAGACTTATCCTCCTTCAGCATAAAGGATTTATTGTCACTTCCTCAGGAAGCTAAAAAGGATGAGAA
ATGTTCACCTGTCCTACGATACGCCCCTTTATCTCGGCGTAAAAAGGGTGAATCACCTTTCACTCAATGTCCGAAAAGCATAAAGAAGAAGCTTCTAAAGGAAGGCTATA
GTCTGCCTACGACGAGAAAAGGACTTGGATATAAATTGCCCGAACCGGTTCGCATAACAAGAAAAGGGAAGGCGAAAGTGGCAGACATAAATCATATAACAGTAGAGGAG
GTTGATGACTCAAAAGAAGAAGAGAGCGTCGACCAACGAGCTTCTGTTTTTAGGCACATCAGGCCACCAGTTGCTCGTGCTTCTGTCTTTCAGAGGATAATTGTGAATGA
AACAAAAGAAGAAAGTGCACAACTTACCAATAGCTCCACTCGATCTTCAGTTTTTCGAAGGTTAAGTATGCATGTTGGTGAAGAAGAGAGTACACTTTCAACTCCGGATG
TCACGCGACCTTCAGCTTTTCGAAGGTTAAATATGCCCGTTGGGGAAGAAGAAGGTACATTTTCAACTTCAGATGTGACTCGACCATCACCGACTTCAATATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGAGAATTTTGAAGCTCCAAGCCAGAGAATTGAGATCAGGGAGGATCATACTCCTCTTGCTGTTGCAAGCAGGATCTCAAAGCTGATTGAAGAATCCTCTAAGGA
TAAAGTTGTAGTCAAAGACAACCCGCTGTTCGAATCTGTTGTTCCAACATCTAAGCAGCCAAAGGATGCACTAAACCCTGATGTGATGTCTGTCATGATGGCTGATGTAG
ACCAGGATGAAAGAATGGTAGAGATGGAAAGAAAACTCAATCTCTTGATGAAGGCAGTTGATGAAAGAGATCTGGAGATGGCCTATTTGAAGAACCAGCTGCAAAACCGA
GAAACGGCTGAGTCTAAGAGGCTGATTCAATTTGGGACCCTTGATCCCGTAGTGGTTCGATTCCGAAAAGAAGCCACAATGAAGGGGTCCCAAGAACAATATAATTCCAT
TGAAGATGAGAATGAAGGCTGGACCCTTGTCGTTCGTCGCAAGAAGCAAAAACAAAGTTACGCACGGAGGTCAAAGCATGTTGTGGAGGAAAGTGAAGATTTCTTTTGCC
CTCCACAACCCATAACTTTGGCAGAATACTTCCCAAGGCGCTTTCTCGATGATGGTCAAGGAGAAGCACTTGAAATCGTCACTTGTCACATTGTGGACGTGGTGGAAGAT
GATGATGTCCCTGCTAGTTCCTCGGGAACGGTGGCAGGTCCAGGAGACTTATCCTCCTTCAGCATAAAGGATTTATTGTCACTTCCTCAGGAAGCTAAAAAGGATGAGAA
ATGTTCACCTGTCCTACGATACGCCCCTTTATCTCGGCGTAAAAAGGGTGAATCACCTTTCACTCAATGTCCGAAAAGCATAAAGAAGAAGCTTCTAAAGGAAGGCTATA
GTCTGCCTACGACGAGAAAAGGACTTGGATATAAATTGCCCGAACCGGTTCGCATAACAAGAAAAGGGAAGGCGAAAGTGGCAGACATAAATCATATAACAGTAGAGGAG
GTTGATGACTCAAAAGAAGAAGAGAGCGTCGACCAACGAGCTTCTGTTTTTAGGCACATCAGGCCACCAGTTGCTCGTGCTTCTGTCTTTCAGAGGATAATTGTGAATGA
AACAAAAGAAGAAAGTGCACAACTTACCAATAGCTCCACTCGATCTTCAGTTTTTCGAAGGTTAAGTATGCATGTTGGTGAAGAAGAGAGTACACTTTCAACTCCGGATG
TCACGCGACCTTCAGCTTTTCGAAGGTTAAATATGCCCGTTGGGGAAGAAGAAGGTACATTTTCAACTTCAGATGTGACTCGACCATCACCGACTTCAATATGA
Protein sequenceShow/hide protein sequence
MGENFEAPSQRIEIREDHTPLAVASRISKLIEESSKDKVVVKDNPLFESVVPTSKQPKDALNPDVMSVMMADVDQDERMVEMERKLNLLMKAVDERDLEMAYLKNQLQNR
ETAESKRLIQFGTLDPVVVRFRKEATMKGSQEQYNSIEDENEGWTLVVRRKKQKQSYARRSKHVVEESEDFFCPPQPITLAEYFPRRFLDDGQGEALEIVTCHIVDVVED
DDVPASSSGTVAGPGDLSSFSIKDLLSLPQEAKKDEKCSPVLRYAPLSRRKKGESPFTQCPKSIKKKLLKEGYSLPTTRKGLGYKLPEPVRITRKGKAKVADINHITVEE
VDDSKEEESVDQRASVFRHIRPPVARASVFQRIIVNETKEESAQLTNSSTRSSVFRRLSMHVGEEESTLSTPDVTRPSAFRRLNMPVGEEEGTFSTSDVTRPSPTSI