; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001096 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001096
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr4:24421527..24422856
RNA-Seq ExpressionLag0001096
SyntenyLag0001096
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031739508.1 uncharacterized protein LOC116403159 [Cucumis sativus]1.3e-3634.67Show/hide
Query:  AIVQSAVVRAVQSAMQAAVAGVIAGQQAQAPQN-NEALSREARCLRDFRKWDPRPFDGASKDPTVAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNAL
        A+  + VV  V +A  A  A    G  AQ PQ     LS EA+ LRDFRK+DP+ FDG+ +DPT A+LWLSS+ET+F +M CPE+ +V   AFLL+D  +
Subjt:  AIVQSAVVRAVQSAMQAAVAGVIAGQQAQAPQN-NEALSREARCLRDFRKWDPRPFDGASKDPTVAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNAL

Query:  IWWQSAERTIDTSNGPVTWPQFREAFFKKYYPAN------------------------AFQEASRVCGSYTGKPNRR------GIRDRVRQTVSICFCPA
        IWW++  R +      +TW QF++ F+ K++ AN                         F   SR      G    R      G+RD +R      F  A
Subjt:  IWWQSAERTIDTSNGPVTWPQFREAFFKKYYPAN------------------------AFQEASRVCGSYTGKPNRR------GIRDRVRQTVSICFCPA

Query:  HRPPNYATTVRVAELIDCHPATAPRATSEQRPSSGQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQANQQNTVREKPVCNNCGKRHWRRCLLGARVCFRC
         +P   A  +R+A  +          + ++  SSGQ R+ +Q +  + Q +++ +  P    Q       +T REKP+CN CGK H  RCL+G RVC++C
Subjt:  HRPPNYATTVRVAELIDCHPATAPRATSEQRPSSGQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQANQQNTVREKPVCNNCGKRHWRRCLLGARVCFRC

XP_031741726.1 uncharacterized protein LOC116403920 [Cucumis sativus]2.2e-3635.33Show/hide
Query:  AIVQSAVVRAVQSAMQAAVAGVIAGQQAQAPQ-NNEALSREARCLRDFRKWDPRPFDGASKDPTVAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNAL
        A+  + VV  V +A  A  A    G  AQ PQ     LS EA+ LRDFRK+DP+ FDG+ +DPT A+LWLSS+ET+F +M CPE+ +V   AFLL+D  +
Subjt:  AIVQSAVVRAVQSAMQAAVAGVIAGQQAQAPQ-NNEALSREARCLRDFRKWDPRPFDGASKDPTVAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNAL

Query:  IWWQSAERTIDTSNGPVTWPQFREAFFKKYYPAN------------------------AFQEASRVCGSYTGKPNRR------GIRDRVRQTVSICFCPA
        IWW++  R +      +TW QF+  F+ K++ AN                         F   SR      G    R      G+RD +R      F  A
Subjt:  IWWQSAERTIDTSNGPVTWPQFREAFFKKYYPAN------------------------AFQEASRVCGSYTGKPNRR------GIRDRVRQTVSICFCPA

Query:  HRPPNYATTVRVAELIDCHPATAPRATSEQRPSSGQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQANQQNTVREKPVCNNCGKRHWRRCLLGARVCFRC
         +P   A  +R+A  +          + ++  SSGQ R+ +Q +  V Q +++    P    Q       +T REKP+CN CGKRH  RCL+G RVC++C
Subjt:  HRPPNYATTVRVAELIDCHPATAPRATSEQRPSSGQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQANQQNTVREKPVCNNCGKRHWRRCLLGARVCFRC

XP_031742890.1 uncharacterized protein LOC116404512 [Cucumis sativus]2.2e-3635.33Show/hide
Query:  AIVQSAVVRAVQSAMQAAVAGVIAGQQAQAPQ-NNEALSREARCLRDFRKWDPRPFDGASKDPTVAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNAL
        A+  + VV  V +A  A  A    G  AQ PQ     LS EA+ LRDFRK+DP+ FDG+ +DPT A+LWLSS+ET+F +M CPE+ +V   AFLL+D  +
Subjt:  AIVQSAVVRAVQSAMQAAVAGVIAGQQAQAPQ-NNEALSREARCLRDFRKWDPRPFDGASKDPTVAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNAL

Query:  IWWQSAERTIDTSNGPVTWPQFREAFFKKYYPAN------------------------AFQEASRVCGSYTGKPNRR------GIRDRVRQTVSICFCPA
        IWW++  R +      +TW QF+  F+ K++ AN                         F   SR      G    R      G+RD +R      F  A
Subjt:  IWWQSAERTIDTSNGPVTWPQFREAFFKKYYPAN------------------------AFQEASRVCGSYTGKPNRR------GIRDRVRQTVSICFCPA

Query:  HRPPNYATTVRVAELIDCHPATAPRATSEQRPSSGQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQANQQNTVREKPVCNNCGKRHWRRCLLGARVCFRC
         +P   A  +R+A  +          + ++  SSGQ R+ +Q +  V Q +++    P    Q       +T REKP+CN CGKRH  RCL+G RVC++C
Subjt:  HRPPNYATTVRVAELIDCHPATAPRATSEQRPSSGQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQANQQNTVREKPVCNNCGKRHWRRCLLGARVCFRC

XP_031745057.1 uncharacterized protein LOC116405236 [Cucumis sativus]2.2e-3634.78Show/hide
Query:  IVQSAVVRAVQSAMQAAVAGVIAGQQAQAPQN-NEALSREARCLRDFRKWDPRPFDGASKDPTVAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNALI
        I  + VV  V +A  A  A    G  AQ PQ     LS EA+ LRDFRK+DP+ FDG+ +DPT A+LWLSS+ET+F +M CPE+ +V   AFLL+D  +I
Subjt:  IVQSAVVRAVQSAMQAAVAGVIAGQQAQAPQN-NEALSREARCLRDFRKWDPRPFDGASKDPTVAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNALI

Query:  WWQSAERTIDTSNGPVTWPQFREAFFKKYYPAN------------------------AFQEASRVCGSYTGKPNRR------GIRDRVRQTVSICFCPAH
        WW++  R +      +TW QF++ F+ K++ AN                         F   SR      G    R      G+RD +R      F  A 
Subjt:  WWQSAERTIDTSNGPVTWPQFREAFFKKYYPAN------------------------AFQEASRVCGSYTGKPNRR------GIRDRVRQTVSICFCPAH

Query:  RPPNYATTVRVAELIDCHPATAPRATSEQRPSSGQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQANQQNTVREKPVCNNCGKRHWRRCLLGARVCFRC
        +P   A  +R+A  +          + ++  SSGQ R+ +Q +  + Q +++ +  P    Q       +T REKP+CN CGK H  RCL+G RVC++C
Subjt:  RPPNYATTVRVAELIDCHPATAPRATSEQRPSSGQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQANQQNTVREKPVCNNCGKRHWRRCLLGARVCFRC

XP_031745532.1 uncharacterized protein LOC116405924 [Cucumis sativus]2.2e-3635.33Show/hide
Query:  AIVQSAVVRAVQSAMQAAVAGVIAGQQAQAPQ-NNEALSREARCLRDFRKWDPRPFDGASKDPTVAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNAL
        A+  + VV  V +A  A  A    G  AQ PQ     LS EA+ LRDFRK+DP+ FDG+ +DPT A+LWLSS+ET+F +M CPE+ +V   AFLL+D  +
Subjt:  AIVQSAVVRAVQSAMQAAVAGVIAGQQAQAPQ-NNEALSREARCLRDFRKWDPRPFDGASKDPTVAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNAL

Query:  IWWQSAERTIDTSNGPVTWPQFREAFFKKYYPAN------------------------AFQEASRVCGSYTGKPNRR------GIRDRVRQTVSICFCPA
        IWW++  R +      +TW QF+  F+ K++ AN                         F   SR      G    R      G+RD +R      F  A
Subjt:  IWWQSAERTIDTSNGPVTWPQFREAFFKKYYPAN------------------------AFQEASRVCGSYTGKPNRR------GIRDRVRQTVSICFCPA

Query:  HRPPNYATTVRVAELIDCHPATAPRATSEQRPSSGQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQANQQNTVREKPVCNNCGKRHWRRCLLGARVCFRC
         +P   A  +R+A  +          + ++  SSGQ R+ +Q +  V Q +++    P    Q       +T REKP+CN CGKRH  RCL+G RVC++C
Subjt:  HRPPNYATTVRVAELIDCHPATAPRATSEQRPSSGQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQANQQNTVREKPVCNNCGKRHWRRCLLGARVCFRC

TrEMBL top hitse value%identityAlignment
A0A5A7TP01 Reverse transcriptase1.6e-3231.54Show/hide
Query:  KGSWKGSHAPEAVVPPVGQENNLAGDPQVEQPTPAAEPVTVDAIQAIVQSAVVRAVQSAMQAAVAGVIAGQQAQA---------------------PQNN
        +G+ +G  A +   PPV        +P V  P     PVT   + A+ Q       Q+ +QAA+A  +A QQ QA                     P   
Subjt:  KGSWKGSHAPEAVVPPVGQENNLAGDPQVEQPTPAAEPVTVDAIQAIVQSAVVRAVQSAMQAAVAGVIAGQQAQA---------------------PQNN

Query:  EA------LSREARCLRDFRKWDPRPFDGASKDPTVAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNALIWWQSAERTIDTSNGPVTWPQFREAFFKK
        EA      LS EA+ LRDFRK++P+ FDG+  +P  A+LWL+SIET+FR+M CPEDQKV   AF L+D    WW++AER +      +TW QFRE+F+ K
Subjt:  EA------LSREARCLRDFRKWDPRPFDGASKDPTVAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNALIWWQSAERTIDTSNGPVTWPQFREAFFKK

Query:  YYPANA----------FQEASRVCGSYTGKPNRRG------IRDRVRQTVSIC---------FCPAHRPPNYATTVRVAELIDCHPATAPRATSEQRPSS
        ++ AN            ++       Y  + +         +RD   +T             F  A RP  +A  +R+A  +  H             +S
Subjt:  YYPANA----------FQEASRVCGSYTGKPNRRG------IRDRVRQTVSIC---------FCPAHRPPNYATTVRVAELIDCHPATAPRATSEQRPSS

Query:  GQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQANQQ-----NTVREKPVCNNCGKRHWRRCLLGARVCFRC
        GQ R+ +   DV+ Q        P++    Q+  ++      T+RE P C  CGK H  +CL G+ VCFRC
Subjt:  GQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQANQQ-----NTVREKPVCNNCGKRHWRRCLLGARVCFRC

A0A5A7U6Z9 Ty3-gypsy retrotransposon protein4.7e-3233.44Show/hide
Query:  GDPQVEQPTPAAEPVTVDAIQAIVQSAVVRAVQSAMQAAVAGVIAGQQAQA----------PQNNEA------LSREARCLRDFRKWDPRPFDGASKDPT
        G P+ +   PA  P      QA + +A+ +  Q  +QAA+A  +A QQ QA          P   EA      LS EA+ L+DFRK++P+ FDG+  +PT
Subjt:  GDPQVEQPTPAAEPVTVDAIQAIVQSAVVRAVQSAMQAAVAGVIAGQQAQA----------PQNNEA------LSREARCLRDFRKWDPRPFDGASKDPT

Query:  VAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNALIWWQSAERTIDTSNGPVTWPQFREAFFKKYYPAN---AFQEASRVCGSYTGKPNRRGIRDRVRQ
         A++WL+SIET+FR+M CP+DQKV  V F L+D    WW++AER +      +TW QF+E F+ K++ AN    F   SR             +RD   +
Subjt:  VAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNALIWWQSAERTIDTSNGPVTWPQFREAFFKKYYPAN---AFQEASRVCGSYTGKPNRRGIRDRVRQ

Query:  TVSIC---------FCPAHRPPNYATTVRVAELIDCHPATAPRATSEQRPSSGQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQANQQNTVREKPVCNNC
        T                A RP  +A  +R+A  +  H        +++ P+ GQ R+ +   DV+ Q + +       + + + A    T+RE P    C
Subjt:  TVSIC---------FCPAHRPPNYATTVRVAELIDCHPATAPRATSEQRPSSGQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQANQQNTVREKPVCNNC

Query:  GKRHWRRCLLGARVCFR
        G+ H  RCL G+ VCFR
Subjt:  GKRHWRRCLLGARVCFR

A0A5A7UHN4 Reverse transcriptase3.6e-3231.42Show/hide
Query:  GDPQVEQPTPAAEPVTVDAIQAIVQSAVVRAVQSAMQAAVAGVIAGQQAQA----------PQNNEA------LSREARCLRDFRKWDPRPFDGASKDPT
        G  + +   PA +P      QA + +A+ +  Q  +QA +A  +A QQ QA          P   EA      LS EA+ LRDFRK++P+ FDG+  +PT
Subjt:  GDPQVEQPTPAAEPVTVDAIQAIVQSAVVRAVQSAMQAAVAGVIAGQQAQA----------PQNNEA------LSREARCLRDFRKWDPRPFDGASKDPT

Query:  VAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNALIWWQSAERTIDTSNGPVTWPQFREAFFKKYYPANA----------FQEASRVCGSYTGKPNRRG
         A++WL+SIET+FR+M CPEDQKV    F L+D  + WW++AER +      +TW QF+E F+ K++ AN            ++       Y  + +   
Subjt:  VAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNALIWWQSAERTIDTSNGPVTWPQFREAFFKKYYPANA----------FQEASRVCGSYTGKPNRRG

Query:  ------IRDRVRQTVSIC---------FCPAHRPPNYATTVRVAELIDCHPATAPRATSEQRPSSGQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQANQ
              +RD   +T                A RP  +A  +R+A  +  H    P   + +  + GQ R+ +   DV+ Q + +       + + +    
Subjt:  ------IRDRVRQTVSIC---------FCPAHRPPNYATTVRVAELIDCHPATAPRATSEQRPSSGQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQANQ

Query:  QNTVREKPVCNNCGKRHWRRCLLGARVCFRC
          T+RE P C  CG+ H  RCL G+ VCFRC
Subjt:  QNTVREKPVCNNCGKRHWRRCLLGARVCFRC

A0A5A7VBY3 Reverse transcriptase7.2e-3331.74Show/hide
Query:  GDPQVEQPTPAAE---PVTVDAIQAIVQSAVVRAVQSAMQAAVAGVIAGQQAQA----------PQNNEA------LSREARCLRDFRKWDPRPFDGASK
        G P+ +   PA +   PVT D       +A+ +  Q  +QAA+A  +A QQ QA          P   EA      LS EA+ LRDFRK++P+ FDG+  
Subjt:  GDPQVEQPTPAAE---PVTVDAIQAIVQSAVVRAVQSAMQAAVAGVIAGQQAQA----------PQNNEA------LSREARCLRDFRKWDPRPFDGASK

Query:  DPTVAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNALIWWQSAERTIDTSNGPVTWPQFREAFFKKYYPANA----------FQEASRVCGSYTGKPN
        +PT A++WL+SIET+FR+M CPE+QKV    F L+D    WW++AER +      +TW QF+E F+ K++ AN            ++       Y  + +
Subjt:  DPTVAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNALIWWQSAERTIDTSNGPVTWPQFREAFFKKYYPANA----------FQEASRVCGSYTGKPN

Query:  RRG------IRDRVRQTVSIC---------FCPAHRPPNYATTVRVAELIDCHPATAPRATSEQRPSSGQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQ
                 +RD   +T                A RP  +A  +R+A  +  H        + +R + GQ R+ +   D++ Q + +       +   + 
Subjt:  RRG------IRDRVRQTVSIC---------FCPAHRPPNYATTVRVAELIDCHPATAPRATSEQRPSSGQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQ

Query:  ANQQNTVREKPVCNNCGKRHWRRCLLGARVCFRC
        A    T+RE P C  CG+ H  RCL G+ VCFRC
Subjt:  ANQQNTVREKPVCNNCGKRHWRRCLLGARVCFRC

A0A5D3BZN1 Reverse transcriptase7.2e-3331.74Show/hide
Query:  GDPQVEQPTPAAE---PVTVDAIQAIVQSAVVRAVQSAMQAAVAGVIAGQQAQA----------PQNNEA------LSREARCLRDFRKWDPRPFDGASK
        G P+ +   PA +   PVT D       +A+ +  Q  +QAA+A  +A QQ QA          P   EA      LS EA+ LRDFRK++P+ FDG+  
Subjt:  GDPQVEQPTPAAE---PVTVDAIQAIVQSAVVRAVQSAMQAAVAGVIAGQQAQA----------PQNNEA------LSREARCLRDFRKWDPRPFDGASK

Query:  DPTVAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNALIWWQSAERTIDTSNGPVTWPQFREAFFKKYYPANA----------FQEASRVCGSYTGKPN
        +PT A++WL+SIET+FR+M CPE+QKV    F L+D    WW++AER +      +TW QF+E F+ K++ AN            ++       Y  + +
Subjt:  DPTVAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNALIWWQSAERTIDTSNGPVTWPQFREAFFKKYYPANA----------FQEASRVCGSYTGKPN

Query:  RRG------IRDRVRQTVSIC---------FCPAHRPPNYATTVRVAELIDCHPATAPRATSEQRPSSGQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQ
                 +RD   +T                A RP  +A  +R+A  +  H        + +R + GQ R+ +   D++ Q + +       +   + 
Subjt:  RRG------IRDRVRQTVSIC---------FCPAHRPPNYATTVRVAELIDCHPATAPRATSEQRPSSGQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQ

Query:  ANQQNTVREKPVCNNCGKRHWRRCLLGARVCFRC
        A    T+RE P C  CG+ H  RCL G+ VCFRC
Subjt:  ANQQNTVREKPVCNNCGKRHWRRCLLGARVCFRC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTACCAGCAAGCACCTTATCATCGCCAGGTTGGGGTAAGGGTTAGCAGCAAAGGAGACTTAGTTGAGTTTGTCGAATATCAGAATCATGCCTCCTCGTGGAAGAG
CTTGTGGAAGGGGTCGTGGAAGGGGTCGCACGCCCCTGAGGCAGTTGTGCCGCCAGTGGGACAAGAAAACAATCTGGCAGGGGACCCACAAGTAGAGCAACCGACACCTG
CAGCGGAACCTGTCACGGTAGATGCTATTCAGGCAATCGTGCAGTCAGCAGTGGTCAGGGCAGTACAGTCTGCGATGCAAGCGGCAGTTGCAGGCGTGATTGCGGGGCAG
CAGGCCCAAGCGCCTCAAAATAATGAAGCATTGTCGCGAGAGGCAAGATGCTTAAGGGACTTTAGGAAGTGGGACCCCCGTCCATTCGATGGAGCATCAAAGGACCCTAC
AGTGGCGAAGTTGTGGTTGTCTTCCATTGAAACCGTCTTTCGTCACATGAATTGTCCGGAAGACCAGAAGGTATATCGTGTCGCCTTTCTGTTGCAAGACAATGCCTTGA
TTTGGTGGCAGTCGGCCGAAAGAACCATAGACACCAGTAATGGACCTGTGACATGGCCCCAGTTCAGGGAAGCGTTCTTCAAGAAATATTACCCTGCAAATGCGTTTCAA
GAAGCAAGCAGAGTTTGTGGCTCTTACACAGGGAAGCCGAACCGTAGAGGAATACGAGACAGAGTTCGCCAGACTGTCTCGATTTGCTTTTGCCCTGCCCATCGACCACC
AAACTACGCCACGACAGTCAGAGTGGCTGAGTTAATAGATTGTCATCCAGCAACTGCACCTCGAGCGACCTCGGAACAAAGACCCTCTTCAGGTCAGAATAGGAGGTTCG
ACCAAACATCGGACGTAGTACAACAATCCCAAGACATAGTCGTGAATCCACCTCAAGCCCAAGCACAGGGGCAACAAGCTAACCAACAGAATACTGTTCGCGAGAAACCG
GTGTGTAATAATTGTGGAAAGCGCCACTGGAGGCGTTGCTTGCTGGGAGCTCGTGTGTGTTTTCGATGTGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCTACCAGCAAGCACCTTATCATCGCCAGGTTGGGGTAAGGGTTAGCAGCAAAGGAGACTTAGTTGAGTTTGTCGAATATCAGAATCATGCCTCCTCGTGGAAGAG
CTTGTGGAAGGGGTCGTGGAAGGGGTCGCACGCCCCTGAGGCAGTTGTGCCGCCAGTGGGACAAGAAAACAATCTGGCAGGGGACCCACAAGTAGAGCAACCGACACCTG
CAGCGGAACCTGTCACGGTAGATGCTATTCAGGCAATCGTGCAGTCAGCAGTGGTCAGGGCAGTACAGTCTGCGATGCAAGCGGCAGTTGCAGGCGTGATTGCGGGGCAG
CAGGCCCAAGCGCCTCAAAATAATGAAGCATTGTCGCGAGAGGCAAGATGCTTAAGGGACTTTAGGAAGTGGGACCCCCGTCCATTCGATGGAGCATCAAAGGACCCTAC
AGTGGCGAAGTTGTGGTTGTCTTCCATTGAAACCGTCTTTCGTCACATGAATTGTCCGGAAGACCAGAAGGTATATCGTGTCGCCTTTCTGTTGCAAGACAATGCCTTGA
TTTGGTGGCAGTCGGCCGAAAGAACCATAGACACCAGTAATGGACCTGTGACATGGCCCCAGTTCAGGGAAGCGTTCTTCAAGAAATATTACCCTGCAAATGCGTTTCAA
GAAGCAAGCAGAGTTTGTGGCTCTTACACAGGGAAGCCGAACCGTAGAGGAATACGAGACAGAGTTCGCCAGACTGTCTCGATTTGCTTTTGCCCTGCCCATCGACCACC
AAACTACGCCACGACAGTCAGAGTGGCTGAGTTAATAGATTGTCATCCAGCAACTGCACCTCGAGCGACCTCGGAACAAAGACCCTCTTCAGGTCAGAATAGGAGGTTCG
ACCAAACATCGGACGTAGTACAACAATCCCAAGACATAGTCGTGAATCCACCTCAAGCCCAAGCACAGGGGCAACAAGCTAACCAACAGAATACTGTTCGCGAGAAACCG
GTGTGTAATAATTGTGGAAAGCGCCACTGGAGGCGTTGCTTGCTGGGAGCTCGTGTGTGTTTTCGATGTGGCTAG
Protein sequenceShow/hide protein sequence
MAYQQAPYHRQVGVRVSSKGDLVEFVEYQNHASSWKSLWKGSWKGSHAPEAVVPPVGQENNLAGDPQVEQPTPAAEPVTVDAIQAIVQSAVVRAVQSAMQAAVAGVIAGQ
QAQAPQNNEALSREARCLRDFRKWDPRPFDGASKDPTVAKLWLSSIETVFRHMNCPEDQKVYRVAFLLQDNALIWWQSAERTIDTSNGPVTWPQFREAFFKKYYPANAFQ
EASRVCGSYTGKPNRRGIRDRVRQTVSICFCPAHRPPNYATTVRVAELIDCHPATAPRATSEQRPSSGQNRRFDQTSDVVQQSQDIVVNPPQAQAQGQQANQQNTVREKP
VCNNCGKRHWRRCLLGARVCFRCG