; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018111 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018111
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr5:16614498..16622387
RNA-Seq ExpressionLag0018111
SyntenyLag0018111
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]6.7e-5740.65Show/hide
Query:  MRKVRELTLVPFDPEIERTLSRIRRESKENHIQMANPNPEE-PKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQMARDCAYRGSPMEDPNSHLKS
        MR+ R   ++P DPEIERTL  +RR      + MA  + E  P+ ++DY +PV     S I+  PINANNFELK  LI M +   + GSP++DPN HL  
Subjt:  MRKVRELTLVPFDPEIERTLSRIRRESKENHIQMANPNPEE-PKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQMARDCAYRGSPMEDPNSHLKS

Query:  FLDICGTVKLNGVSEDAICLHLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGY
        FL+IC TVK+NGV+ED I L LFPFSL+DKAR WLQS+ PGSI +W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG 
Subjt:  FLDICGTVKLNGVSEDAICLHLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGY

Query:  PDWLQWPSERSGPKKIAAGVFEIDNVSALQAQMSSLANAFLKFSESSNRSKLEEAVIALN-------NTVNGHSAAIKNIETQLGQLAVSVHHEEETQLV
        PDWLQ     +G       + +  +   L ++ +  A A L+   S+N     E  +A           +   SA +  +  Q+  L      +    L 
Subjt:  PDWLQWPSERSGPKKIAAGVFEIDNVSALQAQMSSLANAFLKFSESSNRSKLEEAVIALN-------NTVNGHSAAIKNIETQLGQLAVSVHHEEETQLV

Query:  EEDETTEPEELTGE-VEEGTTSNEAEKLNPEPSISSP
                 E + E V+     N   + NP P+   P
Subjt:  EEDETTEPEELTGE-VEEGTTSNEAEKLNPEPSISSP

WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]2.1e-7453.38Show/hide
Query:  LVPFDPEIERTLSRIRRESKENHIQMANPNPEEPKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQMARDCAYRGSPMEDPNSHLKSFLDICGTVK
        L+P DPEI+RT  R  R       +MA    E PK IRDYFQP     Q GI+  PIN NNFELK GLIQMAR+ A+RG   EDP+ HL+SFL+ICGTVK
Subjt:  LVPFDPEIERTLSRIRRESKENHIQMANPNPEEPKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQMARDCAYRGSPMEDPNSHLKSFLDICGTVK

Query:  LNGVSEDAICLHLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQ----
        +NGVS DAI L LFPFSLQD+A+DWL++IPP SITTW+ L QAFL K+FPPAK+ +LRTEIGTF+Q  DEQL+EAWER+K+LLR+CPQHGYPDWLQ    
Subjt:  LNGVSEDAICLHLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQ----

Query:  -----------------------------------------WPSERSGPK-KIAAGVFEIDNVSALQAQMSSLANAFLKFS
                                                 WP ER+ P    AAG++E+D V++L+AQM+SL NA  K +
Subjt:  -----------------------------------------WPSERSGPK-KIAAGVFEIDNVSALQAQMSSLANAFLKFS

XP_021279280.1 LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 [Herrania umbratica]1.8e-5744.59Show/hide
Query:  MRKVRELTLVPFDPEIERTLSRIRRE--------------SKENHIQMANPNPEEPKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQMARDCA-Y
        M++   L LVPFDP+IERT  R RRE              +  N     N  PE  + +RDY  P+ Q     I    INANNFE+K   IQM +    +
Subjt:  MRKVRELTLVPFDPEIERTLSRIRRE--------------SKENHIQMANPNPEEPKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQMARDCA-Y

Query:  RGSPMEDPNSHLKSFLDICGTVKLNGVSEDAICLHLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEAWE
         G P +DPNSHL +FL+IC T K NGV++DAI L LFPFSL+DKA+ WL S+P GSITTW+ L Q FL KFFPPAKT K+R +I +F Q   E L+EAWE
Subjt:  RGSPMEDPNSHLKSFLDICGTVKLNGVSEDAICLHLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEAWE

Query:  RFKELLRKCPQHGYPDWL---------------------------------------------QWPSERSGPKKIAAGVFEIDNVSALQAQMSSLA
        RFKELLR+CP HG PDWL                                             QWPSERSG +K A G +EID +  L  Q+++L+
Subjt:  RFKELLRKCPQHGYPDWL---------------------------------------------QWPSERSGPKKIAAGVFEIDNVSALQAQMSSLA

XP_021279860.1 uncharacterized protein LOC110413413 [Herrania umbratica]2.3e-5744.59Show/hide
Query:  MRKVRELTLVPFDPEIERTLSRIRRE--------------SKENHIQMANPNPEEPKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQMARDCA-Y
        M++   L LVPFDP+IERT  R RRE              +  N     N  PE  + +RDY  P+ Q     I    INANNFE+K   IQM +    +
Subjt:  MRKVRELTLVPFDPEIERTLSRIRRE--------------SKENHIQMANPNPEEPKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQMARDCA-Y

Query:  RGSPMEDPNSHLKSFLDICGTVKLNGVSEDAICLHLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEAWE
         G P +DPNSHL +FL+IC T K NGV++DAI L LFPFSL+DKA+ WL S+P GSITTW+ L Q FL KFFPPAKT K+R +I +F Q   E L+EAWE
Subjt:  RGSPMEDPNSHLKSFLDICGTVKLNGVSEDAICLHLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEAWE

Query:  RFKELLRKCPQHGYPDWL---------------------------------------------QWPSERSGPKKIAAGVFEIDNVSALQAQMSSLA
        RFKELLR+CP HG PDWL                                             QWPSERSG +K A G +EID +  L  Q+++L+
Subjt:  RFKELLRKCPQHGYPDWL---------------------------------------------QWPSERSGPKKIAAGVFEIDNVSALQAQMSSLA

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]5.5e-5943.32Show/hide
Query:  MRKVRELTLVPFDPEIERT------LSRIRRES-KENHIQMANPNPEEPKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQMARDCAYRGSPMEDP
        MR+ R L L+  DPE ERT      + R  RE+  E  ++ AN + ++ + IRDY +PV  +  SGI    I A NFELK GLI M +   + G+ +EDP
Subjt:  MRKVRELTLVPFDPEIERT------LSRIRRES-KENHIQMANPNPEEPKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQMARDCAYRGSPMEDP

Query:  NSHLKSFLDICGTVKLNGVSEDAICLHLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRK
        N+HL SFL+IC TVK+NGV+EDAI L LF FSL+DKA+ W QS+P GSITTWD L Q FL K+FPP+K+ +LR EI  F+Q   E  +EAWERFK+LLR+
Subjt:  NSHLKSFLDICGTVKLNGVSEDAICLHLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRK

Query:  CPQHGYPDWL---------------------------------------------QWPSERSGPKKIAAGVFEIDNVSALQAQMSSLANAFLKFSESSNR
        CPQHG+  W+                                             QWPSERSG KK+ AG+ E+D ++AL AQ++SL N  +  +   N+
Subjt:  CPQHGYPDWL---------------------------------------------QWPSERSGPKKIAAGVFEIDNVSALQAQMSSLANAFLKFSESSNR

Query:  SKLEEAV
          ++  +
Subjt:  SKLEEAV

TrEMBL top hitse value%identityAlignment
A0A3S3N117 Retrotrans_gag domain-containing protein2.6e-5442.4Show/hide
Query:  MRKVRELTLVPFDPEIERTLSRIRRESK-ENHIQMANPNPEEPKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQM-ARDCAYRGSPMEDPNSHLK
        MR+ + L LVP DPEIERTL R+++E K ++  ++     +  + + DY  P+     S I    I ANNFE+K  +IQM A    + G P +DPN+H+ 
Subjt:  MRKVRELTLVPFDPEIERTLSRIRRESK-ENHIQMANPNPEEPKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQM-ARDCAYRGSPMEDPNSHLK

Query:  SFLDICGTVKLNGVSEDAICLHLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHG
        +FL++C T K NGV++DA+ L L PFSL+DKA+ WL S+P  +ITTWD L + FL KFFPP KTVK+R +I TF Q   E L+EAWER+KELLRKCP HG
Subjt:  SFLDICGTVKLNGVSEDAICLHLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHG

Query:  YPDWL---------------------------------------------QWPSERSGPKKIAAGVFEIDNVSALQAQMSSLA
         P W+                                             QWPS+    KKI  GV E+D++SAL AQ+++L+
Subjt:  YPDWL---------------------------------------------QWPSERSGPKKIAAGVFEIDNVSALQAQMSSLA

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129458.6e-5844.59Show/hide
Query:  MRKVRELTLVPFDPEIERTLSRIRRE--------------SKENHIQMANPNPEEPKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQMARDCA-Y
        M++   L LVPFDP+IERT  R RRE              +  N     N  PE  + +RDY  P+ Q     I    INANNFE+K   IQM +    +
Subjt:  MRKVRELTLVPFDPEIERTLSRIRRE--------------SKENHIQMANPNPEEPKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQMARDCA-Y

Query:  RGSPMEDPNSHLKSFLDICGTVKLNGVSEDAICLHLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEAWE
         G P +DPNSHL +FL+IC T K NGV++DAI L LFPFSL+DKA+ WL S+P GSITTW+ L Q FL KFFPPAKT K+R +I +F Q   E L+EAWE
Subjt:  RGSPMEDPNSHLKSFLDICGTVKLNGVSEDAICLHLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEAWE

Query:  RFKELLRKCPQHGYPDWL---------------------------------------------QWPSERSGPKKIAAGVFEIDNVSALQAQMSSLA
        RFKELLR+CP HG PDWL                                             QWPSERSG +K A G +EID +  L  Q+++L+
Subjt:  RFKELLRKCPQHGYPDWL---------------------------------------------QWPSERSGPKKIAAGVFEIDNVSALQAQMSSLA

A0A6J0ZYV0 uncharacterized protein LOC1104134131.1e-5744.59Show/hide
Query:  MRKVRELTLVPFDPEIERTLSRIRRE--------------SKENHIQMANPNPEEPKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQMARDCA-Y
        M++   L LVPFDP+IERT  R RRE              +  N     N  PE  + +RDY  P+ Q     I    INANNFE+K   IQM +    +
Subjt:  MRKVRELTLVPFDPEIERTLSRIRRE--------------SKENHIQMANPNPEEPKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQMARDCA-Y

Query:  RGSPMEDPNSHLKSFLDICGTVKLNGVSEDAICLHLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEAWE
         G P +DPNSHL +FL+IC T K NGV++DAI L LFPFSL+DKA+ WL S+P GSITTW+ L Q FL KFFPPAKT K+R +I +F Q   E L+EAWE
Subjt:  RGSPMEDPNSHLKSFLDICGTVKLNGVSEDAICLHLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEAWE

Query:  RFKELLRKCPQHGYPDWL---------------------------------------------QWPSERSGPKKIAAGVFEIDNVSALQAQMSSLA
        RFKELLR+CP HG PDWL                                             QWPSERSG +K A G +EID +  L  Q+++L+
Subjt:  RFKELLRKCPQHGYPDWL---------------------------------------------QWPSERSGPKKIAAGVFEIDNVSALQAQMSSLA

A0A6P6X8N4 uncharacterized protein LOC1137407881.6e-5150.91Show/hide
Query:  RKVRELTLVPFDPEIERTLSRIRRESKENHIQ------------------MANPNPEEPKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQMARDC
        R  RE+ L  FDPEIERTL R RR +     Q                  MA   P   + +RD+  P   + Q+ I    +NANNFE+K  LIQM +  
Subjt:  RKVRELTLVPFDPEIERTLSRIRRESKENHIQ------------------MANPNPEEPKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQMARDC

Query:  AYRGSPMEDPNSHLKSFLDICGTVKLNGVSEDAICLHLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEA
         Y G+  EDPNSHL +FL+IC T+K NGVSEDAI L LFPFSL+DK   WLQS  P + TTW  L +AFL KFFPP KT KLR +I +F QQ  E L+E 
Subjt:  AYRGSPMEDPNSHLKSFLDICGTVKLNGVSEDAICLHLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEA

Query:  WERFKELLRKCPQHGYPDWL
        WER++EL RKCP HG PDWL
Subjt:  WERFKELLRKCPQHGYPDWL

A0A6P6XAQ1 Reverse transcriptase3.5e-5142.28Show/hide
Query:  NPEEPKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQMARDCAYRGSPMEDPNSHLKSFLDICGTVKLNGVSEDAICLHLFPFSLQDKARDWLQSI
        N    + +RD+  P  Q  Q+ IV   +NANNFE+K  LIQM +   Y G+  EDPNSHL +FL+IC T+K NGVSEDAI L LFPFSL+DKA+ WLQS 
Subjt:  NPEEPKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQMARDCAYRGSPMEDPNSHLKSFLDICGTVKLNGVSEDAICLHLFPFSLQDKARDWLQSI

Query:  PPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWL----------------------------------
        PP + TTWD L +AFL KFFPP KT KLR +I +F QQ  E L+EAWER++EL R+CP HG PDWL                                  
Subjt:  PPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWL----------------------------------

Query:  -----------QWPSERSGPKKIAAGVFEIDNVSALQAQMSSLANAFLKFSESSNRSKLEEAVIALNNTVNG
                   QW +ER G  +  AG+ E+D ++ L A+M ++    +K       S   + V+  + T+ G
Subjt:  -----------QWPSERSGPKKIAAGVFEIDNVSALQAQMSSLANAFLKFSESSNRSKLEEAVIALNNTVNG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAAAGGTTAGGGAATTGACCTTAGTTCCCTTTGACCCAGAAATCGAGAGAACTTTAAGTAGAATTCGAAGGGAAAGTAAAGAAAATCATATCCAAATGGCCAATCC
TAATCCCGAGGAGCCTAAGCCTATTAGGGACTACTTTCAGCCGGTCTTTCAGGAGCAACAGTCGGGGATTGTCTATGCCCCGATCAATGCTAACAATTTTGAGCTGAAAA
TGGGTCTCATACAAATGGCTCGAGATTGTGCTTACAGAGGATCACCCATGGAGGATCCAAATTCTCACCTGAAATCCTTCCTAGACATCTGTGGGACGGTAAAATTAAAT
GGTGTGTCTGAAGATGCCATTTGCTTACATTTATTTCCTTTCTCTTTGCAGGATAAAGCGAGGGACTGGTTACAATCTATTCCACCTGGGAGCATTACCACTTGGGATGC
TTTGGTCCAGGCATTCCTAAAGAAGTTCTTCCCTCCTGCTAAGACGGTCAAGCTGAGGACCGAAATTGGGACATTCCAGCAACAATTCGATGAGCAGCTGTTTGAGGCTT
GGGAAAGATTTAAAGAGCTGCTAAGGAAGTGCCCTCAGCATGGCTACCCCGACTGGCTTCAGTGGCCATCTGAGCGGTCGGGACCGAAAAAGATTGCAGCTGGGGTGTTC
GAGATTGACAACGTAAGTGCACTACAGGCCCAGATGTCTTCCTTGGCTAATGCATTTTTAAAATTCTCAGAATCAAGTAACAGGTCAAAGCTCGAGGAAGCCGTGATTGC
CCTAAACAACACTGTCAATGGTCATTCTGCAGCCATCAAGAACATAGAGACTCAGTTGGGACAACTGGCCGTCTCTGTGCATCACGAAGAGGAGACTCAATTAGTTGAGG
AGGATGAGACAACTGAACCAGAGGAACTCACTGGAGAAGTTGAAGAAGGCACCACCTCAAATGAAGCTGAAAAGCTTAATCCTGAGCCTTCTATCTCGTCTCCTACTGTT
TTAGTTCCTAAATCAAAGAAAAAGAAAAATTCTAAGATAGGGATATCGAGTTCAATTGCATGTGGGAGAATCATGTATGCTCGGGAGAGCACTGGATCTTCCTTGCTATC
CTCAAGCCTTTGGAACGTATTGTGGGATCCTCAAGTGATGGAATTTAAGGAGAAGAGCTTGAAGAACAAGCATATGGAGGTGGGTATGGTTAGTTTTGAGCTAGCAAGGG
GACCTAATGGACCTACACATCATGGGCTCCAAAGACCCTCAGCTGCACTCTTCTCACTGCAGAATATTTCTGTGTCCACGGATATCGACCAATACTACAAATCAAAGGAC
AACCCTAAACGAACAGGAGTTCATAGCAAGCTCAGGATTAAGATCGAGTTACCTAGGTCATCGTATGAAATAGTCAGTCTTAACAGTAAACGACGTTGTAAAGATCACGT
CGTTTGCCTCATTCGCAAAGTAGGCTGCATCCATAGTGTTACCAGAACAAGTTGTTTCGTTGTCCAAGTTTTCTTCAACGCAGACGTAGATCCCAGAGTAGTTCGTACGA
TGAAAGCTGGAATAGATGATCACACTGACATGTGGCTGATGTGCAAGCTGGTGAAAGATGATTGGGGCATGCAAGCCAATTTCTTGGAGAAGTCAGCCTCCTCAGCCTAT
AAGAAGCGCCCAGCTGCCCATGAGAAAAGATACGGATGGAGAGAGGGGGAGAGCATAAAGCAAAAAGAAAGAAACCAGAAGAAGAAAGATTTTCGCGACGCAAACCTATT
TTTGCGACAAAATGTATTGTGTTGTGTTTTTCTATCTCAGCATGAGCTAATTTACTTTTATCAAGGCTTAGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGAAAGGTTAGGGAATTGACCTTAGTTCCCTTTGACCCAGAAATCGAGAGAACTTTAAGTAGAATTCGAAGGGAAAGTAAAGAAAATCATATCCAAATGGCCAATCC
TAATCCCGAGGAGCCTAAGCCTATTAGGGACTACTTTCAGCCGGTCTTTCAGGAGCAACAGTCGGGGATTGTCTATGCCCCGATCAATGCTAACAATTTTGAGCTGAAAA
TGGGTCTCATACAAATGGCTCGAGATTGTGCTTACAGAGGATCACCCATGGAGGATCCAAATTCTCACCTGAAATCCTTCCTAGACATCTGTGGGACGGTAAAATTAAAT
GGTGTGTCTGAAGATGCCATTTGCTTACATTTATTTCCTTTCTCTTTGCAGGATAAAGCGAGGGACTGGTTACAATCTATTCCACCTGGGAGCATTACCACTTGGGATGC
TTTGGTCCAGGCATTCCTAAAGAAGTTCTTCCCTCCTGCTAAGACGGTCAAGCTGAGGACCGAAATTGGGACATTCCAGCAACAATTCGATGAGCAGCTGTTTGAGGCTT
GGGAAAGATTTAAAGAGCTGCTAAGGAAGTGCCCTCAGCATGGCTACCCCGACTGGCTTCAGTGGCCATCTGAGCGGTCGGGACCGAAAAAGATTGCAGCTGGGGTGTTC
GAGATTGACAACGTAAGTGCACTACAGGCCCAGATGTCTTCCTTGGCTAATGCATTTTTAAAATTCTCAGAATCAAGTAACAGGTCAAAGCTCGAGGAAGCCGTGATTGC
CCTAAACAACACTGTCAATGGTCATTCTGCAGCCATCAAGAACATAGAGACTCAGTTGGGACAACTGGCCGTCTCTGTGCATCACGAAGAGGAGACTCAATTAGTTGAGG
AGGATGAGACAACTGAACCAGAGGAACTCACTGGAGAAGTTGAAGAAGGCACCACCTCAAATGAAGCTGAAAAGCTTAATCCTGAGCCTTCTATCTCGTCTCCTACTGTT
TTAGTTCCTAAATCAAAGAAAAAGAAAAATTCTAAGATAGGGATATCGAGTTCAATTGCATGTGGGAGAATCATGTATGCTCGGGAGAGCACTGGATCTTCCTTGCTATC
CTCAAGCCTTTGGAACGTATTGTGGGATCCTCAAGTGATGGAATTTAAGGAGAAGAGCTTGAAGAACAAGCATATGGAGGTGGGTATGGTTAGTTTTGAGCTAGCAAGGG
GACCTAATGGACCTACACATCATGGGCTCCAAAGACCCTCAGCTGCACTCTTCTCACTGCAGAATATTTCTGTGTCCACGGATATCGACCAATACTACAAATCAAAGGAC
AACCCTAAACGAACAGGAGTTCATAGCAAGCTCAGGATTAAGATCGAGTTACCTAGGTCATCGTATGAAATAGTCAGTCTTAACAGTAAACGACGTTGTAAAGATCACGT
CGTTTGCCTCATTCGCAAAGTAGGCTGCATCCATAGTGTTACCAGAACAAGTTGTTTCGTTGTCCAAGTTTTCTTCAACGCAGACGTAGATCCCAGAGTAGTTCGTACGA
TGAAAGCTGGAATAGATGATCACACTGACATGTGGCTGATGTGCAAGCTGGTGAAAGATGATTGGGGCATGCAAGCCAATTTCTTGGAGAAGTCAGCCTCCTCAGCCTAT
AAGAAGCGCCCAGCTGCCCATGAGAAAAGATACGGATGGAGAGAGGGGGAGAGCATAAAGCAAAAAGAAAGAAACCAGAAGAAGAAAGATTTTCGCGACGCAAACCTATT
TTTGCGACAAAATGTATTGTGTTGTGTTTTTCTATCTCAGCATGAGCTAATTTACTTTTATCAAGGCTTAGCTTAG
Protein sequenceShow/hide protein sequence
MRKVRELTLVPFDPEIERTLSRIRRESKENHIQMANPNPEEPKPIRDYFQPVFQEQQSGIVYAPINANNFELKMGLIQMARDCAYRGSPMEDPNSHLKSFLDICGTVKLN
GVSEDAICLHLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQWPSERSGPKKIAAGVF
EIDNVSALQAQMSSLANAFLKFSESSNRSKLEEAVIALNNTVNGHSAAIKNIETQLGQLAVSVHHEEETQLVEEDETTEPEELTGEVEEGTTSNEAEKLNPEPSISSPTV
LVPKSKKKKNSKIGISSSIACGRIMYARESTGSSLLSSSLWNVLWDPQVMEFKEKSLKNKHMEVGMVSFELARGPNGPTHHGLQRPSAALFSLQNISVSTDIDQYYKSKD
NPKRTGVHSKLRIKIELPRSSYEIVSLNSKRRCKDHVVCLIRKVGCIHSVTRTSCFVVQVFFNADVDPRVVRTMKAGIDDHTDMWLMCKLVKDDWGMQANFLEKSASSAY
KKRPAAHEKRYGWREGESIKQKERNQKKKDFRDANLFLRQNVLCCVFLSQHELIYFYQGLA