; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018496 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018496
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr5:28336423..28338957
RNA-Seq ExpressionLag0018496
SyntenyLag0018496
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155000.1 uncharacterized protein LOC111022144 [Momordica charantia]3.1e-6750.93Show/hide
Query:  KWVADLKALFDLMNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRLTQGSMTVVQYERKFTALFRFA
        +W+ +L+A +  + C D  K+ G VF+L+ +A  WW S+A AEDHAN  I W RFKDLLYDYY+ ETV+D KEAEFL L QG+++V QYERKFT L RFA
Subjt:  KWVADLKALFDLMNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRLTQGSMTVVQYERKFTALFRFA

Query:  PDLVNTPERKIKRFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPILPLCATCNRHH
         +L+     KIKRF+KGL + IRG V L RP ++AEA+ GALIMDK+V+ K        SS GVKRK  P  A    +AP+ Q +   + P+C TC + H
Subjt:  PDLVNTPERKIKRFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPILPLCATCNRHH

Query:  SGQCWTGDKVGFNCDKEGHYARQCPTKGEANAEKPALK-APPVQAQGGNQRARVFALTKEETDDEDVVL
        +GQCWTG K  F C +E H+AR+CP    AN ++   + +P V  QG NQRARVFALT++E  D + V+
Subjt:  SGQCWTGDKVGFNCDKEGHYARQCPTKGEANAEKPALK-APPVQAQGGNQRARVFALTKEETDDEDVVL

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]3.2e-7252.57Show/hide
Query:  SITKWVADLKALFDLMNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRLTQGSMTVVQYERKFTALF
        ++ +W+ +L+AL+  + C D  K+ G VF+L+ +A  WW SVA AED+AN PI W RFK+LLYDYY+ ETV+D KEAEFL L QG+++V QYERKFT L 
Subjt:  SITKWVADLKALFDLMNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRLTQGSMTVVQYERKFTALF

Query:  RFAPDLVNTPERKIKRFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPILPLCATCN
        RFA +L+ T   KIKRF+KGLR+ IRG V L RP T+AEA+ GAL+MDK+V+ K  P     SS GVKRK P   A    +AP++Q +   + P+C TC 
Subjt:  RFAPDLVNTPERKIKRFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPILPLCATCN

Query:  RHHSGQCWTGDKVGFNCDKEGHYARQCPTKGEANAEKPALK-APPVQAQGGNQRARVFALTKEETDDEDVVL
        + H+GQCWTG K  F C +EGH+AR+CP    AN ++   +  PPV  QG NQRARVFALT++E  D + V+
Subjt:  RHHSGQCWTGDKVGFNCDKEGHYARQCPTKGEANAEKPALK-APPVQAQGGNQRARVFALTKEETDDEDVVL

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]6.0e-6340.67Show/hide
Query:  KWVADLKALFDLMNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRLTQGSMTVVQYERKFTALFRFA
        +WV +L+AL+  + C+D  K+ G VF+L+ +A  WW+SVA AEDHAN P++W RFKDLLY+YYF    R++K  EFLRLTQGS+TV QYERKFT L RF 
Subjt:  KWVADLKALFDLMNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRLTQGSMTVVQYERKFTALFRFA

Query:  PDLVNTPERKIKRFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPILPLCATCNRHH
           V T + KI +FI GLR EI+G + L  P T+A A+  AL+MDK    + Q +    S+ GVKRK     A   ++  +   +     P+C +C ++H
Subjt:  PDLVNTPERKIKRFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPILPLCATCNRHH

Query:  SGQCWTGDKVGFNCDKEGHYARQCPTKGEANAEKPALKAPPVQA-QGGNQRARVFALTKEETDDEDVVL-------------------------------
        +  CW G K+ F C KEGH+ R+C   G +N +  + K P   A QGG Q ARVFALT+ + +  + V+                               
Subjt:  SGQCWTGDKVGFNCDKEGHYARQCPTKGEANAEKPALKAPPVQA-QGGNQRARVFALTKEETDDEDVVL-------------------------------

Query:  ------------------------QQIVKFGVVSIAGQTLEARLIQLNMHDCDVILGMD
                                 Q+VK G +S  GQTLE  LIQLNM D DVILGMD
Subjt:  ------------------------QQIVKFGVVSIAGQTLEARLIQLNMHDCDVILGMD

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]1.1e-6440.67Show/hide
Query:  KWVADLKALFDLMNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRLTQGSMTVVQYERKFTALFRFA
        +WV +L+AL+  + C+D  K+ G VF+L+ +A  WW+SVA AEDHAN P++W RFKDLLY+YYF  TVR++K  EFLRLTQGS+TV +YERKFT L RF 
Subjt:  KWVADLKALFDLMNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRLTQGSMTVVQYERKFTALFRFA

Query:  PDLVNTPERKIKRFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPILPLCATCNRHH
           + T + KI +FI GLR EI+G + L  P T+A A+  AL+MDK    + Q +    SS GVKRK     +  P++  +   +     P+C +C + H
Subjt:  PDLVNTPERKIKRFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPILPLCATCNRHH

Query:  SGQCWTGDKVGFNCDKEGHYARQCPTKGEANAEKPALKAP-PVQAQGGNQRARVFALTKEETDDEDVVL-------------------------------
        +G CW G ++ + C KEGH+AR+CP  G +N +    + P    AQGG  RARVFALT+ + +  + V+                               
Subjt:  SGQCWTGDKVGFNCDKEGHYARQCPTKGEANAEKPALKAP-PVQAQGGNQRARVFALTKEETDDEDVVL-------------------------------

Query:  ------------------------QQIVKFGVVSIAGQTLEARLIQLNMHDCDVILGMD
                                 Q+VK G +S  GQTLE +LIQL+M D DVILGMD
Subjt:  ------------------------QQIVKFGVVSIAGQTLEARLIQLNMHDCDVILGMD

XP_022159077.1 uncharacterized protein LOC111025517 [Momordica charantia]1.3e-6545.37Show/hide
Query:  MNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRLTQGSMTVVQYERKFTALFRFAPDLVNTPERKIK
        ++C +  K+ G VF+L+ +A  WW SVA AEDHAN PI+W RFKDLLYDYY+ +T++D KEAEFL  + G++TV QYERKFT L  FA +L+ T   KIK
Subjt:  MNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRLTQGSMTVVQYERKFTALFRFAPDLVNTPERKIK

Query:  RFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPILPLCATCNRHHSGQCWTGDKVGF
        RF+KGLR+ IRG V L RP T+AEA+ G LIMD +V+  +QP     SS GVKRK  P  A  P +AP++  +   + P+C +C +  +GQCWTG++  F
Subjt:  RFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPILPLCATCNRHHSGQCWTGDKVGF

Query:  NCDKEGHYARQCPTKGEANAEKPALKA-PPVQAQGG-----NQRARV---------------------------FALTKEETDDEDVVLQQIVKFGVVSI
         C +EGH+AR+C +   AN ++   +A P V  QGG     N  A V                           F L+        ++  Q+V+ G +S 
Subjt:  NCDKEGHYARQCPTKGEANAEKPALKA-PPVQAQGG-----NQRARV---------------------------FALTKEETDDEDVVLQQIVKFGVVSI

Query:  AGQTLEARLIQLNMHDCDVILGMD
          QTLEARLIQL+M D DVILGMD
Subjt:  AGQTLEARLIQLNMHDCDVILGMD

TrEMBL top hitse value%identityAlignment
A0A6J1DL73 uncharacterized protein LOC1110221441.5e-6750.93Show/hide
Query:  KWVADLKALFDLMNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRLTQGSMTVVQYERKFTALFRFA
        +W+ +L+A +  + C D  K+ G VF+L+ +A  WW S+A AEDHAN  I W RFKDLLYDYY+ ETV+D KEAEFL L QG+++V QYERKFT L RFA
Subjt:  KWVADLKALFDLMNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRLTQGSMTVVQYERKFTALFRFA

Query:  PDLVNTPERKIKRFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPILPLCATCNRHH
         +L+     KIKRF+KGL + IRG V L RP ++AEA+ GALIMDK+V+ K        SS GVKRK  P  A    +AP+ Q +   + P+C TC + H
Subjt:  PDLVNTPERKIKRFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPILPLCATCNRHH

Query:  SGQCWTGDKVGFNCDKEGHYARQCPTKGEANAEKPALK-APPVQAQGGNQRARVFALTKEETDDEDVVL
        +GQCWTG K  F C +E H+AR+CP    AN ++   + +P V  QG NQRARVFALT++E  D + V+
Subjt:  SGQCWTGDKVGFNCDKEGHYARQCPTKGEANAEKPALK-APPVQAQGGNQRARVFALTKEETDDEDVVL

A0A6J1DQB9 Reverse transcriptase2.9e-6340.67Show/hide
Query:  KWVADLKALFDLMNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRLTQGSMTVVQYERKFTALFRFA
        +WV +L+AL+  + C+D  K+ G VF+L+ +A  WW+SVA AEDHAN P++W RFKDLLY+YYF    R++K  EFLRLTQGS+TV QYERKFT L RF 
Subjt:  KWVADLKALFDLMNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRLTQGSMTVVQYERKFTALFRFA

Query:  PDLVNTPERKIKRFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPILPLCATCNRHH
           V T + KI +FI GLR EI+G + L  P T+A A+  AL+MDK    + Q +    S+ GVKRK     A   ++  +   +     P+C +C ++H
Subjt:  PDLVNTPERKIKRFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPILPLCATCNRHH

Query:  SGQCWTGDKVGFNCDKEGHYARQCPTKGEANAEKPALKAPPVQA-QGGNQRARVFALTKEETDDEDVVL-------------------------------
        +  CW G K+ F C KEGH+ R+C   G +N +  + K P   A QGG Q ARVFALT+ + +  + V+                               
Subjt:  SGQCWTGDKVGFNCDKEGHYARQCPTKGEANAEKPALKAPPVQA-QGGNQRARVFALTKEETDDEDVVL-------------------------------

Query:  ------------------------QQIVKFGVVSIAGQTLEARLIQLNMHDCDVILGMD
                                 Q+VK G +S  GQTLE  LIQLNM D DVILGMD
Subjt:  ------------------------QQIVKFGVVSIAGQTLEARLIQLNMHDCDVILGMD

A0A6J1DUM2 uncharacterized protein LOC1110232471.5e-7252.57Show/hide
Query:  SITKWVADLKALFDLMNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRLTQGSMTVVQYERKFTALF
        ++ +W+ +L+AL+  + C D  K+ G VF+L+ +A  WW SVA AED+AN PI W RFK+LLYDYY+ ETV+D KEAEFL L QG+++V QYERKFT L 
Subjt:  SITKWVADLKALFDLMNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRLTQGSMTVVQYERKFTALF

Query:  RFAPDLVNTPERKIKRFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPILPLCATCN
        RFA +L+ T   KIKRF+KGLR+ IRG V L RP T+AEA+ GAL+MDK+V+ K  P     SS GVKRK P   A    +AP++Q +   + P+C TC 
Subjt:  RFAPDLVNTPERKIKRFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPILPLCATCN

Query:  RHHSGQCWTGDKVGFNCDKEGHYARQCPTKGEANAEKPALK-APPVQAQGGNQRARVFALTKEETDDEDVVL
        + H+GQCWTG K  F C +EGH+AR+CP    AN ++   +  PPV  QG NQRARVFALT++E  D + V+
Subjt:  RHHSGQCWTGDKVGFNCDKEGHYARQCPTKGEANAEKPALK-APPVQAQGGNQRARVFALTKEETDDEDVVL

A0A6J1DWP4 uncharacterized protein LOC1110252155.3e-6540.67Show/hide
Query:  KWVADLKALFDLMNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRLTQGSMTVVQYERKFTALFRFA
        +WV +L+AL+  + C+D  K+ G VF+L+ +A  WW+SVA AEDHAN P++W RFKDLLY+YYF  TVR++K  EFLRLTQGS+TV +YERKFT L RF 
Subjt:  KWVADLKALFDLMNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRLTQGSMTVVQYERKFTALFRFA

Query:  PDLVNTPERKIKRFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPILPLCATCNRHH
           + T + KI +FI GLR EI+G + L  P T+A A+  AL+MDK    + Q +    SS GVKRK     +  P++  +   +     P+C +C + H
Subjt:  PDLVNTPERKIKRFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPILPLCATCNRHH

Query:  SGQCWTGDKVGFNCDKEGHYARQCPTKGEANAEKPALKAP-PVQAQGGNQRARVFALTKEETDDEDVVL-------------------------------
        +G CW G ++ + C KEGH+AR+CP  G +N +    + P    AQGG  RARVFALT+ + +  + V+                               
Subjt:  SGQCWTGDKVGFNCDKEGHYARQCPTKGEANAEKPALKAP-PVQAQGGNQRARVFALTKEETDDEDVVL-------------------------------

Query:  ------------------------QQIVKFGVVSIAGQTLEARLIQLNMHDCDVILGMD
                                 Q+VK G +S  GQTLE +LIQL+M D DVILGMD
Subjt:  ------------------------QQIVKFGVVSIAGQTLEARLIQLNMHDCDVILGMD

A0A6J1DYU5 uncharacterized protein LOC1110255176.2e-6645.37Show/hide
Query:  MNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRLTQGSMTVVQYERKFTALFRFAPDLVNTPERKIK
        ++C +  K+ G VF+L+ +A  WW SVA AEDHAN PI+W RFKDLLYDYY+ +T++D KEAEFL  + G++TV QYERKFT L  FA +L+ T   KIK
Subjt:  MNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRLTQGSMTVVQYERKFTALFRFAPDLVNTPERKIK

Query:  RFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPILPLCATCNRHHSGQCWTGDKVGF
        RF+KGLR+ IRG V L RP T+AEA+ G LIMD +V+  +QP     SS GVKRK  P  A  P +AP++  +   + P+C +C +  +GQCWTG++  F
Subjt:  RFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPILPLCATCNRHHSGQCWTGDKVGF

Query:  NCDKEGHYARQCPTKGEANAEKPALKA-PPVQAQGG-----NQRARV---------------------------FALTKEETDDEDVVLQQIVKFGVVSI
         C +EGH+AR+C +   AN ++   +A P V  QGG     N  A V                           F L+        ++  Q+V+ G +S 
Subjt:  NCDKEGHYARQCPTKGEANAEKPALKA-PPVQAQGG-----NQRARV---------------------------FALTKEETDDEDVVLQQIVKFGVVSI

Query:  AGQTLEARLIQLNMHDCDVILGMD
          QTLEARLIQL+M D DVILGMD
Subjt:  AGQTLEARLIQLNMHDCDVILGMD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAATCGCCGCAATCTCCTTGCCTTGCGCAGCGAGACCGGCTTCCAGGGCTTCCATTCGCGCTCCATACGAGTAGACATTGTCAAACCAGACATACTTCACTGGGG
GTCTATCACCTACACTGTCACCGAATATCAGAATGCCTTTGTTGAGGGTAGACAGATTCTTGATGCCTCCCTTATGGCTAATGAACTTATCGATGAATGGAGAAGGAAGA
ACTGTAAAGGTGTGACACCCGAGGATAGGAGGGGTCGCAGGGGAATGTCGGGGTCGTCAGGTGCCAAATCCGGATTCCGAATCCTGGGCCTAGGGCGTTACAGTTGGTAT
CAGAGCGGAGTTGTTCCTGTAGACTGGCCTAGGAAATCTAGGTTGTTTGGGTGTCTAGGGTTATGGTCTTCCTCGTTTCTCCTCTCCATCACCAAGTGGGTCGCTGATTT
AAAGGCACTGTTTGACCTCATGAACTGTAATGATCCCCTGAAGATTGGAGGAACCGTCTTCATACTCAAGGATGACGCTCGCATGTGGTGGAAGTCTGTGGCAACCGCCG
AAGATCATGCCAATCAGCCGATTTCATGGGAAAGGTTCAAAGACCTGTTGTACGATTATTACTTCTCGGAGACAGTCAGAGATGACAAAGAAGCTGAGTTCCTCCGTTTG
ACCCAGGGAAGTATGACCGTAGTGCAGTATGAGAGGAAGTTCACTGCGTTGTTTCGCTTCGCTCCTGACCTTGTCAACACACCGGAAAGGAAGATCAAGAGATTCATAAA
AGGCCTCCGTGAGGAAATTCGTGGTTCTGTGGCCTTGTGCAGACCCATGACTTTCGCTGAAGCGCTCTCAGGCGCGTTGATCATGGATAAGAATGTGGCAAGGAAGATGC
AACCTCGCTGGGGAGAAGCTTCATCGCCTGGTGTTAAAAGGAAGCCTCCTCCCATGCCCGCACGTTCGCCGACCAAGGCCCCTCGTCAACAGCAGAGGCCAACTCCCATC
CTCCCTCTGTGCGCTACGTGCAACAGGCACCATTCTGGTCAATGCTGGACAGGTGATAAGGTCGGTTTCAATTGCGATAAAGAAGGACATTATGCCAGACAGTGTCCCAC
TAAAGGCGAAGCAAATGCTGAGAAGCCAGCCCTTAAGGCCCCACCCGTGCAAGCTCAAGGTGGAAATCAGAGGGCGCGTGTCTTTGCCCTCACGAAAGAAGAAACAGATG
ATGAGGATGTCGTGTTACAGCAAATAGTCAAGTTCGGAGTAGTCTCGATTGCAGGTCAGACCCTTGAAGCGAGGTTAATCCAATTGAATATGCACGACTGCGATGTAATC
TTGGGCATGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGAATCGCCGCAATCTCCTTGCCTTGCGCAGCGAGACCGGCTTCCAGGGCTTCCATTCGCGCTCCATACGAGTAGACATTGTCAAACCAGACATACTTCACTGGGG
GTCTATCACCTACACTGTCACCGAATATCAGAATGCCTTTGTTGAGGGTAGACAGATTCTTGATGCCTCCCTTATGGCTAATGAACTTATCGATGAATGGAGAAGGAAGA
ACTGTAAAGGTGTGACACCCGAGGATAGGAGGGGTCGCAGGGGAATGTCGGGGTCGTCAGGTGCCAAATCCGGATTCCGAATCCTGGGCCTAGGGCGTTACAGTTGGTAT
CAGAGCGGAGTTGTTCCTGTAGACTGGCCTAGGAAATCTAGGTTGTTTGGGTGTCTAGGGTTATGGTCTTCCTCGTTTCTCCTCTCCATCACCAAGTGGGTCGCTGATTT
AAAGGCACTGTTTGACCTCATGAACTGTAATGATCCCCTGAAGATTGGAGGAACCGTCTTCATACTCAAGGATGACGCTCGCATGTGGTGGAAGTCTGTGGCAACCGCCG
AAGATCATGCCAATCAGCCGATTTCATGGGAAAGGTTCAAAGACCTGTTGTACGATTATTACTTCTCGGAGACAGTCAGAGATGACAAAGAAGCTGAGTTCCTCCGTTTG
ACCCAGGGAAGTATGACCGTAGTGCAGTATGAGAGGAAGTTCACTGCGTTGTTTCGCTTCGCTCCTGACCTTGTCAACACACCGGAAAGGAAGATCAAGAGATTCATAAA
AGGCCTCCGTGAGGAAATTCGTGGTTCTGTGGCCTTGTGCAGACCCATGACTTTCGCTGAAGCGCTCTCAGGCGCGTTGATCATGGATAAGAATGTGGCAAGGAAGATGC
AACCTCGCTGGGGAGAAGCTTCATCGCCTGGTGTTAAAAGGAAGCCTCCTCCCATGCCCGCACGTTCGCCGACCAAGGCCCCTCGTCAACAGCAGAGGCCAACTCCCATC
CTCCCTCTGTGCGCTACGTGCAACAGGCACCATTCTGGTCAATGCTGGACAGGTGATAAGGTCGGTTTCAATTGCGATAAAGAAGGACATTATGCCAGACAGTGTCCCAC
TAAAGGCGAAGCAAATGCTGAGAAGCCAGCCCTTAAGGCCCCACCCGTGCAAGCTCAAGGTGGAAATCAGAGGGCGCGTGTCTTTGCCCTCACGAAAGAAGAAACAGATG
ATGAGGATGTCGTGTTACAGCAAATAGTCAAGTTCGGAGTAGTCTCGATTGCAGGTCAGACCCTTGAAGCGAGGTTAATCCAATTGAATATGCACGACTGCGATGTAATC
TTGGGCATGGACTGA
Protein sequenceShow/hide protein sequence
MMNRRNLLALRSETGFQGFHSRSIRVDIVKPDILHWGSITYTVTEYQNAFVEGRQILDASLMANELIDEWRRKNCKGVTPEDRRGRRGMSGSSGAKSGFRILGLGRYSWY
QSGVVPVDWPRKSRLFGCLGLWSSSFLLSITKWVADLKALFDLMNCNDPLKIGGTVFILKDDARMWWKSVATAEDHANQPISWERFKDLLYDYYFSETVRDDKEAEFLRL
TQGSMTVVQYERKFTALFRFAPDLVNTPERKIKRFIKGLREEIRGSVALCRPMTFAEALSGALIMDKNVARKMQPRWGEASSPGVKRKPPPMPARSPTKAPRQQQRPTPI
LPLCATCNRHHSGQCWTGDKVGFNCDKEGHYARQCPTKGEANAEKPALKAPPVQAQGGNQRARVFALTKEETDDEDVVLQQIVKFGVVSIAGQTLEARLIQLNMHDCDVI
LGMD