; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001731 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001731
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:34783998..34786112
RNA-Seq ExpressionLag0001731
SyntenyLag0001731
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]8.8e-1324Show/hide
Query:  LSTVSWFLFSVPGIWDSTKFCDNFIPEEAHLIPSLPIPRAELQDQPIWHFEKSEIFTIKSGYHLA--QPAAFQALPSFSLSTVSSWWK------------
        +S VS  +    G W      D F P+EA  I S+PI R   +D+ IW++EK+ +++++SGY +A       QA  S S   V  WW             
Subjt:  LSTVSWFLFSVPGIWDSTKFCDNFIPEEAHLIPSLPIPRAELQDQPIWHFEKSEIFTIKSGYHLA--QPAAFQALPSFSLSTVSSWWK------------

Query:  ------------------------------CGQSREDVLHVFWLCKVAQGHWHKSQF----------------------------HSFWHSRPSNPFSPD
                                      CG++ ED +H+FW+CK A+  W  S+F                               W+ R +  F+  
Subjt:  ------------------------------CGQSREDVLHVFWLCKVAQGHWHKSQF----------------------------HSFWHSRPSNPFSPD

Query:  SKGGAARPTDLALREIRDFYYQKW--ISFGPICCFPGSSVLFALYCLILVELGVIGISLCSTHFGIRVDASFQPNTGLVGIGIILRDRFDQVFLSAMRYV
        +K        + L E  + Y  ++      PI     ++        IL +    GI      + I  DASF  +    G+GII+ +   QV  +A +Y 
Subjt:  SKGGAARPTDLALREIRDFYYQKW--ISFGPICCFPGSSVLFALYCLILVELGVIGISLCSTHFGIRVDASFQPNTGLVGIGIILRDRFDQVFLSAMRYV

Query:  GLKFLLIQQRASLLQKVFGLLKKWFQVFLVEMDFLRPYRILTSQVEDVSKLGLLFSGFRRGLIDHRNCKFLFTPRSGNVTVHRLAQLALCRQENRVWMED
         L+ +     A  +  V GL        L     + P       +ED+S+ G +    +       +  F F  R GN   H LA+ AL   E  +WMED
Subjt:  GLKFLLIQQRASLLQKVFGLLKKWFQVFLVEMDFLRPYRILTSQVEDVSKLGLLFSGFRRGLIDHRNCKFLFTPRSGNVTVHRLAQLALCRQENRVWMED

XP_022150944.1 uncharacterized protein LOC111018973 [Momordica charantia]7.7e-0933.1Show/hide
Query:  IRVDASFQPNTGLVGIGIILRDRFDQVFLSAMRYVGLKFLLIQQRASLLQKVFGL---------LKKWFQVFLVEMDFLRPYRILTSQVEDVSKLGLLFS
        + VDA+F+  + + G+G+I+RD    V+L+A+R +         RAS +  V G          ++  F  F +E D LR + +LT+   D S++G+L S
Subjt:  IRVDASFQPNTGLVGIGIILRDRFDQVFLSAMRYVGLKFLLIQQRASLLQKVFGL---------LKKWFQVFLVEMDFLRPYRILTSQVEDVSKLGLLFS

Query:  GFRRGLIDH-RNCKFLFTPRSGNVTVHRLAQLALCRQENRVWMED
          +  L  H     F FT R+GN   H LAQLAL     ++W+E+
Subjt:  GFRRGLIDH-RNCKFLFTPRSGNVTVHRLAQLALCRQENRVWMED

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]4.5e-0920.18Show/hide
Query:  DSTVSSLFSVPGILDSTKVCDNFIPEEARLTLSLPIPRAELQDQLIWNFEKSEIFTIKSGYHLAQSAAFQALPSFSLSTVSWFLFSVPGIWDSTKFCDNF
        D+TV+ L           +  +F PE+A   + +P+P+   +DQLIW+++K   +++KSGY +A    F   PS                      C N 
Subjt:  DSTVSSLFSVPGILDSTKVCDNFIPEEARLTLSLPIPRAELQDQLIWNFEKSEIFTIKSGYHLAQSAAFQALPSFSLSTVSWFLFSVPGIWDSTKFCDNF

Query:  IPEEAHLIPSLPIPRAELQDQPIWHF-------EKSEIFTIKSGYHLAQPAAFQALPSFSLSTVSSWWK-----------CGQSREDVLHVFWLCKVAQG
                           DQ +W F       EK +IF  ++ + L             L T  + WK           C    E V H    C  A+ 
Subjt:  IPEEAHLIPSLPIPRAELQDQPIWHF-------EKSEIFTIKSGYHLAQPAAFQALPSFSLSTVSSWWK-----------CGQSREDVLHVFWLCKVAQG

Query:  HWHKSQFH---------------SFWHSRPSNPFSPDSKGGAARPTDLALREIRDFYYQKWISFG----PICCFPGSSVLFALYCLILVELGVI---GIS
         W  S                   FW  + +       +G        A+ + R+    KW+  G    P+     +  +   +  I     V    G +
Subjt:  HWHKSQFH---------------SFWHSRPSNPFSPDSKGGAARPTDLALREIRDFYYQKWISFG----PICCFPGSSVLFALYCLILVELGVI---GIS

Query:  LCSTHFG--------IRVDASFQPNTGLVGIGIILRDRFDQVFLSAMRYVGLKFLLIQQRASLLQKVFGLLKKWFQVF-LVEMDFLRPYRILTSQVEDVS
             +         + VDA+      + G+G+++RD       +A++ + L   +    A+ ++    + +K    F + E D L    ++  +   ++
Subjt:  LCSTHFG--------IRVDASFQPNTGLVGIGIILRDRFDQVFLSAMRYVGLKFLLIQQRASLLQKVFGLLKKWFQVF-LVEMDFLRPYRILTSQVEDVS

Query:  KLGLLFSGFRRGLIDHRNCKFLFTPRSGNVTVHRLAQLALCRQENRVWMED
        ++G L S  +  L + +N K   +PR  N   H LA+LAL ++E  +W+++
Subjt:  KLGLLFSGFRRGLIDHRNCKFLFTPRSGNVTVHRLAQLALCRQENRVWMED

XP_024044510.1 uncharacterized protein LOC112100177 [Citrus clementina]2.2e-0820.16Show/hide
Query:  WDSTKFCDNFIPEEAHLIPSLPIPRAELQDQPIWHFEKSEIFTIKSGYHLAQPAAFQALPSFSLSTVSSW------------------------------
        W+  +   +F    +  I  LP+PR    D  IW F+K   ++ KSGY +A    F+  PS S S+ S W                              
Subjt:  WDSTKFCDNFIPEEAHLIPSLPIPRAELQDQPIWHFEKSEIFTIKSGYHLAQPAAFQALPSFSLSTVSSW------------------------------

Query:  WK-----------CGQSREDVLHVFWLCKVAQGHWHKSQFHSFWHSRPSNPFSPDSKGGAARPTDLALREIRDFYYQKWISFG----PICCFPGSSVLFA
        WK           CG  REDV+H    CK  +  W K++F+             D K  A +     ++E+     ++    G    P+     +  +  
Subjt:  WK-----------CGQSREDVLHVFWLCKVAQGHWHKSQFHSFWHSRPSNPFSPDSKGGAARPTDLALREIRDFYYQKWISFG----PICCFPGSSVLFA

Query:  LYCLILVELGVIGISL-----------CSTHFGIRVDASFQPNTGLVGIGIILRDRFDQVFLSAMRYVGLKFLLIQQRASLLQKVFGLLKKWFQV----F
         Y  I      + +                 + + VDA+   +T   G+G+++R+   ++  +A++ V  +  ++   A  +  +FG ++  FQV     
Subjt:  LYCLILVELGVIGISL-----------CSTHFGIRVDASFQPNTGLVGIGIILRDRFDQVFLSAMRYVGLKFLLIQQRASLLQKVFGLLKKWFQV----F

Query:  LVEMDFLRPYRILTSQVEDVSKLGLLFSGFRRGLIDHRNCKFLFTPRSGNVTVHRLAQLALCRQENRVWMED
        ++E D      +  ++   ++++    +  +  L      +  F PR  N     LA+LAL  +   +W+E+
Subjt:  LVEMDFLRPYRILTSQVEDVSKLGLLFSGFRRGLIDHRNCKFLFTPRSGNVTVHRLAQLALCRQENRVWMED

XP_024046732.1 uncharacterized protein LOC112101057 [Citrus clementina]6.5e-0819.71Show/hide
Query:  WDSTKFCDNFIPEEAHLIPSLPIPRAELQDQPIWHFEKSEIFTIKSGYHLAQPAAFQALPSFSLSTV--------SSWWKCGQSREDVLHVFWLCKVAQG
        W++     +F  E A +I S+P+PR    D+ +WH++K   +T+KSGY +A    +   P+ S  ++             C    ED  H F  CKVA+ 
Subjt:  WDSTKFCDNFIPEEAHLIPSLPIPRAELQDQPIWHFEKSEIFTIKSGYHLAQPAAFQALPSFSLSTV--------SSWWKCGQSREDVLHVFWLCKVAQG

Query:  HWHKSQFHSFWHSRPSNPFSPDSKGGAARPTDLALREIRDFYYQKWISFGPICCFPGS-----SVLFALYCLILVELGVIGISLCSTH------------
         W  S   S              +    + +   +  +   +++ W +   +  F G      S++      +     V G    S              
Subjt:  HWHKSQFHSFWHSRPSNPFSPDSKGGAARPTDLALREIRDFYYQKWISFGPICCFPGS-----SVLFALYCLILVELGVIGISLCSTH------------

Query:  -----FGIRVDASFQPNTGLVGIGIILRDRFDQVFLSAMRYVGLKFLLIQQRASLLQKVFGL---LKKWFQVFLVEMDFLRPYRILTSQVEDVSKLGLLF
             + + VDA+      L G+G+++R+   QV ++A++    +  +    A  ++  +GL   L+      ++E D +    +  S+     ++    
Subjt:  -----FGIRVDASFQPNTGLVGIGIILRDRFDQVFLSAMRYVGLKFLLIQQRASLLQKVFGL---LKKWFQVFLVEMDFLRPYRILTSQVEDVSKLGLLF

Query:  SGFRRGLIDHRNCKFLFTPRSGNVTVHRLAQLALCRQENRVWMED
        S         ++      PR  N   H LA+ AL   E   W ++
Subjt:  SGFRRGLIDHRNCKFLFTPRSGNVTVHRLAQLALCRQENRVWMED

TrEMBL top hitse value%identityAlignment
A0A2H5Q972 Uncharacterized protein (Fragment)5.4e-0821.53Show/hide
Query:  WDSTKFCDNFIPEEAHLIPSLPIPRAELQDQPIWHFEKSEIFTIKSGYHLAQPAAFQALPSFSLSTVSSW------------------------------
        WD      +F   +A +I  +P+PR   +D+ IWHF KS  +T+KSGY  A    F A+PS S  + + W                              
Subjt:  WDSTKFCDNFIPEEAHLIPSLPIPRAELQDQPIWHFEKSEIFTIKSGYHLAQPAAFQALPSFSLSTVSSW------------------------------

Query:  WK-----------CGQSREDVLHVFWLCKVAQGHWHKSQFHSFWHSRPSNPFSPDSKGGAARPTDLALREIRDFYYQKWIS----------FGPICCFPG
        WK           C    E+V H    CK A+  W  S F +   + P         G     ++  +       + KW +            P      
Subjt:  WK-----------CGQSREDVLHVFWLCKVAQGHWHKSQFHSFWHSRPSNPFSPDSKGGAARPTDLALREIRDFYYQKWIS----------FGPICCFPG

Query:  SSVLFALYCLI---LVELGVIGISLCSTHF-GIRVDASFQPNTGLVGIGIILRDRFDQVFLSAMRYVGLKFLLIQQRASLLQKVFGLLK-KWFQVFLVEM
        +  +   Y  +     ++  +G +     F  I  DA+      L G+G ++RD   QV  +A++       +    A  ++    + K    +  ++E 
Subjt:  SSVLFALYCLI---LVELGVIGISLCSTHF-GIRVDASFQPNTGLVGIGIILRDRFDQVFLSAMRYVGLKFLLIQQRASLLQKVFGLLK-KWFQVFLVEM

Query:  DFLRPYRILTSQVEDVSKLGLLFSGFR--RGLIDHRNCKFLFTPRSGNVTVHRLAQLALCRQENRVW
        D      ++ ++    S++  +    +  +   DH +C  ++T RS N   H LA+LAL + E  VW
Subjt:  DFLRPYRILTSQVEDVSKLGLLFSGFR--RGLIDHRNCKFLFTPRSGNVTVHRLAQLALCRQENRVW

A0A5E4FZN9 PREDICTED: retrotransposon1.2e-0722.41Show/hide
Query:  PSFSLSTVSWFLFSVPGIWDSTKFCDNFIPEEAHLIPSLPIPRAELQDQPIWHFEKSEIFTIKSGYHLA--QPAAFQALPSFSLSTVSSWWK--------
        P   LST+   LF+  G W+     D F  +E      +P+      D  IWH+E++ ++++KSGY LA  +       PS  +   S +WK        
Subjt:  PSFSLSTVSWFLFSVPGIWDSTKFCDNFIPEEAHLIPSLPIPRAELQDQPIWHFEKSEIFTIKSGYHLA--QPAAFQALPSFSLSTVSSWWK--------

Query:  ----------------CGQ------------------SREDVLHVFWLCKVAQGHWHKS------------QFHSFWHS---------------------
                        CGQ                    E VLH  WLC+ A+  W  S             F   WH+                     
Subjt:  ----------------CGQ------------------SREDVLHVFWLCKVAQGHWHKS------------QFHSFWHS---------------------

Query:  RPSNPFSPDSKGGAA-----RPTDLALREIRDFYYQKWISFGPICCFPGSSVLFALYCLILVELGVIGISLCSTHFGIRVDASFQPNTGLVGIGIILRDR
           N F  + K   A     R T LA +E  +         G       SS    L+       G+         + I VD + +    + G+G+++R+ 
Subjt:  RPSNPFSPDSKGGAA-----RPTDLALREIRDFYYQKWISFGPICCFPGSSVLFALYCLILVELGVIGISLCSTHFGIRVDASFQPNTGLVGIGIILRDR

Query:  FDQVFLSAMRYVGLKFLLIQQRASLLQKVFGL---LKKWFQVFLVEMDFLRPYRILTSQVEDVSKLGLLFSGFRRGLIDHRNCKFLFTPRSGNVTVHRLA
          +   + +R +   +    ++  L+  + GL   +   F   ++EMD       + S  E     GLL       L + R     +TPRSGN   H LA
Subjt:  FDQVFLSAMRYVGLKFLLIQQRASLLQKVFGL---LKKWFQVFLVEMDFLRPYRILTSQVEDVSKLGLLFSGFRRGLIDHRNCKFLFTPRSGNVTVHRLA

Query:  QLALCRQENRVWMED
        Q A    E   W+E+
Subjt:  QLALCRQENRVWMED

A0A6J1DAR4 uncharacterized protein LOC1110189544.2e-1324Show/hide
Query:  LSTVSWFLFSVPGIWDSTKFCDNFIPEEAHLIPSLPIPRAELQDQPIWHFEKSEIFTIKSGYHLA--QPAAFQALPSFSLSTVSSWWK------------
        +S VS  +    G W      D F P+EA  I S+PI R   +D+ IW++EK+ +++++SGY +A       QA  S S   V  WW             
Subjt:  LSTVSWFLFSVPGIWDSTKFCDNFIPEEAHLIPSLPIPRAELQDQPIWHFEKSEIFTIKSGYHLA--QPAAFQALPSFSLSTVSSWWK------------

Query:  ------------------------------CGQSREDVLHVFWLCKVAQGHWHKSQF----------------------------HSFWHSRPSNPFSPD
                                      CG++ ED +H+FW+CK A+  W  S+F                               W+ R +  F+  
Subjt:  ------------------------------CGQSREDVLHVFWLCKVAQGHWHKSQF----------------------------HSFWHSRPSNPFSPD

Query:  SKGGAARPTDLALREIRDFYYQKW--ISFGPICCFPGSSVLFALYCLILVELGVIGISLCSTHFGIRVDASFQPNTGLVGIGIILRDRFDQVFLSAMRYV
        +K        + L E  + Y  ++      PI     ++        IL +    GI      + I  DASF  +    G+GII+ +   QV  +A +Y 
Subjt:  SKGGAARPTDLALREIRDFYYQKW--ISFGPICCFPGSSVLFALYCLILVELGVIGISLCSTHFGIRVDASFQPNTGLVGIGIILRDRFDQVFLSAMRYV

Query:  GLKFLLIQQRASLLQKVFGLLKKWFQVFLVEMDFLRPYRILTSQVEDVSKLGLLFSGFRRGLIDHRNCKFLFTPRSGNVTVHRLAQLALCRQENRVWMED
         L+ +     A  +  V GL        L     + P       +ED+S+ G +    +       +  F F  R GN   H LA+ AL   E  +WMED
Subjt:  GLKFLLIQQRASLLQKVFGLLKKWFQVFLVEMDFLRPYRILTSQVEDVSKLGLLFSGFRRGLIDHRNCKFLFTPRSGNVTVHRLAQLALCRQENRVWMED

A0A6J1DBJ7 uncharacterized protein LOC1110189733.7e-0933.1Show/hide
Query:  IRVDASFQPNTGLVGIGIILRDRFDQVFLSAMRYVGLKFLLIQQRASLLQKVFGL---------LKKWFQVFLVEMDFLRPYRILTSQVEDVSKLGLLFS
        + VDA+F+  + + G+G+I+RD    V+L+A+R +         RAS +  V G          ++  F  F +E D LR + +LT+   D S++G+L S
Subjt:  IRVDASFQPNTGLVGIGIILRDRFDQVFLSAMRYVGLKFLLIQQRASLLQKVFGL---------LKKWFQVFLVEMDFLRPYRILTSQVEDVSKLGLLFS

Query:  GFRRGLIDH-RNCKFLFTPRSGNVTVHRLAQLALCRQENRVWMED
          +  L  H     F FT R+GN   H LAQLAL     ++W+E+
Subjt:  GFRRGLIDH-RNCKFLFTPRSGNVTVHRLAQLALCRQENRVWMED

A0A803NST3 Uncharacterized protein8.3e-0922.31Show/hide
Query:  FSVPGI-WDSTKFCDNFIPEEAHLIPSLPIPRAELQDQPIWHFEKSEIFTIKSGYHLAQPAAFQALPSFSLSTVSSWWK---------------------
        F  P + WD+ +    F P  AH I S+P+P    QD  +W    S IF++KSGYHL+  +A  +L   S S+ S WWK                     
Subjt:  FSVPGI-WDSTKFCDNFIPEEAHLIPSLPIPRAELQDQPIWHFEKSEIFTIKSGYHLAQPAAFQALPSFSLSTVSSWWK---------------------

Query:  ---------------------CGQSREDVLHVFWLCKVAQGHWHKSQFHSF---------WHSRPSNPFSPDSKGGAARPTDLAL-------REIRDFYY
                             CG   E V H  + C+  +  W  +QF S+         +H      ++  SK   A    ++        + ++   +
Subjt:  ---------------------CGQSREDVLHVFWLCKVAQGHWHKSQFHSF---------WHSRPSNPFSPDSKGGAARPTDLAL-------REIRDFYY

Query:  QKWISFGPICCFPGSSVLFALYCLILVELGVIG---ISLCSTHFGIRVDASFQPNTGLVGIGIILRDRFDQVFLS-AMRYVGLKFLLIQQRASLLQKVFG
           +   P+  F  + +      L + +L  +    I        + VDA+     G VG G ++R+    V  + A  Y G   +   +  +LL  +  
Subjt:  QKWISFGPICCFPGSSVLFALYCLILVELGVIG---ISLCSTHFGIRVDASFQPNTGLVGIGIILRDRFDQVFLS-AMRYVGLKFLLIQQRASLLQKVFG

Query:  LLKKWFQVFLVEMDFLRPYRILTSQVEDVSKLGLLFSGFRRGLIDHRNCKFLFTPRSGNVTVHRLAQLALCRQENRVWMED
         + + F V  VE D       +    +D+S  G L    +  L    +     T R+ N    +LA  A    E  VW+ D
Subjt:  LLKKWFQVFLVEMDFLRPYRILTSQVEDVSKLGLLFSGFRRGLIDHRNCKFLFTPRSGNVTVHRLAQLALCRQENRVWMED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAGCACTGTAAGTTCTCTATTTTCTGTTCCTGGTATTTTGGATTCGACTAAGGTCTGTGATAATTTTATTCCTGAGGAGGCCCGTCTAACCCTATCTCTCCCGAT
TCCAAGGGCAGAGCTGCAAGACCAACTGATTTGGAACTTCGAGAAATCAGAGATTTTTACTATCAAAAGTGGATATCATTTGGCCCAATCTGCTGCTTTTCAGGCTCTTC
CGTCCTTTTCGCTCTCTACTGTTTCATGGTTCCTGTTTTCTGTTCCTGGTATTTGGGATTCGACTAAGTTCTGTGATAATTTTATTCCTGAGGAGGCCCATCTAATCCCA
TCTCTCCCGATTCCAAGGGCGGAGCTGCAAGACCAACCGATTTGGCACTTCGAGAAATCAGAGATTTTTACTATCAAAAGTGGATATCATTTGGCCCAACCTGCTGCTTT
TCAGGCTCTTCCGTCCTTTTCGCTCTCTACTGTTTCATCTTGGTGGAAGTGTGGTCAATCAAGGGAAGACGTTCTCCATGTGTTTTGGCTTTGTAAGGTAGCTCAGGGTC
ATTGGCATAAGTCTCAGTTCCACTCATTTTGGCATTCGCGCCCATCTAATCCCTTCTCTCCCGATTCCAAGGGCGGAGCTGCAAGACCAACTGATTTGGCACTTCGAGAA
ATCAGAGATTTTTACTATCAAAAGTGGATATCATTTGGCCCAATCTGCTGCTTTCCAGGCTCTTCCGTCCTTTTCGCTCTCTACTGTTTAATCTTGGTGGAACTTGGGGT
CATTGGCATAAGTCTTTGTTCCACTCATTTTGGCATTCGCGTCGATGCTAGTTTCCAACCGAATACGGGATTGGTTGGGATTGGCATTATCCTTCGTGATAGATTTGATC
AGGTGTTCTTATCTGCGATGCGCTACGTGGGGCTCAAATTTCTATTGATTCAGCAGAGGGCCTCGCTGTTGCAGAAGGTTTTCGGTTTGCTCAAGAAATGGTTTCAAGTC
TTTTTAGTTGAGATGGATTTTTTGCGCCCTTATCGCATTCTTACCTCACAGGTTGAAGATGTATCGAAATTGGGATTGTTGTTTTCTGGTTTTCGACGTGGCCTCATTGA
TCACCGGAATTGCAAATTTCTGTTTACCCCTCGGTCTGGGAATGTTACAGTTCATCGTCTTGCTCAATTAGCTCTATGTCGACAAGAGAATCGTGTGTGGATGGAAGATT
GA
mRNA sequenceShow/hide mRNA sequence
ATGGATAGCACTGTAAGTTCTCTATTTTCTGTTCCTGGTATTTTGGATTCGACTAAGGTCTGTGATAATTTTATTCCTGAGGAGGCCCGTCTAACCCTATCTCTCCCGAT
TCCAAGGGCAGAGCTGCAAGACCAACTGATTTGGAACTTCGAGAAATCAGAGATTTTTACTATCAAAAGTGGATATCATTTGGCCCAATCTGCTGCTTTTCAGGCTCTTC
CGTCCTTTTCGCTCTCTACTGTTTCATGGTTCCTGTTTTCTGTTCCTGGTATTTGGGATTCGACTAAGTTCTGTGATAATTTTATTCCTGAGGAGGCCCATCTAATCCCA
TCTCTCCCGATTCCAAGGGCGGAGCTGCAAGACCAACCGATTTGGCACTTCGAGAAATCAGAGATTTTTACTATCAAAAGTGGATATCATTTGGCCCAACCTGCTGCTTT
TCAGGCTCTTCCGTCCTTTTCGCTCTCTACTGTTTCATCTTGGTGGAAGTGTGGTCAATCAAGGGAAGACGTTCTCCATGTGTTTTGGCTTTGTAAGGTAGCTCAGGGTC
ATTGGCATAAGTCTCAGTTCCACTCATTTTGGCATTCGCGCCCATCTAATCCCTTCTCTCCCGATTCCAAGGGCGGAGCTGCAAGACCAACTGATTTGGCACTTCGAGAA
ATCAGAGATTTTTACTATCAAAAGTGGATATCATTTGGCCCAATCTGCTGCTTTCCAGGCTCTTCCGTCCTTTTCGCTCTCTACTGTTTAATCTTGGTGGAACTTGGGGT
CATTGGCATAAGTCTTTGTTCCACTCATTTTGGCATTCGCGTCGATGCTAGTTTCCAACCGAATACGGGATTGGTTGGGATTGGCATTATCCTTCGTGATAGATTTGATC
AGGTGTTCTTATCTGCGATGCGCTACGTGGGGCTCAAATTTCTATTGATTCAGCAGAGGGCCTCGCTGTTGCAGAAGGTTTTCGGTTTGCTCAAGAAATGGTTTCAAGTC
TTTTTAGTTGAGATGGATTTTTTGCGCCCTTATCGCATTCTTACCTCACAGGTTGAAGATGTATCGAAATTGGGATTGTTGTTTTCTGGTTTTCGACGTGGCCTCATTGA
TCACCGGAATTGCAAATTTCTGTTTACCCCTCGGTCTGGGAATGTTACAGTTCATCGTCTTGCTCAATTAGCTCTATGTCGACAAGAGAATCGTGTGTGGATGGAAGATT
GA
Protein sequenceShow/hide protein sequence
MDSTVSSLFSVPGILDSTKVCDNFIPEEARLTLSLPIPRAELQDQLIWNFEKSEIFTIKSGYHLAQSAAFQALPSFSLSTVSWFLFSVPGIWDSTKFCDNFIPEEAHLIP
SLPIPRAELQDQPIWHFEKSEIFTIKSGYHLAQPAAFQALPSFSLSTVSSWWKCGQSREDVLHVFWLCKVAQGHWHKSQFHSFWHSRPSNPFSPDSKGGAARPTDLALRE
IRDFYYQKWISFGPICCFPGSSVLFALYCLILVELGVIGISLCSTHFGIRVDASFQPNTGLVGIGIILRDRFDQVFLSAMRYVGLKFLLIQQRASLLQKVFGLLKKWFQV
FLVEMDFLRPYRILTSQVEDVSKLGLLFSGFRRGLIDHRNCKFLFTPRSGNVTVHRLAQLALCRQENRVWMED